Show simple item record

dc.contributor.authorGhaderi, Hamid
dc.contributor.authorForeman, Brandon
dc.contributor.authorNayebi, Amin
dc.contributor.authorTipirneni, Sindhu
dc.contributor.authorReddy, Chandan K
dc.contributor.authorSubbian, Vignesh
dc.date.accessioned2024-03-30T01:06:36Z
dc.date.available2024-03-30T01:06:36Z
dc.date.issued2023-05-22
dc.identifier.citationGhaderi, H., Foreman, B., Nayebi, A., Tipirneni, S., Reddy, C. K., & Subbian, V. (2023). A self-supervised learning-based approach to clustering multivariate time-series data with missing values (SLAC-Time): An application to TBI phenotyping. Journal of Biomedical Informatics, 143, 104401.en_US
dc.identifier.pmid37225066
dc.identifier.doi10.1016/j.jbi.2023.104401
dc.identifier.urihttp://hdl.handle.net/10150/671970
dc.description.abstractSelf-supervised learning approaches provide a promising direction for clustering multivariate time-series data. However, real-world time-series data often include missing values, and the existing approaches require imputing missing values before clustering, which may cause extensive computations and noise and result in invalid interpretations. To address these challenges, we present a Self-supervised Learning-based Approach to Clustering multivariate Time-series data with missing values (SLAC-Time). SLAC-Time is a Transformer-based clustering method that uses time-series forecasting as a proxy task for leveraging unlabeled data and learning more robust time-series representations. This method jointly learns the neural network parameters and the cluster assignments of the learned representations. It iteratively clusters the learned representations with the K-means method and then utilizes the subsequent cluster assignments as pseudo-labels to update the model parameters. To evaluate our proposed approach, we applied it to clustering and phenotyping Traumatic Brain Injury (TBI) patients in the Transforming Research and Clinical Knowledge in Traumatic Brain Injury (TRACK-TBI) study. Clinical data associated with TBI patients are often measured over time and represented as time-series variables characterized by missing values and irregular time intervals. Our experiments demonstrate that SLAC-Time outperforms the baseline K-means clustering algorithm in terms of silhouette coefficient, Calinski Harabasz index, Dunn index, and Davies Bouldin index. We identified three TBI phenotypes that are distinct from one another in terms of clinically significant variables as well as clinical outcomes, including the Extended Glasgow Outcome Scale (GOSE) score, Intensive Care Unit (ICU) length of stay, and mortality rate. The experiments show that the TBI phenotypes identified by SLAC-Time can be potentially used for developing targeted clinical trials and therapeutic strategies.en_US
dc.language.isoenen_US
dc.publisherAcademic Press Inc.en_US
dc.rights© 2023 Elsevier Inc. All rights reserved.en_US
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en_US
dc.subjectclusteringen_US
dc.subjectMultivariate time-series dataen_US
dc.subjectSelf-supervised Learningen_US
dc.subjectTransformeren_US
dc.subjectTraumatic brain injuryen_US
dc.titleA self-supervised learning-based approach to clustering multivariate time-series data with missing values (SLAC-Time): An application to TBI phenotypingen_US
dc.typeArticleen_US
dc.identifier.eissn1532-0480
dc.contributor.departmentDepartment of Systems and Industrial Engineering, University of Arizonaen_US
dc.contributor.departmentDepartment of Biomedical Engineering, University of Arizonaen_US
dc.identifier.journalJournal of biomedical informaticsen_US
dc.description.note12 month embargo; first published 22 May 2023en_US
dc.description.collectioninformationThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.en_US
dc.eprint.versionFinal accepted manuscripten_US
dc.source.journaltitleJournal of biomedical informatics
dc.source.volume143
dc.source.beginpage104401
dc.source.endpage
dc.source.countryUnited States
dc.source.countryUnited States


Files in this item

Thumbnail
Name:
Final version.pdf
Size:
1.788Mb
Format:
PDF
Description:
Final Accepted Manuscript

This item appears in the following Collection(s)

Show simple item record