Data Reconstruction Based on Temporal Expressions in Clinical Notes


Tang C. (second author). 11/18/2019. “Data Reconstruction Based on Temporal Expressions in Clinical Notes.” In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Pp. 1004-1008. San Diego, CA, USA: IEEE. Publisher's Version


Learning representations of clinical notes poses challenges in handling complex content that necessitates preprocessing steps to make the data more suitable for data mining. An important issue, addressed here, is that of temporal expressions, where cues indicate the time when clinical events occur. We present a three-step data reconstruction algorithm for transforming similar clinical entities (e.g., symptoms, complications) into sequential data through unsupervised annotation of temporal expressions. First, the data reconstruction algorithm detects if an expression has temporal intent. Second, it decomposes and rewrites the expression into non-temporal sub-expression and temporal constraints. Finally, it clusters similar non-temporal sub-expressions by using unsupervised sentence embedding under the modified K-medoids paradigm. We experimented with our proposed algorithm on clinical notes associated with chronic obstructive pulmonary disease (COPD). Visualizing reconstruction results of cardiology reports for a longitudinal cohort of patients with COPD demonstrated that this algorithm is feasible.

Last updated on 03/11/2021