Unsupervised Clustering of Single-Lead ECGs Associates with Prevalent and Incident Heart Failure in Coronary Artery Disease
Aims
Clinical consequences of coronary artery disease (CAD) are varied (e.g., atrial fibrillation [AF] and heart failure [HF]) and current risk stratification tools are ineffective. We aimed to identify clusters of individuals with CAD exhibiting unique patterns on the electrocardiogram (ECG) in an unsupervised manner and assess their association with cardiovascular risk.
Methods
Twenty-one ECG markers were derived from single-lead median-beat ECGs of 1,928 individuals with CAD without a previous diagnosis of AF, HF or ventricular arrhythmia (VA) from the imaging study in UK Biobank (CAD-IMG-UKB). An unsupervised clustering algorithm was used to group these markers into distinct clusters. We characterized each cluster according to their demographic and ECG characteristics, as well as their prevalent and incident risk of AF, HF and VA (4-year median follow-up). Validation and association with prevalent diagnoses was performed in an independent cohort of 1,644 individuals.
Results
The model identified two clusters within the CAD-IMG-UKB cohort. Cluster 1 (N=359) exhibited prolonged QRS duration and QT intervals, along with greater morphological variations in QRS and T waves, compared to cluster 2 (N=1,569). Cluster 1, relative to cluster 2, had a significantly higher risk of incident HF (hazard ratio [HR]:2.40, 95%-confidence interval [CI]:1.51-3.83), confirmed by independent validation (HR:1.77, CI:1.31-2.41). It also showed a higher association with prevalent HF (odds ratio:4.10, CI:2.02-8.29), independent of clinical risk factors.
Conclusions
Our approach identified a cluster of individuals with CAD sharing ECG characteristics indicating HF risk, holding significant implications for targeted treatment and prevention enabling accessible large-scale screening.