A Novel Multi-Task Learning Framework for Simultaneous Digitization and Classification of Electrocardiogram Images

Jingsu Kang1 and Hao WEN2
1Tianjin Medical University, 2China Agricultural University


Abstract

Aim: This paper addresses the challenge of simultaneous digitization and classification (normal or abnormal) of electrocardiograms (ECGs) captured from images or paper printouts, as presented by the George B. Moody PhysioNet Challenge 2024. This problem is crucial in medical image analysis.

Methods: We propose an end-to-end multi-task learning framework to digitize and classify ECG images concurrently. The framework includes a shared feature extraction backbone network and two task-specific head networks. We experimented with various state-of-the-art pretrained backbone architectures, such as ResNet, ConvNeXt, and Swin Transformer. Model training employed a combination of cross-entropy loss for classification and signal-to-noise ratio (SNR) loss for digitization. Optimization was performed using the AdamW optimizer and OneCycle learning rate scheduler. ECG image data were synthesized from the PTB-XL ECG dataset, comprising 21,799 12-lead time-series ECG recordings. We allocated 10% of the data as a validation set for model selection, following the PTB-XL dataset's predefined split. The selection was based on a combined evaluation of classification and digitization performance.

Further study will focus on exploring and comparing an alternative several-stage solution to the digitization problem, consisting primarily of an object detection model and a self-adaptive edge sharpening algorithm for ECG curve extraction.

Results: The best submission entry of our team "Revenger" achieved an F1 score of 0.626 (ranked 16th) for the classification task and an SNR of -13.203 (ranked 30th) for the digitization task on the hidden validation set.

Conclusion: The proposed multi-task learning framework achieved competitive performance in the Challenge, providing an effective solution to the problem of simultaneous digitization and classification of ECG images. The alternative solution is expected to improve the digitization performance further.