Study on the Generalization Ability of Accelerometer Threshold-based Methods for Noise Detection in PPG Signals

Santiago Mula1, Roberto Zangróniz1, Jose J Rieta2, Raul Alcaraz1
1University of Castilla-La Mancha, 2BioMIT.org, Universitat Politecnica Valencia


Abstract

Continuous heart rate monitoring through wearable devices incorporating photoplethysmogram (PPG) sensors commonly provides very noisy signals. One common approach to deal with this limitation is the use of an accelerometer to discard periods where movement exceeds a certain threshold. Precisely, motion artifacts are one of the main noise sources sharing frequency band with the PPG signal and cannot be removed by filtering without discarding the desired signal as well. However, little research can be found in the literature about how extrapolable the accelerometer threshold-based methods are to successfully work with different databases. Hence, the aim of the present work is to quantitatively assess the performance of some of these methods on several datasets.

All recordings from the public datasets PPG DaLiA and WESAD were segmented and labeled as clean or noisy on the difference in the inter-beat interval between the available synchronous ECG and PPG signals, as this labeling method has been deemed more objective than manual annotation. Next, diverse acceleration thresholds were applied to two acceleration metrics, i.e., the acceleration vector module for the raw signal and the filtered signal. Finally, the classification obtained by each threshold was compared with the reference labels in terms of accuracy.

Values of maximum accuracy larger than 75% were obtained on both databases, but for accelerometer thresholds notably different. Precisely, when an error lower than 8 bpm the accelerometer threshold reporting the higest accuracy was 0.05g for PPG DaLiA and 0.26g for WESAD. Similarly, for an error of 3 bpm, maximum accuracy was obtained for the thresholds of 0.02g and 0.24g, respectively.

These results suggest that accelerometer threshold-based methods for PPG noise detection can be moderately effective for a single dataset, but fail to generalize to other databases acquired with different PPG sensors. This involves a large limitation for many real-world medical applications.