A study on heart sound classification algorithm based on improved Mel-frequency cepstrum coefficient feature extraction and deep Transformer_Journal of Biomedical Engineering

Authors：

MENG Xin ,  ZHANG Sunjie

School of Opto-Electronic Information & Computer Engineering, University of Shanghai for Science & Technology, Shanghai 200093, P. R. China;

Corresponding?author：

ZHANG Sunjie, Email: zhang_sunjie@126.com

Keywords：

Heart sound classification; Modified Mel frequency cepstrum coefficients; Deep Transformer; Global average pooling; Focal loss

DOI：

10.7507/1001-5515.202502053

Video：

Export PDF Favorites Scan Get Citation

Abstract Full text Figures/Tables Video References Cited by

Heart sounds are critical for early detection of cardiovascular diseases, yet existing studies mostly focus on traditional signal segmentation, feature extraction, and shallow classifiers, which often fail to sufficiently capture the dynamic and nonlinear characteristics of heart sounds, limit recognition of complex heart sound patterns, and are sensitive to data imbalance, resulting in poor classification performance. To address these limitations, this study proposes a novel heart sound classification method that integrates improved Mel-frequency cepstral coefficients (MFCC) for feature extraction with a convolutional neural network (CNN) and a deep Transformer model. In the preprocessing stage, a Butterworth filter is applied for denoising, and continuous heart sound signals are directly processed without segmenting the cardiac cycles, allowing the improved MFCC features to better capture dynamic characteristics. These features are then fed into a CNN for feature learning, followed by global average pooling (GAP) to reduce model complexity and mitigate overfitting. Lastly, a deep Transformer module is employed to further extract and fuse features, completing the heart sound classification. To handle data imbalance, the model uses focal loss as the objective function. Experiments on two public datasets demonstrate that the proposed method performs effectively in both binary and multi-class classification tasks. This approach enables efficient classification of continuous heart sound signals, provides a reference methodology for future heart sound research for disease classification, and supports the development of wearable devices and home monitoring systems.

1.	盧官明, 李齊健, 盧峻禾, 等. 基于雙向長短時記憶網絡和自注意力機制的心音分類. 數據采集與處理, 2025, 40(2): 456-468.
2.	He Y, Li W, Zhang W, et al. Research on segmentation and classification of heart sound signals based on deep learning. Appl Sci, 2021, 11(2): 651-665.
3.	Li S, Li F, Tang S, et al. A review of computer-aided heart sound detection techniques. Biomed Res Int, 2020, 2020: 5846191.
4.	夏文鑫, 蔣捷, 張大斌, 等. 基于WST-BiLSTM的心音信號分類研究. 計算機與數字工程, 2025, 53(3): 776-780,810.
5.	肖斌, 陳嘉博, 畢秀麗, 等. 基于一維卷積神經網絡與循環神經網絡串聯的心音分析方法. 電子學報, 2022, 50(10): 2425-2432.
6.	劉偉偉, 桑勝波, 張宏鵬. 基于 CNN+ LSTM 的改進心音分類模型研究. 電子設計工程, 2022, 30(2): 38-42.
7.	彭利勇, 全海燕. 基于雙譜特征提取和卷積神經網絡的心音分類算法. 生物醫學工程學雜志, 2024, 41(5): 977-985,994.
8.	Khan S I, Qaisar S M, Pachori R B. Automated classification of valvular heart diseases using FBSE-EWT and PSR based geometrical features. Biomedical Signal Processing and Control, 2022, 73: 103445.
9.	Jang Y, Jung J, Hong Y, et al. Fully convolutional hybrid fusion network with heterogeneous representations for identification of S1 and S2 from phonocardiogram. IEEE J Biomed Health Inform, 2024, 28(12): 7151-7163.
10.	Bozkurt B, Germanakis I, Stylianou Y. A study of time-frequency features for CNN-based automatic heart sound classification for pathology detection. Comput Biol Med, 2018, 100: 132-143.
11.	Fang Y, Leng H, Wang W, et al. Multi-level feature encoding algorithm based on FBPSI for heart sound classification. Sci Rep, 2024, 14(1): 29132.
12.	Abduh Z, Nehary E A, Wahed M A, et al. Classification of heart sounds using fractional Fourier transform based mel-frequency spectral coefficients and stacked autoencoder deep neural network. J Med Imaging Health Inform, 2019, 9(1): 1-8.
13.	Nogueira D M, Ferreira C A, Gomes E F, et al. Classifying heart sounds using images of motifs, MFCC and temporal features. J Med Syst, 2019, 43(6): 168.
14.	Chen X, Li H, Huang Y, et al. Heart sound classification based on equal scale frequency cepstral coefficients and deep learning. Biomed Tech, 2023, 68(3): 285-295.
15.	Malleswari D, Vazram B J, Shaik R. Chronicnet: random forest classifier-based chronic heart failure detection with CNN feature analysis//Proceedings of the 2024 IEEE 13th International Conference on Communication Systems and Network Technologies (CSNT), Jabalpur: IEEE, 2024: 1076-1081.
16.	Ren Z, Qiao Y, Yuan Y, et al. Time and time frequency features integrated CNN model for heart sound signal detection//Proceedings of the 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Las Vegas: IEEE, 2022: 1138-1143.
17.	Riccio D, Brancati N, Sannino G, et al. CNN-based classification of phonocardiograms using fractal techniques. Biomedical Signal Processing and Control, 2023, 86: 105186.
18.	Deng M, Meng T, Cao J, et al. Heart sound classification based on improved MFCC features and convolutional recurrent neural networks. Neural Netw, 2020, 130: 22-32.
19.	Nguyen M T, Lin W W, Huang J H. Heart sound classification using deep learning techniques based on log-mel spectrogram. Circ Syst Signal Process, 2023, 42(1): 344-360.
20.	Zhu L, Qian K, Wang Z, et al. Heart sound classification based on residual shrinkage networks. Annu Int Conf IEEE Eng Med Biol Soc, 2022, 2022: 4469-4472.
21.	Tian G, Lian C, Zeng Z, et al. Imbalanced heart sound signal classification based on two-stage trained dsanet. Cogn Comput, 2022, 14(4): 1378-1391.
22.	Jamil S, Roy A M. An efficient and robust phonocardiography (PCG)-based valvular heart diseases (VHD) detection framework using vision Transformer (ViT). Comput Biol Med, 2023, 158: 106734.
23.	Ranipa K, Zhu W P, Swamy M N S. A novel feature-level fusion scheme with multimodal attention CNN for heart sound classification. Comput Methods Programs Biomed, 2024, 248: 108122.
24.	Liu C, Springer D, Li Q, et al. An open access database for the evaluation of heart sound algorithms. Physiol Meas, 2016, 37(12): 2181-2213.
25.	Yaseen, Son G Y, Kwon S. Classification of heart sound signal using multiple features. Appl Sci, 2018, 8(12): 2344.
26.	Rubin J, Abreu R, Ganguli A, et al. Classifying heart sound recordings using deep convolutional neural networks and mel-frequency cepstral coefficients//2016 Computing in Cardiology Conference (CinC), Durham: IEEE, 2016: 813-816.
27.	Li F, Zhang Z, Wang L, et al. Heart sound classification based on improved Mel-frequency spectral coefficients and deep residual learning. Front Physiol, 2022, 13: 1084420.
28.	Arora V, Leekha R, Singh R, et al. Heart sound classification using machine learning and phonocardiogram. Mod Phys Lett B, 2019, 33(26): 1950321.
29.	Lin M, Chen Q, Yan S. Network in network. arXiv preprint, 2013, arXiv: 1312.4400.
30.	王幸之, 楊宏波, 宗容, 等. 基于子帶包絡和卷積神經網絡的心音分類算法. 生物醫學工程學雜志, 2021, 38(5): 969-978.
31.	Shuvo S B, Ali S N, Swapnil S I, et al. CardioXNet: a novel lightweight deep learning framework for cardiovascular disease classification using heart sound recordings. IEEE Access, 2021, 9: 36955-36967.
32.	Alkhodari M, Fraiwan L. Convolutional and recurrent neural networks for the detection of valvular heart diseases in phonocardiogram recordings. Comput Methods Programs Biomed, 2021, 200: 105940.
33.	Karhade J, Dash S, Ghosh S K, et al. Time-frequency-domain deep learning framework for the automated detection of heart valve disorders using PCG signals. IEEE Trans Instrum Meas, 2022, 71: 1-11.
34.	Chen J, Guo Z, Xu X, et al. A robust deep learning framework based on spectrograms for heart sound classification. IEEE/ACM Trans Comput Biol Bioinform, 2024, 21(4): 936-947.
35.	Guo Z, Chen J, He T, et al. DS-CNN: dual-stream convolutional neural networks-based heart sound classification for wearable devices. IEEE Trans Consum Electron, 2023, 69(4): 1186-1194.

1. 盧官明, 李齊健, 盧峻禾, 等. 基于雙向長短時記憶網絡和自注意力機制的心音分類. 數據采集與處理, 2025, 40(2): 456-468.
2. He Y, Li W, Zhang W, et al. Research on segmentation and classification of heart sound signals based on deep learning. Appl Sci, 2021, 11(2): 651-665.
3. Li S, Li F, Tang S, et al. A review of computer-aided heart sound detection techniques. Biomed Res Int, 2020, 2020: 5846191.
4. 夏文鑫, 蔣捷, 張大斌, 等. 基于WST-BiLSTM的心音信號分類研究. 計算機與數字工程, 2025, 53(3): 776-780,810.
5. 肖斌, 陳嘉博, 畢秀麗, 等. 基于一維卷積神經網絡與循環神經網絡串聯的心音分析方法. 電子學報, 2022, 50(10): 2425-2432.
6. 劉偉偉, 桑勝波, 張宏鵬. 基于 CNN+ LSTM 的改進心音分類模型研究. 電子設計工程, 2022, 30(2): 38-42.
7. 彭利勇, 全海燕. 基于雙譜特征提取和卷積神經網絡的心音分類算法. 生物醫學工程學雜志, 2024, 41(5): 977-985,994.
8. Khan S I, Qaisar S M, Pachori R B. Automated classification of valvular heart diseases using FBSE-EWT and PSR based geometrical features. Biomedical Signal Processing and Control, 2022, 73: 103445.
9. Jang Y, Jung J, Hong Y, et al. Fully convolutional hybrid fusion network with heterogeneous representations for identification of S1 and S2 from phonocardiogram. IEEE J Biomed Health Inform, 2024, 28(12): 7151-7163.
10. Bozkurt B, Germanakis I, Stylianou Y. A study of time-frequency features for CNN-based automatic heart sound classification for pathology detection. Comput Biol Med, 2018, 100: 132-143.
11. Fang Y, Leng H, Wang W, et al. Multi-level feature encoding algorithm based on FBPSI for heart sound classification. Sci Rep, 2024, 14(1): 29132.
12. Abduh Z, Nehary E A, Wahed M A, et al. Classification of heart sounds using fractional Fourier transform based mel-frequency spectral coefficients and stacked autoencoder deep neural network. J Med Imaging Health Inform, 2019, 9(1): 1-8.
13. Nogueira D M, Ferreira C A, Gomes E F, et al. Classifying heart sounds using images of motifs, MFCC and temporal features. J Med Syst, 2019, 43(6): 168.
14. Chen X, Li H, Huang Y, et al. Heart sound classification based on equal scale frequency cepstral coefficients and deep learning. Biomed Tech, 2023, 68(3): 285-295.
15. Malleswari D, Vazram B J, Shaik R. Chronicnet: random forest classifier-based chronic heart failure detection with CNN feature analysis//Proceedings of the 2024 IEEE 13th International Conference on Communication Systems and Network Technologies (CSNT), Jabalpur: IEEE, 2024: 1076-1081.
16. Ren Z, Qiao Y, Yuan Y, et al. Time and time frequency features integrated CNN model for heart sound signal detection//Proceedings of the 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Las Vegas: IEEE, 2022: 1138-1143.
17. Riccio D, Brancati N, Sannino G, et al. CNN-based classification of phonocardiograms using fractal techniques. Biomedical Signal Processing and Control, 2023, 86: 105186.
18. Deng M, Meng T, Cao J, et al. Heart sound classification based on improved MFCC features and convolutional recurrent neural networks. Neural Netw, 2020, 130: 22-32.
19. Nguyen M T, Lin W W, Huang J H. Heart sound classification using deep learning techniques based on log-mel spectrogram. Circ Syst Signal Process, 2023, 42(1): 344-360.
20. Zhu L, Qian K, Wang Z, et al. Heart sound classification based on residual shrinkage networks. Annu Int Conf IEEE Eng Med Biol Soc, 2022, 2022: 4469-4472.
21. Tian G, Lian C, Zeng Z, et al. Imbalanced heart sound signal classification based on two-stage trained dsanet. Cogn Comput, 2022, 14(4): 1378-1391.
22. Jamil S, Roy A M. An efficient and robust phonocardiography (PCG)-based valvular heart diseases (VHD) detection framework using vision Transformer (ViT). Comput Biol Med, 2023, 158: 106734.
23. Ranipa K, Zhu W P, Swamy M N S. A novel feature-level fusion scheme with multimodal attention CNN for heart sound classification. Comput Methods Programs Biomed, 2024, 248: 108122.
24. Liu C, Springer D, Li Q, et al. An open access database for the evaluation of heart sound algorithms. Physiol Meas, 2016, 37(12): 2181-2213.
25. Yaseen, Son G Y, Kwon S. Classification of heart sound signal using multiple features. Appl Sci, 2018, 8(12): 2344.
26. Rubin J, Abreu R, Ganguli A, et al. Classifying heart sound recordings using deep convolutional neural networks and mel-frequency cepstral coefficients//2016 Computing in Cardiology Conference (CinC), Durham: IEEE, 2016: 813-816.
27. Li F, Zhang Z, Wang L, et al. Heart sound classification based on improved Mel-frequency spectral coefficients and deep residual learning. Front Physiol, 2022, 13: 1084420.
28. Arora V, Leekha R, Singh R, et al. Heart sound classification using machine learning and phonocardiogram. Mod Phys Lett B, 2019, 33(26): 1950321.
29. Lin M, Chen Q, Yan S. Network in network. arXiv preprint, 2013, arXiv: 1312.4400.
30. 王幸之, 楊宏波, 宗容, 等. 基于子帶包絡和卷積神經網絡的心音分類算法. 生物醫學工程學雜志, 2021, 38(5): 969-978.
31. Shuvo S B, Ali S N, Swapnil S I, et al. CardioXNet: a novel lightweight deep learning framework for cardiovascular disease classification using heart sound recordings. IEEE Access, 2021, 9: 36955-36967.
32. Alkhodari M, Fraiwan L. Convolutional and recurrent neural networks for the detection of valvular heart diseases in phonocardiogram recordings. Comput Methods Programs Biomed, 2021, 200: 105940.
33. Karhade J, Dash S, Ghosh S K, et al. Time-frequency-domain deep learning framework for the automated detection of heart valve disorders using PCG signals. IEEE Trans Instrum Meas, 2022, 71: 1-11.
34. Chen J, Guo Z, Xu X, et al. A robust deep learning framework based on spectrograms for heart sound classification. IEEE/ACM Trans Comput Biol Bioinform, 2024, 21(4): 936-947.
35. Guo Z, Chen J, He T, et al. DS-CNN: dual-stream convolutional neural networks-based heart sound classification for wearable devices. IEEE Trans Consum Electron, 2023, 69(4): 1186-1194.

Journal of Biomedical Engineering

Latest ArticlesA study on heart sound classification algorithm based on improved Mel-frequency cepstrum coefficient feature extraction and deep Transformer

Abstract Full text Figures/Tables Video References Cited by

Format

Content