Conditional Electrocardiogram Generation using Hierarchical Variation-al Autoencoders
Main Article Content
Abstract
Cardiovascular diseases remain the leading cause of mortality, and automated electrocardiogram (ECG) analysis can ease clinical workloads but is limited by scarce and imbalanced data. Synthetic ECG can mitigate these issues, and while most methods use Generative Adversarial Networks (GANs), recent work show variational autoencoders (VAEs) perform comparably. We introduce cNVAE-ECG, a conditional Nouveau VAE (NVAE) that generates high-resolution, 12-lead, 10-second ECGs with multiple pathologies. Leveraging a compact channel-generation scheme and class embeddings for multi-label conditioning, cNVAE-ECG improves downstream binary and multi-label classification, achieving up to a 2% AUROC gain in transfer learning over GAN-based models.
Keywords:
Article Details
References
2. Liu X., Wang H., Li Z., Qin L. Deep learning in ECG diagnosis: A review // Knowledge-Based Systems. 2021. Vol. 227. P. 107187.
3. Gerke S., Minssen T., Cohen G. Ethical and legal challenges of artificial intelligence-driven healthcare // Artificial Intelligence in Healthcare / Eds. A. Bohr, K. Memarzadeh. Academic Press, 2020. P. 295–336.
4. Reyna M.A., Sadr N., Alday E.A.P., et al. Will two do? Varying dimensions in electrocardiography: The PhysioNet/Computing in Cardiology Challenge 2021 // Computing in Cardiology. 2021. Vol. 48. P. 1–4.
5. Friesen G., Jannett T., Jadallah M., Yates S., Quint S., Nagle H. A comparison of the noise sensitivity of nine QRS detection algorithms // IEEE Transactions on Biomedical Engineering. 1990. Vol. 37, No. 1. P. 85–98.
6. Maron B.J., Friedman R.A., Kligfield P., et al. Assessment of the 12-lead ECG as a screening test for detection of cardiovascular disease in healthy general populations of young people (12–25 years of age) // Circulation. 2014. Vol. 130, No. 15. P. 1303–1334.
7. Kingma D.P., Welling M. Auto-encoding variational Bayes. 2022.
8. Vahdat A., Kautz J. NVAE: A deep hierarchical variational autoencoder. 2020.
9. Golany T., Radinsky K. PGANs: Personalized generative adversarial networks for ECG synthesis to improve patient-specific deep ECG classification // Proceedings of the AAAI Conference on Artificial Intelligence. 2019. Vol. 33, No. 1. P. 557–564.
10. Yang H., Liu J., Zhang L., Li Y., Zhang H. ProEGAN-MS: A progressive growing generative adversarial networks for electrocardiogram generation // IEEE Access. 2021. Vol. 9. P. 52089–52100.
11. Golany T., Radinsky K., Freedman D. SimGANs: Simulator-based generative adversarial networks for ECG synthesis to improve deep ECG classification // Proceedings of the 37th International Conference on Machine Learning (PMLR). 2020. Vol. 119. P. 3597–3606.
12. Nankani D., Baruah R.D. Investigating deep convolution conditional GANs for electrocardiogram generation // 2020 International Joint Conference on Neural Networks (IJCNN). 2020. P. 1–8.
13. Thambawita V., Isaksen J.L., Hicks S.A., et al. DeepFake electrocardiograms using generative adversarial networks are the beginning of the end for privacy issues in medicine // Scientific Reports. 2021. Vol. 11. P. 21896.
14. Wu J., Wang L., Pan H., Wang B. MLCGAN: Multi-lead ECG synthesis with multi label conditional generative adversarial network // ICASSP 2023 – IEEE International Conference on Acoustics, Speech and Signal Processing. 2023. P. 1–5.
15. Alcaraz J.M.L., Strodthoff N. Diffusion-based conditional ECG generation with structured state space models. 2023.
16. Ho J., Jain A., Abbeel P. Denoising diffusion probabilistic models // CoRR. 2020. abs/2006.11239.
17. Dhariwal P., Nichol A. Diffusion models beat GANs on image synthesis. 2021.
18. Xia Y., Wang W., Wang K. ECG signal generation based on conditional generative models // Biomedical Signal Processing and Control. 2023. Vol. 82. P. 104587.
19. El-Kaddoury M., Mahmoudi A., Himmi M.M. Deep generative models for image generation: A practical comparison between variational autoencoders and generative adversarial networks // Mobile, Secure, and Programmable Networking. Cham: Springer, 2019. P. 1–8.
20. Kuznetsov V., Moskalenko V., Gribanov D., Zolotykh N. Interpretable feature generation in ECG using a variational autoencoder // Frontiers in Genetics. 2021. Vol. 12. P. 638191.
21. Sang Y., Beetz M., Grau V. Generation of 12-lead electrocardiogram with subject-specific, image-derived characteristics using a conditional variational autoencoder // 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). 2022. P. 1–5.
22. Beetz M., Banerjee A., Sang Y., Grau V. Combined generation of electrocardiogram and cardiac anatomy models using multi-modal variational autoencoders // 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). 2022. P. 1–4.
23. Salimans T., Karpathy A., Chen X., Kingma D.P. PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications. 2017.
24. Deng L. The MNIST database of handwritten digit images for machine learning research [Best of the Web] // IEEE Signal Processing Magazine. 2012. Vol. 29, No. 6. P. 141–142.
25. Krizhevsky A. Learning multiple layers of features from tiny images. 2009.
26. Larsen A.B.L., Sønderby S.K., Larochelle H., Winther O. Autoencoding beyond pixels using a learned similarity metric. 2016.
27. Karras T., Aila T., Laine S., Lehtinen J. Progressive growing of GANs for improved quality, stability, and variation. 2018.
28. Malmivuo J., Plonsey R. Bioelectromagnetism. 15. 12-Lead ECG System. 1975. P. 277–289.
29. Griffin D., Lim J. Signal estimation from modified short-time Fourier transform // IEEE Transactions on Acoustics, Speech, and Signal Processing. 1984. Vol. 32, No. 2. P. 236–243.
30. Strodthoff N., Wagner P., Schaeffter T., Samek W. Deep learning for ECG analysis: Benchmarks and insights from PTB-XL // IEEE Journal of Biomedical and Health Informatics. 2021. Vol. 25, No. 5. P. 1519–1528.
31. Donahue C., McAuley J., Puckette M. Adversarial audio synthesis. 2019.
32. Wu C.-J., Raghavendra R., Gupta U., et al. Sustainable AI: Environmental implications, challenges and opportunities // ArXiv. 2021. abs/2111.00364.
33. Wagner P., Strodthoff N., Bousseljot R.-D., et al. PTB-XL, a large publicly available electrocardiography dataset // Scientific Data. 2020. Vol. 7. P. 154.
34. Goldberger A., Amaral L., Glass L., et al. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals // Circulation. 2000. Vol. 101. P. e215–e220.
35. Zheng J., Chu H., Struppa D., et al. Optimal multi-stage arrhythmia classification approach // Scientific Reports. 2020. Vol. 10.
36. El-Sappagh S., Franda F., Ali F., Kwak K.-S. SNOMED CT standard ontology based on the ontology for general medical science // BMC Medical Informatics and Decision Making. 2018. Vol. 18, No. 1. P. 76.
37. Berkaya S.K., Uysal A.K., Gunal E.S., et al. A survey on ECG analysis // Biomedical Signal Processing and Control. 2018. Vol. 43. P. 216–235.
38. Zhang M., Wang Y., Luo T. Federated learning for arrhythmia detection of non-IID ECG // 2020 IEEE 6th International Conference on Computer and Communications (ICCC). 2020. P. 1176–1180.

This work is licensed under a Creative Commons Attribution 4.0 International License.
Presenting an article for publication in the Russian Digital Libraries Journal (RDLJ), the authors automatically give consent to grant a limited license to use the materials of the Kazan (Volga) Federal University (KFU) (of course, only if the article is accepted for publication). This means that KFU has the right to publish an article in the next issue of the journal (on the website or in printed form), as well as to reprint this article in the archives of RDLJ CDs or to include in a particular information system or database, produced by KFU.
All copyrighted materials are placed in RDLJ with the consent of the authors. In the event that any of the authors have objected to its publication of materials on this site, the material can be removed, subject to notification to the Editor in writing.
Documents published in RDLJ are protected by copyright and all rights are reserved by the authors. Authors independently monitor compliance with their rights to reproduce or translate their papers published in the journal. If the material is published in RDLJ, reprinted with permission by another publisher or translated into another language, a reference to the original publication.
By submitting an article for publication in RDLJ, authors should take into account that the publication on the Internet, on the one hand, provide unique opportunities for access to their content, but on the other hand, are a new form of information exchange in the global information society where authors and publishers is not always provided with protection against unauthorized copying or other use of materials protected by copyright.
RDLJ is copyrighted. When using materials from the log must indicate the URL: index.phtml page = elbib / rus / journal?. Any change, addition or editing of the author's text are not allowed. Copying individual fragments of articles from the journal is allowed for distribute, remix, adapt, and build upon article, even commercially, as long as they credit that article for the original creation.
Request for the right to reproduce or use any of the materials published in RDLJ should be addressed to the Editor-in-Chief A.M. Elizarov at the following address: amelizarov@gmail.com.
The publishers of RDLJ is not responsible for the view, set out in the published opinion articles.
We suggest the authors of articles downloaded from this page, sign it and send it to the journal publisher's address by e-mail scan copyright agreements on the transfer of non-exclusive rights to use the work.