Diffusion Model-based Medical Image Generation as a Potential Data Augmentation Strategy for AI Applications

Zijian Cao; Jueye Zhang; Chen Lin; Tian Li; Hao Wu; Yibao Zhang

doi:10.2174/0115734056401610250827114351

ISSN: 1573-4056
E-ISSN: 1875-6603

HTML

oa Diffusion Model-based Medical Image Generation as a Potential Data Augmentation Strategy for AI Applications
Authors: Zijian Cao¹, Jueye Zhang², Chen Lin², Tian Li³, Hao Wu^1,4 and Yibao Zhang⁴
View Affiliations Hide Affiliations

¹ Institute of Medical Technology, Peking University Health Science Center, Beijing 100191, China ² State Key Laboratory of Nuclear Physics and Technology, Peking University School of Physics, Beijing 100871, China ³ Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, 999077, China ⁴ Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
Source: Current Medical Imaging, Volume 21, Issue 1, Jan 2025, E15734056401610
DOI: https://doi.org/10.2174/0115734056401610250827114351
- Received: 25 Mar 2025
- Accepted: 02 Jun 2025
- Available online: 01 Sep 2025

Abstract

Introduction

This study explored a generative image synthesis method based on diffusion models, potentially providing a low-cost and high-efficiency training data augmentation strategy for medical artificial intelligence (AI) applications.

Methods

The MedMNIST v2 dataset was utilized as a small-volume training dataset under low-performance computing conditions. Based on the characteristics of existing samples, new medical images were synthesized using the proposed annotated diffusion model. In addition to observational assessment, quantitative evaluation was performed based on the gradient descent of the loss function during the generation process and the Fréchet Inception Distance (FID), using various loss functions and feature vector dimensions.

Results

Compared to the original data, the proposed diffusion model successfully generated medical images of similar styles but with dramatically varied anatomic details. The model trained with the Huber loss function achieved a higher FID of 15.2 at a feature vector dimension of 2048, compared with the model trained with the L2 loss function, which achieved the best FID of 0.85 at a feature vector dimension of 64.

Discussion

The use of the Huber loss enhanced model robustness, while FID values indicated acceptable similarity between generated and real images. Future work should explore the application of these models to more complex datasets and clinical scenarios.

Conclusion

This study demonstrated that diffusion model-based medical image synthesis is potentially applicable as an augmentation strategy for AI, particularly in situations where access to real clinical data is limited. Optimal training parameters were also proposed by evaluating the dimensionality of feature vectors in FID calculations and the complexity of loss functions.

This is an open access article published under CC BY 4.0 https://creativecommons.org/licenses/by/4.0/legalcode

Article metrics loading...

/content/journals/cmir/10.2174/0115734056401610250827114351

2025-09-01

2026-03-06

From This Site

/content/journals/cmir/10.2174/0115734056401610250827114351

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/deliver/fulltext/cmir/21/1/CMIR-21-E15734056401610.html?itemId=/content/journals/cmir/10.2174/0115734056401610250827114351&mimeType=html&fmt=ahah

References

GuY. ChiJ. LiuJ. YangL. ZhangB. YuD. ZhaoY. LuX. A survey of computer-aided diagnosis of lung nodules from CT scans using deep learning.Comput. Biol. Med.202113710480610.1016/j.compbiomed.2021.10480634461501
[Google Scholar]
WangJ. ZhuH. WangS.H. ZhangY.D. A review of deep learning on medical image analysis.Mob. Netw. Appl.202126135138010.1007/s11036‑020‑01672‑7
[Google Scholar]
HollisK.F. To share or not to share: Ethical acquisition and use of medical data.AMIA Jt. Summits Transl. Sci. Proc.2016201642042727570683
[Google Scholar]
HasaniN. FarhadiF. MorrisM.A. NikpanahM. RahmimA. XuY. PariserA. CollinsM.T. SummersR.M. JonesE. SiegelE. SabouryB. Artificial intelligence in medical imaging and its impact on the rare disease community: Threats, challenges and opportunities.PET Clin.2022171132910.1016/j.cpet.2021.09.00934809862
[Google Scholar]
FuhrmanJ.D. GorreN. HuQ. LiH. El NaqaI. GigerM.L. A review of explainable and interpretable AI with applications in COVID-19 imaging.Med. Phys.202249111410.1002/mp.1535934796530
[Google Scholar]
KerthJ.L. HagemeisterM. BischopsA.C. ReinhartL. DukartJ. HeinrichsB. EickhoffS.B. MeissnerT. Artificial intelligence in the care of children and adolescents with chronic diseases: A systematic review.Eur. J. Pediatr.202418418310.1007/s00431‑024‑05846‑339672974
[Google Scholar]
DhariwalP. NicholA. Diffusion models beat GANs on image synthesis.Proceedings of the 35th International Conference on Neural Information Processing Systems (NIPS ’21)Red Hook, NY, USA, 06 December 2021, pp. 672-686.
[Google Scholar]
StypułkowskiM. VougioukasK. HeS. ZiębaM. PetridisS. PanticM. Diffused heads: Diffusion models beat gans on talking-face generation.Proceedings of the IEEE/CVF Winter Conference on Applications of Computer VisionWaikoloa, HI, USA, 03-08 January 2024, pp. 5089-5098.10.1109/WACV57701.2024.00502
[Google Scholar]
MukhopadhyayS. GwilliamM. AgarwalV. PadmanabhanN. SwaminathanA. HegdeS. ZhouT. ShrivastavaA. Diffusion models beat gans on image classification.arXiv2023arXiv:2307.08702v110.48550/arXiv.2307.08702
[Google Scholar]
KingmaD.P. WellingM. An introduction to variational autoencoders.Found. Trends Mach. Learn.201912430739210.1561/2200000056
[Google Scholar]
UemuraT. NäppiJ.J. RyuY. WatariC. KamiyaT. YoshidaH. A generative flow-based model for volumetric data augmentation in 3D deep learning for computed tomographic colonography.Int. J. CARS2021161818910.1007/s11548‑020‑02275‑z33150471
[Google Scholar]
HoJ. JainA. AbbeelP. Denoising diffusion probabilistic models. LarochelleH. RanzatoM. HadsellR. BalcanM.F. LinH. Advances in neural information processing systems.33Curran Associates, Inc.202068406851
[Google Scholar]
YangJ. ShiR. WeiD. LiuZ. ZhaoL. KeB. PfisterH. NiB. MedMNIST v2 - A large-scale lightweight benchmark for 2D and 3D biomedical image classification.Sci. Data20231014110.1038/s41597‑022‑01721‑836658144
[Google Scholar]
WangB. WangH. CaoG. Enhanced slicing prototype and hybrid metric transformer for few-shot medical image classification.2024 IEEE International Conference on Systems, Man, and Cybernetics (SMC)Kuching, Malaysia, 06-10 October 2024, pp. 2275-2281.10.1109/SMC54092.2024.10831734
[Google Scholar]
JainG. MittalD. ThakurD. MittalM.K. A deep learning approach to detect Covid-19 coronavirus with X-Ray images.Biocybern. Biomed. Eng.20204041391140510.1016/j.bbe.2020.08.00832921862
[Google Scholar]
YodaS. KawazoeH. KurokiY. Convolutional dictionary learning with Huber error and L1 regularization terms.2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)Hualien City, Taiwan, 16-19 November 2021, pp. 1-2.10.1109/ISPACS51563.2021.9651025
[Google Scholar]
KhrapovA. PopovV. SadekovaT. YermekovaA. KudinovM. Improving diffusion models’s data-corruption resistance using scheduled pseudo-huber loss.arXiv2024arXiv:2403.16728v110.48550/arXiv.2403.16728
[Google Scholar]
ZhangD. WangJ. LuoF. Directly denoising diffusion models.arXiv2024arXiv:2405.13540v210.48550/arXiv.2405.13540
[Google Scholar]
HurkmansC. BibaultJ.E. BrockK.K. van ElmptW. FengM. David FullerC. Jereczek-FossaB.A. KorremanS. LandryG. MadestaF. MayoC. McWilliamA. MouraF. MurenL.P. El NaqaI. SeuntjensJ. ValentiniV. VelecM. A joint ESTRO and AAPM guideline for development, clinical validation and reporting of artificial intelligence models in radiation therapy.Radiother. Oncol.202419711034510.1016/j.radonc.2024.11034538838989
[Google Scholar]

/content/journals/cmir/10.2174/0115734056401610250827114351

Diffusion Model-based Medical Image Generation as a Potential Data Augmentation Strategy for AI Applications

Curr. Med. Imaging 21, E15734056401610 (2025); https://doi.org/10.2174/0115734056401610250827114351

/content/journals/cmir/10.2174/0115734056401610250827114351

Data & Media loading...

Article Type: Research Article

Keyword(s): AI training; Artificial intelligence; Data augmentation; Diffusion models; Image generation; Medical radiology

oa Diffusion Model-based Medical Image Generation as a Potential Data Augmentation Strategy for AI Applications

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Small Animal Computed Tomography Imaging

Brain Tumor Detection Using Machine Learning and Deep Learning: A Review

Low-dose COVID-19 CT Image Denoising Using CNN and its Method Noise Thresholding

How to Collect and Interpret Medical Pictures Captured in Highly Challenging Environments that Range from Nanoscale to Hyperspectral Imaging

SegEIR-Net: A Robust Histopathology Image Analysis Framework for Accurate Breast Cancer Classification

An Efficient Ensemble-based Machine Learning approach for Predicting Chronic Kidney Disease

Automated Diagnosis of Bone Metastasis by Classifying Bone Scintigrams Using a Self-defined Deep Learning Model

Prediction of Lumbar Pedicle Screw Loosening Using Hounsfield Units in Computed Tomography

AI-assisted Method for Efficiently Generating Breast Ultrasound Screening Reports

Is Gadoxetic Acid Disodium (Gd-EOB-DTPA)-Enhanced Magnetic Resonance Imaging an Accurate Diagnostic Method for Hepatocellular Carcinoma? A Systematic Review with Meta-Analysis