Skip to content
2000
Volume 21, Issue 1
  • ISSN: 1573-4056
  • E-ISSN: 1875-6603

Abstract

Introduction

This study explored a generative image synthesis method based on diffusion models, potentially providing a low-cost and high-efficiency training data augmentation strategy for medical artificial intelligence (AI) applications.

Methods

The MedMNIST v2 dataset was utilized as a small-volume training dataset under low-performance computing conditions. Based on the characteristics of existing samples, new medical images were synthesized using the proposed annotated diffusion model. In addition to observational assessment, quantitative evaluation was performed based on the gradient descent of the loss function during the generation process and the Fréchet Inception Distance (FID), using various loss functions and feature vector dimensions.

Results

Compared to the original data, the proposed diffusion model successfully generated medical images of similar styles but with dramatically varied anatomic details. The model trained with the Huber loss function achieved a higher FID of 15.2 at a feature vector dimension of 2048, compared with the model trained with the L2 loss function, which achieved the best FID of 0.85 at a feature vector dimension of 64.

Discussion

The use of the Huber loss enhanced model robustness, while FID values indicated acceptable similarity between generated and real images. Future work should explore the application of these models to more complex datasets and clinical scenarios.

Conclusion

This study demonstrated that diffusion model-based medical image synthesis is potentially applicable as an augmentation strategy for AI, particularly in situations where access to real clinical data is limited. Optimal training parameters were also proposed by evaluating the dimensionality of feature vectors in FID calculations and the complexity of loss functions.

This is an open access article published under CC BY 4.0 https://creativecommons.org/licenses/by/4.0/legalcode
Loading

Article metrics loading...

/content/journals/cmir/10.2174/0115734056401610250827114351
2025-09-01
2025-10-29
Loading full text...

Full text loading...

/deliver/fulltext/cmir/21/1/CMIR-21-E15734056401610.html?itemId=/content/journals/cmir/10.2174/0115734056401610250827114351&mimeType=html&fmt=ahah

References

  1. GuY. ChiJ. LiuJ. YangL. ZhangB. YuD. ZhaoY. LuX. A survey of computer-aided diagnosis of lung nodules from CT scans using deep learning.Comput. Biol. Med.202113710480610.1016/j.compbiomed.2021.10480634461501
    [Google Scholar]
  2. WangJ. ZhuH. WangS.H. ZhangY.D. A review of deep learning on medical image analysis.Mob. Netw. Appl.202126135138010.1007/s11036‑020‑01672‑7
    [Google Scholar]
  3. HollisK.F. To share or not to share: Ethical acquisition and use of medical data.AMIA Jt. Summits Transl. Sci. Proc.2016201642042727570683
    [Google Scholar]
  4. HasaniN. FarhadiF. MorrisM.A. NikpanahM. RahmimA. XuY. PariserA. CollinsM.T. SummersR.M. JonesE. SiegelE. SabouryB. Artificial intelligence in medical imaging and its impact on the rare disease community: Threats, challenges and opportunities.PET Clin.2022171132910.1016/j.cpet.2021.09.00934809862
    [Google Scholar]
  5. FuhrmanJ.D. GorreN. HuQ. LiH. El NaqaI. GigerM.L. A review of explainable and interpretable AI with applications in COVID-19 imaging.Med. Phys.202249111410.1002/mp.1535934796530
    [Google Scholar]
  6. KerthJ.L. HagemeisterM. BischopsA.C. ReinhartL. DukartJ. HeinrichsB. EickhoffS.B. MeissnerT. Artificial intelligence in the care of children and adolescents with chronic diseases: A systematic review.Eur. J. Pediatr.202418418310.1007/s00431‑024‑05846‑339672974
    [Google Scholar]
  7. DhariwalP. NicholA. Diffusion models beat GANs on image synthesis.Proceedings of the 35th International Conference on Neural Information Processing Systems (NIPS ’21)Red Hook, NY, USA, 06 December 2021, pp. 672-686.
    [Google Scholar]
  8. StypułkowskiM. VougioukasK. HeS. ZiębaM. PetridisS. PanticM. Diffused heads: Diffusion models beat gans on talking-face generation.Proceedings of the IEEE/CVF Winter Conference on Applications of Computer VisionWaikoloa, HI, USA, 03-08 January 2024, pp. 5089-5098.10.1109/WACV57701.2024.00502
    [Google Scholar]
  9. MukhopadhyayS. GwilliamM. AgarwalV. PadmanabhanN. SwaminathanA. HegdeS. ZhouT. ShrivastavaA. Diffusion models beat gans on image classification.arXiv2023arXiv:2307.08702v110.48550/arXiv.2307.08702
    [Google Scholar]
  10. KingmaD.P. WellingM. An introduction to variational autoencoders.Found. Trends Mach. Learn.201912430739210.1561/2200000056
    [Google Scholar]
  11. UemuraT. NäppiJ.J. RyuY. WatariC. KamiyaT. YoshidaH. A generative flow-based model for volumetric data augmentation in 3D deep learning for computed tomographic colonography.Int. J. CARS2021161818910.1007/s11548‑020‑02275‑z33150471
    [Google Scholar]
  12. HoJ. JainA. AbbeelP. Denoising diffusion probabilistic models. LarochelleH. RanzatoM. HadsellR. BalcanM.F. LinH. Advances in neural information processing systems.33Curran Associates, Inc.202068406851
    [Google Scholar]
  13. YangJ. ShiR. WeiD. LiuZ. ZhaoL. KeB. PfisterH. NiB. MedMNIST v2 - A large-scale lightweight benchmark for 2D and 3D biomedical image classification.Sci. Data20231014110.1038/s41597‑022‑01721‑836658144
    [Google Scholar]
  14. WangB. WangH. CaoG. Enhanced slicing prototype and hybrid metric transformer for few-shot medical image classification.2024 IEEE International Conference on Systems, Man, and Cybernetics (SMC)Kuching, Malaysia, 06-10 October 2024, pp. 2275-2281.10.1109/SMC54092.2024.10831734
    [Google Scholar]
  15. JainG. MittalD. ThakurD. MittalM.K. A deep learning approach to detect Covid-19 coronavirus with X-Ray images.Biocybern. Biomed. Eng.20204041391140510.1016/j.bbe.2020.08.00832921862
    [Google Scholar]
  16. YodaS. KawazoeH. KurokiY. Convolutional dictionary learning with Huber error and L1 regularization terms.2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)Hualien City, Taiwan, 16-19 November 2021, pp. 1-2.10.1109/ISPACS51563.2021.9651025
    [Google Scholar]
  17. KhrapovA. PopovV. SadekovaT. YermekovaA. KudinovM. Improving diffusion models’s data-corruption resistance using scheduled pseudo-huber loss.arXiv2024arXiv:2403.16728v110.48550/arXiv.2403.16728
    [Google Scholar]
  18. ZhangD. WangJ. LuoF. Directly denoising diffusion models.arXiv2024arXiv:2405.13540v210.48550/arXiv.2405.13540
    [Google Scholar]
  19. HurkmansC. BibaultJ.E. BrockK.K. van ElmptW. FengM. David FullerC. Jereczek-FossaB.A. KorremanS. LandryG. MadestaF. MayoC. McWilliamA. MouraF. MurenL.P. El NaqaI. SeuntjensJ. ValentiniV. VelecM. A joint ESTRO and AAPM guideline for development, clinical validation and reporting of artificial intelligence models in radiation therapy.Radiother. Oncol.202419711034510.1016/j.radonc.2024.11034538838989
    [Google Scholar]
/content/journals/cmir/10.2174/0115734056401610250827114351
Loading
/content/journals/cmir/10.2174/0115734056401610250827114351
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test