Deep Learning in Medical Imaging: Application Progress and Future Challenges
DOI:
https://doi.org/10.54691/kskfkp57Keywords:
Medical Imaging; Deep Learning; Image Reconstruction; Computer-aided diagnosis; Image Segmentation; Federated Learning; Multimodal Large Models.Abstract
Medical Imaging occupies a central position in modern clinical diagnosis and treatment. However, faced with vast and continuously growing volumes of complex imaging data, traditional image processing algorithms reliant on manual features struggle to meet the demands for efficient and precise clinical applications. Against this backdrop, Deep Learning technology, leveraging its unique advantages, has demonstrated immense application potential in Medical Imaging, significantly enhancing the accuracy and efficiency of medical diagnosis. This paper systematically reviews the primary technological developments and practical application scenarios of Deep Learning within the Medical Imaging domain. The focus will be on key application areas such as image reconstruction, segmentation, and registration, alongside multimodal synthesis, intelligent lesion detection, and computer-aided diagnosis. Finally, the paper will delve into the current limitations of these technologies and explore future development directions, aiming to provide valuable reference for research and application in this field.
Downloads
References
[1] He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770-778.
[2] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556, 2014.
[3] Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 1-9.
[4] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. Advances in neural information processing systems, 2017, 30.
[5] Yi X, Walia E, Babyn P. Generative adversarial networks in Medical Imaging: A review[J]. Medical Image Analysis, 2019, 58: 101552.
[6] Milletari F, Navab N, Ahmadi S A. V-net: Fully Convolutional Neural Networks for volumetric medical image segmentation. . [C]//2016 fourth international conference on 3D vision (3DV). Ieee, 2016: 565-571.
[7] Niu Z, Zhong G, Yu H. A review on the attention mechanism of Deep Learning[J]. Neurocomputing, 2021, 452: 48-62.
[8] Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7132-7141.
[9] Chen J, Lu Y, Yu Q, et al. TransUNet: Transformers make strong encoders for medical image segmentation[J]. arXiv preprint arXiv:2102.04306, 2021.
[10] Cao H, Wang Y, Chen J, et al. SWIN-UNet: UNet-like pure transformer for medical image segmentation[C]//European Conference on Computer Vision. Cham: Springer Nature Switzerland, 2022: 205-218.
[11] Hatamizadeh A, Tang Y, Nath V, et al. UNETR: Transformers for 3D medical image segmentation [C] //Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2022: 574-584.
[12] Zhu J Y, Park T, Isola P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2223-2232.
[13] Kazerouni A, Aghdam E K, Heidari M, et al. Diffusion models in Medical Imaging: A comprehensive survey[J]. Medical image analysis, 2023, 88: 102846.
[14] Huang Q, Zhao C, Jiang M, et al. Cascade-Net: A new Deep Learning architecture for OFDM detection[J]. arXiv preprint arXiv:1812.00023, 2018.
[15] Wang S, Cheng H, Ying L, et al. DeepcomplexMRI: Exploiting deep residual networks for fast parallel MR imaging with complex convolution[J]. Magnetic Resonance Imaging, 2020, 68: 136-147.
[16] Sun J, Li H, Xu Z. Deep ADMM-Net for compressive sensing MRI[J]. Advances in neural information processing systems, 2016, 29.
[17] Willemink MJ, Noël PB. The evolution of image reconstruction for CT—from filtered back projection to artificial intelligence[J]. European Radiology, 2019, 29(5): 2185-2195.
[18] Chen H, Zhang Y, Kalra M K, et al. Low-dose CT with a residual encoder-decoder convolutional neural network[J]. IEEE transactions on Medical Imaging, 2017, 36(12): 2524-2535.
[19] Kang E, Ye J C. Wavelet domain residual network (WavResNet) for low-dose X-ray CT reconstruction[J]. arXiv preprint arXiv:1703.01383, 2017.
[20] Adler J, Lunz S. Banach Wasserstein GAN[J]. Advances in Neural Information Processing Systems, 2018, 31.
[21] Sandfort V, Yan K, Pickhardt P J, et al. Data augmentation using generative adversarial networks (CycleGAN) to improve generalisability in CT segmentation tasks[J]. Scientific reports, 2019, 9(1): 16884.
[22] Li Z, Liu Q, Li Y, et al. A two-stage multi-loss super-resolution network for arterial spin labelling magnetic resonance imaging[C]//International conference on medical image computing and computer-assisted intervention. Cham: Springer International Publishing, 2019: 12-20.
[23] Gulshan V, Peng L, Coram M, et al. Development and validation of a Deep Learning algorithm for detection of diabetic retinopathy in retinal fundus photographs[J]. JAMA, 2016, 316(22): 2402-2410.
[24] Esteva A, Kuprel B, Novoa R A, et al. Dermatologist-level classification of skin cancer with deep neural networks[J]. Nature, 2017, 542(7639): 115-118.
[25] Zhu W, Liu C, Fan W, et al. Deeplung: Deep 3D dual path nets for automated pulmonary nodule detection and classification[C]//2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 2018: 673-681.
[26] Li Z, Zhang S, Zhang J, et al. MVP-Net: multi-view FPN with position-aware attention for deep universal lesion detection[C]//International conference on medical image computing and computer-assisted intervention. Cham: Springer International Publishing, 2019: 13-21.
[27] Suk H I, Lee S W, Shen D, et al. Hierarchical feature representation and multimodal fusion with Deep Learning for AD/MCI diagnosis[J]. NeuroImage, 2014, 101: 569-582.
[28] Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer International Publishing, 2015: 234-241.
[29] Isensee F, Jaeger P F, Kohl S A A, et al. nnU-Net: a self-configuring method for Deep Learning-based biomedical image segmentation[J]. Nature methods, 2021, 18(2): 203-211.
[30] Wasserthal J, Breit H C, Meyer M T, et al. TotalSegmentator: robust segmentation of 104 anatomic structures in CT images[J]. Radiology: Artificial Intelligence, 2023, 5(5): e230024.
[31] Ma J, He Y, Li F, et al. Segment anything in medical images[J]. Nature communications, 2024, 15(1): 654.
[32] Balakrishnan G, Zhao A, Sabuncu M R, et al. Voxelmorph: a learning framework for deformable medical image registration[J]. IEEE Transactions on Medical Imaging, 2019, 38(8): 1788-1800.
[33] Luo X, Wang G, Liao W, et al. Semi-supervised medical image segmentation via uncertainty rectified pyramid consistency[J]. Medical Image Analysis, 2022, 80: 102517.
[34] Lu M Y, Williamson D F K, Chen T Y, et al. Data-efficient and weakly supervised computational pathology on whole-slide images: [J]. Nature Biomedical Engineering, 2021, 5(6): 555-570.
[35] Kaissis GA, Makowski MR, Rückert D, et al. Secure, privacy-preserving and federated machine learning in Medical Imaging. Nature Machine Intelligence, 2020, 2(6): 305-311.
[36] Rieke N, Hancox J, Li W, et al. The future of digital health with Federated Learning. NPJ Digital Medicine, 3, 119[J]. DOI: https://doi.org/10.1038/s41746-020-00323-1, 2020.
[37] Yan K, Wang X, Lu L, et al. Deeplesion: Automated deep mining, categorisation and detection of significant radiology image findings using large-scale clinical lesion annotations[J]. arXiv preprint arXiv:1710.01766, 2017.
[38] Saab K, Tu T, Weng W H, et al. Capabilities of Gemini models in medicine[J]. arXiv preprint arXiv:2404.18416, 2024.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Scientific Journal of Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.






