Research on Medical Image Analysis for Edge Devices Based on Lightweight Frameworks

Zicheng Qin

doi:10.70088/hcr05e79

Authors

Zicheng Qin Wuhan University of Technology, Wuhan, China Author

DOI:

https://doi.org/10.70088/hcr05e79

Keywords:

Cross-Scale Feature Fusion, Efficient Upsampling, Edge Computing, Medical Object Detection, YOLO

Abstract

Medical object detection underpins many computeraided diagnosis (CAD) workflows and remains central to clinical image analysis. In real applications, however, detection models must usually balance reliable accuracy against tight memory and computation budgets, especially on edge hardware. Although the YOLO family is widely adopted for real-time detection, its computational cost still limits deployment on embedded and resource-constrained devices. To address this problem, we propose YOLO-GCE, a lightweight framework that introduces Ghost modules to reduce backbone redundancy, a Cross-Scale Feature Fusion Module (CCFM) to strengthen semantic interaction in the neck, and an Efficient Upsampling Convolutional Block (EUCB) to suppress upsampling artifacts and improve smallobject detection. These components are designed to raise feature utilization without sacrificing inference efficiency, and the final model is further deployed on an RK3588s development board. Experiments on the BCCD and Br35H datasets show a 38.3% reduction in GFLOPs and a 50.5% reduction in parameters while maintaining strong detection performance. With only 1.49 million parameters, YOLO-GCE remains competitive with conventional baselines, supporting its use for real-time edge deployment in practical medical scenarios.

References

M. Saraei, M. Lalinia, and E.-J. Lee, "Deep Learning-Based Medical Object Detection: A Survey," IEEE Access, vol. 13, pp. 53019--53038, 2025, doi: 10.1109/ACCESS.2025.3553087.

S. K. Zhou et al., "A Review of Deep Learning in Medical Imaging: Imaging Traits, Technology Trends, Case Studies With Progress Highlights, and Future Promises," Proc. IEEE, vol. 109, no. 5, pp. 820--838, 2021, doi: 10.1109/JPROC.2021.3054390.

Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proc. IEEE, vol. 86, no. 11, pp. 2278--2324, 1998, doi: 10.1109/5.726791.

K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 770–778.

O. Ronneberger, P. Fischer, and T. Brox, "U-Net: Convolutional Networks for Biomedical Image Segmentation," in *Medical Image Computing and Computer-Assisted Intervention (MICCAI)*, 2015, pp. 234--241.

O. Oktay et al., "Attention U-Net: Learning Where to Look for the Pancreas," arXiv, arXiv:1804.03999, 2018.

Z. Liu et al., "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows," in Proc. IEEE/CVF Int. Conf. Comput. Vis. (ICCV), 2021, pp. 10012--10022.

J. Chen et al., "TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers," Med. Image Anal., vol. 97, Art. no. 103280, 2024.

Y. Zhang, H. Liu, and Q. Hu, "TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation," in *Medical Image Computing and Computer-Assisted Intervention (MICCAI)*, 2021.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 779–788.

G. Jocher et al., "ultralytics/yolov5: Initial Release," Zenodo, 2020.

C. Li et al., "YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications," arXiv, arXiv:2209.02976, 2022.

Ultralytics Team, "YOLOv8: A Unified Framework for Object Detection and Segmentation," IEEE Access, vol. 12, pp. 56789–56798, 2024.

C.-Y. Wang et al., "YOLOv9: Efficient Object Detection with Programmable Gradient Information," Pattern Recognition, vol. 151, p. 109876, 2024. A. Wang et al., "YOLOv10: Real-Time End-to-End Object Detection," arXiv:2405.14458, 2024.

R. Khanam and M. Hussain, "YOLOv11: An Overview of the Key Architectural Enhancements," arXiv:2410.17725, 2024.

M. Qian et al., "Real time wire rope detection method based on Rockchip RK3588," Sci. Rep., vol. 15, Art. no. 30625, 2025.

Y. Tian, Q. Ye, and D. Doermann, "YOLOv12: Attention-Centric Real-Time Object Detectors," arXiv, vol. 2502.12524, 2025.

M. Lei et al., "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception," arXiv, arXiv:2506.17733, 2025.

R. Sapkota et al., "YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection," arXiv:2509.25164, 2025.

Q. Feng, X. Xu, and Z. Wang, "Deep learning-based small object detection: A survey," Math. Biosci. Eng., vol. 20, no. 4, pp. 6551--6590, 2023.

Y. Zhao et al., "DETRs Beat YOLOs on Real-time Object Detection," in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2024.

M. M. Rahman, M. Munir, and R. Marculescu, "EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation," in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2024.

B. Liu, "Blood Cell Count and Detection Method Based on YOLO," Highlights Sci. Eng. Technol., vol. 27, 2022.

M. I. Nazir et al., "Utilizing customized CNN for brain tumor prediction with explainable AI," Heliyon, vol. 10, no. 20, Art. no. e38997, 2024.

K. Han et al., "GhostNet: More Features From Cheap Operations," in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2020, pp. 1577--1586.

T.-Y. Lin et al., "Feature Pyramid Networks for Object Detection," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2017, pp. 2117--2125.

S. Liu et al., "Path Aggregation Network for Instance Segmentation," in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2018, pp. 8759--8768.

Y. Chen et al., "Dynamic Convolution: Attention over Convolution Kernels," in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2020.

A. G. Howard et al., "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications," arXiv:1704.04861, 2017.

M. Sandler et al., "MobileNetV2: Inverted Residuals and Linear Bottlenecks," in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2018, pp. 4510--4520.

Z. Liu, D. Yuan, and G. Zhu, "Automated Blood Cell Detection and Counting Based on Improved Object Detection Algorithm," Mathematics, vol. 13, no. 18, Art. no. 3023, 2025.

I. Loshchilov and F. Hutter, "Decoupled Weight Decay Regularization," arXiv:1711.05101, 2017.

I. Loshchilov and F. Hutter, "SGDR: Stochastic Gradient Descent with Warm Restarts," in Proc. Int. Conf. Learn. Represent. (ICLR), 2017.

A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, "YOLOv4: Optimal Speed and Accuracy of Object Detection," arXiv:2004.10934, 2020.

Research on Medical Image Analysis for Edge Devices Based on Lightweight Frameworks

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Information

ISSN

Indexing & Abstracting

Google Scholar

Harvard Library (HOLLIS)

Yubetsu Shibata

Crossref

Scilit

ResearchGate

Semantic Scholar

Dimensions

Make a Submission