基于深度学习的小目标检测技术研究进展(特邀)

刘耿焕; 曾祥津; 豆嘉真; 任振波; 钟丽云; 邸江磊; 秦玉文

doi:10.3788/IRLA20240253

基于深度学习的小目标检测技术研究进展(特邀)

Review of advances in small object detection technology based on deep learning (invited)

摘要

摘要: 小目标检测在自动驾驶、安防等领域具有重要的应用价值。然而，由于小目标自身视觉特征不明显、复杂背景干扰以及信噪比低等因素，使得小目标检测一直以来都是一个极具挑战性的难题。笔者系统回顾了当前基于深度学习方法的小目标检测技术，对现有算法进行了系统地归类、分析和比较：界定了小目标检测的概念，总结了小目标检测所面临的主要挑战；着重讨论了几种主要的网络优化策略，如利用数据增强技术提高模型的泛化能力，通过超分辨率技术改善小目标可视性，采用多尺度信息融合技术提升检测精度，以及基于上下文信息学习和大核卷积策略改进特征表达能力、无锚框检测机制、DETR技术和针对特定应用场景的多模态小目标检测等方法并详细分析了其优缺点；全面介绍了现有小目标数据集，并在常用公共数据集上对目前经典的小目标检测算法进行了测试和性能评估；对小目标检测领域未来的研究方向进行了展望，旨在推动小目标检测技术的进一步发展和应用拓展。

Abstract:
Significance Small object detection holds a crucial position in various industrial fields and everyday life. In remote sensing, it is used to identify and track small objects, providing key support for military reconnaissance and national defense security. For instance, infrared small object detection can detect invasive targets and take subsequent interception measures against them. In autonomous driving, the system automatically detects objects such as traffic signs, vehicles, pedestrians, and obstacles, to help to deeply analyze the meaning of the driving scene and to predict the behavior of surrounding objects to ultimately make appropriate decisions. In public safety and surveillance, small object detection systems can accurately identify and track small objects hidden in the distance or complex backgrounds, enabling functionalities such as pedestrian face recognition, vehicle identification, and detection of illegal crowd lingering, counting of individuals, and estimation of crowd density. For industrial automation, small object detection is also necessary to locate visible small defects on the surface of materials. Overall, small object detection technology significantly enhances the work efficiency across various sectors, demonstrating its broad application prospects and profound research value.
Progress This paper provides a comprehensive review of current deep learning-based small object detection techniques and systematically categorizes, analyzes, and compares existing algorithms. We initially outline the definition of small object detection, the challenges, and its application areas. The definition of small objects is elaborated from two perspectives of based on absolute and relative scales. We also summarize the main challenges encountered in small object detection, including information compression loss, low signal-to-noise ratio and detectability in complex backgrounds, high sensitivity to minor deviations in bounding boxes, complexity in network structure and optimization, and the scarcity of large-scale small object datasets.
Following, we delve into several key optimization approaches. These include the enhancement of model robustness through data augmentation, the improvement of small object visibility via super-resolution methods, the augmentation of detection accuracy through the application of multi-scale information fusion, and the refinement of feature representation using contextual information and large-kernel convolution techniques. Moreover, the discussion extends to anchor-free detection frameworks, DETR technology, and dual-modal strategies for small object detection tailored to particular contexts, offering an exhaustive evaluation of their benefits and drawbacks.
We ultimately provide a comprehensive introduction to currently available small object datasets, encompassing twelve major datasets of DOTA, AI_TOD, DIOR, VisDrone2019, TT100K, BSTID, TinyPerson, CityPerson, WiderPerson, WIDER FACE, BIRDSAI, and MS COCO. These datasets offer a rich resource for research purpose and performance evaluation of small object detection. Further, we also conduct a detailed performance evaluation of existing small object detection algorithms on several widely-used public datasets, such as MS COCO, DOTA, AI_TOD, TinyPerson, and TT100K. Additionally, we forecast future research directions in this field, proposing four main potential challenges: feature fusion, contextual learning, optimization of large kernel convolution, and improvements in DETR technology. These directions not only illustrate the development trends of small object detection, but also highlight technical challenges that current research needs to overcome, providing guidance and inspiration for future studies.
Conclusions and Prospects Small object detection is one of the most critical and fundamental tasks in the field of computer vision, with broad application demands in the real world, such as military reconnaissance, autonomous driving, public safety and surveillance, and robotic vision. Although substantial algorithms have shown relatively satisfactory performance in specific applications and scenarios, overall, their effectiveness, robustness as well as speed still need improvement. This paper aims to provide references and bases for the development of algorithms through in-depth research and analysis of small object detection technology.

HTML全文

参考文献(222)

施引文献

资源附件(0)