基于深度学习的轻量化红外弱小车辆目标检测算法研究

蔡仁昊; 程宁; 彭志勇; 董施泽; 安建民; 金钢

doi:10.3788/IRLA20220253

基于深度学习的轻量化红外弱小车辆目标检测算法研究

doi: 10.3788/IRLA20220253

1.
天津津航技术物理研究所，天津 300308
2.
空装驻天津地区第三军事代表室，天津 300308
3.
天津大学，天津300072

详细信息

作者简介:
蔡仁昊，女，工程师，硕士，主要研究方向为智能光电探测成像等

中图分类号: TP391

Lightweight infrared dim vehicle target detection algorithm based on deep learning

1.
Tianjin Jinhang Institute of Technical Physics, Tianjin 300308, China
2.
The Third Military Representative Office in Tianjin, Tianjin 300308, China
3.
Tianjin University, Tianjin 300072, China

摘要: 伴随高速飞行器的不断发展，目标检测识别作为精确制导的关键一环，需要更高实时性、高准确性地进行目标定位和识别。当前，针对装甲车辆、车辆阵地等时间敏感目标精确检测识别的需求日益迫切，深度学习算法在特征提取及分类器设计上具备优势。文中以特定复杂背景下的小尺寸红外车辆目标为研究对象，针对样本数据少、平台资源受限、实时性要求高、检测精度高等需求，开展基于红外弱小车辆目标检测识别的轻量化深度学习算法研究。项目基于YOLOv5算法进行轻量化剪裁，减小模型的结构，提高实时性；提出了混合域注意力机制模块EPA，该模块通过不降维的局部跨信道交互策略使算法更快速有效地关注重要通道，抑制无效通道，并将通道注意力机制与空间注意力机制结合，使得算法更关注与目标相关的像素信息。提出了残差密集注意模块（RDAB），该模块由密集残差块与注意力机制EPA构成，通过密集卷积层来提取充分的局部特征，通过注意力机制获取更有效的通道与像素信息，可以使得算法以较小的模型结构获得较好的检测效果。运用设计的网络对数据增广后的小尺寸红外车辆目标数据进行检测识别，并与多种典型算法进行对比实验。由实验结果可知，文中提出的JH-YOLOv5-RDAB网络检测识别效果优于其他网络，权重大小仅为6.6 MB，仅为YOLOv5s算法模型权重的一半，但算法检测效果更优，与93.7 MB的YOLOv5l算法的检测效果接近，mAP50达到95.1%。实验结果表明：该网络在红外弱小车辆目标检测上的优越性和可行性。
- 红外车辆目标 /
- 目标检测 /
- 轻量化 /
- 注意力机制 /
- 密集卷积网络
Abstract: With the continuous development of high-speed aircraft, target detection and recognition, as a key part of precision guidance, requires higher real-time and high-accuracy target positioning and recognition. At present, the need for accurate detection and identification of time-sensitive targets such as armored vehicles and vehicle positions is increasingly urgent. Deep learning algorithms have advantages in feature extraction and classifier design. This paper takes the small-sized infrared vehicle target under a specific complex background as the research object, and develops a lightweight deep learning algorithm based on infrared dim vehicle target detection and recognition to meet the needs of less sample data, limited platform resources, high real-time requirements, and high detection accuracy. The project is light-weight cut based on the YOLOv5 algorithm, reduce the structure of the model and improve the real-time performance; a hybrid domain attention mechanism module EPA is proposed, which enables the algorithm to focus on important channels more quickly and effectively through a local cross-channel interaction strategy without dimensionality reduction. Suppressing invalid channels and combining the channel attention mechanism with the spatial attention mechanism makes the algorithm pay more attention to the pixel information related to the target. The Residual Dense Attention Module (RDAB) is proposed, which is composed of dense residual blocks and attention mechanism EPA. It extracts sufficient local features through dense convolutional layers, and obtains more effective channel and pixel information through attention mechanism, which can make the algorithm obtain better detection effect. Detect and identify the small-size infrared vehicle target data after data augmentation, and compare experiments with a variety of typical algorithms. It can be seen from the experimental results that the detection and recognition effect of the JH-YOLOv5-RDAB network proposed in this paper is better than other networks, and the weight size is only 6.6 MB, which is only half of the weight of the YOLOv5s algorithm model, but the algorithm detection effect is better, and the detection effect of the algorithm is close YOLOv5l whose weight size is 93.7 MB, with mAP50 reaching 95.1%. The experimental results show the superiority and feasibility of this network in infrared dim vehicle target detection.
- infrared vehicle target /
- target detection /
- lightweight /
- attention mechanism /
- dense convolutional network

图 1 近年深度学习目标检测算法发展趋势

Figure 1. Development trend of deep learning object detection algorithms in recent years

下载: 全尺寸图片幻灯片

图 2 数据增广效果图

Figure 2. Effect drawing of data expansion

下载: 全尺寸图片幻灯片

图 3 JH-TOLOv5-RDAB结构图

Figure 3. Structure diagram of JH-YOLOv5-RDAB

下载: 全尺寸图片幻灯片

图 4 YOLOv5 s 结构图

Figure 4. Structure of YOLOv5 s

下载: 全尺寸图片幻灯片

图 5 融合EPA模块后的SPP模块结构图

Figure 5. Structure diagram of SPP module after merging EPA module

下载: 全尺寸图片幻灯片

图 6 融合EPA模块后的CSP模块结构图

Figure 6. Structure diagram of CSP module after merging EPA module

下载: 全尺寸图片幻灯片

图 7 ECA模块结构图

Figure 7. Structure diagram of ECA module

下载: 全尺寸图片幻灯片

图 8 EPA模块结构图

Figure 8. Structure diagram of EPA module

下载: 全尺寸图片幻灯片

图 9 同一场景可见光图像（左）和红外图像（右）

Figure 9. Visible light image (left) and infrared image (right) of the same scene

下载: 全尺寸图片幻灯片

图 10 融合RDAB模块后的Focus模块结构图

Figure 10. Structure diagram of Focus module after merging RDAB module

下载: 全尺寸图片幻灯片

图 11 融合RDAB模块后的SPP模块结构图

Figure 11. Structure diagram of SPP module after merging RDAB module

下载: 全尺寸图片幻灯片

图 12 残差密集注意模块结构图

Figure 12. Structure diagram of RDAB

下载: 全尺寸图片幻灯片

图 13 残差密集块的结构图

Figure 13. Structure diagram of RDB

下载: 全尺寸图片幻灯片

图 14 各算法训练结果对比

Figure 14. Image comparison of training results

下载: 全尺寸图片幻灯片

图 15 部分检测结果图像对比（左：实际标注图，右：算法检测图）

Figure 15. Image comparison of some detection results (left: actual annotation map, right: algorithm detection map)

下载: 全尺寸图片幻灯片

表 1 单幅图像的模拟图像处理方法

Table 1. Analog image processing method for single image

Flight live image	Analog image processing
Aircraft falling at high speed	Image magnification under fixed field of view
Aircraft level flight	Image translation
Aircraft rotation	Image rotation at various angles
Aircraft shaking	Image translation
Infrared imagers are affected by temperature and weather	Image contrast, brightness changes
Aerodynamic effect of aircraft flying at high speed	Image random blur, edge blur
Aircraft is disturbed	Image random occlusion

下载: 导出CSV

表 2 红外弱小车辆图像数据集分布情况

Table 2. Distribution of infrared small and weak vehicle image dataset

Target type	[car]
Target scene	Desert, city, field, highway, village
Data Augmentation Method	Brightness change, contrast change, rotation, translation, scaling, flipping, clipping, splicing
Original data Set	5023 (Before treatment) 4986 (After treatment)
Data augmentation	1000
Training set	49228
Validation set	2590

下载: 导出CSV

表 3 实验环境具体参数

Table 3. Specific parameters of the experimental en-vironment

Experimental system	Ubuntu18.04
CPU	Inter Xeon Gold 6133
GPU	NVIDIA TITAN RTX ×4
Memory	512 GB
Development environment	Python3.7
Deep learning framework	Pytorch
CUDA	10.2
cuDNN	7.5.3

下载: 导出CSV

表 4 JH-YOLOv5-RDAB与典型算法网络实验效果对比

Table 4. Comparison of experimental results between JH-YOLOv5-RDAB and typical algorithm networks

Algorithm	Layers	Parameters	Size (Semi precision quantization)	mAP50	GFLOPs	Single test time/ms
YOLOv3	261	61497430	123.4M	81.6	154.9	5.3
YOLOv4-tiny	99	5874116	23.6M	88.2	16.1	1.8
YOLOv4	488	63937686	256.3M	94.6	141.4	8.7
YOLOv5 s	283	7063542	14.4M	93.4	16.3	2.34
YOLOv5 m	391	21056406	42.5M	94.5	50.3	3.45
YOLOv5 l	499	46631350	93.7M	95.2	114.1	4.93
YOLOv5 x	607	87244374	175.1M	95.9	217.1	8.43
YOLO-Fastest	277	356700	4.8M	32.1	0.96	1.6
JH-YOLOv5-RDAB	565	3117313	6.6M	95.1	8.4	2.52

下载: 导出CSV

表 5 基于YOLOv5 s的轻量化剪裁实验效果对比

Table 5. Comparison of experimental effects of lightweight tailoring based on YOLOv5 s

Algorithm	Depth_ multiple	width_ multiple	Performance effect/times	Network size/times
YOLOv5 x	1.33	1.25	2.3	12.2
YOLOv5 l	1	1	1.8	6.5
YOLOv5 m	0.67	0.75	1.2	2.9
YOLOv5 s	0.33	0.50	1	1
Net 1	0.25	0.33	0.85	0.49
Net 2	0.2	0.25	0.63	0.35
Net 3	0.1	0.2	0.33	0.12

下载: 导出CSV

表 6 基于注意力机制融合的网络实验效果对比

Table 6. Comparison of network experimental effects based on attention mechanism fusion

Fused operator	Integrated ECA		Integrated EPA
Fused operator	Performance effect/times	Network size/times	Performance effect/times	Network size/times
Net 2	1	1	1	1
CBL	1.1	1.6	1.3	2.4
SPP	1.2	1.12	1.43	1.18
CSP	1.3	1.1	1.33	1.11
Focus	1.02	1.01	1.11	1.01

下载: 导出CSV

表 7 基于残差密集注意模块融合的网络实验效果对比

Table 7. Comparison of network experimental effects based on RDAB

Fused operator	Integrated RDAB
Fused operator	Performance effect/times	Network size/times
Focus	1.28	1.06
SPP+ Focus	1.35	1.08

下载: 导出CSV

[1]	Li Xudong, Ye Mao, Li Tao. A review of object detection research based on convolutional neural network [J]. Computer Application Research, 2017, 34(10): 2881-2887. (in Chinese)
[2]	Tang Cong, Ling Yongshun, Yang Hua, et al. Visual tracking method for object detection based on deep learning [J]. Infrared and Laser Engineering, 2018, 47(5): 0526001. (in Chinese)
[3]	Krizhevsky A, Sutskever I, Hinton G. Imagenet classification with deep convolutional neural networks [J]. Advances in Neural Information Processing Systems, 2012, 25: 1097-1105.
[4]	Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015: 1-9.
[5]	Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[EB/OL]. (2014-09-04)[2022-03-20]. https://arxiv.org/abs/1409.1556v4.
[6]	He K M, Zhang X Y, Ren S Q, et al. Deep residual learning for image recognition [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2016: 770-778.
[7]	Zhou Xiaoyan, Wang Ke, Li Lingyan. A review of target detection algorithms based on deep learning [J]. Electronic Measurement Technology, 2017, 40(11): 89-94. (in Chinese)
[8]	Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014: 580-587.
[9]	Redmon J , Divvala S , Girshick R , et al. You only look once: Unified, Real-TimeObject detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2016: 779-788.
[10]	Ren H, Wang X G. Review of attention mechanism [J]. Journal of Computer Applications, 2021, 41(S1): 1-6. (in Chinese)
[11]	Hu J, Shen L, Albanie S, et al. Squeeze-and-excitation networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(8): 2011-2023. doi: 10.1109/TPAMI.2019.2913372
[12]	Wang Q, Wu B, Zhu P, et al. ECA-Net: Efficient channel attention for deep convolutional neural networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2020: 11531-11539.
[13]	Qin X, Wang Z, Bai Y, et al. FFA-Net: Feature fusion attention network for single image dehazing[C]//Proceedings of the National Conference on Artificial Intelligence in Association for the Advancement of Artificial Intelligence, 2020: 1-9.
[14]	Zhang Y, Tian Y, Kong Y, et al. Residual dense network for image super-resolution[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 2472-2481.

[1]	薛珊, 安宏宇, 吕琼莹, 曹国华. 复杂背景下基于YOLOv7-tiny的图像目标检测算法 . 红外与激光工程, 2024, 53(1): 20230472-1-20230472-12. doi: 10.3788/IRLA20230472
[2]	张学志, 赵红东, 刘伟娜, 赵一鸣, 关松. 基于改进YOLOv5的红外车辆检测方法 . 红外与激光工程, 2023, 52(8): 20230245-1-20230245-10. doi: 10.3788/IRLA20230245
[3]	张景程, 乔新博, 赵永强. 红外偏振摄像机动目标检测跟踪系统（特邀） . 红外与激光工程, 2022, 51(4): 20220233-1-20220233-10. doi: 10.3788/IRLA20220233
[4]	蒋昕昊, 蔡伟, 杨志勇, 徐佩伟, 姜波. 基于YOLO-IDSTD算法的红外弱小目标检测 . 红外与激光工程, 2022, 51(3): 20210106-1-20210106-10. doi: 10.3788/IRLA20210106
[5]	李博, 张心宇. 复杂场景下基于自适应特征融合的目标跟踪算法 . 红外与激光工程, 2022, 51(10): 20220013-1-20220013-11. doi: 10.3788/IRLA20220013
[6]	韩金辉, 魏艳涛, 彭真明, 赵骞, 陈耀弘, 覃尧, 李楠. 红外弱小目标检测方法综述 . 红外与激光工程, 2022, 51(4): 20210393-1-20210393-24. doi: 10.3788/IRLA20210393
[7]	庞忠祥, 刘勰, 刘桂华, 龚泿军, 周晗, 罗洪伟. 并行多特征提取网络的红外图像增强方法 . 红外与激光工程, 2022, 51(8): 20210957-1-20210957-9. doi: 10.3788/IRLA20210957
[8]	薛珊, 陈宇超, 吕琼莹, 曹国华. 基于坐标注意力机制融合的反无人机系统图像识别方法 . 红外与激光工程, 2022, 51(9): 20211101-1-20211101-11. doi: 10.3788/IRLA20211101
[9]	李延伟, 殷龙海, 李玉龙, 谢新旺, 张景国, 谢虹波. 机载红外成像系统主支撑结构新型轻量化设计方法与应用 . 红外与激光工程, 2022, 51(11): 20220232-1-20220232-9. doi: 10.3788/IRLA20220232
[10]	王向军, 欧阳文森. 多尺度循环注意力网络运动模糊图像复原方法 . 红外与激光工程, 2022, 51(6): 20210605-1-20210605-9. doi: 10.3788/IRLA20210605
[11]	高凡, 杨小冈, 卢瑞涛, 王思宇, 高久安, 夏海. Anchor-free轻量级红外目标检测方法（特邀） . 红外与激光工程, 2022, 51(4): 20220193-1-20220193-9. doi: 10.3788/IRLA20220193
[12]	陈明, 赵连飞, 苑立民, 徐峰, 韩默. 基于特征选择YOLOv3网络的红外图像绝缘子检测方法 . 红外与激光工程, 2020, 49(S2): 20200401-20200401. doi: 10.3788/IRLA20200401
[13]	徐云飞, 张笃周, 王立, 华宝成. 非合作目标局部特征识别轻量化特征融合网络设计 . 红外与激光工程, 2020, 49(7): 20200170-1-20200170-7. doi: 10.3788/IRLA20200170
[14]	南天章, 耿建君, 陈旭, 陈颖. 基于邻域特征的红外低慢小目标检测 . 红外与激光工程, 2019, 48(S1): 174-180. doi: 10.3788/IRLA201948.S128002
[15]	唐聪, 凌永顺, 郑科栋, 杨星, 郑超, 杨华, 金伟. 基于深度学习的多视窗SSD目标检测方法 . 红外与激光工程, 2018, 47(1): 126003-0126003(9). doi: 10.3788/IRLA201847.0126003
[16]	吴天舒, 张志佳, 刘云鹏, 裴文慧, 陈红叶. 基于改进SSD的轻量化小目标检测算法 . 红外与激光工程, 2018, 47(7): 703005-0703005(7). doi: 10.3788/IRLA201847.0703005
[17]	许典, 曹佃生, 林冠宇, 于向阳. 双光栅光谱仪光栅转轴的多目标优化 . 红外与激光工程, 2017, 46(3): 320001-0320001(7). doi: 10.3788/IRLA201746.0320001
[18]	孙照蕾, 惠斌, 秦莫凡, 常铮, 罗海波, 夏仁波. 红外图像显著目标检测算法 . 红外与激光工程, 2015, 44(9): 2633-2637.
[19]	刘志刚, 卢云龙, 魏一苇. 有监督的高光谱图像伪装目标检测方法 . 红外与激光工程, 2013, 42(11): 3076-3081.
[20]	黎志华, 李新国. 基于OpenCV的红外弱小运动目标检测与跟踪 . 红外与激光工程, 2013, 42(9): 2561-2565.

点击查看大图

图(15) / 表(7)

计量

文章访问数: 417
HTML全文浏览量: 86
PDF下载量: 95
被引次数: 0

全文HTML

4. 结　论

(1) 为满足特定复杂背景下的时间敏感目标快速、精确检测识别的军事需求，针对红外车辆目标尺寸小、样本少，识别难度大等问题，提出了一种由YOLOv5 s进行轻量化，融合了特征注意力机制和残差密集块的深度学习检测识别算法。通过对网络轻量化剪裁，缩小结构，提高实时性；提出了混合域注意力机制模块EPA，该模块通过不降维的局部跨信道交互策略使算法更快速有效地关注重要通道，抑制无效通道，同时，使得算法更关注与目标相关的像素信息；提出了基于注意力机制的RDAB，由密集残差块与注意力机制EPA构成，该模块通过密集卷积层来提取充分的局部特征，允许通过多个局部残差连接绕过较不重要的信息，使得检测算法更关注与目标相关的通道与像素位置信息，从而使得算法以较小的模型结构获得较好的检测效果。

(2) 构建基于机载挂飞采集和无人机采集的中/长波红外车辆目标数据库，并进行了亮度变化、对比度变化、旋转、平移、放缩、翻转、裁剪、随机拼接等数据增广处理。设计了不同位置和数量模块融合实验，以得到最优算法网络JH-YOLOv5-RDAB的网络结构。同时，在文中数据集上，将提出的JH-YOLOv5-RDAB算法与其他8种典型网络算法进行了对比实验，以6.6 MB的模型大小获得了95.1%的检测精度（mAP50），实验结果表明了该网络的优越性和可行性。JH-YOLOv5-RDAB算法模型小、精度高、实时性高的优势对于提升高速飞行器精确检测识别能力，提升探测系统的智能化水平具有重要意义。

参考文献 (14)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于深度学习的轻量化红外弱小车辆目标检测算法研究

doi: 10.3788/IRLA20220253

作者简介:
蔡仁昊，女，工程师，硕士，主要研究方向为智能光电探测成像等

Lightweight infrared dim vehicle target detection algorithm based on deep learning

计量

基于深度学习的轻量化红外弱小车辆目标检测算法研究

doi: 10.3788/IRLA20220253

1. 天津津航技术物理研究所，天津 300308

2. 空装驻天津地区第三军事代表室，天津 300308

3. 天津大学，天津300072

作者简介:
蔡仁昊，女，工程师，硕士，主要研究方向为智能光电探测成像等

English Abstract

Lightweight infrared dim vehicle target detection algorithm based on deep learning

1. Tianjin Jinhang Institute of Technical Physics, Tianjin 300308, China

2. The Third Military Representative Office in Tianjin, Tianjin 300308, China

3. Tianjin University, Tianjin 300072, China

全文HTML

2.1. 基于YOLOv5的轻量化网络设计

2.2. 基于混合域注意力模块(EPA)设计

2.3. 基于注意力机制的残差密集块模块（RDAB）设计

3.1. 实验数据和环境搭建

3.2. JH-YOLOv5-RDAB网络与典型网络性能对比

3.3. 消融实验设计

目录

留言板

基于深度学习的轻量化红外弱小车辆目标检测算法研究

doi: 10.3788/IRLA20220253

作者简介: 蔡仁昊，女，工程师，硕士，主要研究方向为智能光电探测成像等

Lightweight infrared dim vehicle target detection algorithm based on deep learning

计量

出版历程

基于深度学习的轻量化红外弱小车辆目标检测算法研究

doi: 10.3788/IRLA20220253

1. 天津津航技术物理研究所，天津 300308 2. 空装驻天津地区第三军事代表室，天津 300308 3. 天津大学，天津300072

作者简介: 蔡仁昊，女，工程师，硕士，主要研究方向为智能光电探测成像等

English Abstract

Lightweight infrared dim vehicle target detection algorithm based on deep learning

1. Tianjin Jinhang Institute of Technical Physics, Tianjin 300308, China 2. The Third Military Representative Office in Tianjin, Tianjin 300308, China 3. Tianjin University, Tianjin 300072, China

全文HTML

2.1. 基于YOLOv5的轻量化网络设计

2.2. 基于混合域注意力模块(EPA)设计

2.3. 基于注意力机制的残差密集块模块（RDAB）设计

3.1. 实验数据和环境搭建

3.2. JH-YOLOv5-RDAB网络与典型网络性能对比

3.3. 消融实验设计

目录

作者简介:
蔡仁昊，女，工程师，硕士，主要研究方向为智能光电探测成像等

1. 天津津航技术物理研究所，天津 300308

2. 空装驻天津地区第三军事代表室，天津 300308

3. 天津大学，天津300072

作者简介:
蔡仁昊，女，工程师，硕士，主要研究方向为智能光电探测成像等

1. Tianjin Jinhang Institute of Technical Physics, Tianjin 300308, China

2. The Third Military Representative Office in Tianjin, Tianjin 300308, China

3. Tianjin University, Tianjin 300072, China