基于改进YOLOv5的红外车辆检测方法

张学志; 赵红东; 刘伟娜; 赵一鸣; 关松

doi:10.3788/IRLA20230245

基于改进YOLOv5的红外车辆检测方法

doi: 10.3788/IRLA20230245

张学志^1,,
赵红东^{1, 2,},
刘伟娜¹,
赵一鸣¹,
关松²

1.
河北工业大学电子信息工程学院，天津 300401
2.
电磁空间安全全国重点实验室，天津 300308

基金项目: 天津市科技计划项目 (21YDTPJC00050)；电磁空间安全全国重点实验室基金项目 (2021JCJQLB055008)

详细信息

作者简介:
张学志，男，硕士生，主要从事计算机视觉、深度学习方面的研究

中图分类号: TP391.4

An infrared vehicle detection method based on improved YOLOv5

1.
School of Electronic and Information Engineering, Hebei University of Technology, Tianjin 300401, China
2.
National Key Laboratory of Electromagnetic Space Security, Tianjin 300308, China

Funds: Tianjin Science and Technology Project (21YDTPJC00050); National Key Laboratory of Electromagnetic Space Security Fund Project (2021JCJQLB055008)

摘要: 红外图像可在低照度、恶劣天气等条件下工作，红外车辆检测技术旨在使用红外传感器来监测道路上的车辆，实现对车辆数量、车速等信息的收集与分析，该技术不仅可应用于路面车辆，还可应用于铁路、机场、港口等场景，为交通运输行业的安全和便捷提供了有效的技术支持。然而，由于红外图像成像原理的局限和外部环境的干扰，通常导致红外图像成像质量不理想，红外车辆检测仍然存在许多问题。文中提出了一种改进的YOLOv5模型，在YOLOv5的主干部分引入了混合注意力机制，使模型能够更好地关注研究者感兴趣的区域，抑制图像噪声的干扰。此外，在BiFPN基础上提出了一种改进的Z-BiFPN特征融合结构，融合更多的浅层信息，提高浅层信息利用率，并增加一个四分之一下采样的小目标检测层，同时将YOLOv5的检测头替换为解耦头来提升模型的检测能力。在自建的七类红外车辆数据集INFrared-417上进行了实验，验证了算法的有效可行性。与原始YOLOv5相比，mAP从81.1%提升到了85.3%。
- 红外车辆 /
- 目标检测 /
- 注意力机制 /
- 特征融合 /
- YOLOv5
Abstract: Objective Infrared image technology is capable of working in low-light and adverse weather conditions. Infrared vehicle detection technology is designed to use infrared sensors to monitor vehicles on roads, enabling the collection and analysis of information related to vehicle quantity and speed, which can be used to achieve traffic management and safety control. This technology can be applied not only to road vehicles, but also to rail transport, airports, and ports, providing effective technical support for the safety and convenience of the transportation industries. However, infrared vehicle detection still faces many challenges due to the low resolution, low contrast, and blurred edges of small targets in infrared images. Traditional hand-crafted image feature extraction methods are not adaptable nor robust, require substantial prior knowledge and have low efficiency. Therefore, this paper aims to explore deep learning-based vehicle detection models, which plays an important role in traffic regulation. Methods YOLOv5 is a one-stage object detection algorithm that is characterized by its lightweight design, ease of deployment, and high accuracy, making it widely used in industrial applications. In this paper, a CFG mixed attention mechanism (Fig.2) is introduced into the model backbone to help the model better locate the vehicle area in the image and improve its feature extraction ability, due to the low resolution of infrared images. In the feature fusion part, an improved Z-BiFPN structure (Fig.5) is proposed to incorporate more information in the shallow fusion, thereby improving the utilization of shallow information. A small object detection layer is added, and the Decoupled Head (Fig.6) is used to separate classification and regression, improving the model's ability to detect small target vehicles. Results and Discussions In order to improve the model's generalization ability, an infrared image dataset INFrared-417 (Fig.7) consisting of seven categories of bus, truck, car, van, person, bicycle and elecmot, was constructed by collecting data and combining existing infrared datasets. The main evaluation metrics used were AP (Average Precision) and mAP (mean Average Precision), with P (Precision) and R (Recall) as secondary metrics for the experiments. The ablation experiment results (Tab.1) confirmed the effectiveness and feasibility of the proposed improvement methods, with mAP improving by 4.0%, and AP significantly improving for the van, person, and bicycle categories, while P increased by 1.7% and R increased by 3.6%. In addition, the comparison results (Fig.10) demonstrated that the improved model reduced false alarm and missed detection rates, while improving the detection of small targets. The comparison experiment results (Tab.2) also showed that the proposed improved model had excellent performance in terms of detection accuracy and model parameter count. Conclusions This paper proposes an improved infrared vehicle detection algorithm. By introducing the mixed attention mechanism, the model is able to better focus on the vehicle region in the image and enhance its feature extraction ability. The improved Z-BiFPN is used in the model neck to efficiently integrate context information. At the same time, the detection head is replaced with a more advanced Decoupled Head to improve the detection ability, and a small object detection layer is added to improve the ability to capture small targets. It is hoped that this model can be applied in traffic control.
- infrared vehicle /
- object detection /
- attention mechanism /
- feature fusion /
- YOLOv5

图 1 改进后的 YOLOv5 网络结构图

Figure 1. Improved YOLOv5 network architecture diagram

下载: 全尺寸图片幻灯片

图 2 CFG 结构图

Figure 2. Structure of CFG

下载: 全尺寸图片幻灯片

图 3 CA结构图

Figure 3. Structure of CA

下载: 全尺寸图片幻灯片

图 4 Spatial Attention 结构图

Figure 4. Structure of Spatial Attention

下载: 全尺寸图片幻灯片

图 5 (a) FPN 结构，增加了一条自小尺寸特征图向上的路径； (b) PANet结构，在 FPN 基础上增加了一条自大尺寸向下的路径；(c) BiFPN结构；(d) Z-BiFPN

Figure 5. (a) FPN structure, adds an upward path from small-sized feature map; (b) PANet structure, adds a downward path from large-sized feature map based on FPN; (c) BiFPN structure; (d) Z-BiFPN

下载: 全尺寸图片幻灯片

图 6 Decoupled Head 结构示意图

Figure 6. Diagram of Decoupled Head architecture

下载: 全尺寸图片幻灯片

图 7 (a) 自行采集数据集实例；(b) SCUT_FIR_Pedestrian_Dataset 实例；(c) MULTISPECTRAL DATASET 实例

Figure 7. (a) An example of a self-collected dataset; (b) An example of SCUT_FIR_Pedestrian_Dataset; (c) An example of MULTISPECTRAL DATASET

下载: 全尺寸图片幻灯片

图 8 PR 曲线。(a) YOLOv5；(b) 改进后的 YOLOv5

Figure 8. PR curve. (a) YOLOv5; (b) Improved YOLOv5

下载: 全尺寸图片幻灯片

图 9 混淆矩阵。(a) YOLOv5；(b) 改进后的 YOLOv5

Figure 9. Confusion matrices. (a) YOLOv5; (b) Improved YOLOv5

下载: 全尺寸图片幻灯片

图 10 检测结果对比。(a) 原始图像；(b) YOLOv5；(c) 改进后的YOLOv5

Figure 10. Comparison of detection results. (a) Original image; (b) YOLOv5; (c) Improved YOLOv5

下载: 全尺寸图片幻灯片

表 1 不同改进方法的实验结果

Table 1. Experimental results of different improvement methods

	a	b	c	d	e
YOLOv5s	√	√	√	√	√
CFG		√	√	√	√
Four Head			√	√	√
Z-BiFPN				√	√
Decoupled Head					√
AP-bus	88.5%	85.3%	87.1%	86.0%	89.4%
AP-truck	81.0%	81.2%	82.9%	81.0%	85.4%
AP-car	89.6%	88.7%	89.1%	89.6%	90.3%
AP-van	78.3%	76.4%	77.8%	79.8%	82.6%
AP-person	79.3%	82.6%	81.0%	79.7%	83.5%
AP-bicycle	72.0%	76.6%	75.7%	79.5%	86.2%
AP-elecmot	79.0%	80.1%	80.2%	82.5%	79.7%
P	86.5%	89.4%	85.8%	86.9%	88.2%
R	73.8%	75.4%	74.7%	76.7%	77.4%
*mAP*	81.1%	81.6%	82.0%	82.6%	85.3%

下载: 导出CSV

表 2 不同目标检测算法对比

Table 2. Comparison of different object detection algorithms

Models	SSD	YOLOv3	YOLOv5	YOLOR-W6	YOLOv7-tiny	YOLOX	Ours
AP-bus	76.4%	85.9%	88.5%	81.7%	85.7%	87.6%	89.4%
AP-truck	88.0%	83.8%	81.0%	82.3%	82.4%	84.2%	85.4%
AP-car	68.7%	83.3%	89.6%	90.1%	90.8%	90.3%	90.3%
AP-van	63.2%	71.9%	78.3%	80.3%	79.2%	82.1%	82.6%
AP-person	35.8%	70.1%	79.3%	76.9%	75.1%	81.5%	83.5%
AP-bicycle	41.9%	50.2%	72.0%	44.7%	53.0%	78.3%	86.2%
AP-elecmot	47.3%	65.6%	79.0%	80.5%	65.3%	80.7%	79.7%
mAP	60.2%	73.0%	81.1%	76.6%	75.9%	83.5%	85.3%
Parameters	24.4×10⁶	61.6×10⁶	7.0×10⁶	79.3×10⁶	6.0×10⁶	8.9×10⁶	10.4×10⁶
Weight/MB	93.7	235.2	13.7	151.8	11.7	17.3	20.3

下载: 导出CSV

[1]	Zhang X X, Zhu X. An efficient and scene-adaptive algorithm for vehicle detection in aerial images using an improved YOLOv3 framework [J]. ISPRS International Journal of Geo-information, 2019, 8(11): 483. doi: 10.3390/ijgi8110483
[2]	Zhu Q F, Zheng H F, Wang Y B, et al. Study on the evaluation method of sound phase cloud maps based on an improved YOLOv4 algorithm [J]. Sensors, 2020, 20(15): 4314. doi: 10.3390/s20154314
[3]	Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014: 580-587.
[4]	Girshick R. Fast R-CNN [C]//2015 IEEE International Conference on Computer Vision (ICCV), 2015: 1440-1448.
[5]	Ren S, He K, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. doi: 10.1109/TPAMI.2016.2577031
[6]	Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector [C]//Computer Vision-ECCV 2016, 2016, 9905: 21-37.
[7]	Redmon J, Divvala S, Girshick R, et al. You only look once: unified, real-time object detection [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 779-788.
[8]	Redmon J, Farhadi A. YOLO9000: better, faster, stronger [C]//30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), 2017: 6517-6525.
[9]	Li S, Li Y, Li Y, et al. YOLO-FIRI: improved YOLOv5 for infrared image object detection [J]. IEEE Access, 2021, 9: 141861-141875. doi: 10.1109/ACCESS.2021.3120870
[10]	Zhou L, Gao S, Wang S, et al. IPD-Net: infrared pedestrian detection network via adaptive feature extraction and coordinate information fusion [J]. Sensors, 2022, 22(22): 8966. doi: 10.3390/s22228966
[11]	Bai Y, Li R, Gou S, et al. Cross-connected bidirectional pyramid network for infrared small-dim target detection [J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 7506405.
[12]	Lv G, Dong L, Liang J, et al. Novel asymmetric pyramid aggregation network for infrared dim and small target detection [J]. Remote Sensing, 2022, 14(22): 5643. doi: 10.3390/rs14225643
[13]	Du S, Zhang P, Zhang B, et al. Weak and occluded vehicle detection in complex infrared environment based on improved YOLOv4 [J]. IEEE Access, 2021, 9: 25671-25680. doi: 10.1109/ACCESS.2021.3057723
[14]	Long Y, Jin D, Wu Z, et al. Accurate identification of infrared ship in island-shore background based on visual attention [C]//2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC), 2022: 800-806.
[15]	Xu Z, Zhuang J, Liu Q, et al. Benchmarking a large-scale FIR dataset for on-road pedestrian detection [J]. Infrared Physics & Technology, 2019, 96: 199-208.
[16]	Karasawa T, Watanabe K, Ha Q, et al. Multispectral object detection for autonomous vehicles [C]//Proceedings of The Thematic Workshops of ACM Multimedia 2017 (Thematic Workshops' 17), 2017: 35-43.
[17]	Hu J, Shen L, Sun G, et al. Squeeze-and-excitation networks [C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018: 7132-7141.
[18]	Woo S, Park J, Lee J-Y, et al. CBAM: convolutional block attention module [C]//Computer Vision-ECCV 2018, PT VII, 2018, 11211: 3-19.
[19]	Hou Q, Zhou D, Feng J, et al. Coordinate attention for efficient mobile network design [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021, 2021: 13708-13717.
[20]	Song J Y, Zhao Y, Song W L, et al. Fisheye image detection of trees using improved YOLOX for tree height estimation [J]. Sensors, 2022, 22(10): 3636. doi: 10.3390/s22103636

[1]	薛珊, 安宏宇, 吕琼莹, 曹国华. 复杂背景下基于YOLOv7-tiny的图像目标检测算法 . 红外与激光工程, 2024, 53(1): 20230472-1-20230472-12. doi: 10.3788/IRLA20230472
[2]	刘芬, 孙杰, 张帅, 桑宏强, 孙秀军. 基于YOLOv5的红外船舶目标检测算法 . 红外与激光工程, 2023, 52(10): 20230006-1-20230006-12. doi: 10.3788/IRLA20230006
[3]	高凡, 杨小冈, 卢瑞涛, 王思宇, 高久安, 夏海. Anchor-free轻量级红外目标检测方法（特邀） . 红外与激光工程, 2022, 51(4): 20220193-1-20220193-9. doi: 10.3788/IRLA20220193
[4]	张景程, 乔新博, 赵永强. 红外偏振摄像机动目标检测跟踪系统（特邀） . 红外与激光工程, 2022, 51(4): 20220233-1-20220233-10. doi: 10.3788/IRLA20220233
[5]	蒋昕昊, 蔡伟, 杨志勇, 徐佩伟, 姜波. 基于YOLO-IDSTD算法的红外弱小目标检测 . 红外与激光工程, 2022, 51(3): 20210106-1-20210106-10. doi: 10.3788/IRLA20210106
[6]	韩金辉, 魏艳涛, 彭真明, 赵骞, 陈耀弘, 覃尧, 李楠. 红外弱小目标检测方法综述 . 红外与激光工程, 2022, 51(4): 20210393-1-20210393-24. doi: 10.3788/IRLA20210393
[7]	李奕铎, 郭子博, 刘凯, 孙逍遥. 基于误差限制的神经网络混合精度量化方法（特邀） . 红外与激光工程, 2022, 51(4): 20220166-1-20220166-8. doi: 10.3788/IRLA20220166
[8]	庞忠祥, 刘勰, 刘桂华, 龚泿军, 周晗, 罗洪伟. 并行多特征提取网络的红外图像增强方法 . 红外与激光工程, 2022, 51(8): 20210957-1-20210957-9. doi: 10.3788/IRLA20210957
[9]	孙鹏, 于跃, 陈嘉欣, 秦翰林. 基于深度空时域特征融合的高动态空中多形态目标检测方法(特邀) . 红外与激光工程, 2022, 51(4): 20220167-1-20220167-8. doi: 10.3788/IRLA20220167
[10]	蔡仁昊, 程宁, 彭志勇, 董施泽, 安建民, 金钢. 基于深度学习的轻量化红外弱小车辆目标检测算法研究 . 红外与激光工程, 2022, 51(12): 20220253-1-20220253-11. doi: 10.3788/IRLA20220253
[11]	贺其恭, 贾晓东. 激光/毫米波双模融合近炸探测目标检测技术 . 红外与激光工程, 2021, 50(7): 20200361-1-20200361-9. doi: 10.3788/IRLA20200361
[12]	徐云飞, 张笃周, 王立, 华宝成. 非合作目标局部特征识别轻量化特征融合网络设计 . 红外与激光工程, 2020, 49(7): 20200170-1-20200170-7. doi: 10.3788/IRLA20200170
[13]	陈明, 赵连飞, 苑立民, 徐峰, 韩默. 基于特征选择YOLOv3网络的红外图像绝缘子检测方法 . 红外与激光工程, 2020, 49(S2): 20200401-20200401. doi: 10.3788/IRLA20200401
[14]	赵晓枫, 徐明扬, 王聃漂, 杨佳星, 张志利. 基于改进SSD的特种车辆红外伪装检测方法 . 红外与激光工程, 2019, 48(11): 1104003-1104003(10). doi: 10.3788/IRLA201948.1104003
[15]	南天章, 耿建君, 陈旭, 陈颖. 基于邻域特征的红外低慢小目标检测 . 红外与激光工程, 2019, 48(S1): 174-180. doi: 10.3788/IRLA201948.S128002
[16]	程全, 樊宇, 刘玉春, 王志良. 多特征融合的车辆识别技术 . 红外与激光工程, 2018, 47(7): 726003-0726003(6). doi: 10.3788/IRLA201847.0726003
[17]	陈善静, 康青, 顾忠征, 王正刚, 沈志强, 蒲欢, 辛颖. 基于三维GMRF的高光谱图像空天融合目标检测 . 红外与激光工程, 2016, 45(S2): 132-139. doi: 10.3788/IRLA201645.S223003
[18]	孙照蕾, 惠斌, 秦莫凡, 常铮, 罗海波, 夏仁波. 红外图像显著目标检测算法 . 红外与激光工程, 2015, 44(9): 2633-2637.
[19]	黎志华, 李新国. 基于OpenCV的红外弱小运动目标检测与跟踪 . 红外与激光工程, 2013, 42(9): 2561-2565.
[20]	杨亚威, 李俊山, 杨威, 赵方舟. 利用稀疏化生物视觉特征的多类多视角目标检测方法 . 红外与激光工程, 2012, 41(1): 267-272.

点击查看大图

图(10) / 表(2)

计量

文章访问数: 207
HTML全文浏览量: 82
PDF下载量: 72
被引次数: 0

全文HTML

0. 引　言

红外车辆检测技术是一种基于红外技术的非接触式车辆检测技术，通过红外传感器对路面上的车辆进行实时监测和识别。红外车辆检测具有高效、准确、无人值守等优点，在交通中的作用尤为重要。

YOLO系列算法是一种基于单阶段(one-stage)的目标检测算法，并且在速度和准确率上具有优势。YOLOv3^[1]在前代基础上使用了特征金字塔结构（Feature Pyramid Network, FPN），增加了更多的先验框，进一步提升了准确率。YOLOv4^[2]是目标检测领域中先进的方法之一，通过改进网络结构并采用多种优化策略取得了更好的性能表现。而YOLOv5具有高精度、快速推理和易部署等特点，被广泛应用于工业界。

基于深度学习的目标检测算法可分为两阶段算法和一阶段算法。两阶段算法通过Region Proposal Network （RPN）结构生成一系列候选框，然后对这些候选框进行分类与位置回归，如R-CNN^[3]、Fast R-CNN^[4]、Faster R-CNN^[5]；而一阶段算法不需要产生候选框，直接生成类别概率和坐标等信息，如SSD^[6]、YOLO系列^[7-8]等。两阶段算法检测精度较高，但速度较慢，难以完成实时检测任务。一阶段算法单次检测直接得到最终结果，但精度稍低于两阶段检测算法。

通常情况下，红外图像的分辨率比可见光图像低。这主要是由于红外光的波长较长，相同大小的探测器下包含的像素数量减少。同时，由于红外光的波长长、传输距离远，在大气中传输时会衰减，这也导致了红外图像的对比度相对较低。因此，红外小目标的边缘轮廓通常更难以辨别。为此，Li等人提出了YOLO-FIRI，将注意力机制加入残差块中，并改进CSP结构，改善网络对鲁棒性特征的学习^[9]。Zhou等人设计了自适应特征提取模块，并在FPN中引入注意力机制（Coordinate Attention, CA），提高了行人红外图像的检测精度^[10]。Bai等人提出了CBP-Net，设计了交叉连接的双向金字塔与区域特征增强模块，提高了模型对红外弱小目标的检测性能^[11]。Lv等人在FPN中使用了双注意力机制，并使用门控聚合路径（Deep Aggregation-Gating Pathway, DAGP）增强模型对小目标的检测能力^[12]。Du等人在YOLOv4基础上进行改进，引入难例挖掘模块（Hard Example Mining Module），提高了模型对遮挡车辆的识别能力^[13]。Long等人通过在YOLOv5中添加CBAM注意力机制，并使用扩张卷积，解决了在复杂背景干扰下红外舰船识别率低的问题^[14]。

文中针对红外车辆检测的实际需要，提出了一种改进的YOLOv5检测方法。在主干施加混合注意力机制以优化提取到的特征。优化模型Neck部分的特征融合方式，从而更高效地利用提取到的特征。使用解耦头的同时，增加了一个小目标检测层，提高模型对小目标车辆的捕获能力。为了提高模型的泛化能力，使用了由笔者自行采集、华南理工大学的SCUT_FIR_Pedestrian_Dataset^[15]以及东京大学的MULTISPECTRAL DATASET^[16]三部分构成的红外车辆数据集INFrared-417来训练笔者的模型。

3. 结　论

在该研究中基于YOLOv5提出了一种改进的红外车辆检测模型。通过引入混合注意力来使模型能够更好地关注图像中的车辆区域，增强模型提取特征的能力。在模型颈部使用改进后的Z-BiFPN，充分利用提取到的特征，并将其进行高效地融合。同时将检测头更换为更先进的Decoupled Head以提高检测能力，并增加一个小目标检测层去捕获小目标。在模型参数量小幅度增加的同时，mAP值从81.1%提高至85.3%，准确率提高了1.7%，召回率提高了3.6%。后续将会在嵌入式设备上对模型进行部署。

参考文献 (20)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于改进YOLOv5的红外车辆检测方法

doi: 10.3788/IRLA20230245

作者简介:
张学志，男，硕士生，主要从事计算机视觉、深度学习方面的研究

An infrared vehicle detection method based on improved YOLOv5

计量

基于改进YOLOv5的红外车辆检测方法

doi: 10.3788/IRLA20230245

1. 河北工业大学电子信息工程学院，天津 300401

2. 电磁空间安全全国重点实验室，天津 300308

作者简介:
张学志，男，硕士生，主要从事计算机视觉、深度学习方面的研究

English Abstract

An infrared vehicle detection method based on improved YOLOv5

1. School of Electronic and Information Engineering, Hebei University of Technology, Tianjin 300401, China

2. National Key Laboratory of Electromagnetic Space Security, Tianjin 300308, China

全文HTML

1.1. YOLOv5

1.2. 混合注意力机制

1.3. 改进的多尺度融合结构

1.4. 增强型YOLO Head

2.1. 实验环境配置

2.2. 红外数据集INFrared-417

2.3. 评估指标

2.4. 消融实验

2.5. 对比实验

2.6. 结果分析

目录

留言板

基于改进YOLOv5的红外车辆检测方法

doi: 10.3788/IRLA20230245

作者简介: 张学志，男，硕士生，主要从事计算机视觉、深度学习方面的研究

An infrared vehicle detection method based on improved YOLOv5

计量

出版历程

基于改进YOLOv5的红外车辆检测方法

doi: 10.3788/IRLA20230245

1. 河北工业大学 电子信息工程学院，天津 300401 2. 电磁空间安全全国重点实验室，天津 300308

作者简介: 张学志，男，硕士生，主要从事计算机视觉、深度学习方面的研究

English Abstract

An infrared vehicle detection method based on improved YOLOv5

1. School of Electronic and Information Engineering, Hebei University of Technology, Tianjin 300401, China 2. National Key Laboratory of Electromagnetic Space Security, Tianjin 300308, China

全文HTML

1.1. YOLOv5

1.2. 混合注意力机制

1.3. 改进的多尺度融合结构

1.4. 增强型YOLO Head

2.1. 实验环境配置

2.2. 红外数据集INFrared-417

2.3. 评估指标

2.4. 消融实验

2.5. 对比实验

2.6. 结果分析

目录

作者简介:
张学志，男，硕士生，主要从事计算机视觉、深度学习方面的研究

1. 河北工业大学电子信息工程学院，天津 300401

2. 电磁空间安全全国重点实验室，天津 300308

作者简介:
张学志，男，硕士生，主要从事计算机视觉、深度学习方面的研究

1. School of Electronic and Information Engineering, Hebei University of Technology, Tianjin 300401, China

2. National Key Laboratory of Electromagnetic Space Security, Tianjin 300308, China