基于Transformer的复合材料多源图像实例分割网络

柯岩; 傅云; 周玮珠; 朱伟东

doi:10.3788/IRLA20220338

基于Transformer的复合材料多源图像实例分割网络

doi: 10.3788/IRLA20220338

1.
浙江大学机械工程学院，浙江杭州 310027
2.
浙江西子势必锐航空工业有限公司，浙江杭州 310018

基金项目: 浙江省尖兵“领雁”研发攻关计划（2022C01134）

详细信息

作者简介:
柯岩，男，硕士生，主要从事机器视觉、深度学习等方面的研究

中图分类号: V261.97; TB332; TP183

Transformer-based multi-source images instance segmentation network for composite materials

1.
School of Mechanical Engineering, Zhejiang University, Hangzhou 310027, China
2.
Xizi Spirit Aerospace Industry (Zhejiang) Ltd, Hangzhou 310018, China

Funds: Pioneer" and "Leading Goose" R&D Program of Zhejiang

摘要: 为提高复合材料铺放质量，辅助现场人员快速对缺陷进行检测，提出一种基于Transformer的复合材料多源图像实时实例分割网络Trans-Yolact，用来对复合材料缺陷进行检测、分类、分割。在Yolact网络框架基础上，针对复合材料缺陷特点，从空间域与通道域两个维度，增强网络对复合材料缺陷的检测能力。在空间域上，常规卷积核具有空间尺度的局限性，对狭长形、大尺寸缺陷的检测效果不佳。因此，采用CNN+Transformer架构的BoTNet作为基础主干网络；同时将Transformer引入Yolact网络的FPN结构中，增强网络从非局部空间中获取信息的能力。在通道域上，采用红外与可见光联立的检测方式，并改进主干网络浅层结构，将其分为可见光通道、红外通道、混合通道，混合通道中引入通道域注意力机制，进一步增强网络对红外与可见光图像的综合判断能力。实验结果表明：改进后Trans-Yolact对复合材料缺陷mAP为88.0%，较基准Yolact网络提高5.5%，缺丝、扭转等狭长形缺陷AP提高15.2%、5.1%，包含部分大尺寸缺陷的异物类缺陷AP提高9.1%。最终对Trans-Yolact网络进行结构化剪枝，剪枝后较基准Yolact网络减少26.5%的计算量(FLOPs)、减少44.7%的参数量；检测帧数提高58%，达到57.67 fps。并在大型龙门复合材料自动铺放设备上进行在线测试，可以满足生产过程最大铺放速度1.2 m/s下，复合材料缺陷的实时检测分割。
- 缺陷检测 /
- 多光谱融合 /
- 纤维自动铺放 /
- 深度学习 /
- 实例分割
Abstract: In order to improve the quality of automatic fiber placement and assist on-site personnel to quickly detect defects, this paper proposes a real-time instance segmentation network named Trans-Yolact, which is based on Transformer. The Trans-Yolact is used to detect, classify and segment multi-spectrum images of composite material defects. Based on Yolact, aiming at the characteristics of composite material defects, Trans-Yolact's detection ability of composite material defects is enhanced from the two dimensions of space domain and channel domain. In the spatial domain, the convolution kernels have the limitation of spatial scale. The detection of narrow, long, large-size defects is not effective. Therefore, this paper adopts the BoTNet of the CNN+Transformer architecture as backbone; at the same time, the Transformer is introduced into the FPN structure of the Yolact network to enhance the network's ability to obtain information from non-local spaces. In the channel domain, the infrared and visible simultaneous detection method is adopted, and the shallow structure of the backbone is improved, which is divided into visible channel, infrared channel, and mixed channel. Channel domain attention mechanism is introduced in mixed channel. Enhance the comprehensive judgment ability of the network for infrared and visible images. The results show that the mAP of Trans-Yolact for composite defect detection is 88.0%, which is 5.5% higher than Yolact network, and the AP of narrow defects such as miss and twist are increased by 15.2% and 5.1%. The AP of foreign defects including some large-scale defects is increased by 9.1%. Finally, the Trans-Yolact network is pruned. After pruning, the amount of floating-point operations per second (FLOPs) and parameters are reduced by 26.5% and 44.7% compared with Yolact network. The number of detection frames is increased by 58%, reaching 57.67 fps. And the online test is carried out on the large-scale gantry composite material automatic laying equipment, which can meet the real-time detection and segmentation of composite material defects under the maximum laying speed of 1.2 m/s in the production process.
- defects detection /
- multi-spectrum fusion /
- automatic fiber placement /
- deep learning /
- instance segmentation

图 1 复合材料缺陷图像采集平台

Figure 1. Acquisition platform of composite material defect image

下载: 全尺寸图片幻灯片

图 2 (a)原红外图像；(b)增强标定板反射；(c) 红外图像畸变矫正前；(d)红外图像畸变矫正后

Figure 2. (a) Infrared image; (b) Enhance the reflection of the calibration plate; (c) Infrared image before distortion correction; (d) Infrared image after distortion correction

下载: 全尺寸图片幻灯片

图 3 (a) 仿形变换前；(b) 仿形变换后

Figure 3. (a) Before transformation; (b) After transformation

下载: 全尺寸图片幻灯片

图 4 (a) 可见光图像；(b) 红外图像；(c) 缺陷标签

Figure 4. (a) Visible image; (b) Infrared image; (c) Defect label

下载: 全尺寸图片幻灯片

图 5 Yolact网络结构

Figure 5. Network structure of Yolact

下载: 全尺寸图片幻灯片

图 6 Trans-Yolact网络结构

Figure 6. Network structure of Trans-Yolact

下载: 全尺寸图片幻灯片

图 7 MHSA结构

Figure 7. Multi-Head Self-Attention structure

下载: 全尺寸图片幻灯片

图 8 引入Transformer的FPN结构

Figure 8. FPN structure based on Transformer

下载: 全尺寸图片幻灯片

图 9 (a) ST变换；(b) GT变换；(c) RT变换

Figure 9. (a) Self-Transformer; (b) Grounding Transformer; (c) Rendering Transformer

下载: 全尺寸图片幻灯片

图 10 Trans-Yolact网络损失

Figure 10. Loss of Trans-Yolact

下载: 全尺寸图片幻灯片

图 11 六种复合材料缺陷的PR测试曲线

Figure 11. PR test curves of six composite defects

下载: 全尺寸图片幻灯片

图 12 (a) 可见光图像；(b) 红外图像；(c) Yolact检测结果；(d) Trans-Yolact检测结果

Figure 12. (a) Visible image; (b) Infrared image; (c) Yolact detection results; (d) Trans-Yolact detection results

下载: 全尺寸图片幻灯片

图 13 Yolact与Trans-Yolact在不同网络深度层下的实验

Figure 13. Performance of Yolact and Trans-Yolact under different network depth layers

下载: 全尺寸图片幻灯片

图 14 现场检测实验

Figure 14. Field detecting experiment

下载: 全尺寸图片幻灯片

表 1 六种复合材料缺陷的AP

Table 1. AP of six composite defects

	Twist	Wrinkle	Bridge	Bubble	Miss	Foreign	Total
Yolact IR VI	0.771	0.843	0.959	0.783	0.736	0.856	0.825
Yolact IR	0.715	0.857	0.904	0.632	0.722	0.899	0.788
Yolact VI	0.501	0.872	0.929	0.730	0.685	0.820	0.756
Backbone improved	0.790	0.870	0.933	0.807	0.836	0.910	0.857
Transformer improved	0.806	0.886	0.952	0.740	0.857	0.905	0.858
Trans-Yolact	0.822	0.889	0.948	0.788	0.888	0.947	0.880

下载: 导出CSV

表 2 剪枝优化的网络结构与测试数据

Table 2. Prune-optimized network structure and test data

	Trans-Yolact					Yolact
	Ⅰ	Ⅱ	Ⅲ	Ⅳ	Ⅴ	Ⅰ	Ⅱ	Ⅲ
C1	1	1	1	1	1	1	1	1
C2(SE)	3(√)	3(√)	3(√)	3(√)	3(√)	3(×)	3(×)	3(×)
C3(SE)	4(√)	4(√)	4(√)	4(√)	4(√)	4(×)	4(×)	4(×)
C4	23	18	12	8	4	23	12	4
C5(MHSA)	3(√)	3(√)	3(√)	3(√)	3(√)	3(×)	3(×)	3(×)
FLOPs	79.79 G	71.95 G	63.74 G	58.27 G	50.11 G	79.28 G	64.22 G	53.27 G
Params	44.69 M	38.62 M	31.92 M	27.45 M	22.59 M	49.62 M	37.34 M	28.40 M
File space	176.6 M	152.8 M	126.5 M	108.9 M	89.8 M	194.5 M	146.3 M	111.2 M
mAP	88.03	87.63	86.69	86.03	83.50	82.48	82.10	80.50
FPS	37.72	45.27	55.68	57.67	59.03	36.53	42.40	48.25

下载: 导出CSV

[1]	Brüning J, Denkena B, Dittrich M A, et al. Machine learning approach for optimization of automated fiber placement processes [J]. Procedia CIRP, 2017, 66: 74-78. doi: 10.1016/j.procir.2017.03.295
[2]	Harik R, Saidy C, Williams S J, et al. Automated fiber placement defect identity cards: Cause, anticipation, existence, significance, and progression[R]. 2018.
[3]	Rudberg T, Nielson J, Henscheid M, et al. Improving AFP cell performance [J]. SAE International Journal of Aerospace, 2014, 7(2): 317. doi: 10.4271/2014-01-2272
[4]	Wen L W, Song Q H, Qin L H, et al. Defect detection and closed-loop control system for automated fiber placement forming components based on machine vision and UMAC [J]. Acta Aeronautica et Astronautica Sinica, 2015, 36(12): 3991-4000. (in Chinese)
[5]	Wei T S. Research on image detection method for defects of composite prepreg tapes[D]. Zibo: Shandong University of Technology, 2018. (in Chinese)
[6]	Ritter J A, Sjogren J F. Real-time infrared thermography inspection and control for automated composite marterial layup: US. Patent 7, 513, 964[P]. 2009-04-07.
[7]	Denkena B, Schmidt C, Völtzer K, et al. Thermographic online monitoring system for automated fiber Placement processes [J]. Composites Part B: Engineering, 2016, 97: 239-243. doi: 10.1016/j.compositesb.2016.04.076
[8]	Schmidt C, Denkena B, Völtzer K, et al. Thermal image-based monitoring for the automated fiber placement process [J]. Procedia CIRP, 2017, 62: 27-32. doi: 10.1016/j.procir.2016.06.058
[9]	Chen M, Jiang M, Liu X, et al. Intelligent inspection system based on infrared vision for automated fiber placement[C]//2018 IEEE International Conference on Mechatronics and Automation (ICMA). IEEE, 2018: 918-923.
[10]	Wang X, Kang S, Zhu W D. Defect detection of laminated surface in the automated fiber placement process based on improved CenterNet [J]. Infrared and Laser Engineering, 2021, 50(10): 20210011. (in Chinese) doi: 10.3788/IRLA20210011
[11]	Gregory E D, Juarez P D. In-situ thermography of automated fiber placement parts[C]//AIP Conference Proceedings, 2018, 1949(1): 060005.
[12]	Juarez P D, Gregory E D. In situ thermal inspection of automated fiber placement manufacturing[C]//AIP Conference Proceedings, 2019, 2102(1): 120005.
[13]	Juarez P D, Gregory E D. In situ thermal inspection of automated fiber placement for manufacturing induced defects [J]. Composites Part B: Engineering, 2021, 220: 109002. doi: 10.1016/j.compositesb.2021.109002
[14]	Kang S, Ke Z Z, Wang X, et al. Detection method of defects in automatic fiber placement based on fusion of infrared and visible images [J]. Acta Aeronautica et Astronautica Sinica, 2022, 43(3): 556-568. (in Chinese)
[15]	Sacco C, Radwan A B, Anderson A, et al. Machine learning in composites manufacturing: A case study of automated fiber placement inspection [J]. Composite Structures, 2020, 250: 112514. doi: 10.1016/j.compstruct.2020.112514
[16]	Sacco C. Machine learning methods for rapid inspection of automated fiber placement manufactured composite structures[D]. US: University of South Carolina, 2019: 57-68.
[17]	Vaswani A, Shazeer N, Parmar N, et al . Attention is all you need [C]//31st Annual Conference on Neural Information Processing Systems, 2017: 5999 - 6009.
[18]	Carion N, Massa F, Synnaeve G, et al. End-to-end object detection with transformers[C]//European Conference on Computer Vision. Springer, 2020: 213-229.
[19]	Zhu X, Su W, Lu L, et al. Deformable detr: Deformable transformers for end-to-end object detection[EB/OL]. (2020-10-08)[2022-06-20]. https://arxiv.org/abs/2010.04159.
[20]	Zheng S, Lu J, Zhao H, et al. Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 6881-6890.
[21]	Dosovitskiy A, Beyer L, Kolesnikov A, et al. An Image is worth 16x16 words: Transformers for image recognition at scale[C]//International Conference on Learning Representations, 2020.
[22]	Srinivas A, Lin T Y, Parmar N, et al. Bottleneck transformers for visual recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 16519-16529.
[23]	Zhang Z. Flexible camera calibration by viewing a plane from unknown orientations[C]//Proceedings of the Seventh IEEE International Conference on Computer Vision. IEEE, 1999: 666-673.
[24]	Bolya D, Zhou C, Xiao F, et al. Yolact: Real-time instance segmentation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 9157-9166.
[25]	Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 7132-7141.
[26]	Zhang D, Zhang H, Tang J, et al. Feature pyramid transformer[C]//European Conference on Computer Vision. Springer, 2020: 323-339.

[1]	周天彪, 黄思远, 文龙, 陈沁. 基于无透镜散斑图像编码的集成式光谱检测 . 红外与激光工程, 2024, 53(3): 20240010-1-20240010-9. doi: 10.3788/IRLA20240010
[2]	徐佳男, 孔明, 刘维, 王道档, 谢中思. 微流控芯片通道的全息显微检测方法 . 红外与激光工程, 2022, 51(9): 20210915-1-20210915-8. doi: 10.3788/IRLA20210915
[3]	张津浦, 王岳环. 融合检测技术的孪生网络跟踪算法综述 . 红外与激光工程, 2022, 51(10): 20220042-1-20220042-14. doi: 10.3788/IRLA20220042
[4]	钟友坤, 莫海宁. 基于深度自编码-高斯混合模型的视频异常检测方法 . 红外与激光工程, 2022, 51(6): 20210547-1-20210547-7. doi: 10.3788/IRLA20210547
[5]	李明泽, 侯溪, 赵文川, 王洪, 李梦凡, 胡小川, 赵远程, 周杨. 非球面光学表面缺陷检测技术现状和发展趋势(特邀) . 红外与激光工程, 2022, 51(9): 20220457-1-20220457-20. doi: 10.3788/IRLA20220457
[6]	吴志洋, 王双, 刘铁根, 靳党鹏. 基于深度学习视觉和激光辅助的盾构管片自动拼装定位方法 . 红外与激光工程, 2022, 51(4): 20210183-1-20210183-9. doi: 10.3788/IRLA20210183
[7]	李芳丽. 监控视频中采用深度支持向量数据描述的异常检测 . 红外与激光工程, 2021, 50(9): 20210094-1-20210094-7. doi: 10.3788/IRLA20210094
[8]	刘云朋, 霍晓丽, 刘智超. 基于深度学习的光纤网络异常数据检测算法 . 红外与激光工程, 2021, 50(6): 20210029-1-20210029-6. doi: 10.3788/IRLA20210029
[9]	王璇, 康硕, 朱伟东. 基于改进CenterNet的AFP铺层表面缺陷检测 . 红外与激光工程, 2021, 50(10): 20210011-1-20210011-11. doi: 10.3788/IRLA20210011
[10]	钟锦鑫, 尹维, 冯世杰, 陈钱, 左超. 基于深度学习的散斑投影轮廓术 . 红外与激光工程, 2020, 49(6): 20200011-1-20200011-11. doi: 10.3788/IRLA20200011
[11]	张旭, 于明鑫, 祝连庆, 何彦霖, 孙广开. 基于全光衍射深度神经网络的矿物拉曼光谱识别方法 . 红外与激光工程, 2020, 49(10): 20200221-1-20200221-8. doi: 10.3788/IRLA20200221
[12]	周宏强, 黄玲玲, 王涌天. 深度学习算法及其在光学的应用 . 红外与激光工程, 2019, 48(12): 1226004-1226004(20). doi: 10.3788/IRLA201948.1226004
[13]	唐聪, 凌永顺, 杨华, 杨星, 路远. 基于深度学习的红外与可见光决策级融合检测 . 红外与激光工程, 2019, 48(6): 626001-0626001(15). doi: 10.3788/IRLA201948.0626001
[14]	张秀玲, 侯代标, 张逞逞, 周凯旋, 魏其珺. 深度学习的MPCANet火灾图像识别模型设计 . 红外与激光工程, 2018, 47(2): 203006-0203006(6). doi: 10.3788/IRLA201847.0203006
[15]	唐聪, 凌永顺, 杨华, 杨星, 郑超. 基于深度学习物体检测的视觉跟踪方法 . 红外与激光工程, 2018, 47(5): 526001-0526001(11). doi: 10.3788/IRLA201847.0526001
[16]	耿磊, 梁晓昱, 肖志涛, 李月龙. 基于多形态红外特征与深度学习的实时驾驶员疲劳检测 . 红外与激光工程, 2018, 47(2): 203009-0203009(9). doi: 10.3788/IRLA201847.0203009
[17]	唐聪, 凌永顺, 郑科栋, 杨星, 郑超, 杨华, 金伟. 基于深度学习的多视窗SSD目标检测方法 . 红外与激光工程, 2018, 47(1): 126003-0126003(9). doi: 10.3788/IRLA201847.0126003
[18]	彭铁根, 何永辉, 李兵虎, 杨水山, 宗德祥. 基于TDI 成像技术的镀锡带钢表面质量在线检测系统研发 . 红外与激光工程, 2014, 43(1): 294-299.
[19]	徐振淞, 史铁林, 陆向宁, 宿磊, 廖广兰. 基于小波分析的倒装芯片主动红外缺陷检测 . 红外与激光工程, 2014, 43(10): 3233-3237.
[20]	孙雪晨, 姜肖楠, 傅瑶, 韩诚山, 文明. 基于机器视觉的凸轮轴表面缺陷检测系统 . 红外与激光工程, 2013, 42(6): 1647-1653.

点击查看大图

图(14) / 表(2)

计量

文章访问数: 221
HTML全文浏览量: 62
PDF下载量: 54
被引次数: 0

全文HTML

0. 引　言

碳纤维复合材料因质量轻、强度高、耐高温，被广泛应用于航空航天领域^[1]。随着使用量的增加，越来越多的复合材料构件采用自动纤维铺放技术生产(Automated Fiber Placement，AFP)。受铺放路径、压辊压力、预热温度等影响，复合材料表面会出现缺丝、间隙、重叠、褶皱、扭转、架桥、气泡、异物等缺陷^[2]。起初采用人工方式对缺陷进行检测，但检测耗时过长(约占生产时间32%^[3])。为缩短检测时间，各铺放团队将检测系统前置，开发出复合材料在线检测系统。

按检测传感器种类划分，可分为可见光检测与热红外检测。基于可见光图像的检测系统，如文立伟等^[4]将可见光相机与UMAC结合，可实现间隙、重叠类缺陷的检测与铺放头的闭环控制。魏天舒等^[5]使用图像分割算法提取缺陷边缘，并对缺陷进行分类。由于复合材料可见光图像的对比度过低，因此仅对与背景亮度差异较大的缺陷，检测效果较好。

基于热红外图像的检测系统是利用复合材料铺放过程中需要热激励源对其加热，不同种类的缺陷由于热传导方式、热传导率的不同，在红外图像中与正常铺放表面差异明显。最早由波音公司将热红外相机集成进AFP系统中^[6]。Denkena等^[7-8]根据红外图像中温度的极值点，设置阈值分割缺陷所在区域，可以检测出扭转、间隙、重叠、架桥、异物类缺陷。Chen等^[9]设计了一种可用于智能决策、参数优化与质量追溯的基于红外视觉的复合材料缺陷检测系统。王璇等^[10]设计了一种基于红外图像的复合材料表面检测网络AFP-CenterNet，可以实现在无GPU加速的情况下以4.2 fps的检测速度对间隙、缺丝、扭转、气泡、起皱、异物进行检测。Gregory等^[11]采用信息重建与视频分析，对缺陷定位与评估。Juarez等^[12]采用机器学习的方法，利用热红外相机对重叠、间隙、褶皱、脱粘、扭转类缺陷进行检测，并发现借助红外热成像可以对铺放层间粘性减弱缺陷进行检测。最近，NASA^[13]提出了一种基于红外热成像的复合材料缺陷检测系统ISTIS，借助红外热图像，实时检测复合材料铺放过程中产生的重叠与间隙缺陷。并将该系统部署在NASA Langley的AFP系统中进行在线测试，可以检测出实际尺寸在0.762 mm内的缺陷。上述研究证明，热传导方式、热传导率突变的缺陷在基于红外图像的检测系统中检测效果更好。

在上述研究的基础上，康硕等^[14]首先建立一种红外与可见光联立的复合材料检测系统。分别将红外图像与可见光图像输入两个CSP-DarkNet网络中，对得到的特征图采用改进的特征金字塔网络结构进行多尺度预测。结果较单光谱检测mAP提高6.3%，证实采用红外与可见光联立的方式可以进一步提升复合材料的检测效果。

从检测结果来看，复合材料检测系统由单一的目标检测向可获取缺陷轮廓的实例分割网络发展。传统的目标检测网络仅能获取缺陷的尺寸位置信息，无法获取缺陷轮廓、面积等细化信息。为了更好辅助现场人员识别缺陷、保证产品的质量溯源，Sacco等^[15-16]提出了一种基于深度学习的缺陷分割算法。在线获取可见光图像，对缺丝、间隙、重叠、扭转、褶皱类缺陷进行检测，并对图像进行语义分割，记录缺陷轮廓。将网络嵌入AFP铺层表面缺陷检测系统ACSIS中，检测准确率可达75%以上。

由于复合材料缺丝、扭转类缺陷长宽比在12~4之间，属于狭长形缺陷。而常规卷积核具有空间尺度的局限性，很难从图像全局的角度进行综合判断，因此对上述缺陷的检测是复合材料检测的难点。为了解决卷积核只能聚合局部空间信息的劣势，目前计算机领域主流方法是将Transformer变换引入CNN网络中^[17]，增强网络对非局部空间信息的整合。在目标检测(DETR^[18]、Deformable-DETR^[19])、图像分割(SETR^[20])任务中取得亮眼的成效。将引入Transformer的网络进一步细分，可以分为Pure-Transformer架构(如ViT^[21])，和CNN+Transformer架构(如BoTNet^[22])。Pure-Transformer架构需要在大型训练集中进行预训练，再运用在特定数据集中微调模型，否则难以收敛；同时缺乏对于偏置的归纳能力。而CNN+Transformer架构是将Transformer核心机制（自注意力机制）变形为模块，插入CNN架构中，其兼具CNN平移不变性的优点以及Transformer获取全局信息的能力。

文中设计了一种基于Transformer的复合材料多源图像实时实例分割网络Trans-Yolact(Transformer Yolact)。选择检测效果最优的红外与可见光联立检测方式，在检测网络中并联图像分割模块，实时对复合材料缺陷进行检测、分类、分割。针对复合材料缺陷特点，空间域上引入Transformer变换，通道域上改进主干网络信息提取方式，进一步优化网络。第1节介绍了复合材料图像数据集的拍摄方式，红外与可见光图像的矫正、配准方式，并分析复合材料缺陷特点。第2节根据缺陷特点与基准Yolact网络特点，有针对性的提出改进方法，并详细阐述Transformer增强网络获取图像全局信息的数学原理。第3节从网络收敛的稳定性、检测的准确性对网络进行验证。并在此基础上，对网络进行剪枝优化，部署在复合材料实时检测系统中，进行在线验证。

4. 结　论

文中提出了一种基于Transformer变换的复合材料红外与可见光联立的实例分割网络Trans-Yolact。以实时实例分割网络Yolact为基础框架，从空间域和通道域两个维度增强网络的信息整合能力。改进后Trans-Yolact网络对复合材料缺陷检测准确度可达88.0%，较基准提高5.5%。并对网络进行结构化剪枝，较基准Yolact网络减少26.5%的计算量(FLOPs)、减少44.7%的参数量；检测帧数提高58%，达到57.67 fps。最后在大型龙门复合材料自动铺放设备上进行在线测试，满足生产过程中复合材料以最大铺放速度1.2 m/s下的实时检测。并得出以下结论：

（1）空间域上引入Transformer可以显著增强网络从非局部空间中获取信息的能力。常规卷积核具有空间尺度的局限性，文中选择CNN+Transformer结构的BoTNet作为基础主干网络、并采用基于Transformer的FPN结构。相较基准网络，Trans-Yolact网络对狭长形的缺丝类、扭转类检测准确度提升15.2%、5.1%，对含有部分大尺寸的异物类缺陷检测准确度提升9.1%。

（2）通道域上采用红外与可见光联立的检测方式，较单红外或单可见光图像mAP提高3.7%、6.8%，可以增强复合材料检测效果。在此基础上，改进主干网络浅层结构，并引入基于通道域的注意力机制(SE模块)，可以进一步提升检测准确度3.2%，增强混合通道对红外与可见光图像综合判断能力。

对复合材料缺陷进行实例分割，在得到缺陷预测框的基础上，可以获取缺陷面积、缺陷轮廓掩膜等信息，更好地辅助现场人员对复合材料缺陷进行识别，以及对复合材料构件进行产品溯源。但仍存在缺陷轮廓边缘识别模糊、铺放速度较慢时缺陷重复计数等问题，需要后续持续优化。

参考文献 (26)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于Transformer的复合材料多源图像实例分割网络

doi: 10.3788/IRLA20220338

作者简介:
柯岩，男，硕士生，主要从事机器视觉、深度学习等方面的研究

Transformer-based multi-source images instance segmentation network for composite materials

计量

基于Transformer的复合材料多源图像实例分割网络

doi: 10.3788/IRLA20220338

1. 浙江大学机械工程学院，浙江杭州 310027

2. 浙江西子势必锐航空工业有限公司，浙江杭州 310018

作者简介:
柯岩，男，硕士生，主要从事机器视觉、深度学习等方面的研究

English Abstract

Transformer-based multi-source images instance segmentation network for composite materials

1. School of Mechanical Engineering, Zhejiang University, Hangzhou 310027, China

2. Xizi Spirit Aerospace Industry (Zhejiang) Ltd, Hangzhou 310018, China

全文HTML

1.1. 图像采集

1.2. 图像矫正与配准

1.3. 复合材料缺陷分析

2.1. Trans-Yolact网络总体设计

2.2. 主干网络深层引入Transformer

2.3. 改进主干网络浅层

2.4. 基于Transformer的FPN结构

3.1. 实验训练细节

3.2. 实验结果分析

3.3. 网络剪枝优化

3.4. 在线部署与验证

目录

留言板

基于Transformer的复合材料多源图像实例分割网络

doi: 10.3788/IRLA20220338

作者简介: 柯岩，男，硕士生，主要从事机器视觉、深度学习等方面的研究

Transformer-based multi-source images instance segmentation network for composite materials

计量

出版历程

基于Transformer的复合材料多源图像实例分割网络

doi: 10.3788/IRLA20220338

1. 浙江大学 机械工程学院，浙江 杭州 310027 2. 浙江西子势必锐航空工业有限公司，浙江 杭州 310018

作者简介: 柯岩，男，硕士生，主要从事机器视觉、深度学习等方面的研究

English Abstract

Transformer-based multi-source images instance segmentation network for composite materials

1. School of Mechanical Engineering, Zhejiang University, Hangzhou 310027, China 2. Xizi Spirit Aerospace Industry (Zhejiang) Ltd, Hangzhou 310018, China

全文HTML

1.1. 图像采集

1.2. 图像矫正与配准

1.3. 复合材料缺陷分析

2.1. Trans-Yolact网络总体设计

2.2. 主干网络深层引入Transformer

2.3. 改进主干网络浅层

2.4. 基于Transformer的FPN结构

3.1. 实验训练细节

3.2. 实验结果分析

3.3. 网络剪枝优化

3.4. 在线部署与验证

目录

作者简介:
柯岩，男，硕士生，主要从事机器视觉、深度学习等方面的研究

1. 浙江大学机械工程学院，浙江杭州 310027

2. 浙江西子势必锐航空工业有限公司，浙江杭州 310018

作者简介:
柯岩，男，硕士生，主要从事机器视觉、深度学习等方面的研究

1. School of Mechanical Engineering, Zhejiang University, Hangzhou 310027, China

2. Xizi Spirit Aerospace Industry (Zhejiang) Ltd, Hangzhou 310018, China