Multi-scale recurrent attention network for image motion deblurring

Wang Xiangjun; Ouyang Wensen

doi:10.3788/IRLA20210605

In image acquisition process, the image blur caused by the moving subject or the camera itself will have a negative impact on the subsequent high-level vision tasks. Aiming at the problem that the current deep learning image deblurring method cannot balance the deblurring effect and efficiency, a multi-scale recurrent attention network was proposed, which used separable convolution to reduce the amount of parameters, and improved the attention module to allocate computing resources reasonably. Layers were used for dense connection to improve parameter utilization efficiency, and edge loss was introduced to improve the edge detail information in the generated image. Experiments prove that the proposed method has good generalization performance and robustness. Compared with the typical methods in recent years, the SSIM and PSNR have increased by about 1.15%, 0.86% and 0.91%, 1.04% on the Lai dataset and Köhler dataset, respectively. The average single frame running speed on the GoPro dataset is nearly 2.5 times faster than similar methods.

HTML

3. 结　论

针对图像运动模糊去除效率和效果不能很好统一的问题，文中提出了一种基于多尺度循环注意力网络的图像运动模糊去除方法。文中方法在每个尺度中使用编解码网络结构，LSTM在颈部进行跨尺度连接，强调多尺度间的信息交流。文中对CBAM模块的拼接方式进行了重新设计，在图像上的不同区域更合理地分配计算资源。为了提高参数使用效率，引入卷积层的密集型连接方式。加入边缘损失函数作为先验指导，有效提高了生成图像的边缘清晰度。实验表明，文中图像去模糊方法的定量指标SSIM和PSNR较近年典型方法的最佳效果分别提升了1.15%、0.86%和0.91%、1.04%，运行速度比同类方法提高了约2.5倍，且文中方法具有更佳的鲁棒性和泛化性能。

Reference (18)

[1]	Li Anti, Wu Dingjie, Li Chenglong. Survey of autonomous collision avoidance algorithm for unmanned aerial vehicle in low altitude [J]. Electronics Optics & Control, 2021, 28(8): 1-8. (in Chinese)
[2]	陆峰, 刘华海, 黄长缨, 杨艳, 谢禹, 刘财喜. 基于深度学习的目标检测技术综述[J]. 计算机系统应用, 2021, 30(03): 1-13.	Lu Feng, Liu Huahai, Huang Changying, et al. Overview of target detection techniques based on deep learning [J]. Computer Systems Applications, 2021, 30(3): 1-13. (in Chinese)
[3]	高文, 朱明, 贺柏根, 等. 目标跟踪技术综述[J]. 中国光学, 2013, 7(3): 365-375.	Gao Wen, Zhu Ming, He Baigen, et al. Overview of target tracking technology [J]. Chinese Optics, 2013, 7(3): 365-375. (in Chinese)
[4]	Krishnan D, Tay T, Fergus R, et al. Blind deconvolution using a normalized sparsity measure[C]//Proceedings of 2011 IEEE International Conference on Computer Vision and Pattern Recognition, 2011: 233-240.
[5]	Pan J, Sun D, Pfister H, et al. Blind image deblurring using dark channel prior[C]//Proceedings of 2016 IEEE International Conference on Computer Vision and Pattern Recognition, 2016: 1628-1636.
[6]	Whyte O, Sivic J, Zisserman A, et al. Non-uniform deblurring for shaken images [J]. International Journal of Computer Vision, 2012, 98(2): 168-186.
[7]	周箩鱼, 张葆, 杨扬. 采用Hough变换的离焦模糊参数的估计[J]. 红外与激光工程, 2012, 41(10): 2833-2837.	Zhou Luoyu, Zhang Bao, Yang Yang. Estimation of parameter of defocused blurred image using Hough transform [J]. Infrared and Laser Engineering, 2012, 41(10): 2833-2837. (in Chinese)
[8]	Nah S, Kim T H, Lee K M. Deep multi-scale convolutional neural network for dynamic scene deblurring [J]. IEEE Computer Vision and Pattern Recognition, 2017, 35(1): 257-265.
[9]	Kupyn O, Budzan V, Mykhailych M, et al. DeblurGAN: blind motion deblurring using conditional adversarial networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018: 8183-8192.
[10]	Tao X, Gao H, Wang Y, et al. Scale-recurrent network for deep image deblurring[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, 2018.
[11]	Kupyn O, Martyniuk T, Wu J, et al. DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, 2019.
[12]	Zhang K, Luo W, Zhong Y, et al. Deblurring by realistic blurring[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2020.
[13]	Qi Q, Guo J, Jin W. Attention network for non-uniform deblurring [J]. IEEE Access, 2020, 8: 100044-100057.
[14]	Howard A G, Zhu M, Chen B, et al. MobileNets: Efficient convolutional neural networks for mobile vision applications[J]. *Computer Vision and Pattern Recognition*, 2017: 10.48550/arXiv.1704.04861.
[15]	Woo S, Park J, Lee J Y, et al. CBAM:Convolutional block attention module[C]//European Conference on Computer Vision, 2018: arXiv:1807.06521.
[16]	Qian P, Wu Y, Zhang X. Dense connected residual generative adversarial network for single image deblurring[C]//2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), 2021: 461-466.
[17]	Lai W S, Huang J B, Hu Z, et al. A comparative studyfor single image blind deblurring [C]//Computer Visionand Pattern Recognition, IEEE, 2016: 1701-1709.
[18]	Köhler R, Hirsch M, Mohler B, et al. Recording and playback of camera shake: benchmarking blind deconvolution with a real-world database [C]//Conferenceon Computer Vision-ECCV, 2012: 27-40.

CBAM connections	Output (y)
Proposed(CBAM-J)	$ x \cdot S(x) \cdot \left( {1+C\left( {x \cdot S(x)} \right)} \right) $
Channel+Spatial	$ x \cdot C(x) \cdot S\left( {x \cdot C(x)} \right) $
Spatial+Channel	$ x \cdot S(x) \cdot C\left( {x \cdot S(x)} \right) $
Spatial // Channel	$ x \cdot S(x)+x \cdot C(x) $

Method	GoPro		Lai		Köhler
Method	SSIM	PSNR/dB	SSIM	PSNR/dB	SSIM	PSNR/dB
Proposed method	0.9185	29.0284	0.6674	16.5491	0.7625	19.9943
DeblurGAN	0.8474	25.0200	0.6425	15.8905	0.7447	19.7570
DeblurGAN-v2(Inception)	0.9141	28.2701	0.6514	16.1121	0.7469	19.4994
DeblurGAN-v2(MobileNet)	0.8731	25.9644	0.6598	16.4073	0.7556	19.7882
SRN deblur net	0.9331	30.1513	0.6494	16.1000	0.7505	19.5238

Method	FLOPs/G	Size/MB	Time/s
Proposed method	261.19	12.3	0.206
DeblurGAN	678.29	45.6	0.694
DeblurGAN-v2(Inception)	411.34	244.7	0.212
DeblurGAN-v2(MobileNet)	43.75	13.6	0.068
SRN deblur net	1434.82	78.7	0.501

Multi-scale recurrent attention network for image motion deblurring

doi: 10.3788/IRLA20210605

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views