基于体素化图卷积网络的三维点云目标检测方法

赵毅强; 艾西丁·艾克白尔; 陈瑞; 周意遥; 张琦

doi:10.3788/IRLA20200500

基于体素化图卷积网络的三维点云目标检测方法

3D point cloud object detection method in view of voxel based on graph convolution network

摘要

摘要: 针对激光雷达点云的稀疏性和空间离散分布的特点，通过结合体素划分和图表示方法设计了新的图卷积特征提取模块，提出一种基于体素化图卷积神经网络的激光雷达三维点云目标检测算法。该方法通过消除传统3D卷积神经网络的计算冗余性，不仅提升了网络的目标检测能力，并且提高了点云拓扑信息的分析能力。文中设计的方法在KITTI公开数据集的车辆、行人、骑行者的3D目标检测和鸟瞰图目标检测任务的检测性能相比基准网络均有了有效提升，尤其在车辆3D目标检测任务上最高提升了13.75%。实验表明：该方法采用图卷积特征提取模块有效提高了网络整体检测性能和数据拓扑关系的学习能力，为三维点云目标检测任务提供了新的方法。

Abstract: In view of the sparsity and spatial discrete distribution of lidar point cloud, a graph convolution feature extraction module was designed by combining voxel partition and graph representation, and a 3D lidar point cloud object detection algorithm in view of voxel based graph convolution neural network was proposed. By eliminating the computational redundancy of the traditional 3D convolution neural network, this method not only improved the object detection ability of the network, but also improved the analysis ability of the point cloud topology information. Compared with the baseline network, the detection performance of vehicle, pedestrian and cyclist 3D object detection and bird’s eye view object detection tasks in KITTI public dataset were improved greatly, especially improved with 13.75% precision in 3D object detection task of vehicle at maximal. Experimental results show that the proposed method improves the detection performance of the network and the learning ability of data topological relationship via graph convolution feature extraction module, which provides a new method for 3D point cloud object detection task.

HTML全文

参考文献(11)

施引文献

资源附件(0)