融合UWB定位的矿井无人驾驶三维目标检测

3D object detection for mine autonomous driving integrated with UWB positioning

  • 摘要: 基于目标检测的障碍物感知是矿井无人驾驶的重要技术支撑。受类别不平衡和长程问题影响,矿井三维目标检测精度有限。针对该问题,提出一种基于改进体素化图注意力网络(VGAT)的三维目标检测网络:通过图注意力网络区分节点重要性,解决有效特征稀释问题;采用最邻近体素搜索方式,应对矿井点云的结构稀疏性;设计体素特征编码增强(VFE+)模块,解决体素内不同点分布可能被编码为相同特征的问题;设计体素特征编码补偿(VFE_C)模块,弥补体素划分的空间信息损失;优化损失函数,解决正负样本比例失衡、角度预测周期性歧义等问题。在此基础上,提出融合超宽带(UWB)定位与远距离点云上采样(PU)的矿井无人驾驶三维目标检测网络——UWB−PU−VGAT:利用UWB定位系统获取矿车、人员的位置先验信息,精准裁剪激光雷达点云以聚焦目标感兴趣区域,缓解类别不平衡问题;针对远距离稀疏点云,采用基于梯度下降的点云上采样(Grad−PU)网络生成高密度点云,解决长程检测难题。实验结果表明,UWB−PU−VGAT对矿车、矿工检测的平均精度分别达90.23%和83.67%,平均精度均值达86.95%,优于SECOND,PointRCNN等主流的点云三维目标检测网络,且帧率为32.3帧/s,满足无人驾驶实时避障需求。

     

    Abstract: Obstacle perception based on target detection is an important technical support for mine autonomous driving. Affected by class imbalance and long-distance detection problems, the accuracy of 3D object detection in mines is limited. To address this problem, a 3D object detection network based on an improved Voxel-Graph Attention Network (VGAT) was proposed. The graph attention network was used to distinguish node importance and alleviate the dilution of effective features. The nearest-neighbor voxel search method was adopted to cope with the structural sparsity of mine point clouds. A Voxel Feature Encoding Plus (VFE+) module was designed to solve the problem that different point distributions within voxels might be encoded into identical features. A Voxel Feature Encoding Compensation (VFE_C) module was designed to compensate for the loss of spatial information caused by voxel partitioning. The loss function was optimized to alleviate the imbalance between positive and negative samples and the periodic ambiguity in angle prediction. On this basis, a 3D object detection network for autonomous driving in mines integrating Ultra-Wideband (UWB) positioning and long-range Point Cloud Upsampling (PU), namely UWB-PU-VGAT, was proposed. The UWB positioning system was used to obtain prior location information of mine trucks and miners, and LiDAR point clouds were accurately cropped to focus on target regions of interest, thereby alleviating the class imbalance problem. For sparse long-range point clouds, the Gradient-descent-based Point Cloud Upsampling (Grad-PU) network was adopted to generate high-density point clouds and alleviate the long-range detection problem. Experimental results showed that the average precisions of UWB-PU-VGAT for mine trucks and miners reached 90.23% and 83.67%, respectively, and the mean average precision reached 86.95%, outperforming mainstream 3D point cloud object detection networks such as SECOND and PointRCNN. The frame rate reached 32.3 fps, meeting the real-time obstacle avoidance requirements of autonomous driving.

     

/

返回文章
返回