高级检索

用于铁路场景语义分割的改进动态图卷积神经网络

Improved Dynamic Graph Convolutional Neural Network for Semantic Segmentation of Railway Scenes

  • 摘要: 针对目前在铁路场景语义分割中存在的数据获取成本高、分割精度低、泛化能力差等问题,提出了一种基于改进动态图卷积神经网络的铁路场景语义分割方法.首先利用高分辨率的无人机采集铁路场景的多视角图像,并通过结构运动恢复与基于面片的多视角立体视觉算法生成铁路场景的三维点云;然后在动态图卷积神经网络中引入空间注意力模块,增强网络结构的分割精度与泛化性;最后通过改进后的图卷积神经网络对预处理后的铁路场景点云完成高精度的语义分割.实验阶段采用的铁路场景包括桥梁段、路基段与联络线,共计11个区域.以平均交并比为评价指标,与动态图卷积神经网络、PointNet++进行对比,研究结果表明,基于图像点云训练的改进动态图卷积神经网络对于铁路场景语义分割具有更高的精度,与动态图卷积神经网络、PointNet++相比,分割精度分别提高3.3个百分点与6.0个百分点,且具有更好的泛化能力.

     

    Abstract: A railway scene semantic segmentation method based on improved dynamic graph convolutional neural network is proposed to address the issues of high data acquisition cost, low segmentation accuracy, and poor generalization ability in current railway scene semantic segmentation. Firstly, high-resolution drones are used to capture multi view images of railway scenes, and 3D point clouds of railway scenes are generated through structural motion recovery and patch based multi view stereo vision algorithms. Then, a spatial attention module is introduced into the dynamic graph convolutional neural network to enhance the segmentation accuracy and generalization of the network structure. Finally, an improved graph convolutional neural network is used to achieve high-precision semantic segmentation of the pre-processed railway scene point cloud. The railway scenes used in the experimental stage include bridge sections, roadbed sections, and connecting lines, totaling 11 areas. Compared with dynamic graph convolutional neural networks and PointNet++, the research results show that the improved dynamic graph convolutional neural network based on image point cloud training has higher accuracy in semantic segmentation of railway scenes, with an average intersection over union ratio as the evaluation indicator. Compared with dynamic graph convolutional neural neural networks and PointNet++, the segmentation accuracy is improved by 3.3 percentage points and 6.0 percentage points, respectively, and it has better generalization ability.

     

/

返回文章
返回