基于语义重定位的语义分割并行网络

陈姝; 徐蕾; 邹北骥; 陈静

doi:10.3724/SP.J.1089.2022.18909

基于语义重定位的语义分割并行网络

Semantic Relocation Parallel Network for Semantic Segmentation

摘要

摘要: 语义分割任务是对图像进行像素级别的分类预测,其难点在于对像素级别的准确预测和物体的边缘划分.现有方法大多采用基于编解码结构的网络模型,通过下采样快速扩充网络的感受野,但连续的下采样对特征图的空间信息造成了不可逆转的损失,为此,提出一种基于语义重定位的并行网络.设计了一条全局空间路径,在保持高分辨率的情况下提取丰富的空间信息并缓解多次下采样带来的信息丢失.在另一条上下文信息提取路径中,采用一个特征提取器,通过快速下采样扩充网络的感受野.此外,设计基于物体类别的语义重定位模块弥补多次下采样造成的上下文信息缺失,使用粗分割结果中该类目标区域的所有像素分别对目标区域中的每个像素进行引导.同时,采用Dice loss缓解数据中存在的正负样本不平衡问题,以获得更好的分割性能.最后,在Cityscapes和CamVid数据集上对所提网络进行了评价.实验结果表明,与已有分割网络相比,在CamVid数据集上,SRPNet在mIoU指标上能提升3.1%,在Cityscapes数据集上,SRPNet在mIoU指标上能提升1.8%.

Abstract: Semantic segmentation is an essential issue in the computer vision field,the difficulty of which lies in the accurate prediction of the pixel level and the edge division of similar objects.The encoder-encoder structure is widely used in many methods to capture the global information of semantic objects.However,continuous subsampling causes irreversible loss of spatial information of the feature map.A parallel semantic relocation(SRPNet)based network is proposed.Specifically,a high-resolution global spatial path is designed to extract rich spatial information in which feature maps have high resolution.In feature extraction path,a powerful feature extractor is used to expand the receptive field by fast subsampling.Besides,a semantic relocation module(SRM)is designed to compensate for the lack of context information caused by multiple subsamples.Dice loss is employed to alleviate the imbalance of positive and negative samples in the dataset and obtain better segmentation performance.Finally,the proposed network is evaluated on the Cityscapes and CamVid dataset.The results show that SRPNet can improve the previous best result by approximately 3.1%and 1.8%measured by mIoU on the CamVid dataset and the Cityscapes dataset,respectively.

HTML全文

参考文献(0)

施引文献

资源附件(0)