全局特征感知与融合的多层次蒸馏学习道路提取模型
Multilevel distillation learning road extraction model with global feature awareness and fusion
-
摘要: 为提取空间特征更细节和语义信息更全面的道路信息, 提升道路信息提取的推理速度, 在端到端的卷积神经网络(CNN)基础上, 提出一种结合空间注意力机制、全局信息感知和特征融合模块的道路提取模型. 首先利用空间注意力机制和全局信息感知模块获取道路特征的上下文信息, 提高浅层特征的空间信息表达能力; 然后构建顾及通道和语义信息的特征融合模块, 消除基于端到端的CNN中浅层和深层特征之间的语义差距, 完成跨层特征的有效融合; 最后使用多层次知识蒸馏学习策略减少并降低所提模型的网络参数和计算复杂度, 快速准确地获取遥感影像中的道路信息. 在公开的Deep Globe和马萨诸塞州2个卫星遥感影像道路数据集以及京津新城无人机遥感影像道路数据集上, 进行训练、验证和评估的结果表明, 所提模型是一种提取精度高、提取效果好的道路提取模型, 无论是卫星遥感数据源还是无人机遥感数据源均具有较好的道路信息提取能力, 其分别达到79.36%, 78.42%和84.27%, 皆优于文中对比的道路提取模型; 同时, 多层次知识蒸馏学习策略能显著地提升模型的精度和泛化能力, 其指标IOU值分别提高了0.29%, 0.77%和0.46%, 在模型精度和网络参数方面都取得了较优的效果, 具有广阔的应用前景.Abstract: Based on an end-to-end convolutional neural network (CNN), the paper proposed a road extraction model combining spatial attention mechanism, global information perception, and feature fusion module. It can extract road information with detailed spatial features and comprehensive semantic information and improve the inference speed of road information extraction. Firstly, the spatial attention mechanism and global information awareness module were used to obtain road contextual information and improve the spatial information representation of low-level features. Then the feature fusion module was built to consider the channel and semantic information to eliminate the semantic gap between low- and high-level features. Finally, the multilevel knowledge distillation learning strategy was used to reduce network parameters and computational complexity to obtain road information quickly and accurately. The training, validation, and evaluation were conducted on the Deep Globe, Massachusetts, and Beijing-Tianjin New Town remote sensing image road datasets. The results show that the proposed model provides high accuracy and a suitable extraction effect. It achieves excellent road information extraction ability both in satellite remote sensing data sources and UAV remote sensing data sources. The scores reach 79.36%, 78.42%, and 84.27%, respectively, outperforming other road extraction models. Meanwhile, the multilevel knowledge distillation learning strategy significantly improves the accuracy and generalization ability of the model. The index IOU values were improved by 0.29%, 0.77%, and 0.46%, respectively. It achieves better results regarding model accuracy and network parameters and has broad application prospects.