高级检索
张静, 靳淇兆, 王洪振, 达铖, 向世明, 潘春洪. 多尺度信息融合的遥感图像语义分割模型[J]. 计算机辅助设计与图形学学报, 2019, 31(9): 1509-1517. DOI: 10.3724/SP.J.1089.2019.17645
引用本文: 张静, 靳淇兆, 王洪振, 达铖, 向世明, 潘春洪. 多尺度信息融合的遥感图像语义分割模型[J]. 计算机辅助设计与图形学学报, 2019, 31(9): 1509-1517. DOI: 10.3724/SP.J.1089.2019.17645
Zhang Jing, Jin Qizhao, Wang Hongzhen, Da Cheng, Xiang Shiming, Pan Chunhong. Semantic Segmentation on Remote Sensing Images with Multi-Scale Feature Fusion[J]. Journal of Computer-Aided Design & Computer Graphics, 2019, 31(9): 1509-1517. DOI: 10.3724/SP.J.1089.2019.17645
Citation: Zhang Jing, Jin Qizhao, Wang Hongzhen, Da Cheng, Xiang Shiming, Pan Chunhong. Semantic Segmentation on Remote Sensing Images with Multi-Scale Feature Fusion[J]. Journal of Computer-Aided Design & Computer Graphics, 2019, 31(9): 1509-1517. DOI: 10.3724/SP.J.1089.2019.17645

多尺度信息融合的遥感图像语义分割模型

Semantic Segmentation on Remote Sensing Images with Multi-Scale Feature Fusion

  • 摘要: 遥感图像分割是目前学术界和工业界的一个研究热点,在城市规划、变化检测以及GIS信息构建等方面有着十分广泛的应用.然而,诸多复杂因素(如多变的尺度、多样化的拓扑形状以及复杂的背景和阴影等)使得遥感图像语义分割成为一项具有挑战性的任务.为此,提出一种基于多尺度信息融合的遥感图像语义分割深层卷积神经网络模型,该模型分为编码器和解码器2部分.在编码阶段,设计了基于DenseNet网络的跨卷积层级的多尺度特征融合策略,采用子区域全局平均池化及多尺度卷积处理复杂的背景区域;在解码阶段,为了准确地恢复图像的细节信息,设计了能够融合不同层级卷积特征的短解码器;最后,在整体模型构建方面设计了一种具有多输出的分层监督机制网络模型,从不同层级获取监督信息,可在充分利用监督信息的同时更好地引导网络的训练.在ISPRS公开数据集以及北京市遥感数据集上,通过实验验证了文中模型的有效性.

     

    Abstract: Semantic segmentation of remote sensing images has drawn extensive attention both from academy and industry for its wide range of applications,such as urban planning,urban change detection and geographic information system.Nevertheless,many complicated factors,such as complex background,shadows,objects with various scales,topological shapes and appearances in different regions,make this task quite challenging.Accordingly,this paper proposes a deep convolutional neural network model with multi-scale information fusion for semantic segmentation of remote sensing images.The structure of our model is composed of two parts:encoder and decoder.In the encoder part,a strategy is proposed to fuse multi-scale features based on DenseNet network.Specifically,global average pooling is first used to extract regional semantic information of different sub-regions to make network understand complex background in remote sensing images;sub-region global average pooling and multiscale convolution are then used to deal with complex background areas.In the decoder part,we design a shorter decoder which can fuse features from different levels of convolution to accurately restore the image details.For the overall model construction,we design a hierarchical monitoring mechanism with multiple outputs.This trick allows our model to obtain supervised information from different levels,which can help guide the training of the network.Extensive experiments on ISPRS benchmark datasets and Beijing remote sensing dataset demonstrate the effectiveness of our approach.

     

/

返回文章
返回