多类别的边缘感知方法在图像分割中的应用

董子昊; 邵秀丽

doi:10.3724/SP.J.1089.2019.17496

多类别的边缘感知方法在图像分割中的应用

A Multi-Category Edge Perception Method for Semantic Segmentation

摘要

摘要: 针对图像语义分割方法预测出的目标大多存在边缘模糊和准确度较低的问题,提出多类别边缘感知的图像分割方法.首先设计一种用于多目标分割的Multi-sigmoid 损失函数,结合COCO 数据集预训练的FCN+CRF 网络,建立可优化类别边界的语义分割模型;然后在全局嵌套边缘检测(HED)模型的基础上,增加自底向上的信息解码部分,利用亚像素(subpixel)的图像增强算法实现上采样以及相邻尺度之间的特征融合,构建出可用于边缘检测的深度多尺度编解码模型(MSDF);最后将FCN+CRF 提取到的分割信息作为一元势, MSDF 检测到的边缘特征作为二元势,设计全局能量函数并计算最小值,实现分割结果的进一步优化.在2 个标准数据集Pascal context 和SIFT Flow 上进行了实验,结果表明,该模型的总体性能较为优越,可应用在图像语义分割和显著性目标检测等相关领域.

Abstract: The image semantic segmentation methods always had edge blurring and low accuracy problems. Addressing these problems, this paper proposes a multi-category edge perception method. Firstly, our novel Multi-sigmoid loss function is used for multi-category objects segmentation, which is combined with the pretrained FCN-CRF network on the COCO dataset to build a sematic segmentation model that can optimize the class boundaries. Then, the bottom-up information decoding subnet is added to HED model, the image enhancement algorithm called subpixel is employed to up-sample and fuse features between adjacent scales. Through above methods, we construct the edge detection model called Multi-Scale encode and decode Fusion (MSDF) model. Finally, the segmentation information extracted from FCN-CRF is taken as unary potentials and edge features detected from MSDF are taken as pairwise potential, the global energy function consists of these two components, which is minimized to further optimize the segmentation results. Subjective experiments are performed on the two most common datasets, including Pascal-context and SIFT Flow. The results show that the proposed model performs well, which can be applied to the relevant area such as image sematic segmentation and salient object detection.

HTML全文

参考文献(0)

施引文献

资源附件(0)