高级检索
郑柏伦, 冼楚华, 张东九. 融合RGB图像特征的多尺度深度图像补全方法[J]. 计算机辅助设计与图形学学报, 2021, 33(9): 1407-1417. DOI: 10.3724/SP.J.1089.2021.18861
引用本文: 郑柏伦, 冼楚华, 张东九. 融合RGB图像特征的多尺度深度图像补全方法[J]. 计算机辅助设计与图形学学报, 2021, 33(9): 1407-1417. DOI: 10.3724/SP.J.1089.2021.18861
Zheng Bolun, Xian Chuhua, Zhang Dongjiu. Multi-Scale Fusion Networks with RGB Image Features for Depth Map Comple-tion[J]. Journal of Computer-Aided Design & Computer Graphics, 2021, 33(9): 1407-1417. DOI: 10.3724/SP.J.1089.2021.18861
Citation: Zheng Bolun, Xian Chuhua, Zhang Dongjiu. Multi-Scale Fusion Networks with RGB Image Features for Depth Map Comple-tion[J]. Journal of Computer-Aided Design & Computer Graphics, 2021, 33(9): 1407-1417. DOI: 10.3724/SP.J.1089.2021.18861

融合RGB图像特征的多尺度深度图像补全方法

Multi-Scale Fusion Networks with RGB Image Features for Depth Map Comple-tion

  • 摘要: 针对目前因缺少配对的“缺失-完整”RGB-D数据集而不能直接训练端对端深度图像补全模型的问题,提出基于随机掩码构造对应的缺失-完整数据,结合真实数据集与合成数据集交替训练模型的策略.基于随机掩码生成不同缺失比例的深度图像,并且利用合成数据集构造具有可靠真值的深度图像缺失数据,从而得到具有可靠数据的缺失-完整RGB-D数据集.以此策略为基础,搭建融合对应RGB图像特征的多尺度深度图像补全网络,该网络分别从RGB图像特征提取分支和深度图像特征提取分支提取不同尺度的RGB图像特征和深度图像特征,再经过特征融合分支在不同尺度上对RGB图像特征和深度图像特征进行融合,进而能够充分地学习RGB图像丰富的语义信息和深度图像的信息补全缺失深度.在NYU-Depth V2数据集的实验表明,该方法在不同缺失比例的深度图像补全任务中,阈值精度平均值为0.98,平均相对误差约为0.061,与现有基于神经网络和优化稀疏方程组的方法相比,其在阈值精度上平均提升了0.02,平均相对误差平均下降了0.027.

     

    Abstract: Currently,researchers cannot directly train end-to-end model for depth image completion because of lacking paired“incomplete-complete”RGB-D datasets.To address this problem,a random mask-based method which is combined with“real-synthetic”data for joint training strategy is proposed to construct paired incom-plete-complete RGB-D data.This method generates depth maps with different missing ratios based on random masks,and uses synthetic scene datasets to construct missing regions of depth map with high fidelity truth values.Based on this strategy,a multi-scale depth map completion network is constructed,which fuses the corresponding RGB image features.The proposed network extracts RGB image features and depth map features with different scales from RGB image branches and depth map branches.Then,in the feature fusion branch,the RGB image features and depth map features are fused at different scales,which makes that the rich semantic features of RGB images and the information of depth maps can be effectively integrated for depth map completion.Experiments on the NYU-Depth V2 dataset show that in depth completion tasks with different missing ratios,the average threshold accuracy of this method is 0.98,and the mean relative error is about 0.061.Compared with the existing methods based on neural networks and optimizing sparse equations,this method improves the threshold accuracy by an average of 0.02,and the mean relative error decreases by an average of 0.027.

     

/

返回文章
返回