高级检索
冉清, 冯结青. 人体前景的自动抠图算法[J]. 计算机辅助设计与图形学学报, 2020, 32(2): 277-286. DOI: 10.3724/SP.J.1089.2020.17926
引用本文: 冉清, 冯结青. 人体前景的自动抠图算法[J]. 计算机辅助设计与图形学学报, 2020, 32(2): 277-286. DOI: 10.3724/SP.J.1089.2020.17926
Ran Qing, Feng Jieqing. Automatic Human Body Foreground Matting Algorithm[J]. Journal of Computer-Aided Design & Computer Graphics, 2020, 32(2): 277-286. DOI: 10.3724/SP.J.1089.2020.17926
Citation: Ran Qing, Feng Jieqing. Automatic Human Body Foreground Matting Algorithm[J]. Journal of Computer-Aided Design & Computer Graphics, 2020, 32(2): 277-286. DOI: 10.3724/SP.J.1089.2020.17926

人体前景的自动抠图算法

Automatic Human Body Foreground Matting Algorithm

  • 摘要: 在基于立体视觉的人体建模系统中,背景像素的移除可以减少不必要的立体匹配计算,提高人体模型重建效率.为此,在给定大量具有前景Alpha蒙板真值的人体图像作为训练数据的前提下,提出了一个端到端的深度学习网络,以实现系统采集图像中人体前景自动抠图.该深度学习网络包括2个阶段:人体前景分割阶段和人体前景Alpha抠图阶段.在人体前景分割阶段,采用Mask R-CNN网络中的目标检测和掩码生成2个负载,并结合训练数据进行迁移学习,得到了适用于人体前景二值化分割的模型网络.在人体前景Alpha抠图阶段,采用Encoder-Decoder网络架构实现Alpha蒙板的自动预测.首先引入核为5的非学习卷积层,以上一个阶段的二值化分割结果作为输入,自动得到三分图Trimap,再和人体前景训练数据一起作为此阶段抠图网络的输入;经过学习迭代,获得能够预测人体前景Alpha蒙板的模型网络.在实验部分,以单幅系统采集人体图像为输入,无需额外先验和人工交互,可以自动估计人体前景Alpha掩码结果.用户测试结果以及与其他方法的对比和分析证明了文中算法的可靠性和鲁棒性;同时,该自动抠图算法还对其他公开数据集的人体图像进行了掩码预测,实验结果表明该算法具有一定的泛化能力.

     

    Abstract: In human body modeling system via stereo vision,removing background pixels could reduce the computational costs of stereo matching and improve human model reconstruction efficiency.For this purpose,an end-to-end deep learning network is proposed to automatically estimate Alpha mask of human body foreground in captured images,where a large number of ground-truth Alpha masks of human body foreground are given.The proposed network includes two stages:human body foreground segmentation stage and Alpha matting of human body foreground stage.In the first stage,a tailored Mask R-CNN,where its object detection and mask regression are reserved,is fined-tuned with the prepared training data to generate a binary mask of the human body foreground.In the second stage,an Encoder-Decoder network is used to address the task of Alpha matting.First,a non-trainable convolutional layer with a kernel size of 5 is applied to the binary mask to generate a Trimap of human body foreground as an input of this stage.After iteratively training with the Trimap data and the human body foreground data,the network can estimate the human foreground alpha mask.In experimental part,taking an acquisition image of human body as an input,the proposed network could output the foreground Alpha matting result fully automatically.Experimental results and detailed comparison with other commonly used matting algorithms are provided to demonstrate accuracy and robustness of the proposed method from both qualitative and quantitative aspects.Meanwhile,the proposed algorithm is also applied to some public data sets.The matting result demonstrates that the proposed method could deal with general images containing human body foreground.

     

/

返回文章
返回