联合注意力和条件GAN的被遮挡人体姿态和体形估计方法

朱妍; 汪楷; 汪粼波; 方贤勇

doi:10.3724/SP.J.1089.2024.19863

联合注意力和条件GAN的被遮挡人体姿态和体形估计方法

Pose and Shape Estimation of Occluded Humans with Attention and Conditional GAN

摘要

摘要: 基于图像的人体姿态和体形估计常常因人体被遮挡而充满挑战.为此,提出一种基于单幅图像的姿态和体形估计方法.首先提出多尺度的注意力模块策略,输出具有丰富上下文信息的多尺度注意力特征,以有效地获得不受遮挡影响的全局的姿态和体形分布;然后提出基于热图的条件生成对抗网络策略,将由关节热图得到的姿态估计作为约束,实现网格精细调整;最后借助这2个策略得到的姿态和体形估计方法实现全局预测和局部细节求精的结合.在Ubuntu环境下,在3DPW,3DOH50K和Human3.6M公开数据集上的实验结果表明,与SMPLify,GraphCMR和SPIN等方法相比,所提方法在身体部分被遮挡时重建效果更好,并在ACK,AVE和PA-MPJPE等定量评价指标上取得了更好的结果.

Abstract: The occlusions of body parts often appear in the images, which makes the human pose and shape estimation from single images difficult. This paper proposes a single-image oriented framework to tackle this problem, where two effective tactics are proposed. One is a multi-scale attention module which generates the enhanced multi-scale attention features with rich contextual information, so that efficient global pose and shape distribution can be obtained without the affection of occlusion. The other is heatmap based conditional generative adversarial networks (GAN) which utilize the poses from the joint heatmaps as constraints and thus can refine the mesh of the occluded subject accurately. Combining these two tactics can make the proposed human pose and shape estimation method robustly recover the body meshes with both global prediction and local details. Qualitative and quantitative experiments with the training based on public datasets show the efficiency of the proposed method for occluded humans.

HTML全文

参考文献(41)

施引文献

资源附件(0)