基于法线贴图增强隐式函数的单图像三维头部重建

刘乐元; 刘旭; 孙见弛; 陈靓影

基于法线贴图增强隐式函数的单图像三维头部重建

Single-image 3D Head Reconstruction Using Normals Enhanced Implicit Function

摘要

摘要: 摘要: 三维头部重建是建构元宇宙的基础技术之一, 在影视制作、游戏娱乐、智能教育等领域也具备广阔的应用前景. 从单图像重建三维头部模型可以最大程度地节省成本并大幅提高操作的便捷性. 然而, 单图像三维头部重建是一个非适定问题, 现有方法普遍存在重建的三维头部模型保真度低、细节少以及算法泛化能力差等问题. 为此, 文中提出了一种基于法线贴图增强隐式函数的单图像三维头部重建方法. 首先, 设计了一个法线贴图估计子网络从单幅图像估计头部法线贴图; 其次, 将三维头部模型表面看作由隐式函数描述的水平集, 建立了端到端的深度神经网络, 从输入图像及法线贴图中提取视觉特征并判别三维空间中各点位于该水平集等值面的概率. 文中方法在FCH (FaceScape、CoMA、HeadSpace) 和FaceVerse公共数据集上所重建三维头部模型的平均倒角距离分别为0.769 6 mm和1.308 0 mm, 大幅优于现有单图像三维头部重建方法; 在2个从互联网采集的数据集上的实验结果也表明, 文中方法能从包含不同年龄、人种、性别、面部表情的单幅肖像图像中重建出具备高保真度和丰富细节的三维头部模型, 且具备强大泛化能力.

Abstract: Abstract: 3D head reconstruction is one of the fundamental techniques for building the metaverse, and it also has broad applications in the fields of film and cartoon creation, game design, and intelligent education. Compared to modeling 3D heads manually by artists or capturing head scans using 3D scanners or stereo imaging systems, reconstructing 3D heads from a single image is much more economical and practical. However, single-image 3D head reconstruction is an ill-posed problem, and existing methods often suffer from low fidelity, fewer details, and poor generalization ability. To this end, this paper proposes a single-image 3D head reconstruction method using normals enhanced implicit function. First, a deep network for estimating normal maps of heads from a single image is designed. Second, the surface of a 3D head model is defined as a level set described by an implicit function, and an end-to-end deep neural network is built to extract visual features from the input image and the estimated normal map and predict whether a 3D point lies on the head surface or not. Our method achieves Chamfer distances of 0.769 6 mm and 1.308 0 mm respectively on the FCH (FaceScape、CoMA、HeadSpace) and FaceVerse datasets, which outperform other single-image 3D head reconstruction methods by a large margin; experimental results on images collected from the Internet also show that our method can reconstruct 3D head models with high fidelity and rich details from single images with different ages, human races, genders, and facial expressions, and has a strong generalization capability.

HTML全文

参考文献(0)

施引文献

资源附件(0)