高级检索

抵抗第二次人脸属性编辑的不可感知主动防御算法

Imperceptible Proactive Defense against Second Facial Attribute Editing

  • 摘要: 人脸伪造的主动防御技术通过向待保护人脸添加难以察觉的扰动来破坏伪造模型的生成能力. 最近的潜在对抗性探索(latent adversarial exploration, LAE)算法实现了较好扰动不可感知性, 但其生成防御人脸的语义信息易被改变且其防御输出易被第二次编辑, 为此提出一种抵抗第二次人脸属性编辑的不可感知主动防御算法. 针对一些人脸所有者无法接受的人脸语义信息改变的问题, 将LAE中非可逆的编码器-生成器结构替换为正交的离散小波变换, 并在离散小波变换域施加扰动; 针对成功防御后的无效输出易被伪造模型第二次成功编辑的问题, 将LAE的无效攻击替换为无目标攻击. 此外, 为了提高防御人脸的视觉质量, 利用人眼视觉系统对亮度通道扰动更敏感的特点, 在YCbCr色彩空间的色度通道中加入扰动; 在提升防御人脸的通用性方面, 采用权重动态更新的集成策略进行训练. 在CelebA-HQ数据集上与一些主流算法进行实验的结果表明, 相比于2种非集成算法, 所提算法针对5个人脸属性编辑模型的平均攻击成功率提升了约30%~43%, 相比于3种集成算法则提升了约6%~9%, 更好地平衡了扰动不可感知性和防御通用性.

     

    Abstract: Proactive defense against face forgery disrupts the generative ability of forgery models by adding imper-ceptible perturbations to the faces to be protected. The recent latent adversarial exploration (LAE) algo-rithm achieves better perturbation imperceptibility but the semantic information of its defensed faces is prone to be altered and its nullifying output after successful defense is prone to be edited by the forgery models. Therefore, this paper proposes an imperceptible proactive defense algorithm against second facial attribute editing. To address the problem of face semantic information alteration that is unacceptable to some face owners, the incompletely reversible encoder-generator structure in LAE is replaced by an or-thogonal discrete wavelet transform, and the perturbations are performed in the discrete wavelet transform domain; to address the problem that the nullifying outputs after successful defense are easily edited by the forgery models again, the nullifying attack in LAE is replaced by the non-targeted attack. Furthermore, to improve the visual quality of the defensed faces, the perturbations are added in chrominance channels of YCbCr color space because the human visual system is more sensitive to the perturbations in luminance channel; to increase the universality of the defensed faces, an ensemble strategy with dynamically updated weights is used for training. Experiments on the CelebA-HQ dataset with some mainstream algorithms show that the proposed algorithm improves the average attack success rate of the five face attribute editing mod-els by about 30%~43% compared to the two non-integrated algorithms, and 6%~9% compared to the three integrated algorithms, which better balances the imperceptibility of perturbation and the universality of defensed faces.

     

/

返回文章
返回