Spatial Domain Enhanced Channel Adaptive Deepfake Image Detection Method
-
摘要: 针对现有的深度伪造检测方法缺少关注图像空间域信息及模型复杂度较高的问题,提出一种使用空间域特征增强的通道自适应深度伪造图像检测方法.首先提取图像空间域信息并归一化空域特征图,将空域信息作为第4个通道传入网络;其次,在主干网络中加入SE-Layer模块,对4个通道的权重进行重建,解决通道间的异构性问题;最后,设计了半自动的预训练策略,进一步提升了模型训练效率和准确率.以深度伪造人脸检测为例,在7个不同来源的数据集上进行检测实验.结果表明,即使在未使用数据增强技术的情况下,文中方法优于基线方法,在效果最差的StyleGAN2数据集上准确率也达到99.60%,AP指标达到98.00%.Abstract: Aiming at the problems that the current deepfake detection method lacks attention to the spatial domain information of the image and the high complexity of the model, a channel adaptive deepfake image detection method using spatial domain feature enhancement is proposed. Firstly, spatial domain features of the images are extracted and normalized, which will be passed into the network as the fourth channel; secondly, SE-Layer module is incorporated into the backbone network to reconstruct the weights of the four channels, effectively addressing the heterogeneity issue between channels; finally, a semi-automatic pre-training strategy is designed to further improve the training efficiency and accuracy of the model. Taking deepfake face detection as an example, the detection experiment is conducted on seven datasets from different sources. The results show that even without data augmentation, the method in this article is better than the baseline method, with an accuracy of 99.60% and an AP index of 98.00% on the StyleGAN2 dataset with the worst performance.
-
Keywords:
- deepfake detection /
- spatial domain feature /
- channel adaptive network /
- pre-training
-
-
[1] Westerlund M. The emergence of deepfake technology:a review[J]. Technology Innovation Management Review, 2019, 9(11):39-52
[2] Wei Xiaoyan, Zuo Xinlan, Dan Zhiping, et al. FCR-CNN model to improve selection performance of image manipulation detection region[J]. Journal of Computer-Aided Design&Computer Graphics, 2021, 33(4):560-568(in Chinese)(魏晓燕,左鑫兰,但志平,等.提高图像篡改检测区域选取性能的FCR-CNN模型[J].计算机辅助设计与图形学学报, 2021, 33(4):560-568) [3] Liu X H, Liu Y J, Chen J, et al. PSCC-Net:progressive spatio-channel correlation network for image manipulation detection and localization[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(11):7505-7517
[4] Gragnaniello D, Cozzolino D, Marra F, et al. Are GAN generated images easy to detect?A critical analysis of the state-of-the-art[C]//Proceedings of the IEEE International Conference on Multimedia and Expo. Los Alamitos:IEEE Computer Society Press, 2021:1-6
[5] Wang S Y, Wang O, Zhang R, et al. CNN-generated images are surprisingly easy to spot… for now[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2020:8692-8701
[6] Li W C, He P S, Li H L, et al. Detection of GAN-generated images by estimating artifact similarity[J]. IEEE Signal Processing Letters, 2022, 29:862-866
[7] Wang B, Wu X H, Tang Y L, et al. Frequency domain filtered residual network for deepfake detection[J]. Mathematics, 2023, 11(4):Article No.816
[8] Hu J, Shen L, Sun G. Squeeze-and-Excitation networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2018:7132-7141
[9] Chollet F. Xception:deep learning with depthwise separable convolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2017:1800-1807
[10] Liu H G, Li X D, Zhou W B, et al. Spatial-phase shallow learning:rethinking face forgery detection in frequency domain[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2021:772-781
[11] Odena A, Dumoulin V, Olah C. Deconvolution and checkerboard artifacts[EB/OL].[2023-05-23] . https://distill.pub/2016/deconv-checkerboard
[12] Wesselkamp V, Rieck K, Arp D, et al. Misleading deep-fake detection with GAN fingerprints[C]//Proceedings of the IEEE Security and Privacy Workshops. Los Alamitos:IEEE Computer Society Press, 2022:59-65
[13] Patel Y, Tanwar S, Bhattacharya P, et al. An improved dense CNN architecture for deepfake image detection[J]. IEEE Access, 2023, 11:22081-22095
[14] Guarnera L, Giudice O, Guarnera F, et al. The face deepfake detection challenge[J]. Journal of Imaging, 2022, 8(10):Article No.263
[15] Liu Z W, Luo P, Wang X G, et al. Deep learning face attributes in the wild[C]//Proceedings of the IEEE International Conference on Computer Vision. Los Alamitos:IEEE Computer Society Press, 2015:3730-3738
[16] Karras T, Laine S, Aila T. A style-based generator architecture for generative adversarial networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2019:4396-4405
[17] Choi Y, Choi M, Kim M, et al. StarGAN:unified generative adversarial networks for multi-domain image-to-image translation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2018:8789-8797
[18] Cho W, Choi S, Park D K, et al. Image-to-image translation via group-wise deep whitening-and-coloring transformation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2019:10631-10639
[19] He Z L, Zuo W M, Kan M N, et al. AttGAN:facial attribute editing by only changing what you want[J]. IEEE Transactions on Image Processing, 2019, 28(11):5464-5478
[20] Karras T, Laine S, Aittala M, et al. Analyzing and improving the image quality of StyleGAN[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2020:8107-8116
[21] Carreira J, Zisserman A. Quo vadis, action recognition?A new model and the Kinetics dataset[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Los Alamitos:IEEE Computer Society Press, 2017:4724-4733
计量
- 文章访问数: 194
- HTML全文浏览量: 22
- PDF下载量: 70