Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Comparison of CNN, ResNet50, and Xception for Deepfake Image Detection

View through CrossRef
This study compares the performance of three deep learning architectures—Convolutional Neural Network , ResNet50, and Xception—for frame-based deepfake image detection and identifies the most effective model in terms of accuracy, precision, recall, F1-score, and generalization. The study followed the Knowledge Discovery in Databases (KDD) framework using the Deepfake Detection Dataset (DFD Entire Original) from Kaggle, which consists of 3,432 videos, including 3,068 fake and 364 real videos. Videos were converted into frames using OpenCV, followed by face detection and cropping using MTCNN. The resulting face images were resized to 224×224 pixels, normalized, augmented, and labeled. To reduce classification bias caused by class imbalance, the training data were balanced using random undersampling, resulting in real frames and  fake frames. The dataset was then split into training, validation, and testing sets using a stratified 60:20:20 ratio. The results show that Xception achieved the best performance among the three models, with an accuracy of 95.21%, precision of 0.95, recall of 0.95, and F1-score of 0.95, followed by ResNet50 with an accuracy of 93.42% and CNN with an accuracy of 87.65%. These findings indicate that transfer learning-based architectures, particularly Xception, are more effective than conventional CNNs for deepfake image detection under a consistent experimental setting. This study is limited to a single dataset and frame-based evaluation, thus future work will explore the potential of hybrid models, such as Vision Transformer (ViT) combined with Capsule Networks , to improve detection performance and address challenges like temporal analysis and cross-dataset validation.
Title: Comparison of CNN, ResNet50, and Xception for Deepfake Image Detection
Description:
This study compares the performance of three deep learning architectures—Convolutional Neural Network , ResNet50, and Xception—for frame-based deepfake image detection and identifies the most effective model in terms of accuracy, precision, recall, F1-score, and generalization.
The study followed the Knowledge Discovery in Databases (KDD) framework using the Deepfake Detection Dataset (DFD Entire Original) from Kaggle, which consists of 3,432 videos, including 3,068 fake and 364 real videos.
Videos were converted into frames using OpenCV, followed by face detection and cropping using MTCNN.
The resulting face images were resized to 224×224 pixels, normalized, augmented, and labeled.
To reduce classification bias caused by class imbalance, the training data were balanced using random undersampling, resulting in real frames and  fake frames.
The dataset was then split into training, validation, and testing sets using a stratified 60:20:20 ratio.
The results show that Xception achieved the best performance among the three models, with an accuracy of 95.
21%, precision of 0.
95, recall of 0.
95, and F1-score of 0.
95, followed by ResNet50 with an accuracy of 93.
42% and CNN with an accuracy of 87.
65%.
These findings indicate that transfer learning-based architectures, particularly Xception, are more effective than conventional CNNs for deepfake image detection under a consistent experimental setting.
This study is limited to a single dataset and frame-based evaluation, thus future work will explore the potential of hybrid models, such as Vision Transformer (ViT) combined with Capsule Networks , to improve detection performance and address challenges like temporal analysis and cross-dataset validation.

Related Results

Evaluating the Threshold of Authenticity in Deepfake Audio and Its Implications Within Criminal Justice
Evaluating the Threshold of Authenticity in Deepfake Audio and Its Implications Within Criminal Justice
Deepfake technology has come a long way in recent years and the world has already seen cases where it has been used maliciously. After a deepfake of UK independent financial adviso...
Deepfake attack prevention using steganography GANs
Deepfake attack prevention using steganography GANs
Background Deepfakes are fake images or videos generated by deep learning algorithms. Ongoing progress in deep learning techniques like auto-encoders and generative adversarial net...
Analysis of deepfake crime trends using BIGKinds
Analysis of deepfake crime trends using BIGKinds
This study is significant for analyzing criminal trends using deepfake technology based on media reports. A total of 478 articles related to crimes using deepfake technology were e...
Deepfake Detection using Deep Learning with InceptionV3
Deepfake Detection using Deep Learning with InceptionV3
Deepfake technology has rapidly evolved, making it increasingly difficult to distinguish between real and manipulated videos. This poses serious risks, including misinformation, id...
Deepfake Detection with Choquet Fuzzy Integral
Deepfake Detection with Choquet Fuzzy Integral
Deep forgery has been spreading quite quickly in recent years and continues to develop. The development of deep forgery has been used in films. This development and spread have beg...
An Intelligent System for Analysing and Detecting Deepfake Videos: A Deep Learning Approach
An Intelligent System for Analysing and Detecting Deepfake Videos: A Deep Learning Approach
The swift rise of Artificial Intelligence (AI) has brought about remarkable technological progress in numerous fields such as media, entertainment, and communication. Among the var...
A New Deepfake Detection Method Based on Compound Scaling Dual-Stream Attention Network
A New Deepfake Detection Method Based on Compound Scaling Dual-Stream Attention Network
INTRODUCTION: Deepfake technology allows for the overlaying of existing images or videos onto target images or videos. The misuse of this technology has led to increasing complexit...
Deepfake image detection and classification model using Bayesian deep learning with coronavirus herd immunity optimizer
Deepfake image detection and classification model using Bayesian deep learning with coronavirus herd immunity optimizer
<p>Deepfake images are combined media constructed from deep learning (DL) methods, usually Generative Adversarial Networks (GANs), to manipulate visual content, often giving ...

Back to Top