Javascript must be enabled to continue!

GenConViT: Deepfake Video Detection Using Generative Convolutional Vision Transformer

Deepfakes have raised significant concerns due to their potential to spread false information and compromise the integrity of digital media. Current deepfake detection models often struggle to generalize across a diverse range of deepfake generation techniques and video content. In this work, we propose a Generative Convolutional Vision Transformer (GenConViT) for deepfake video detection. Our model combines ConvNeXt and Swin Transformer models for feature extraction, and it utilizes an Autoencoder and Variational Autoencoder to learn from latent data distributions. By learning from the visual artifacts and latent data distribution, GenConViT achieves an improved performance in detecting a wide range of deepfake videos. The model is trained and evaluated on DFDC, FF++, TM, DeepfakeTIMIT, and Celeb-DF (v2) datasets. The proposed GenConViT model demonstrates strong performance in deepfake video detection, achieving high accuracy across the tested datasets. While our model shows promising results in deepfake video detection by leveraging visual and latent features, we demonstrate that further work is needed to improve its generalizability when encountering out-of-distribution data. Our model provides an effective solution for identifying a wide range of fake videos while preserving the integrity of media.

MDPI AG

Deressa Wodajo Deressa Hannes Mareen Peter Lambert Solomon Atnafu Zahid Akhtar Glenn Van Wallendael

Applied Sciences

2025

Title: GenConViT: Deepfake Video Detection Using Generative Convolutional Vision Transformer

Description:

Deepfakes have raised significant concerns due to their potential to spread false information and compromise the integrity of digital media.

Current deepfake detection models often struggle to generalize across a diverse range of deepfake generation techniques and video content.

In this work, we propose a Generative Convolutional Vision Transformer (GenConViT) for deepfake video detection.

Our model combines ConvNeXt and Swin Transformer models for feature extraction, and it utilizes an Autoencoder and Variational Autoencoder to learn from latent data distributions.

By learning from the visual artifacts and latent data distribution, GenConViT achieves an improved performance in detecting a wide range of deepfake videos.

The model is trained and evaluated on DFDC, FF++, TM, DeepfakeTIMIT, and Celeb-DF (v2) datasets.

The proposed GenConViT model demonstrates strong performance in deepfake video detection, achieving high accuracy across the tested datasets.

While our model shows promising results in deepfake video detection by leveraging visual and latent features, we demonstrate that further work is needed to improve its generalizability when encountering out-of-distribution data.

Our model provides an effective solution for identifying a wide range of fake videos while preserving the integrity of media.

Back

Deepfake technology has come a long way in recent years and the world has already seen cases where it has been used maliciously. After a deepfake of UK independent financial adviso...

Automatic Load Sharing of Transformer

Transformer plays a major role in the power system. It works 24 hours a day and provides power to the load. The transformer is excessive full, its windings are overheated which lea...

Deepfake Detection using Deep Learning with InceptionV3

Deepfake technology has rapidly evolved, making it increasingly difficult to distinguish between real and manipulated videos. This poses serious risks, including misinformation, id...

Depth-aware salient object segmentation

Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...

Deepfake Detection with Choquet Fuzzy Integral

Deep forgery has been spreading quite quickly in recent years and continues to develop. The development of deep forgery has been used in films. This development and spread have beg...

A New Deepfake Detection Method Based on Compound Scaling Dual-Stream Attention Network

INTRODUCTION: Deepfake technology allows for the overlaying of existing images or videos onto target images or videos. The misuse of this technology has led to increasing complexit...

High frequency modeling of power transformers under transients

This thesis presents the results related to high frequency modeling of power transformers. First, a 25kVA distribution transformer under lightning surges is tested in the laborator...

Detecting Deepfake Media with AI and ML

Deepfake technology has rapidly advanced, enabling the creation of highly realistic yet manipulated digital media. These artificial videos and images pose significant risks to digi...

Email:
Password:

Email:

GenConViT: Deepfake Video Detection Using Generative Convolutional Vision Transformer

Related Results