Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Deep Fake Audio Detection Using Deep Learning

View through CrossRef
With the proliferation of deep learning-based voice synthesis technologies, deep fake audio has become an emerging cybersecurity and ethical threat. This project proposes a deep learning-based system for the detection of deep fake audio using advanced neural network architectures. The model leverages spectrogram analysis and convolutional neural networks (CNNs) to learn distinguishing features between authentic and synthesized audio. A dataset consisting of both real and AI-generated audio samples was compiled, preprocessed, and used to train and evaluate the model. Feature extraction techniques, such as Mel-frequency cepstral coefficients (MFCC) and log-mel spectrograms, were employed to convert audio into image-like representations for input into the CNN. Experimental results indicate that the model achieves high classification accuracy in detecting deep fake audio across various speakers and synthetic voice generation techniques. This research demonstrates the potential of deep learning in combating the challenges posed by deep fake technologies. Future work may involve incorporating recurrent neural networks (RNNs) for better temporal feature analysis, as well as expanding the model’s robustness across languages and background noise conditions.
Title: Deep Fake Audio Detection Using Deep Learning
Description:
With the proliferation of deep learning-based voice synthesis technologies, deep fake audio has become an emerging cybersecurity and ethical threat.
This project proposes a deep learning-based system for the detection of deep fake audio using advanced neural network architectures.
The model leverages spectrogram analysis and convolutional neural networks (CNNs) to learn distinguishing features between authentic and synthesized audio.
A dataset consisting of both real and AI-generated audio samples was compiled, preprocessed, and used to train and evaluate the model.
Feature extraction techniques, such as Mel-frequency cepstral coefficients (MFCC) and log-mel spectrograms, were employed to convert audio into image-like representations for input into the CNN.
Experimental results indicate that the model achieves high classification accuracy in detecting deep fake audio across various speakers and synthetic voice generation techniques.
This research demonstrates the potential of deep learning in combating the challenges posed by deep fake technologies.
Future work may involve incorporating recurrent neural networks (RNNs) for better temporal feature analysis, as well as expanding the model’s robustness across languages and background noise conditions.

Related Results

Feature selection for multimodal: acoustic event detection
Feature selection for multimodal: acoustic event detection
The detection of the Acoustic Events (AEs) naturally produced in a meeting room may help to describe the human and social activity. The automatic description of interactions betwee...
Analisis Saddu Dzari’ah terhadap Penggunaan Aplikasi Fake Global Positioning System (GPS) pada Shopeefood Driver
Analisis Saddu Dzari’ah terhadap Penggunaan Aplikasi Fake Global Positioning System (GPS) pada Shopeefood Driver
Abstract. Shopee is a company engaged in online-based buying and selling services. One of the latest features of Shopee is the ShopeeFood service and has standard rules that must b...
Deep Learning for Forgery Face Detection Using Fuzzy Fisher Capsule Dual Graph
Deep Learning for Forgery Face Detection Using Fuzzy Fisher Capsule Dual Graph
In digital manipulation, creating fake images/videos or swapping face images/videos with another person is done by using a deep learning algorithm is termed deep fake. Fake pornogr...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Deep Fake Audio Detection Using Deep Learning
Deep Fake Audio Detection Using Deep Learning
This project is titled as “Deep Fake Audio Detection Using Deep Learning”. The rapid advancement of artificial intelligence and deep learning technologies, deepfake audio has emerg...
DISCOURSE: KNOWLEDGE, NEWS, AND FAKE INTERTWINED
DISCOURSE: KNOWLEDGE, NEWS, AND FAKE INTERTWINED
Discourse has been a focal point for linguists over an extended period. The multidisciplinary character of the term ‘discourse’ has resulted in diverse approaches aiming to define ...
Effects of Intervention Timing on Health-Related Fake News: Simulation Study
Effects of Intervention Timing on Health-Related Fake News: Simulation Study
Background Fake health-related news has spread rapidly through the internet, causing harm to individuals and society. Despite interventions, a fenbendazole scan...
Effects of Intervention Timing on Health-Related Fake News: Simulation Study (Preprint)
Effects of Intervention Timing on Health-Related Fake News: Simulation Study (Preprint)
BACKGROUND Fake health-related news has spread rapidly through the internet, causing harm to individuals and society. Despite interventions, a fenbendazole ...

Back to Top