Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Detecting deepfakes using emotional irregularities

View through CrossRef
Recent advances in deep learning have spawned a new class of media forgeries known as deepfakes, which typically consist of artificially generated human faces or voices. The creation and distribution of deepfakes raises many legal and ethical concerns. As a result, the ability to distinguish between deepfakes and authentic media is vital. Since deepfake generation methods are numerous and rapidly evolving, it is unrealistic to hope that a single method will be able to detect all deepfakes. Rather, a multiplicity of detection methods is needed, so that the burden of creating an undetectable deepfake becomes extremely high. While deepfakes can create plausible video and audio, it may be challenging for them to accurately synthesize high-level concepts such as emotion. Unnatural displays of emotion, as measured by valence and arousal, can provide important evidence that a video has been synthesized. In this work, we propose a novel method for detecting deepfakes of a human speaker using the emotion predicted from the face and voice. First, we train long short-term memory (LSTM) networks to predict emotion from low-level descriptors (LLDs) of audio and video of human speakers from the SEMAINE database. Then, we use the trained networks as feature extractors to extract sequences of valence and arousal from videos in a subset of the Deepfake Detection Challenge (DFDC) dataset. Finally, we classify the videos as authentic videos or deepfakes with high accuracy using features derived from the predicted emotion.
Title: Detecting deepfakes using emotional irregularities
Description:
Recent advances in deep learning have spawned a new class of media forgeries known as deepfakes, which typically consist of artificially generated human faces or voices.
The creation and distribution of deepfakes raises many legal and ethical concerns.
As a result, the ability to distinguish between deepfakes and authentic media is vital.
Since deepfake generation methods are numerous and rapidly evolving, it is unrealistic to hope that a single method will be able to detect all deepfakes.
Rather, a multiplicity of detection methods is needed, so that the burden of creating an undetectable deepfake becomes extremely high.
While deepfakes can create plausible video and audio, it may be challenging for them to accurately synthesize high-level concepts such as emotion.
Unnatural displays of emotion, as measured by valence and arousal, can provide important evidence that a video has been synthesized.
In this work, we propose a novel method for detecting deepfakes of a human speaker using the emotion predicted from the face and voice.
First, we train long short-term memory (LSTM) networks to predict emotion from low-level descriptors (LLDs) of audio and video of human speakers from the SEMAINE database.
Then, we use the trained networks as feature extractors to extract sequences of valence and arousal from videos in a subset of the Deepfake Detection Challenge (DFDC) dataset.
Finally, we classify the videos as authentic videos or deepfakes with high accuracy using features derived from the predicted emotion.

Related Results

Deepfakes and the epistemic apocalypse
Deepfakes and the epistemic apocalypse
AbstractIt is widely thought that deepfake videos are a significant and unprecedented threat to our epistemic practices. In some writing about deepfakes, manipulated videos appear ...
The Socio-Political Implications of Deepfakes in Developing Countries
The Socio-Political Implications of Deepfakes in Developing Countries
Highly realistic media created through Artificial Intelligence and Deep Learning, commonly known as deepfakes, presents a serious risk to political stability and the integrity of i...
Tax Aggressiveness and Accounting and Financial Irregularities in Brazil
Tax Aggressiveness and Accounting and Financial Irregularities in Brazil
This paper aimed to analyse whether tax aggressiveness increases the Company probability to incur in accounting and financial irregularities. It was used as a quantitative and desc...
The effects of magnetic storm phases on F layer irregularities below the auroral oval
The effects of magnetic storm phases on F layer irregularities below the auroral oval
Through the study of two periods in September and October 1981 it was possible to observe F layer irregularity development and intensity primarily over subauroral latitudes in the ...
Emotional Intelligence: Understanding, Assessing, and Cultivating the Key to Personal and Professional Success
Emotional Intelligence: Understanding, Assessing, and Cultivating the Key to Personal and Professional Success
A complicated idea that has received a lot of attention in workplace behavior and psychology is emotional intelligence (EI). It includes the capacity to recognize, comprehend, cont...
Statistical Study on the Effect of Meridional Neutral Wind on the Occurrence of Post‐Sunset Equatorial Ionospheric Irregularities
Statistical Study on the Effect of Meridional Neutral Wind on the Occurrence of Post‐Sunset Equatorial Ionospheric Irregularities
AbstractThe navigation and radio communication systems experience significant disruptions due to post‐sunset equatorial ionospheric irregularities. There is ongoing debate regardin...
<strong>Kabir Wealth Code: The combined effect of emotional awareness (EA) and intelligence quotient (IQ)</strong>
<strong>Kabir Wealth Code: The combined effect of emotional awareness (EA) and intelligence quotient (IQ)</strong>
Background: Intelligence quotient (IQ) is a measure of intellectual ability of performing, comprehension, and learning. Previous studies reported that intelligence measures predict...

Back to Top