Javascript must be enabled to continue!
DDA-MSLD: A Multi-Feature Speech Lie Detection Algorithm Based on a Dual-Stream Deep Architecture
View through CrossRef
Speech lie detection is a technique that analyzes speech signals in detail to determine whether a speaker is lying. It has significant application value and has attracted attention from various fields. However, existing speech lie detection algorithms still have certain limitations. These algorithms fail to fully explore manually extracted features based on prior knowledge and also neglect the dynamic characteristics of speech as well as the impact of temporal context, resulting in reduced detection accuracy and generalization. To address these issues, this paper proposes a multi-feature speech lie detection algorithm based on the dual-stream deep architecture (DDA-MSLD).This algorithm employs a dual-stream structure to learn different types of features simultaneously. Firstly, it combines a gated recurrent unit (GRU) network with the attention mechanism. This combination enables the network to more comprehensively capture the context of speech signals and focus on the parts that are more critical for lie detection. It can perform in-depth sequence pattern analysis on manually extracted static prosodic features and nonlinear dynamic features, obtaining high-order dynamic features related to lies. Secondly, the encoder part of the transformer is used to simultaneously capture the macroscopic structure and microscopic details of speech signals, specifically for high-precision feature extraction of Mel spectrogram features of speech signals, obtaining deep features related to lies. This dual-stream structure processes various features of speech simultaneously, describing the subjective state of speech signals from different perspectives and thereby improving detection accuracy and generalization. Experiments were conducted on the multi-person scenario lie detection dataset CSC, and the results show that this algorithm outperformed existing state-of-the-art algorithms in detection performance. Considering the significant differences in lie speech in different lying scenarios, and to further evaluate the algorithm’s generalization performance, a single-person scenario Chinese lie speech dataset Local was constructed, and experiments were conducted on it. The results indicate that the algorithm has a strong generalization ability in different scenarios.
Title: DDA-MSLD: A Multi-Feature Speech Lie Detection Algorithm Based on a Dual-Stream Deep Architecture
Description:
Speech lie detection is a technique that analyzes speech signals in detail to determine whether a speaker is lying.
It has significant application value and has attracted attention from various fields.
However, existing speech lie detection algorithms still have certain limitations.
These algorithms fail to fully explore manually extracted features based on prior knowledge and also neglect the dynamic characteristics of speech as well as the impact of temporal context, resulting in reduced detection accuracy and generalization.
To address these issues, this paper proposes a multi-feature speech lie detection algorithm based on the dual-stream deep architecture (DDA-MSLD).
This algorithm employs a dual-stream structure to learn different types of features simultaneously.
Firstly, it combines a gated recurrent unit (GRU) network with the attention mechanism.
This combination enables the network to more comprehensively capture the context of speech signals and focus on the parts that are more critical for lie detection.
It can perform in-depth sequence pattern analysis on manually extracted static prosodic features and nonlinear dynamic features, obtaining high-order dynamic features related to lies.
Secondly, the encoder part of the transformer is used to simultaneously capture the macroscopic structure and microscopic details of speech signals, specifically for high-precision feature extraction of Mel spectrogram features of speech signals, obtaining deep features related to lies.
This dual-stream structure processes various features of speech simultaneously, describing the subjective state of speech signals from different perspectives and thereby improving detection accuracy and generalization.
Experiments were conducted on the multi-person scenario lie detection dataset CSC, and the results show that this algorithm outperformed existing state-of-the-art algorithms in detection performance.
Considering the significant differences in lie speech in different lying scenarios, and to further evaluate the algorithm’s generalization performance, a single-person scenario Chinese lie speech dataset Local was constructed, and experiments were conducted on it.
The results indicate that the algorithm has a strong generalization ability in different scenarios.
Related Results
Hydrophobic Coating of Vegetable-Tanned Leather with Dodecylamine
Hydrophobic Coating of Vegetable-Tanned Leather with Dodecylamine
Vegetable-tanned leather exhibits a hydrophilic character due to the abundance of hydroxyl groups (-OH). In this study, the surface of vegetable-tanned leather was modified to beco...
Quasi-pre-Lie bialgebras and twisting of pre-Lie algebras
Quasi-pre-Lie bialgebras and twisting of pre-Lie algebras
Given a (quasi-)twilled pre-Lie algebra, we first construct a differential graded Lie algebra ([Formula: see text]-algebra). Then we study the twisting theory of (quasi-)twilled pr...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
MetaDIA: A Novel Database Reduction Strategy for DIA Human Gut Metaproteomics
MetaDIA: A Novel Database Reduction Strategy for DIA Human Gut Metaproteomics
AbstractBackgroundMicrobiomes, especially within the gut, are complex and may comprise hundreds of species. The identification of peptides in metaproteomics presents a significant ...
Self-Assembly of NaOL-DDA Mixtures in Aqueous Solution: A Molecular Dynamics Simulation Study
Self-Assembly of NaOL-DDA Mixtures in Aqueous Solution: A Molecular Dynamics Simulation Study
The self-assembly behaviors of sodium oleate (NaOL), dodecylamine (DDA), and their mixtures in aqueous solution were systematically investigated by large-scale molecular dynamics s...
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
The Neural Mechanisms of Private Speech in Second Language Learners’ Oral Production: An fNIRS Study
Background: According to Vygotsky’s sociocultural theory, private speech functions both as a tool for thought regulation and as a transitional form between outer and inner speech. ...

