Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

An Attention-Erasing Stripe Pyramid Network for Face Forgery Detection

View through CrossRef
Abstract ace forgery detection aims to distinguish between real and fake facial images or videos by identifying manipulated or forged visual media. The main challenge in face forgery detection is achieving high model generalization ability, i.e., satisfactory performance under cross-database scenarios where the training and testing datasets are from different forgery methods. To achieve this goal, this paper presents an Attention-Erasing Stripe Pyramid Network (ASPNet) to utilize high-frequency noises and exploit both the RGB and fine-grained frequency clues. First, since separately extracting features from different scales and granularities will ignore their complementarity, we employ a Stripe Pyramid Block (SPB) to learn multi-scale and multi-granularity features simultaneously. Second, to make the model to focus on useful information and suppress noise, a Two-Stage Attention Block (TSAB) is introduced by combining spatial attention and channel attention to filter out the pixel-wise and channel-wise noise in the learned feature maps. Finally, to dynamically guide the model to pay attention to different areas of the human face, an Attention Erasing (AE) scheme is adopted by randomly erasing units in attention maps. Sufficient experiments demonstrate that ASPNet has superior performance than F3-Net on the FaceForensics++ dataset. The Area Under the Receiver Operating Characteristic Curve (AUC) and the Accuracy (ACC) of our model reach 77.4% and 70.85% respectively, whichare improved by 0.83% and 1.28% compared with F3-Net. Our code is available at: https://github.com/NWPU-Zwu.
Title: An Attention-Erasing Stripe Pyramid Network for Face Forgery Detection
Description:
Abstract ace forgery detection aims to distinguish between real and fake facial images or videos by identifying manipulated or forged visual media.
The main challenge in face forgery detection is achieving high model generalization ability, i.
e.
, satisfactory performance under cross-database scenarios where the training and testing datasets are from different forgery methods.
To achieve this goal, this paper presents an Attention-Erasing Stripe Pyramid Network (ASPNet) to utilize high-frequency noises and exploit both the RGB and fine-grained frequency clues.
First, since separately extracting features from different scales and granularities will ignore their complementarity, we employ a Stripe Pyramid Block (SPB) to learn multi-scale and multi-granularity features simultaneously.
Second, to make the model to focus on useful information and suppress noise, a Two-Stage Attention Block (TSAB) is introduced by combining spatial attention and channel attention to filter out the pixel-wise and channel-wise noise in the learned feature maps.
Finally, to dynamically guide the model to pay attention to different areas of the human face, an Attention Erasing (AE) scheme is adopted by randomly erasing units in attention maps.
Sufficient experiments demonstrate that ASPNet has superior performance than F3-Net on the FaceForensics++ dataset.
The Area Under the Receiver Operating Characteristic Curve (AUC) and the Accuracy (ACC) of our model reach 77.
4% and 70.
85% respectively, whichare improved by 0.
83% and 1.
28% compared with F3-Net.
Our code is available at: https://github.
com/NWPU-Zwu.

Related Results

CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection
CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection
With the swift progression of image generation technology, the widespread emergence of facial deepfakes poses significant challenges to the field of security, thus amplifying the u...
Hierarchical Categorization and Review of Recent Techniques on Image Forgery Detection
Hierarchical Categorization and Review of Recent Techniques on Image Forgery Detection
Abstract Information in the form of the image conveys more details than any other form of information. Several software packages are available to manipulate the imag...
Reinterpreting the Great Pyramid of Cholula, Mexico
Reinterpreting the Great Pyramid of Cholula, Mexico
AbstractThe Great Pyramid of Cholula is both the largest and oldest continuously occupied building in Mesoamerica. Initial occupation of the ceremonial precinct began in the Late F...
The network characteristics of classic red tourist attractions in Shaanxi province, China
The network characteristics of classic red tourist attractions in Shaanxi province, China
Red tourism is a distinctive form of tourism in China. Its network attention serves as a typical indicator to measure the level of promotion and publicity for red tourism, as well ...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
The Cultural Messages of Pyramid House in Palembang
The Cultural Messages of Pyramid House in Palembang
Abstract Pyramid House in Palembang is one of the works of art that is still very interesting to be researched today. The problem raised in this study aims to analyze the cu...
Identification of QTLs for Stripe Rust Resistance in a Recombinant Inbred Line Population
Identification of QTLs for Stripe Rust Resistance in a Recombinant Inbred Line Population
Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is one of the most devastating fungal diseases of wheat worldwide. It is essential to discover more sources of str...
EVALUASI PELAKSANAAN PEMBELAJARAN TATAP MUKA TERBATAS DENGAN MODEL CIPP
EVALUASI PELAKSANAAN PEMBELAJARAN TATAP MUKA TERBATAS DENGAN MODEL CIPP
This study aims to: (1) find out the context in limited face-to-face learning, (2) find out the input in limited face-to-face learning, (3) find out the process in limited face-to-...

Back to Top