Javascript must be enabled to continue!
Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model
View through CrossRef
In recent years, AI-driven video generation has gained significant attention due to great advancements in visual and language generative techniques. Consequently, there is a growing need for accurate Video Quality Assessment (VQA) metrics to evaluate the perceptual quality of AI-generated content (AIGC) videos and optimize video generation models. However, assessing the quality of AIGC videos remains a significant challenge because these videos often exhibit highly complex distortions, such as unnatural actions and irrational objects. To address this challenge, we systematically investigate the AIGC-VQA problem in this article, considering both subjective and objective quality assessment perspectives. For the subjective perspective, we construct the
L
arge-scale
G
enerated
V
ideo
Q
uality Assessment (LGVQ) dataset, consisting of
\(2,\!808\)
AIGC videos generated by six video generation models using 468 carefully curated text prompts. Unlike previous subjective VQA experiments, we evaluate the perceptual quality of AIGC videos from three critical dimensions: spatial quality, temporal quality, and text-video alignment, which hold utmost importance for current video generation techniques. For the objective perspective, we establish a benchmark for evaluating existing quality assessment metrics on the LGVQ dataset. Our findings show that current metrics perform poorly on this dataset, highlighting a gap in effective evaluation tools. To bridge this gap, we propose the
U
nify
G
enerated
V
ideo
Q
uality Assessment (UGVQ) model, designed to accurately evaluate the multi-dimensional quality of AIGC videos. The UGVQ model integrates the visual and motion features of videos with the textual features of their corresponding prompts, forming a unified quality-aware feature representation tailored to AIGC videos. Experimental results demonstrate that UGVQ achieves state-of-the-art performance on the LGVQ dataset across all three quality dimensions, validating its effectiveness as an accurate quality metric for AIGC videos. We hope that our benchmark can promote the development of AIGC-VQA studies. Both the LGVQ dataset and the UGVQ model are publicly available on
https://github.com/zczhang-sjtu/UGVQ.git
.
Association for Computing Machinery (ACM)
Title: Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model
Description:
In recent years, AI-driven video generation has gained significant attention due to great advancements in visual and language generative techniques.
Consequently, there is a growing need for accurate Video Quality Assessment (VQA) metrics to evaluate the perceptual quality of AI-generated content (AIGC) videos and optimize video generation models.
However, assessing the quality of AIGC videos remains a significant challenge because these videos often exhibit highly complex distortions, such as unnatural actions and irrational objects.
To address this challenge, we systematically investigate the AIGC-VQA problem in this article, considering both subjective and objective quality assessment perspectives.
For the subjective perspective, we construct the
L
arge-scale
G
enerated
V
ideo
Q
uality Assessment (LGVQ) dataset, consisting of
\(2,\!808\)
AIGC videos generated by six video generation models using 468 carefully curated text prompts.
Unlike previous subjective VQA experiments, we evaluate the perceptual quality of AIGC videos from three critical dimensions: spatial quality, temporal quality, and text-video alignment, which hold utmost importance for current video generation techniques.
For the objective perspective, we establish a benchmark for evaluating existing quality assessment metrics on the LGVQ dataset.
Our findings show that current metrics perform poorly on this dataset, highlighting a gap in effective evaluation tools.
To bridge this gap, we propose the
U
nify
G
enerated
V
ideo
Q
uality Assessment (UGVQ) model, designed to accurately evaluate the multi-dimensional quality of AIGC videos.
The UGVQ model integrates the visual and motion features of videos with the textual features of their corresponding prompts, forming a unified quality-aware feature representation tailored to AIGC videos.
Experimental results demonstrate that UGVQ achieves state-of-the-art performance on the LGVQ dataset across all three quality dimensions, validating its effectiveness as an accurate quality metric for AIGC videos.
We hope that our benchmark can promote the development of AIGC-VQA studies.
Both the LGVQ dataset and the UGVQ model are publicly available on
https://github.
com/zczhang-sjtu/UGVQ.
git
.
Related Results
Research on the Impact of AI-Generated Content Technology on Design Education in Chinese Universities
Research on the Impact of AI-Generated Content Technology on Design Education in Chinese Universities
The purpose of this study is to investigate the impact of emerging AI Generated Content (AIGC)technology on teachers, students, and teaching in design and related majors in Chinese...
Strategic Integration of AIGC in Asian Elderly Fashion: Human-Centric Design Enhancement and Algorithmic Bias Neutralization
Strategic Integration of AIGC in Asian Elderly Fashion: Human-Centric Design Enhancement and Algorithmic Bias Neutralization
The advent of Artificial Intelligence Generated Content (AIGC) has catalyzed transformative shifts in the domain of fashion design, providing novel opportunities for customization ...
Innovative Design Research on Jiaodong Peninsula's Marine Folk Culture Based on AIGC
Innovative Design Research on Jiaodong Peninsula's Marine Folk Culture Based on AIGC
This study explores the digital innovation design pathways for the marine folk culture of the Jiaodong Peninsula through Artificial Intelligence Generated Content (AIGC) technology...
Evolving benchmarking practices: a review for research perspectives
Evolving benchmarking practices: a review for research perspectives
PurposeThe purpose of this study is to review a major section of the literature on benchmarking practices in order to achieve better perspectives for emerging benchmarking research...
A Human-Centered AIGC Framework for Inclusive Fashion Design: Mitigating Bias for East Asian (Chinese) Elderly
A Human-Centered AIGC Framework for Inclusive Fashion Design: Mitigating Bias for East Asian (Chinese) Elderly
This paper applies human-centered Artificial Intelligence-Generated Content (AIGC) techniques to fashion design for the East Asian elderly population, with a specific focus on the ...
Research on the Application of AIGC Technology in Three-Dimensional Animation Creation
Research on the Application of AIGC Technology in Three-Dimensional Animation Creation
In recent years, AIGC technology has seen widespread application within the field of animated content production. This paper focuses on exploring the application of AIGC technology...
Perceptions about benchmarking best practices among French managers: an exploratory survey
Perceptions about benchmarking best practices among French managers: an exploratory survey
PurposeThe purpose of this study is to present a discussion on the most commonly accepted benchmarking norms in the USA, the lessons learned from benchmarking experiences and see h...
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-generated content (AIGC) methods aim at producing text, images, videos, 3D assets, and other media using AI algorithms. Due to its wide range of applications and the potential o...

