Javascript must be enabled to continue!

Better Understanding: Stylized Image Captioning with Style Attention and Adversarial Training

Compared with traditional image captioning technology, stylized image captioning has broader application scenarios, such as a better understanding of images. However, stylized image captioning faces many challenges, the most important of which is how to make the model take into account both the image meta information and the style factor of the generated captions. In this paper, we propose a novel end-to-end stylized image captioning framework (ST-BR). Specifically, we first use a style transformer to model the factual information of images, and the style attention module learns style factor form a multi-style corpus, it is a symmetric structure on the whole. At the same time, we use back-reinforcement to evaluate the degree of consistency between the generated stylized captions with the image knowledge and specified style, respectively. These two parts further enhance the learning ability of the model through adversarial learning. Our experiment has achieved effective performance on the benchmark dataset.

MDPI AG

Zhenyu Yang Qiao Liu Guojing Liu

Symmetry

2020

Title: Better Understanding: Stylized Image Captioning with Style Attention and Adversarial Training

Description:

Compared with traditional image captioning technology, stylized image captioning has broader application scenarios, such as a better understanding of images.

However, stylized image captioning faces many challenges, the most important of which is how to make the model take into account both the image meta information and the style factor of the generated captions.

In this paper, we propose a novel end-to-end stylized image captioning framework (ST-BR).

Specifically, we first use a style transformer to model the factual information of images, and the style attention module learns style factor form a multi-style corpus, it is a symmetric structure on the whole.

At the same time, we use back-reinforcement to evaluate the degree of consistency between the generated stylized captions with the image knowledge and specified style, respectively.

These two parts further enhance the learning ability of the model through adversarial learning.

Our experiment has achieved effective performance on the benchmark dataset.

Back

<spa...

Crescimento de feijoeiro sob influência de carvão vegetal e esterco bovino

É indiscutível a import...

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas

<span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...

Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program

Abstract Funding Acknowledgements Type of funding sources: None. INTRODUCTION Patients with heart failure (HF)...

=== PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === Knowledge of the Problem and Intention to Act on Student Environmentally Responsible Behavior

=== PAPER RETRACTED === </span...

Image Captioning with External Knowledge

This dissertation is dedicated to image captioning, the task of automatically generating a natural language description of a given image. Most modern automatic caption generators a...

A Comprehensive Survey on Image Captioning for Indian Languages: Techniques, Datasets, and Challenges

Abstract In image captioning, we generate visual descriptions from an image. Image Cap-tioning requires identifying the key entity, feature, and association in an image. Th...

Even Star Decomposition of Complete Bipartite Graphs

A decomposition (<span style="font-family: 宋体; font-size: medi...

Email:
Password:

Email:

Better Understanding: Stylized Image Captioning with Style Attention and Adversarial Training

Related Results