Javascript must be enabled to continue!
Better Understanding: Stylized Image Captioning with Style Attention and Adversarial Training
View through CrossRef
Compared with traditional image captioning technology, stylized image captioning has broader application scenarios, such as a better understanding of images. However, stylized image captioning faces many challenges, the most important of which is how to make the model take into account both the image meta information and the style factor of the generated captions. In this paper, we propose a novel end-to-end stylized image captioning framework (ST-BR). Specifically, we first use a style transformer to model the factual information of images, and the style attention module learns style factor form a multi-style corpus, it is a symmetric structure on the whole. At the same time, we use back-reinforcement to evaluate the degree of consistency between the generated stylized captions with the image knowledge and specified style, respectively. These two parts further enhance the learning ability of the model through adversarial learning. Our experiment has achieved effective performance on the benchmark dataset.
Title: Better Understanding: Stylized Image Captioning with Style Attention and Adversarial Training
Description:
Compared with traditional image captioning technology, stylized image captioning has broader application scenarios, such as a better understanding of images.
However, stylized image captioning faces many challenges, the most important of which is how to make the model take into account both the image meta information and the style factor of the generated captions.
In this paper, we propose a novel end-to-end stylized image captioning framework (ST-BR).
Specifically, we first use a style transformer to model the factual information of images, and the style attention module learns style factor form a multi-style corpus, it is a symmetric structure on the whole.
At the same time, we use back-reinforcement to evaluate the degree of consistency between the generated stylized captions with the image knowledge and specified style, respectively.
These two parts further enhance the learning ability of the model through adversarial learning.
Our experiment has achieved effective performance on the benchmark dataset.
Related Results
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
<span style="font-size:11pt"><span style="background:#f9f9f4"><span style="line-height:normal"><span style="font-family:Calibri,sans-serif"><b><spa...
Crescimento de feijoeiro sob influência de carvão vegetal e esterco bovino
Crescimento de feijoeiro sob influência de carvão vegetal e esterco bovino
<p align="justify"><span style="color: #000000;"><span style="font-family: 'Times New Roman', serif;"><span><span lang="pt-BR">É indiscutível a import...
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Abstract
Funding Acknowledgements
Type of funding sources: None.
INTRODUCTION Patients with heart failure (HF)...
=== PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === Knowledge of the Problem and Intention to Act on Student Environmentally Responsible Behavior
=== PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === === PAPER RETRACTED === Knowledge of the Problem and Intention to Act on Student Environmentally Responsible Behavior
<p><span lang="IN"><span style="vertical-align: inherit;"><span style="vertical-align: inherit;">=== PAPER RETRACTED === </span></span></span...
Image Captioning with External Knowledge
Image Captioning with External Knowledge
This dissertation is dedicated to image captioning, the task of automatically generating a natural language description of a given image. Most modern automatic caption generators a...
A Comprehensive Survey on Image Captioning for Indian Languages: Techniques, Datasets, and Challenges
A Comprehensive Survey on Image Captioning for Indian Languages: Techniques, Datasets, and Challenges
Abstract
In image captioning, we generate visual descriptions from an image. Image Cap-tioning requires identifying the key entity, feature, and association in an image. Th...
Even Star Decomposition of Complete Bipartite Graphs
Even Star Decomposition of Complete Bipartite Graphs
<p><span lang="EN-US"><span style="font-family: 宋体; font-size: medium;">A decomposition (</span><span><span style="font-family: 宋体; font-size: medi...

