Javascript must be enabled to continue!

Image Captioning with Convolutional Neural Networks and Autoencoder-Transformer Model

This study deals with emerging machine learning technologies, deep learning, and Transformers with autoencode-decode mechanisms for image captioning. This study is important to provide in-depth and detailed information about methodologies, algorithms and procedures involved in the task of captioning images. In this study, exploration and implementation of the most efficient technologies to produce relevant captions is done. This research aims to achieve a detailed understanding of image captioning using Transformers and convolutional neural networks, which can be achieved using various available algorithms. Methods and utilities used in this study are some of the predefined CNN models, COCO dataset, Transformers (enc-BERT,dec-GPT) and machine learning algorithms which are used for visualization and analysis in the area of model’s performance which would help to contribute to advancements in accuracy and effectiveness of image captioning models and technologies. The evaluation and comparison of metrics that are applied to the generated captions state the model's performance.

International Journal of Experimental Research and Review

Selvani Deepthi Kavila Moni Sushma Deep Kavila Kanaka Raghu Sreerama Sai Harsha Vardhan Pittada Krishna Rupendra Singh Badugu Samatha Mahanty Rashmita

International Journal of Experimental Research and Review

2024

Title: Image Captioning with Convolutional Neural Networks and Autoencoder-Transformer Model

Description:

This study deals with emerging machine learning technologies, deep learning, and Transformers with autoencode-decode mechanisms for image captioning.

This study is important to provide in-depth and detailed information about methodologies, algorithms and procedures involved in the task of captioning images.

In this study, exploration and implementation of the most efficient technologies to produce relevant captions is done.

This research aims to achieve a detailed understanding of image captioning using Transformers and convolutional neural networks, which can be achieved using various available algorithms.

Methods and utilities used in this study are some of the predefined CNN models, COCO dataset, Transformers (enc-BERT,dec-GPT) and machine learning algorithms which are used for visualization and analysis in the area of model’s performance which would help to contribute to advancements in accuracy and effectiveness of image captioning models and technologies.

The evaluation and comparison of metrics that are applied to the generated captions state the model's performance.

Back

This dissertation is dedicated to image captioning, the task of automatically generating a natural language description of a given image. Most modern automatic caption generators a...

Automatic Load Sharing of Transformer

Transformer plays a major role in the power system. It works 24 hours a day and provides power to the load. The transformer is excessive full, its windings are overheated which lea...

A Comprehensive Survey on Image Captioning for Indian Languages: Techniques, Datasets, and Challenges

Abstract In image captioning, we generate visual descriptions from an image. Image Cap-tioning requires identifying the key entity, feature, and association in an image. Th...

High frequency modeling of power transformers under transients

This thesis presents the results related to high frequency modeling of power transformers. First, a 25kVA distribution transformer under lightning surges is tested in the laborator...

An Analysis on Recent Approaches for Image Captioning

Image captioning is an interdisciplinary area that uses techniques from computer vision and natural language processing to provide a textual description of a picture. The Image cap...

Graph convolutional neural networks for 3D data analysis

(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...

NEURAL NETWORKS AND DEEP LEARNING: THEORITICAL INSIGHTS AND FRAMEWORKS

“NEURAL NETWORKS AND DEEP LEARNING: THEORITICAL INSIGHTS AND FRAMEWORKS” is a comprehensive guide that dives deep into the world of neural networks and their applications in modern...

TAPER-WE: Transformer-Based Model Attention with Relative Position Encoding and Word Embedding for Video Captioning and Summarization in Dense Environment

In the era of burgeoning digital content, the need for automated video captioning and summarization in dense environments has become increasingly critical. This paper introduces TA...

Email:
Password:

Email:

Image Captioning with Convolutional Neural Networks and Autoencoder-Transformer Model

Related Results