The automatic prediction of image captions is a very challenging task in natural language processing (NLP). Many studies have employed convolutional neural networks as encoders and decoders. Nevertheless, to accurately predict image captions, a model must comprehend the semantic relationships among the numerous objects available in the given image. Attention-based mechanism carries out a linear grouping of encoder and decoder state operations. It places equal emphasis on the semantic information that is found in the caption as well as the visual knowledge that is contained within a given image. In this research paper, we integrated the local attention approach with two pre-trained convolutional neural networks (CNN) known as VGG19 and Inception_V3 in order to provide the textual description of any given image. These models are employed as an encoder, while the recurrent neural network serves as the decoder in the system. Together with the attention mechanism, these encoders are capable of transmitting the semantic-context knowledge to the decoder and achieved the BLUE Score of 64.7.
Skip Nav Destination
Article navigation
22 August 2024
THE 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN ENGINEERING, SCIENCE AND TECHNOLOGY FOR SUSTAINABLE DEVELOPMENT (ICEST 2023)
15–17 November 2023
Male’ City, Maldives
Research Article|
August 22 2024
Image captioning using attention network and machine learning approaches
Manju Pandey
Manju Pandey
a)
Department of Computer Applications
, NIT, Raipur, India
a)Corresponding author: [email protected]
Search for other works by this author on:
Manju Pandey
a)
Department of Computer Applications
, NIT, Raipur, India
a)Corresponding author: [email protected]
AIP Conf. Proc. 3245, 050003 (2024)
Citation
Manju Pandey; Image captioning using attention network and machine learning approaches. AIP Conf. Proc. 22 August 2024; 3245 (1): 050003. https://doi.org/10.1063/5.0232649
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
9
Views
Citing articles via
Inkjet- and flextrail-printing of silicon polymer-based inks for local passivating contacts
Zohreh Kiaee, Andreas Lösel, et al.
The implementation of reflective assessment using Gibbs’ reflective cycle in assessing students’ writing skill
Lala Nurlatifah, Pupung Purnawarman, et al.
Effect of coupling agent type on the self-cleaning and anti-reflective behaviour of advance nanocoating for PV panels application
Taha Tareq Mohammed, Hadia Kadhim Judran, et al.
Related Content
Image caption generator using CNN & LSTM
AIP Conf. Proc. (September 2023)
Enhancing image captioning performance based on efficientnet B0 model and transformer encoder-decoder
AIP Conf. Proc. (March 2024)
Generating image captions based on deep learning and natural language processing
AIP Conf. Proc. (February 2025)
Deep learning model for automatic image captioning
AIP Conf. Proc. (May 2022)
Comprehensive study of 3D dense captioning
AIP Conf. Proc. (December 2024)