A Hindi Image Caption Generation Framework Using Deep Learning

Santosh Kumar Mishra 1
Rijul Dhir 1
Sriparna Saha 1
Pushpak Bhattacharyya 1
Publication typeJournal Article
Publication date2021-03-15
scimago Q2
wos Q3
SJR0.503
CiteScore5.1
Impact factor2.0
ISSN23754699, 23754702
General Computer Science
Abstract

Image captioning is the process of generating a textual description of an image that aims to describe the salient parts of the given image. It is an important problem, as it involves computer vision and natural language processing, where computer vision is used for understanding images, and natural language processing is used for language modeling. A lot of works have been done for image captioning for the English language. In this article, we have developed a model for image captioning in the Hindi language. Hindi is the official language of India, and it is the fourth most spoken language in the world, spoken in India and South Asia. To the best of our knowledge, this is the first attempt to generate image captions in the Hindi language. A dataset is manually created by translating well known MSCOCO dataset from English to Hindi. Finally, different types of attention-based architectures are developed for image captioning in the Hindi language. These attention mechanisms are new for the Hindi language, as those have never been used for the Hindi language. The obtained results of the proposed model are compared with several baselines in terms of BLEU scores, and the results show that our model performs better than others. Manual evaluation of the obtained captions in terms of adequacy and fluency also reveals the effectiveness of our proposed approach.

Availability of resources : The codes of the article are available at https://github.com/santosh1821cs03/Image_Captioning_Hindi_Language ; The dataset will be made available: http://www.iitp.ac.in/∼ai-nlp-ml/resources.html .

Found 
Found 

Top-30

Journals

1
2
3
4
5
6
7
8
9
ACM Transactions on Asian and Low-Resource Language Information Processing
9 publications, 27.27%
Communications in Computer and Information Science
3 publications, 9.09%
Lecture Notes in Networks and Systems
2 publications, 6.06%
Journal of Intelligent and Fuzzy Systems
1 publication, 3.03%
Computer Systems Science and Engineering
1 publication, 3.03%
Journal of Ambient Intelligence and Humanized Computing
1 publication, 3.03%
Displays
1 publication, 3.03%
Neural Computing and Applications
1 publication, 3.03%
Applied Mathematics and Nonlinear Sciences
1 publication, 3.03%
Handbook of Research on Innovations in Systems and Software Engineering
1 publication, 3.03%
International Journal of Multimedia Information Retrieval
1 publication, 3.03%
Information (Switzerland)
1 publication, 3.03%
Expert Systems with Applications
1 publication, 3.03%
Scientific Reports
1 publication, 3.03%
1
2
3
4
5
6
7
8
9

Publishers

2
4
6
8
10
Association for Computing Machinery (ACM)
10 publications, 30.3%
Springer Nature
9 publications, 27.27%
Institute of Electrical and Electronics Engineers (IEEE)
7 publications, 21.21%
Elsevier
2 publications, 6.06%
SAGE
1 publication, 3.03%
Tech Science Press
1 publication, 3.03%
Walter de Gruyter
1 publication, 3.03%
IGI Global
1 publication, 3.03%
MDPI
1 publication, 3.03%
2
4
6
8
10
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
33
Share
Cite this
GOST |
Cite this
GOST Copy
Mishra S. K. et al. A Hindi Image Caption Generation Framework Using Deep Learning // ACM Transactions on Asian and Low-Resource Language Information Processing. 2021. Vol. 20. No. 2. pp. 1-19.
GOST all authors (up to 50) Copy
Mishra S. K., Dhir R., Saha S., Bhattacharyya P. A Hindi Image Caption Generation Framework Using Deep Learning // ACM Transactions on Asian and Low-Resource Language Information Processing. 2021. Vol. 20. No. 2. pp. 1-19.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1145/3432246
UR - https://doi.org/10.1145/3432246
TI - A Hindi Image Caption Generation Framework Using Deep Learning
T2 - ACM Transactions on Asian and Low-Resource Language Information Processing
AU - Mishra, Santosh Kumar
AU - Dhir, Rijul
AU - Saha, Sriparna
AU - Bhattacharyya, Pushpak
PY - 2021
DA - 2021/03/15
PB - Association for Computing Machinery (ACM)
SP - 1-19
IS - 2
VL - 20
SN - 2375-4699
SN - 2375-4702
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2021_Mishra,
author = {Santosh Kumar Mishra and Rijul Dhir and Sriparna Saha and Pushpak Bhattacharyya},
title = {A Hindi Image Caption Generation Framework Using Deep Learning},
journal = {ACM Transactions on Asian and Low-Resource Language Information Processing},
year = {2021},
volume = {20},
publisher = {Association for Computing Machinery (ACM)},
month = {mar},
url = {https://doi.org/10.1145/3432246},
number = {2},
pages = {1--19},
doi = {10.1145/3432246}
}
MLA
Cite this
MLA Copy
Mishra, Santosh Kumar, et al. “A Hindi Image Caption Generation Framework Using Deep Learning.” ACM Transactions on Asian and Low-Resource Language Information Processing, vol. 20, no. 2, Mar. 2021, pp. 1-19. https://doi.org/10.1145/3432246.