Open Access
Lecture Notes in Computer Science, volume 13068 LNAI, pages 31-45
Question Answering for Visual Navigation in Human-Centered Environments
Publication type: Book Chapter
Publication date: 2021-10-20
Journal:
Lecture Notes in Computer Science
Quartile SCImago
Q3
Quartile WOS
—
Impact factor: —
ISSN: 03029743, 16113349, 18612075, 18612083
Abstract
In this paper, we propose an HISNav VQA dataset – a challenging dataset for a Visual Question Answering task that is aimed at the needs of Visual Navigation in human-centered environments. The dataset consists of images of various room scenes that were captured using the Habitat virtual environment and of questions important for navigation tasks using only visual information. We also propose a baseline for a HISNav VQA dataset, a Vector Semiotic Architecture, and demonstrate its performance. The Vector Semiotic Architecture is a combination of a Sign-Based World Model and Vector Symbolic Architectures. The Sign-Based World Model allows representing various aspects of an agent’s knowledge, and Vector Symbolic Architectures serve on a low computational level. The Vector Semiotic Architecture addresses the symbol grounding problem that plays an important role in the Visual Question Answering Task.
Citations by journals
1
|
|
ACM Computing Surveys
|
ACM Computing Surveys
1 publication, 50%
|
Lecture Notes in Computer Science
|
Lecture Notes in Computer Science
1 publication, 50%
|
1
|
Citations by publishers
1
|
|
Association for Computing Machinery (ACM)
|
Association for Computing Machinery (ACM)
1 publication, 50%
|
Springer Nature
|
Springer Nature
1 publication, 50%
|
1
|
- We do not take into account publications that without a DOI.
- Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
- Statistics recalculated weekly.
{"yearsCitations":{"type":"bar","data":{"show":true,"labels":[2023],"ids":[0],"codes":[0],"imageUrls":[""],"datasets":[{"label":"Citations number","data":[2],"backgroundColor":["#3B82F6"],"percentage":["100"],"barThickness":null}]},"options":{"indexAxis":"x","maintainAspectRatio":true,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":1,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Citations per year","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"journals":{"type":"bar","data":{"show":true,"labels":["ACM Computing Surveys","Lecture Notes in Computer Science"],"ids":[2817,1022],"codes":[0,0],"imageUrls":["\/storage\/images\/resized\/XZDD1UbkaHV0BImS1Dm7kQfvovjiljgbqNi7vyqK_medium.webp","\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp"],"datasets":[{"label":"","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":[50,50],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Journals","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"publishers":{"type":"bar","data":{"show":true,"labels":["Association for Computing Machinery (ACM)","Springer Nature"],"ids":[1141,8],"codes":[0,0],"imageUrls":["\/storage\/images\/resized\/XZDD1UbkaHV0BImS1Dm7kQfvovjiljgbqNi7vyqK_medium.webp","\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp"],"datasets":[{"label":"","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":[50,50],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Publishers","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}}}
Metrics
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Kirilenko D. E. et al. Question Answering for Visual Navigation in Human-Centered Environments // Lecture Notes in Computer Science. 2021. Vol. 13068 LNAI. pp. 31-45.
GOST all authors (up to 50)
Copy
Kirilenko D. E., Kovalev A. K., Osipov E., Panov A. Question Answering for Visual Navigation in Human-Centered Environments // Lecture Notes in Computer Science. 2021. Vol. 13068 LNAI. pp. 31-45.
Cite this
RIS
Copy
TY - GENERIC
DO - 10.1007/978-3-030-89820-5_3
UR - https://doi.org/10.1007%2F978-3-030-89820-5_3
TI - Question Answering for Visual Navigation in Human-Centered Environments
T2 - Lecture Notes in Computer Science
AU - Kirilenko, Daniil E
AU - Kovalev, Alexey K
AU - Osipov, Evgeny
AU - Panov, Aleksandr
PY - 2021
DA - 2021/10/20 00:00:00
PB - Springer Nature
SP - 31-45
VL - 13068 LNAI
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
Cite this
BibTex
Copy
@incollection{2021_Kirilenko,
author = {Daniil E Kirilenko and Alexey K Kovalev and Evgeny Osipov and Aleksandr Panov},
title = {Question Answering for Visual Navigation in Human-Centered Environments},
publisher = {Springer Nature},
year = {2021},
volume = {13068 LNAI},
pages = {31--45},
month = {oct}
}