Open Access
Open access
Lecture Notes in Computer Science, volume 13068 LNAI, pages 31-45

Question Answering for Visual Navigation in Human-Centered Environments

Publication typeBook Chapter
Publication date2021-10-20
Quartile SCImago
Q3
Quartile WOS
Impact factor
ISSN03029743, 16113349, 18612075, 18612083
Abstract
In this paper, we propose an HISNav VQA dataset – a challenging dataset for a Visual Question Answering task that is aimed at the needs of Visual Navigation in human-centered environments. The dataset consists of images of various room scenes that were captured using the Habitat virtual environment and of questions important for navigation tasks using only visual information. We also propose a baseline for a HISNav VQA dataset, a Vector Semiotic Architecture, and demonstrate its performance. The Vector Semiotic Architecture is a combination of a Sign-Based World Model and Vector Symbolic Architectures. The Sign-Based World Model allows representing various aspects of an agent’s knowledge, and Vector Symbolic Architectures serve on a low computational level. The Vector Semiotic Architecture addresses the symbol grounding problem that plays an important role in the Visual Question Answering Task.

Citations by journals

1
ACM Computing Surveys
ACM Computing Surveys, 1, 50%
ACM Computing Surveys
1 publication, 50%
Lecture Notes in Computer Science
Lecture Notes in Computer Science, 1, 50%
Lecture Notes in Computer Science
1 publication, 50%
1

Citations by publishers

1
Association for Computing Machinery (ACM)
Association for Computing Machinery (ACM), 1, 50%
Association for Computing Machinery (ACM)
1 publication, 50%
Springer Nature
Springer Nature, 1, 50%
Springer Nature
1 publication, 50%
1
  • We do not take into account publications that without a DOI.
  • Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
  • Statistics recalculated weekly.
Metrics
Share
Cite this
GOST |
Cite this
GOST Copy
Kirilenko D. E. et al. Question Answering for Visual Navigation in Human-Centered Environments // Lecture Notes in Computer Science. 2021. Vol. 13068 LNAI. pp. 31-45.
GOST all authors (up to 50) Copy
Kirilenko D. E., Kovalev A. K., Osipov E., Panov A. Question Answering for Visual Navigation in Human-Centered Environments // Lecture Notes in Computer Science. 2021. Vol. 13068 LNAI. pp. 31-45.
RIS |
Cite this
RIS Copy
TY - GENERIC
DO - 10.1007/978-3-030-89820-5_3
UR - https://doi.org/10.1007%2F978-3-030-89820-5_3
TI - Question Answering for Visual Navigation in Human-Centered Environments
T2 - Lecture Notes in Computer Science
AU - Kirilenko, Daniil E
AU - Kovalev, Alexey K
AU - Osipov, Evgeny
AU - Panov, Aleksandr
PY - 2021
DA - 2021/10/20 00:00:00
PB - Springer Nature
SP - 31-45
VL - 13068 LNAI
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
BibTex
Cite this
BibTex Copy
@incollection{2021_Kirilenko,
author = {Daniil E Kirilenko and Alexey K Kovalev and Evgeny Osipov and Aleksandr Panov},
title = {Question Answering for Visual Navigation in Human-Centered Environments},
publisher = {Springer Nature},
year = {2021},
volume = {13068 LNAI},
pages = {31--45},
month = {oct}
}
Found error?