Open Access

Lecture Notes in Computer Science, volume 13068 LNAI, pages 31-45

Question Answering for Visual Navigation in Human-Centered Environments

Kirilenko Daniil E ¹

Kovalev Alexey K ^{2, 3}

Osipov Evgeny ⁴

Panov Aleksandr ^{1, 3}

Hide authors affiliations Show authors affiliations: 4 affiliations

Moscow Institute of Physics and Technology, Moscow, russia |

Hse University, Moscow, Russia |

Artificial Intelligence Research Institute FRC CSC RAS, Moscow, Russia

Artificial Intelligence Research Institute

Federal Research Center Computer Science and Control of the Russian Academy of Sciences

⁴

Lulea University of Technology, Lulea, Sweden |

Publication type: Book Chapter

Publication date: 2021-10-20

Springer Nature

Journal: Lecture Notes in Computer Science

Quartile SCImago

Quartile WOS

—

Impact factor: —

ISSN: 03029743, 16113349, 18612075, 18612083

DOI: 10.1007/978-3-030-89820-5_3

Copy DOI

Abstract

In this paper, we propose an HISNav VQA dataset – a challenging dataset for a Visual Question Answering task that is aimed at the needs of Visual Navigation in human-centered environments. The dataset consists of images of various room scenes that were captured using the Habitat virtual environment and of questions important for navigation tasks using only visual information. We also propose a baseline for a HISNav VQA dataset, a Vector Semiotic Architecture, and demonstrate its performance. The Vector Semiotic Architecture is a combination of a Sign-Based World Model and Vector Symbolic Architectures. The Sign-Based World Model allows representing various aspects of an agent’s knowledge, and Vector Symbolic Architectures serve on a low computational level. The Vector Semiotic Architecture addresses the symbol grounding problem that plays an important role in the Visual Question Answering Task.

By date By citations

Citations by journals

	1
ACM Computing Surveys	ACM Computing Surveys, 1, 50% ACM Computing Surveys 1 publication, 50%
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 1, 50% Lecture Notes in Computer Science 1 publication, 50%
	1

Citations by publishers

	1
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 1, 50% Association for Computing Machinery (ACM) 1 publication, 50%
Springer Nature	Springer Nature, 1, 50% Springer Nature 1 publication, 50%
	1

We do not take into account publications that without a DOI.
Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
Statistics recalculated weekly.

{"yearsCitations":{"type":"bar","data":{"show":true,"labels":[2023],"ids":[0],"codes":[0],"imageUrls":[""],"datasets":[{"label":"Citations number","data":[2],"backgroundColor":["#3B82F6"],"percentage":["100"],"barThickness":null}]},"options":{"indexAxis":"x","maintainAspectRatio":true,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":1,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Citations per year","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"journals":{"type":"bar","data":{"show":true,"labels":["ACM Computing Surveys","Lecture Notes in Computer Science"],"ids":[2817,1022],"codes":[0,0],"imageUrls":["\/storage\/images\/resized\/XZDD1UbkaHV0BImS1Dm7kQfvovjiljgbqNi7vyqK_medium.webp","\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp"],"datasets":[{"label":"","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":[50,50],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Journals","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"publishers":{"type":"bar","data":{"show":true,"labels":["Association for Computing Machinery (ACM)","Springer Nature"],"ids":[1141,8],"codes":[0,0],"imageUrls":["\/storage\/images\/resized\/XZDD1UbkaHV0BImS1Dm7kQfvovjiljgbqNi7vyqK_medium.webp","\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp"],"datasets":[{"label":"","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":[50,50],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Publishers","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}}}

Metrics

Cite this

GOST |

Cite this

GOST Copy

Kirilenko D. E. et al. Question Answering for Visual Navigation in Human-Centered Environments // Lecture Notes in Computer Science. 2021. Vol. 13068 LNAI. pp. 31-45.

GOST all authors (up to 50) Copy

Kirilenko D. E., Kovalev A. K., Osipov E., Panov A. Question Answering for Visual Navigation in Human-Centered Environments // Lecture Notes in Computer Science. 2021. Vol. 13068 LNAI. pp. 31-45.

RIS |

Cite this

RIS Copy

TY - GENERIC

DO - 10.1007/978-3-030-89820-5_3

UR - https://doi.org/10.1007%2F978-3-030-89820-5_3

TI - Question Answering for Visual Navigation in Human-Centered Environments

T2 - Lecture Notes in Computer Science

AU - Kirilenko, Daniil E

AU - Kovalev, Alexey K

AU - Osipov, Evgeny

AU - Panov, Aleksandr

PY - 2021

DA - 2021/10/20 00:00:00

PB - Springer Nature

SP - 31-45

VL - 13068 LNAI

SN - 0302-9743

SN - 1611-3349

SN - 1861-2075

SN - 1861-2083

ER -

BibTex

Cite this

BibTex Copy

@incollection{2021_Kirilenko,

author = {Daniil E Kirilenko and Alexey K Kovalev and Evgeny Osipov and Aleksandr Panov},

title = {Question Answering for Visual Navigation in Human-Centered Environments},

publisher = {Springer Nature},

year = {2021},

volume = {13068 LNAI},

pages = {31--45},

month = {oct}

}

Found error?

Publisher

Springer Nature

Journal

Lecture Notes in Computer Science

Quartile SCImago

Quartile WOS

—

Impact factor

—

ISSN

03029743 (Print)
16113349 (Electronic)
18612075 (Print)
18612083 (Electronic)

Labs

MIPT Center for Cognitive Modeling

Profiles