Open Access
Lecture Notes in Computer Science, volume 13154 LNAI, pages 344-354
Case-Based Task Generalization in Model-Based Reinforcement Learning
Zholus Artem
1
,
Publication type: Book Chapter
Publication date: 2022-01-06
Journal:
Lecture Notes in Computer Science
Quartile SCImago
Q3
Quartile WOS
—
Impact factor: —
ISSN: 03029743, 16113349, 18612075, 18612083
Abstract
Model-based reinforcement learning has recently demonstrated significant advances in solving complex problems of sequential decision-making. Updating the model using the case of solving the current task allows the agent to update the model and apply it to improve the efficiency of solving the following similar tasks. This approach also aligns with case-based planning methods, which already have mechanisms for retrieving and reusing precedents. In this work, we propose a meta-learned case retrieval mechanism that provides case-based samples for the agent to accelerate the learning process. We have tested the performance of the proposed approach on the well-known MuJoCo dataset and have shown results at the level of methods using pre-generated expert data.
Citations by journals
1
|
|
Studies in Computational Intelligence
|
Studies in Computational Intelligence
1 publication, 50%
|
Pattern Recognition and Image Analysis
|
Pattern Recognition and Image Analysis
1 publication, 50%
|
1
|
Citations by publishers
1
|
|
Springer Nature
|
Springer Nature
1 publication, 50%
|
Pleiades Publishing
|
Pleiades Publishing
1 publication, 50%
|
1
|
- We do not take into account publications that without a DOI.
- Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
- Statistics recalculated weekly.
{"yearsCitations":{"type":"bar","data":{"show":true,"labels":[2022,2023],"ids":[0,0],"codes":[0,0],"imageUrls":["",""],"datasets":[{"label":"Citations number","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":["50","50"],"barThickness":null}]},"options":{"indexAxis":"x","maintainAspectRatio":true,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":1,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Citations per year","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"journals":{"type":"bar","data":{"show":true,"labels":["Studies in Computational Intelligence","Pattern Recognition and Image Analysis"],"ids":[2714,9753],"codes":[0,0],"imageUrls":["\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp","\/storage\/images\/resized\/oZgeErrVFhuDksyqFURLvYS1wtVSBWczh001igGo_medium.webp"],"datasets":[{"label":"","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":[50,50],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Journals","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}},"publishers":{"type":"bar","data":{"show":true,"labels":["Springer Nature","Pleiades Publishing"],"ids":[8,101],"codes":[0,0],"imageUrls":["\/storage\/images\/resized\/voXLqlsvTwv5p3iMQ8Dhs95nqB4AXOG7Taj7G4ra_medium.webp","\/storage\/images\/resized\/oZgeErrVFhuDksyqFURLvYS1wtVSBWczh001igGo_medium.webp"],"datasets":[{"label":"","data":[1,1],"backgroundColor":["#3B82F6","#3B82F6"],"percentage":[50,50],"barThickness":13}]},"options":{"indexAxis":"y","maintainAspectRatio":false,"scales":{"y":{"ticks":{"precision":0,"autoSkip":false,"font":{"family":"Montserrat"},"color":"#000000"}},"x":{"ticks":{"stepSize":null,"precision":0,"font":{"family":"Montserrat"},"color":"#000000"}}},"plugins":{"legend":{"position":"top","labels":{"font":{"family":"Montserrat"},"color":"#000000"}},"title":{"display":true,"text":"Publishers","font":{"size":24,"family":"Montserrat","weight":600},"color":"#000000"}}}}}
Metrics
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Zholus A., Panov A. Case-Based Task Generalization in Model-Based Reinforcement Learning // Lecture Notes in Computer Science. 2022. Vol. 13154 LNAI. pp. 344-354.
GOST all authors (up to 50)
Copy
Zholus A., Panov A. Case-Based Task Generalization in Model-Based Reinforcement Learning // Lecture Notes in Computer Science. 2022. Vol. 13154 LNAI. pp. 344-354.
Cite this
RIS
Copy
TY - GENERIC
DO - 10.1007/978-3-030-93758-4_35
UR - https://doi.org/10.1007%2F978-3-030-93758-4_35
TI - Case-Based Task Generalization in Model-Based Reinforcement Learning
T2 - Lecture Notes in Computer Science
AU - Zholus, Artem
AU - Panov, Aleksandr
PY - 2022
DA - 2022/01/06 00:00:00
PB - Springer Nature
SP - 344-354
VL - 13154 LNAI
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
Cite this
BibTex
Copy
@incollection{2022_Zholus
author = {Artem Zholus and Aleksandr Panov},
title = {Case-Based Task Generalization in Model-Based Reinforcement Learning},
publisher = {Springer Nature},
year = {2022},
volume = {13154 LNAI},
pages = {344--354},
month = {jan}
}
Profiles