An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback

Celemin, Carlos; Ruiz del Solar, Javier

Author	dc.contributor.author	Celemin, Carlos
Author	dc.contributor.author	Ruiz del Solar, Javier
Admission date	dc.date.accessioned	2019-10-11T17:31:07Z
Available date	dc.date.available	2019-10-11T17:31:07Z
Publication date	dc.date.issued	2019
Cita de ítem	dc.identifier.citation	Journal of Intelligent and Robotic Systems: Theory and Applications, Volumen 95, Issue 1, 2019, Pages 77-97
Identifier	dc.identifier.issn	15730409
Identifier	dc.identifier.issn	09210296
Identifier	dc.identifier.other	10.1007/s10846-018-0839-z
Identifier	dc.identifier.uri	https://repositorio.uchile.cl/handle/2250/171299
Abstract	dc.description.abstract	© 2018, Springer Science+Business Media B.V., part of Springer Nature.The main goal of this article is to present COACH (COrrective Advice Communicated by Humans), a new learning framework that allows non-expert humans to advise an agent while it interacts with the environment in continuous action problems. The human feedback is given in the action domain as binary corrective signals (increase/decrease the current action magnitude), and COACH is able to adjust the amount of correction that a given action receives adaptively, taking state-dependent past feedback into consideration. COACH also manages the credit assignment problem that normally arises when actions in continuous time receive delayed corrections. The proposed framework is characterized and validated extensively using four well-known learning problems. The experimental analysis includes comparisons with other interactive learning frameworks, with classical reinforcement learning approaches, and with human teleoperators tryi
Lenguage	dc.language.iso	en
Publisher	dc.publisher	Springer Netherlands
Type of license	dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Chile
Link to License	dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/cl/
Source	dc.source	Journal of Intelligent and Robotic Systems: Theory and Applications
Keywords	dc.subject	Decision making systems
Keywords	dc.subject	Human feedback
Keywords	dc.subject	Human teachers
Keywords	dc.subject	Interactive machine learning
Keywords	dc.subject	Learning from demonstration
Título	dc.title	An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback
Document type	dc.type	Artículo de revista
Cataloguer	uchile.catalogador	SCOPUS
Indexation	uchile.index	Artículo de publicación SCOPUS
uchile.cosecha	uchile.cosecha	SI

Files in this item

Name:: item_85046824032.pdf
Size:: 2.010Kb
Format:: PDF

This item appears in the following Collection(s)

Artículos de revistas
Artículos de revistas

Show simple item record

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile