Show simple item record
Author | dc.contributor.author | Celemin, Carlos | |
Author | dc.contributor.author | Ruiz del Solar, Javier | |
Admission date | dc.date.accessioned | 2019-10-11T17:31:07Z | |
Available date | dc.date.available | 2019-10-11T17:31:07Z | |
Publication date | dc.date.issued | 2019 | |
Cita de ítem | dc.identifier.citation | Journal of Intelligent and Robotic Systems: Theory and Applications, Volumen 95, Issue 1, 2019, Pages 77-97 | |
Identifier | dc.identifier.issn | 15730409 | |
Identifier | dc.identifier.issn | 09210296 | |
Identifier | dc.identifier.other | 10.1007/s10846-018-0839-z | |
Identifier | dc.identifier.uri | https://repositorio.uchile.cl/handle/2250/171299 | |
Abstract | dc.description.abstract | © 2018, Springer Science+Business Media B.V., part of Springer Nature.The main goal of this article is to present COACH (COrrective Advice Communicated by Humans), a new learning framework that allows non-expert humans to advise an agent while it interacts with the environment in continuous action problems. The human feedback is given in the action domain as binary corrective signals (increase/decrease the current action magnitude), and COACH is able to adjust the amount of correction that a given action receives adaptively, taking state-dependent past feedback into consideration. COACH also manages the credit assignment problem that normally arises when actions in continuous time receive delayed corrections. The proposed framework is characterized and validated extensively using four well-known learning problems. The experimental analysis includes comparisons with other interactive learning frameworks, with classical reinforcement learning approaches, and with human teleoperators tryi | |
Lenguage | dc.language.iso | en | |
Publisher | dc.publisher | Springer Netherlands | |
Type of license | dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Chile | |
Link to License | dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/cl/ | |
Source | dc.source | Journal of Intelligent and Robotic Systems: Theory and Applications | |
Keywords | dc.subject | Decision making systems | |
Keywords | dc.subject | Human feedback | |
Keywords | dc.subject | Human teachers | |
Keywords | dc.subject | Interactive machine learning | |
Keywords | dc.subject | Learning from demonstration | |
Título | dc.title | An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback | |
Document type | dc.type | Artículo de revista | |
Cataloguer | uchile.catalogador | SCOPUS | |
Indexation | uchile.index | Artículo de publicación SCOPUS | |
uchile.cosecha | uchile.cosecha | SI | |
Files in this item
- Name:
- item_85046824032.pdf
- Size:
- 2.010Kb
- Format:
- PDF
This item appears in the following Collection(s)
Show simple item record
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile