Decentralized reinforcement learning of robot behaviors
Author
dc.contributor.author
Leottau, David L.
Author
dc.contributor.author
Ruíz del Solar San Martín, Javier
Author
dc.contributor.author
Babuška, Robert
Admission date
dc.date.accessioned
2018-07-26T15:24:00Z
Available date
dc.date.available
2018-07-26T15:24:00Z
Publication date
dc.date.issued
2018
Cita de ítem
dc.identifier.citation
Artificial Intelligence, 256 (2018): 130–159
es_ES
Identifier
dc.identifier.other
10.1016/j.artint.2017.12.001
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/150316
Abstract
dc.description.abstract
A multi-agent methodology is proposed for Decentralized Reinforcement Learning (DRL) of individual behaviors in problems where multi-dimensional action spaces are involved. When using this methodology, sub-tasks are learned in parallel by individual agents working toward a common goal. In addition to proposing this methodology, three specific multi agent DRL approaches are considered: DRL-Independent, DRL Cooperative Adaptive (CA), and DRL-Lenient. These approaches are validated and analyzed with an extensive empirical study using four different problems: 3D Mountain Car, SCARA Real-Time Trajectory Generation, Ball-Dribbling in humanoid soccer robotics, and Ball Pushing using differential drive robots. The experimental validation provides evidence that DRL implementations show better performances and faster learning times than their centralized counterparts, while using less computational resources. DRL-Lenient and DRL-CA algorithms achieve the best final performances for the four tested problems, outperforming their DRL-Independent counterparts. Furthermore, the benefits of the DRLLenient and DRL-CA are more noticeable when the problem complexity increases and the centralized scheme becomes intractable given the available computational resources and training time.
es_ES
Patrocinador
dc.description.sponsorship
CONICYT
CONICYT-PCHA/Doctorado Nacional/2013-63130183
FONDECYT
1161500
European Regional Development Fund under the project Robotics 4 Industry 4.0
CZ.02.1.01/0.0/0.0/15_003/0000470