Black-box data-efficient policy search for robotics

Konstantinos Chatzilygeroudis; Roberto Rama; Rituraj Kaushik; Dorian Goepp; Vassilis Vassiliades; Jean-Baptiste Mouret

doi:https://doi.org/10.1109/iros.2017.8202137

doi.org/10.1109/iros.2017.8202137

Black-box data-efficient policy search for robotics

Konstantinos Chatzilygeroudis

10

,

Roberto Rama

1

,

..., Jean-Baptiste Mouret

27

Published: Sep 1, 2017

Abstract

The most data-efficient algorithms for reinforcement learning (RL) in robotics are based on uncertain dynamical models: after each episode, they first learn a dynamical model of the robot, then they use an optimization algorithm to find a policy that maximizes the expected return given the model and its uncertainties. It is often believed that this optimization can be tractable only if analytical, gradient-based algorithms are used; however,...

Paper Fields

Paper Details

Title

Black-box data-efficient policy search for robotics

DOI

doi.org/10.1109/iros.2017.8202137

Published Date

Sep 1, 2017

Citation AnalysisPro

You’ll need to upgrade your plan to Pro

Looking to understand the true influence of a researcher’s work across journals & affiliations?

Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.

Learn more

Notes

History