Self-adapting recurrent models for object pushing from learning in simulation

University of Hamburg

Recurrent Model Predictive Path Integral on Real Robot.

Abstract

Planar pushing remains a challenging research topic, where building the dynamic model of the interaction is the core issue. Even an accurate analytical dynamic model is inherently unstable because physics parameters such as inertia and friction can only be approximated. Data-driven models usually rely on large amounts of training data, but data collection is time consuming when working with real robots.In this paper, we collect all training data in a physics simulator and build an LSTM-based model to fit the pushing dynamics. Domain Randomization is applied to capture the pushing trajectories of a generalized class of objects. When executed on the real robot, the trained recursive model adapts to the real dynamics within a few steps. We propose the algorithm Recurrent Model Predictive Path Integral (RMPPI) as a variation of the original MPPI approach, employing state-dependent recurrent models. As a comparison, we also train a Deep Deterministic Policy Gradient (DDPG) network as a model-free baseline, which is also used as the action generator in the data collection phase. During policy training, Hindsight Experience Replay is used to improve exploration efficiency. Pushing experiments on our UR5 platform demonstrate the model’s adaptability and the effectiveness of the proposed framework.

Video

BibTeX

@inproceedings{cong2020self,
  title={Self-adapting recurrent models for object pushing from learning in simulation},
  author={Cong, Lin and Grner, Michael and Ruppel, Philipp and Liang, Hongzhuo and Hendrich, Norman and Zhang, Jianwei},
  booktitle={2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  pages={5304--5310},
  year={2020},
  organization={IEEE}
}