Interactively shaping robot behaviour with unlabeled human instructions
作者:Anis Najar, Olivier Sigaud, Mohamed Chetouani
摘要
In this paper, we propose a framework that enables a human teacher to shape a robot behaviour by interactively providing it with unlabeled instructions. We ground the meaning of instruction signals in the task-learning process, and use them simultaneously for guiding the latter. We implement our framework as a modular architecture, named TICS (Task-Instruction-Contingency-Shaping) that combines different information sources: a predefined reward function, human evaluative feedback and unlabeled instructions. This approach provides a novel perspective for robotic task learning that lies between Reinforcement Learning and Supervised Learning paradigms. We evaluate our framework both in simulation and with a real robot. The experimental results demonstrate the effectiveness of our framework in accelerating the task-learning process and in reducing the number of required teaching signals.
论文关键词:Interactive machine learning, Human–robot interaction, Shaping, Reinforcement learning, Unlabeled instructions
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10458-020-09459-6