When trained with multiple objects of random position and orientation for 200 episodes (~10 minutes), the human-as-adversary model achieved a 52% grasping success rate. The rate was significantly higher than those of a simulated adversary trained in the same environment for the same number of episodes, which had 28% grasping success rate.
1 Comments