A toolkit for Reinforcement Learning using ROS and Gazebo

vmayoral · August 19, 2016, 10:19pm

For those interested in Reinforcement Learning, here’s some recent results obtained at at Erle.

Briefly,

This work presents an extension of the OpenAI Gym for robotics using the Robot Operating System (ROS) and the Gazebo simulator. The content discusses the software architecture proposed and the results obtained by using two Reinforcement Learning techniques: Q-Learning and Sarsa. Ultimately, the output of this work presents a benchmarking system for robotics that allows different techniques and algorithms to be compared using the same virtual conditions.

While the paper gets published in arXiv you can temporarily access a summary of this work at http://erlerobotics.com/whitepaper/robot_gym.pdf

flobotics · August 20, 2016, 2:53pm

hi, very cool, but why/for-what do you use openai.gym ? Do you know http://wiki.ros.org/reinforcement_learning

vmayoral · August 23, 2016, 12:20pm

hi @flobotics

Given the recent popularity of the OpenAI gym, we’ve used the gym to facilitate a common interface for RL problems in robotics. That is, to have the AI people (that don’t necessarily know about ROS) focus on the ai problems.

spk921 · December 5, 2016, 5:47am

Do you have code for this ? I am trying to use Gazebo for reinforcement learning to do hand object grasping with DQN. It would be great if you can share how you coded binding Gazebo and open AI Gym

jrivero · December 5, 2016, 3:43pm

If I’m not wrong, the friends of Erle have the code in github: GitHub - erlerobot/gym-gazebo: Refer to https://github.com/AcutronicRobotics/gym-gazebo2 for the new version

spk921 · December 5, 2016, 4:00pm

Thank you very much. This is really helpful.

Random-Word · December 5, 2016, 11:19pm

I would strongly recommend looking into DDPG, TRPO, A3C, or FAN for a grasping task. DQN doesn’t perform well for robotics tasks due to its discrete action space and poor data efficiency.

vmayoral · December 6, 2016, 10:57am

@Random-Word, that’s interesting, thanks for sharing.
Can you point out any benchmark or results that compares DQN with these different techniques? A pointer to a paper would also do.

flobotics · December 6, 2016, 12:40pm

hi @spk921, that sounds interresting. I build a robotic humanoid hand (Robotic Humanoid hand) and i am also interrested into building a ai-software for hand grasping. Do you have already any code and where? thanks

spk921 · December 6, 2016, 3:23pm

No I am working on it. And I was not able to find some related work. I don’t have a code yet.

spk921 · December 6, 2016, 3:23pm

Thank you for your idea.

spk921 · December 6, 2016, 3:29pm

@Random-Word What is FAN stands for? Could you point out links or full name of FAN?

flobotics · December 7, 2016, 11:42am

perhaps your interested as start in code https://github.com/flobotics/flobotics_tensorflow_controller . Its a ROS node with tensorflow. The code projects the angle of a phalanx into a picture and then feeds a DQN.

Random-Word · December 8, 2016, 2:18am

Sure, here are some references and reading material.

A benchmark that sadly doesn’t include DQN, but does include TRPO and DDPG:

DDPG:

Asynchronous RL learning showing improved performance of asynchronous actor critic over asynchronous Q learning.

@spk921 Apologies, it’s NAF not FAN. It was designed for robotic manipulation and outperforms DDPG. Paper is here:

roberto-martinmartin · April 10, 2017, 10:27pm

Hi Victor,

is your code connecting ROS/Gazebo to OpenAI gym easy to extend to other robots and tasks? I would like to use the openAI interface with a Robotis Mini robot that I am simulating in Gazebo.

Thanks!

–roberto

roberto-martinmartin · April 11, 2017, 11:12am

Also, I saw this other project: https://github.com/openai/rosbridge
Did you know about it?

vmayoral · September 7, 2018, 5:43pm

For those interested, a follow up work on this topic that will be presented at ROSCon this year:

Accelerated robot training through simulation in the cloud with ROS and Gazebo
Rather than programming, training allows robots to achieve behaviors that generalize better and are capable to respond to real-world needs. However, such training requires a big amount of experimentation which is not always feasible for a physical robot. In this work, we present robot_gym, a framework to accelerate robot training through simulation in the cloud that makes use of roboticists’ tools, simplifying the development and deployment processes on real robots. We unveil that, for simple tasks, simple 3DoF robots require more than 140 attempts to learn. For more complex, 6DoF robots, the number of attempts increases to more than 900 for the same task. We demonstrate that our framework, for simple tasks, accelerates the robot training time by more than 33% while maintaining similar levels of accuracy and repeatability.

Full article available at https://arxiv.org/pdf/1808.10369.pdf.

MohmadAyman · September 7, 2018, 8:01pm

That training speed up sounds very interesting.
I tried searching for the framework but couldn’t find it tho, is it published yet?

vmayoral · September 10, 2018, 8:15am

Hello @MohmadAyman,

The proposal of the paper named “robot_gym” is built upon gym_gazebo. There’re no plans to release our particular setup (which is what’s discussed in this paper).

If you’re interested, you should be able to reproduce such setup yourself using gym-gazebo and customize it to your needs. There’re some community contributions that will facilitate the process.

Daisuke · September 10, 2018, 3:01pm

Hi Victor,
does your presentation have any relation with the ROS package openai_ros or is it something different?

Topic		Replies	Views
gym-gazebo migration to ROS2 with MARA Projects ros2	7	2943	March 21, 2019
Check out gym-gazebo2 and ROS2Learn! ROS General release , ros2 , gazebo , drl	5	3526	June 13, 2024
Introducing robo-gym: An Open Source Toolkit for Distributed Deep Reinforcement Learning on Real and Simulated Robots ROS General release , melodic , gazebo , drl , openai	7	5495	June 5, 2023
Initial release of openai_ros package ROS General	0	1071	August 9, 2018
Reinforcement learning Projects moveit , openai , noetic	3	1059	June 4, 2023

A toolkit for Reinforcement Learning using ROS and Gazebo

Related topics