A toolkit for Reinforcement Learning using ROS and Gazebo

perhaps your interested as start in code https://github.com/flobotics/flobotics_tensorflow_controller . Its a ROS node with tensorflow. The code projects the angle of a phalanx into a picture and then feeds a DQN.