drl_navigation_robot_ros2_foxy repository

No version for distro humble. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro jazzy. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro rolling. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

td3 velodyne_description velodyne_gazebo_plugins velodyne_simulator

Repository Summary

Description	DRL_Navigation_Robot_ROS2_Foxy
Checkout URI	https://github.com/toxuandung/drl_navigation_robot_ros2_foxy.git
VCS Type	git
VCS Version	main
Last Updated	2023-10-20
Dev Status	UNKNOWN
CI status	No Continuous Integration
Released	UNRELEASED
Tags	No category tags.
Contributing	Help Wanted (0) Good First Issues (0) Pull Requests to Review (0)

Packages

Name	Version
td3	0.0.0
velodyne_description	1.0.9
velodyne_gazebo_plugins	1.0.9
velodyne_simulator	1.0.9

README

DRL_Navigation_Robot_ROS2_Foxy

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo 11 simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles. Obstacles are detected by LIDAR (Light Detection and Ranging) sensor and a goal is given to the robot in polar coordinates. Trained in ROS Gazebo 11 simulator with PyTorch. Tested with ROS2 Foxy on Ubuntu 20.04 with python 3.8.10 and pytorch 1.10.

TD3 Network Implementation :

TD3 is an actor-critic type of network similar to DDPG. That means that there is an “actor” network that calculates an action to perform, and a “critic” network, that estimates, how good is this action. In a simple form, TD3 architecture is an extension of DDPG architecture to solve the problem of overestimating the Q-value. It does so by introducing a second critic network within the loop and selecting the output from the one that produces the lower Q-value estimations. (Once again, a mathematical and algorithmic background overview can be obtained here.) Therefore, we need to create an actor-network that will take the environmental state as input and output action for the robot to take. Also, we need to create two critic networks that will take the environmental state as well as the action from the actor-network as inputs and will output the estimated value of this state-action pair.

The detail of the network can be found in src/td3/scripts

The Robot and The Evironment :

We are trying to “find the optimal sequence of actions that lead the robot to a given goal”. There are two things to consider — the action and the environment that the action reacts to. In a mobile robot setting, it is quite easy to express the action in a mathematical form. It is the force applied to each actuator for the controllable degree of freedom. To put it simply, it is how much we want to move in any controllable direction.

a = (v, ω)
s = (laser_state + distance_to_goal + theta + previous_action)

a is tuple action , v is translational velocity, ω is angular velocity
s is state, laser_state are distances to an obstacle at each 9-degree interval within a 180-degree range in front of a robot from LIDAR sensor, theta is angles between the robot heading and the heading towards the goal

Reward :

if robot_reach_the_goal:

r = 100 elif collision:

r = -100.0 else:

r = v - |ω| - r3  // r3 = (1 - smallest distance of robot to obstacles) if that distance < 1m else r3 = 0

r is the reward for each time step, The idea behind it is that the robot needs to realize that it should be moving around and not just sitting in a single spot. By setting a positive reward for linear motion robot first learns that moving forward is good and rotating is not.Additionally, we add the term r3 which is calculated by our lambda function. This gives an additional negative reward if the robot is closer to any obstacle than 1 meter.

Training environment :

Installation

Main dependencies:

$ sudo apt install python3-colcon-common-extensions
$ sudo apt install ros-foxy-gazebo-ros-pkgs
$ sudo apt install ros-foxy-xacro

Clone the repository :

$ git clone https://github.com/toxuandung/DRL_Navigation_Robot_ROS2_Foxy.git
$ cd DRL_Navigation_Robot_ROS2_Foxy

Compile the workspace:

$ source /opt/ros/foxy/setup.bash
$ colcon build
$ source install/setup.bash

Training :

$ ros2 launch td3 training_simulation.launch.py

monitor the training process by tensorboard. Open the new terminal:

$ tensorboard dev upload --logdir     './src/td3/runs/train/tensorboard'

Training example :

Testing :

$ ros2 launch td3 test_simulation.launch.py

Test example :

CONTRIBUTING

No CONTRIBUTING.md found.

drl_navigation_robot_ros2_foxy repository

No version for distro noetic. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro ardent. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro bouncy. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro crystal. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro eloquent. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro dashing. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro galactic. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro foxy. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro iron. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro lunar. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro jade. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro indigo. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro hydro. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro kinetic. Known supported distros are highlighted in the buttons above.

drl_navigation_robot_ros2_foxy repository

No version for distro melodic. Known supported distros are highlighted in the buttons above.