No version for distro humble. Known supported distros are highlighted in the buttons above.
No version for distro jazzy. Known supported distros are highlighted in the buttons above.
No version for distro rolling. Known supported distros are highlighted in the buttons above.

Package Summary

Tags No category tags.
Version 1.0.0
License GPLv3
Build type AMENT_CMAKE
Use RECOMMENDED

Repository Summary

Checkout URI https://github.com/macstepien/furuta_pendulum.git
VCS Type git
VCS Version master
Last Updated 2024-04-14
Dev Status UNMAINTAINED
CI status No Continuous Integration
Released UNRELEASED
Tags No category tags.
Contributing Help Wanted (0)
Good First Issues (0)
Pull Requests to Review (0)

Package Description

TODO: Package description

Additional Links

No additional links.

Maintainers

  • Maciej Stępień

Authors

  • Maciej Stępień

furuta_pendulum_rl

Setup

Mujoco

Copy mujoco210 to ~/.mujoco/mujoco210

Imitation

imitation library isn’t currently installed automatically, so it is necessary to do it manually:

pip install imitation

[!WARNING]
It may result in upgrading gym, and it will be necessary to downgrade it again (by once again executing install of the specific stable baselines version, refer to Dockerfile). Here is the message caused by this problem:

ImportError: Missing shimmy installation. You provided an OpenAI Gym environment. Stable-Baselines3 (SB3) has transitioned to using Gymnasium internally. In order to use OpenAI Gym environments with SB3, you need to install shimmy (`pip install 'shimmy>=0.2.1'`).

Training

To solve Furuta control, I created three environments with progressive level of difficulty:

  • furuta_pendulum_upward_stabilization - pendulum starts with some angle that is a little off upward position and with some velocity, the agent’s task is to stabilize in the 0 position. A reward is received for every episode, in which the angle isn’t greater than some threshold.
  • furuta_pendulum_swing_up - pendulum starts in the downward stable position, the agent’s task is to swing it up close to the upward position
  • furuta_pendulum_full - this is a full Furuta pendulum control problem - the pendulum starts in the downward stable position, and the agent’s task is to swing it up and stabilize it upward.

Solutions

In scripts with the name of the environment and the name of the algorithm you can find example solutions.

Exporting model

Due to problems with ROS2 performance in Python I decided to use C++. To export the actor model from SAC used to solve the full control problem, I used export_sac_model.py script.

Development

Problems

High variance of actions - in gaussian exploration actions can be different in each timestep which results in high variance and oscillations of the motor. gSDE results in smoother actions.

CHANGELOG

Changelog for package furuta_pendulum_rl

1.0.0 (2023-07-30)

  • Working simulation
  • Contributors: Maciej Stępień

Wiki Tutorials

This package does not provide any links to tutorials in it's rosindex metadata. You can check on the ROS Wiki Tutorials page for the package.

Package Dependencies

System Dependencies

No direct system dependencies.

Dependant Packages

Launch files

No launch files found

Messages

No message files found.

Services

No service files found

Plugins

No plugins found.

Recent questions tagged furuta_pendulum_rl at Robotics Stack Exchange

No version for distro noetic. Known supported distros are highlighted in the buttons above.
No version for distro ardent. Known supported distros are highlighted in the buttons above.
No version for distro bouncy. Known supported distros are highlighted in the buttons above.
No version for distro crystal. Known supported distros are highlighted in the buttons above.
No version for distro eloquent. Known supported distros are highlighted in the buttons above.
No version for distro dashing. Known supported distros are highlighted in the buttons above.
No version for distro galactic. Known supported distros are highlighted in the buttons above.
No version for distro foxy. Known supported distros are highlighted in the buttons above.
No version for distro iron. Known supported distros are highlighted in the buttons above.
No version for distro lunar. Known supported distros are highlighted in the buttons above.
No version for distro jade. Known supported distros are highlighted in the buttons above.
No version for distro indigo. Known supported distros are highlighted in the buttons above.
No version for distro hydro. Known supported distros are highlighted in the buttons above.
No version for distro kinetic. Known supported distros are highlighted in the buttons above.
No version for distro melodic. Known supported distros are highlighted in the buttons above.