Skip to content

Latest commit

 

History

History
30 lines (23 loc) · 1.26 KB

File metadata and controls

30 lines (23 loc) · 1.26 KB

Breakout

Libraries:

Stable Baselines:

https://stable-baselines3.readthedocs.io/en/master/index.html

Stable Baselines contrib

https://sb3-contrib.readthedocs.io/en/master/index.html

Algorithms:

  1. PPO
  2. Recurrent PPO
  3. TRPO
  4. A2C
  5. DQN
  6. QR-DQN

Description:

Another famous Atari game. The dynamics are similar to pong: You move a paddle and hit the ball in a brick wall at the top of the screen. Your goal is to destroy the brick wall. You can try to break through the wall and let the ball wreak havoc on the other side, all on its own! You have five lives. Detailed documentation can be found on the AtariAge page.

Training steps:

  1. Inital exploration across algorithms - 200K
  2. Final training for PPO and RecurrentPPO - 5M

Results:

Randomly acting agent:

Initial

Modelled agent

Trained model