Breakout

Libraries:

Stable Baselines:

https://stable-baselines3.readthedocs.io/en/master/index.html

Stable Baselines contrib

https://sb3-contrib.readthedocs.io/en/master/index.html

Algorithms:

PPO
Recurrent PPO
TRPO
A2C
DQN
QR-DQN

Description:

Another famous Atari game. The dynamics are similar to pong: You move a paddle and hit the ball in a brick wall at the top of the screen. Your goal is to destroy the brick wall. You can try to break through the wall and let the ball wreak havoc on the other side, all on its own! You have five lives. Detailed documentation can be found on the AtariAge page.

Training steps:

Inital exploration across algorithms - 200K
Final training for PPO and RecurrentPPO - 5M

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Readme.md

Readme.md

Breakout

Libraries:

Stable Baselines:

Stable Baselines contrib

Algorithms:

Description:

Training steps:

Results:

Randomly acting agent:

Modelled agent

Files

Readme.md

Latest commit

History

Readme.md

File metadata and controls

Breakout

Libraries:

Stable Baselines:

Stable Baselines contrib

Algorithms:

Description:

Training steps:

Results:

Randomly acting agent:

Modelled agent