P

Ppo BreakoutNoFrameskip V4

Developed by ThomasSimonini
A deep reinforcement learning model trained using the PPO algorithm in the Atari Breakout environment
Downloads 459
Release Time : 3/2/2022

Model Overview

This model is implemented based on the stable-baselines3 library, trained using the PPO algorithm in the BreakoutNoFrameskip-v4 environment, capable of playing the classic Atari Breakout game.

Model Features

Based on PPO Algorithm
Uses the Proximal Policy Optimization (PPO) algorithm, a widely-used policy gradient method in reinforcement learning
Frame Stacking Processing
Employs 4-frame stacking technology to process game screens, enabling the model to perceive temporal dynamics
Parallel Environment Training
Uses 8 parallel environments for training to improve sample collection efficiency
Stable Training
Adopts various stabilization techniques such as gradient clipping and value function coefficients to ensure training stability

Model Capabilities

Atari Game Control
Reinforcement Learning Decision Making
Real-time Game Interaction

Use Cases

Game AI
Breakout Game AI
Acts as an automatic player for the Breakout game, capable of consistently achieving high scores
Average reward reaches 339 points
Reinforcement Learning Research
Algorithm Benchmarking
Can serve as a performance benchmark for the PPO algorithm on Atari games
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase