%%writefile README.md

PPO Pyramids Agent

This model was trained using PPO with Unity ML-Agents in the Pyramids environment.

Environment

Pyramids

Algorithm

PPO (Proximal Policy Optimization)

Training

The agent was trained using Unity ML-Agents and Hugging Face Deep RL course materials.

Files

  • Pyramids.onnx — trained policy network

Watch

To watch model go to https://huggingface.co/spaces/unity/ML-Agents-Pyramids and search for RyanAA

Author

Ryan Aparicio

Downloads last month
16
Video Preview
loading