Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

0x05a4
/
DeepRL-PPO-LLv2

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card Files Files and versions
xet
Community
DeepRL-PPO-LLv2 / LunarLander-v2-PPO
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
0x05a4's picture
0x05a4
Baseline: LR=3e-4/.996, epochs=2e6
678a575 over 3 years ago
  • _stable_baselines3_version
    5 Bytes
    Baseline 1M epochs over 3 years ago
  • data
    16.3 kB
    Baseline: LR=3e-4/.996, epochs=2e6 over 3 years ago
  • policy.optimizer.pth
    84.9 kB
    xet
    Baseline: LR=3e-4/.996, epochs=2e6 over 3 years ago
  • policy.pth
    43.2 kB
    xet
    Baseline: LR=3e-4/.996, epochs=2e6 over 3 years ago
  • pytorch_variables.pth

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    431 Bytes
    xet
    Baseline 1M epochs over 3 years ago
  • system_info.txt
    193 Bytes
    Baseline 1M epochs over 3 years ago