0x05a4
/

DeepRL-PPO-LLv2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions

DeepRL-PPO-LLv2 / LunarLander-v2-PPO

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

0x05a4's picture

Baseline: LR=3e-4/.996, epochs=2e6

678a575 over 3 years ago

_stable_baselines3_version

5 Bytes

Baseline 1M epochs over 3 years ago
data

16.3 kB

Baseline: LR=3e-4/.996, epochs=2e6 over 3 years ago
policy.optimizer.pth

84.9 kB
xet

Baseline: LR=3e-4/.996, epochs=2e6 over 3 years ago
policy.pth

43.2 kB
xet

Baseline: LR=3e-4/.996, epochs=2e6 over 3 years ago
pytorch_variables.pth
Pickle imports
- No problematic imports detected
What is a pickle import?
431 Bytes
xet

Baseline 1M epochs over 3 years ago
system_info.txt

193 Bytes

Baseline 1M epochs over 3 years ago