ray/rllib/tuned_examples/regression_tests/cartpole-ppo.yaml at c51fbfb453d4b63f9d914b6875273ded8c78ba0b - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 04:46:38 -04:00

Eric Liang 5d7afe8092

[rllib] Try moving RLlib to top level dir (#5324 )

2019-08-05 23:25:49 -07:00

10 lines

238 B

YAML

Raw Blame History

 cartpole-ppo:
     env: CartPole-v0
     run: PPO
     stop:
         episode_reward_mean: 150
         timesteps_total: 100000
     config:
         num_workers: 1
         batch_mode: complete_episodes
         observation_filter: MeanStdFilter