ray/rllib/tuned_examples/es/cartpole-es.yaml at 7e3ded7439bde825f00cfc914ad71b751a4889c7 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

Sven Mika 2589309cf0

[RLlib] Make sure torch and tf behave the same wrt conv2d nets. (#8785 )

2020-06-20 00:05:19 +02:00

12 lines

278 B

YAML

Raw Blame History

 cartpole-es:
     env: CartPole-v0
     run: ES
     stop:
         episode_reward_mean: 150
         timesteps_total: 1000000
     config:
         # Works for both torch and tf.
         framework: tf
         num_workers: 2
         noise_size: 25000000
         episodes_per_batch: 50