ray/rllib/tuned_examples/regression_tests/cartpole-a2c-microbatch.yaml at 31b40b00f67163e9723648fac8a4de5cb99063dc - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

Eric Liang 243b1b7281

[rllib] Add microbatch optimizer with A2C example (#6161 )

2019-11-14 12:14:00 -08:00

11 lines

247 B

YAML

Raw Blame History

 cartpole-a2c-microbatch:
     env: CartPole-v0
     run: A2C
     stop:
         episode_reward_mean: 100
         timesteps_total: 100000
     config:
         num_workers: 1
         gamma: 0.95
         microbatch_size: 50
         train_batch_size: 100