ray/rllib/tuned_examples/impala/cartpole-impala.yaml at master - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 10:01:43 -05:00

Sven Mika e6ae08f416

[RLlib] Optionally don't drop last ts in v-trace calculations (APPO and IMPALA). (#19601 )

2021-11-03 10:01:34 +01:00

11 lines

257 B

YAML

Raw Permalink Blame History

 cartpole-impala:
     env: CartPole-v0
     run: IMPALA
     stop:
         episode_reward_mean: 150
         timesteps_total: 500000
     config:
         # Works for both torch and tf.
         framework: tf
         num_gpus: 0
         vtrace_drop_last_ts: false