ray/rllib/tuned_examples/impala/pendulum-impala.yaml

7 lines
133 B
YAML
Raw Normal View History

pendulum-impala-tf:
env: Pendulum-v0
run: IMPALA
stop:
episode_reward_mean: -700
timesteps_total: 500000