ray/rllib/tuned_examples/impala/pong-impala-fast.yaml

# This can reach 18-19 reward in ~3 minutes on p3.16xl head w/m4.16xl workers
#   128 workers -> 3 minutes (best case)
#    64 workers -> 4 minutes
#    32 workers -> 7 minutes
# See also: pong-impala.yaml, pong-impala-vectorized.yaml
pong-impala-fast:
    env: PongNoFrameskip-v4
    run: IMPALA
    config:
        rollout_fragment_length: 50
        train_batch_size: 1000
        num_workers: 128
        num_envs_per_worker: 5
        broadcast_interval: 5
        max_sample_requests_in_flight_per_worker: 1
        num_multi_gpu_tower_stacks: 4
        num_gpus: 2
        model:
          dim: 42
[rllib] Update multi-gpu impala numbers (#3327) 2018-11-19 20:55:27 -08:00			`# This can reach 18-19 reward in ~3 minutes on p3.16xl head w/m4.16xl workers`
			`# 128 workers -> 3 minutes (best case)`
			`# 64 workers -> 4 minutes`
			`# 32 workers -> 7 minutes`
			`# See also: pong-impala.yaml, pong-impala-vectorized.yaml`
			`pong-impala-fast:`
			`env: PongNoFrameskip-v4`
			`run: IMPALA`
			`config:`
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503) * bulk rename * deprecation warn * update doc * update fig * line length * rename * make pytest comptaible * fix test * fi sys * rename * wip * fix more * lint * update svg * comments * lint * fix use of batch steps 2020-03-14 12:05:04 -07:00			`rollout_fragment_length: 50`
[rllib] Update multi-gpu impala numbers (#3327) 2018-11-19 20:55:27 -08:00			`train_batch_size: 1000`
[rllib] example and docs on how to use parametric actions with DQN / PG algorithms (#3384) 2018-11-27 23:35:19 -08:00			`num_workers: 128`
[rllib] Update multi-gpu impala numbers (#3327) 2018-11-19 20:55:27 -08:00			`num_envs_per_worker: 5`
			`broadcast_interval: 5`
			`max_sample_requests_in_flight_per_worker: 1`
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00			`num_multi_gpu_tower_stacks: 4`
[rllib] Update multi-gpu impala numbers (#3327) 2018-11-19 20:55:27 -08:00			`num_gpus: 2`
			`model:`
			`dim: 42`