mirror of
https://github.com/vale981/ray
synced 2025-03-07 02:51:39 -05:00

- Translate all vtrace functionality to torch and added torch to the framework_iterator-loop in all existing vtrace test cases. - Add learning test cases for APPO torch (both w/ and w/o v-trace). - Add quick compilation tests for APPO (tf and torch, v-trace and no v-trace).
14 lines
328 B
YAML
14 lines
328 B
YAML
cartpole-appo-tf:
|
|
env: CartPole-v0
|
|
run: APPO
|
|
stop:
|
|
episode_reward_mean: 150
|
|
timesteps_total: 200000
|
|
config:
|
|
use_pytorch: false
|
|
rollout_fragment_length: 10
|
|
train_batch_size: 10
|
|
num_envs_per_worker: 5
|
|
num_workers: 1
|
|
num_gpus: 0
|
|
vtrace: false
|