ray/rllib/tuned_examples/regression_tests/cartpole-appo-tf.yaml at ae2e1f0883e3bb98344c2d3838a415f9986b6d75 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-07 02:51:39 -05:00

Sven Mika 499ad5fbe4

[RLlib] PyTorch version of APPO. (#8120 )

- Translate all vtrace functionality to torch and added torch to the framework_iterator-loop in all existing vtrace test cases.
- Add learning test cases for APPO torch (both w/ and w/o v-trace).
- Add quick compilation tests for APPO (tf and torch, v-trace and no v-trace).

2020-04-23 09:11:12 +02:00

14 lines

328 B

YAML

Raw Blame History

 cartpole-appo-tf:
     env: CartPole-v0
     run: APPO
     stop:
         episode_reward_mean: 150
         timesteps_total: 200000
     config:
         use_pytorch: false
         rollout_fragment_length: 10
         train_batch_size: 10
         num_envs_per_worker: 5
         num_workers: 1
         num_gpus: 0
         vtrace: false