ray/rllib/tuned_examples/regression_tests/cartpole-appo-tf.yaml

cartpole-appo-tf:
    env: CartPole-v0
    run: APPO
    stop:
        episode_reward_mean: 150
        timesteps_total: 200000
    config:
        use_pytorch: false
        rollout_fragment_length: 10
        train_batch_size: 10
        num_envs_per_worker: 5
        num_workers: 1
        num_gpus: 0
        vtrace: false
[RLlib] PyTorch version of APPO. (#8120) - Translate all vtrace functionality to torch and added torch to the framework_iterator-loop in all existing vtrace test cases. - Add learning test cases for APPO torch (both w/ and w/o v-trace). - Add quick compilation tests for APPO (tf and torch, v-trace and no v-trace). 2020-04-23 09:11:12 +02:00			`cartpole-appo-tf:`
Appo (#3779) * Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder * Deleted unneccesary vtrace.py file * Update pong-impala.yaml * Cleaned PPO Code * Update pong-impala.yaml * Update pong-impala.yaml * wip * new ifle * refactor * add vtrace off option * revert * support any space * docs * fix comment * remove kl * Update cartpole-appo-vtrace.yaml 2019-01-18 13:40:26 -08:00			`env: CartPole-v0`
			`run: APPO`
			`stop:`
[RLlib] PyTorch version of APPO. (#8120) - Translate all vtrace functionality to torch and added torch to the framework_iterator-loop in all existing vtrace test cases. - Add learning test cases for APPO torch (both w/ and w/o v-trace). - Add quick compilation tests for APPO (tf and torch, v-trace and no v-trace). 2020-04-23 09:11:12 +02:00			`episode_reward_mean: 150`
			`timesteps_total: 200000`
Appo (#3779) * Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder * Deleted unneccesary vtrace.py file * Update pong-impala.yaml * Cleaned PPO Code * Update pong-impala.yaml * Update pong-impala.yaml * wip * new ifle * refactor * add vtrace off option * revert * support any space * docs * fix comment * remove kl * Update cartpole-appo-vtrace.yaml 2019-01-18 13:40:26 -08:00			`config:`
[RLlib] PyTorch version of APPO. (#8120) - Translate all vtrace functionality to torch and added torch to the framework_iterator-loop in all existing vtrace test cases. - Add learning test cases for APPO torch (both w/ and w/o v-trace). - Add quick compilation tests for APPO (tf and torch, v-trace and no v-trace). 2020-04-23 09:11:12 +02:00			`use_pytorch: false`
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503) * bulk rename * deprecation warn * update doc * update fig * line length * rename * make pytest comptaible * fix test * fi sys * rename * wip * fix more * lint * update svg * comments * lint * fix use of batch steps 2020-03-14 12:05:04 -07:00			`rollout_fragment_length: 10`
Appo (#3779) * Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder * Deleted unneccesary vtrace.py file * Update pong-impala.yaml * Cleaned PPO Code * Update pong-impala.yaml * Update pong-impala.yaml * wip * new ifle * refactor * add vtrace off option * revert * support any space * docs * fix comment * remove kl * Update cartpole-appo-vtrace.yaml 2019-01-18 13:40:26 -08:00			`train_batch_size: 10`
			`num_envs_per_worker: 5`
			`num_workers: 1`
			`num_gpus: 0`
			`vtrace: false`