ray/rllib/agents/ppo
2020-08-21 12:35:16 +02:00
..
tests [rllib] Learning rate schedule for DDPPO. (#10006) 2020-08-15 00:51:45 -07:00
__init__.py [RLlib] Examples folder restructuring (models) part 1 (#8353) 2020-05-08 08:20:18 +02:00
appo.py [RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115) 2020-08-20 17:05:57 +02:00
appo_tf_policy.py [RLlib] Tf2x preparation; part 2 (upgrading try_import_tf()). (#9136) 2020-06-30 10:13:20 +02:00
appo_torch_policy.py [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
ddppo.py [RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115) 2020-08-20 17:05:57 +02:00
ppo.py [RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678) 2020-07-29 21:15:09 +02:00
ppo_tf_policy.py [RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678) 2020-07-29 21:15:09 +02:00
ppo_torch_policy.py [RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950) 2020-08-21 12:35:16 +02:00