ray/rllib/agents/ppo at 93120e034725717608939f600c9da020de7b7671 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

History

Sven Mika e968b52cb7 [RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950 )		2020-08-21 12:35:16 +02:00
..
tests	[rllib] Learning rate schedule for DDPPO. (#10006 )	2020-08-15 00:51:45 -07:00
__init__.py	[RLlib] Examples folder restructuring (models) part 1 (#8353 )	2020-05-08 08:20:18 +02:00
appo.py	[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115 )	2020-08-20 17:05:57 +02:00
appo_tf_policy.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
appo_torch_policy.py	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
ddppo.py	[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115 )	2020-08-20 17:05:57 +02:00
ppo.py	[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678 )	2020-07-29 21:15:09 +02:00
ppo_tf_policy.py	[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678 )	2020-07-29 21:15:09 +02:00
ppo_torch_policy.py	[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950 )	2020-08-21 12:35:16 +02:00