ray/rllib/agents/ppo at 019030cb4d589846c8954dc0f39cde9325ba8876 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-10 13:26:39 -04:00

History

Eric Liang baadbdf8d4 [rllib] Execute PPO using training workflow (#8206 ) * wip * add kl * kl * works now * doc update * reorg * add ddppo * add stats * fix fetch * comment * fix learner stat regression * test fixes * fix test		2020-04-30 01:18:09 -07:00
..
tests	[RLlib] PyTorch version of APPO. (#8120 )	2020-04-23 09:11:12 +02:00
__init__.py	[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )	2020-02-10 15:28:27 -08:00
appo.py	[RLlib] PyTorch version of APPO. (#8120 )	2020-04-23 09:11:12 +02:00
appo_tf_policy.py	[RLlib] PyTorch version of APPO. (#8120 )	2020-04-23 09:11:12 +02:00
appo_torch_policy.py	[RLlib] PyTorch version of APPO. (#8120 )	2020-04-23 09:11:12 +02:00
ddppo.py	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
ppo.py	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
ppo_tf_policy.py	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
ppo_torch_policy.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
utils.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00