ray/rllib/agents/ppo
Eric Liang baadbdf8d4
[rllib] Execute PPO using training workflow (#8206)
* wip

* add kl

* kl

* works now

* doc update

* reorg

* add ddppo

* add stats

* fix fetch

* comment

* fix learner stat regression

* test fixes

* fix test
2020-04-30 01:18:09 -07:00
..
tests [RLlib] PyTorch version of APPO. (#8120) 2020-04-23 09:11:12 +02:00
__init__.py [rllib] Add Decentralized DDPPO trainer and documentation (#7088) 2020-02-10 15:28:27 -08:00
appo.py [RLlib] PyTorch version of APPO. (#8120) 2020-04-23 09:11:12 +02:00
appo_tf_policy.py [RLlib] PyTorch version of APPO. (#8120) 2020-04-23 09:11:12 +02:00
appo_torch_policy.py [RLlib] PyTorch version of APPO. (#8120) 2020-04-23 09:11:12 +02:00
ddppo.py [rllib] Execute PPO using training workflow (#8206) 2020-04-30 01:18:09 -07:00
ppo.py [rllib] Execute PPO using training workflow (#8206) 2020-04-30 01:18:09 -07:00
ppo_tf_policy.py [RLlib] Deprecate all Model(v1) usage. (#8146) 2020-04-29 12:12:59 +02:00
ppo_torch_policy.py [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 2020-04-01 00:43:21 -07:00
utils.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00