ray/rllib/models
Eric Liang baadbdf8d4
[rllib] Execute PPO using training workflow (#8206)
* wip

* add kl

* kl

* works now

* doc update

* reorg

* add ddppo

* add stats

* fix fetch

* comment

* fix learner stat regression

* test fixes

* fix test
2020-04-30 01:18:09 -07:00
..
tests [RLlib] Nested action space PR (minimally invasive; torch only + test). (#8101) 2020-04-23 09:09:22 +02:00
tf [RLlib] Deprecate all Model(v1) usage. (#8146) 2020-04-29 12:12:59 +02:00
torch [RLlib] Deprecate all Model(v1) usage. (#8146) 2020-04-29 12:12:59 +02:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
action_dist.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
catalog.py [RLlib] Deprecate all Model(v1) usage. (#8146) 2020-04-29 12:12:59 +02:00
extra_spaces.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
model.py [RLlib] Deprecate all Model(v1) usage. (#8146) 2020-04-29 12:12:59 +02:00
modelv2.py [rllib] Execute PPO using training workflow (#8206) 2020-04-30 01:18:09 -07:00
preprocessors.py [RLlib] Fix broken example: tf-eager with custom-RNN (#6732). (#7021) 2020-02-06 09:44:08 -08:00
README.txt [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00

Shared neural network models for RLlib.