ray/rllib/agents
Eric Liang 9b8218aabd
[docs] Move all /latest links to /master (#11897)
* use master link

* remae

* revert non-ray

* more

* mre
2020-11-10 10:53:28 -08:00
..
a3c [RLlib] Fix PyTorch A3C / A2C loss function using mixed reduced sum / mean (#11449) 2020-10-22 12:39:34 -07:00
ars [RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) 2020-10-12 13:49:48 -07:00
ddpg [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00
dqn [RLlib] Make sure, DQN torch actions are of type=long before torch.nn.functional.one_hot() op. (#11800) 2020-11-04 18:04:03 +01:00
dreamer [rllib] Forgot to pass ioctx to child json readers (#11839) 2020-11-05 22:07:57 -08:00
es [RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) 2020-10-12 13:49:48 -07:00
impala [RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609) 2020-10-27 10:00:24 +01:00
maml [RLlib] Do not create env on driver iff num_workers > 0. (#11307) 2020-10-15 18:21:30 +02:00
marwil [RLlib] Fix test_bc.py test case. (#11722) 2020-10-31 00:16:09 -07:00
mbmpo [RLlib] Do not create env on driver iff num_workers > 0. (#11307) 2020-10-15 18:21:30 +02:00
pg [docs] Move all /latest links to /master (#11897) 2020-11-10 10:53:28 -08:00
ppo [RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609) 2020-10-27 10:00:24 +01:00
qmix [RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609) 2020-10-27 10:00:24 +01:00
sac [docs] Move all /latest links to /master (#11897) 2020-11-10 10:53:28 -08:00
slateq [RLlib] Implement the SlateQ algorithm (#11450) 2020-11-03 09:52:04 +01:00
__init__.py [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00
callbacks.py [RLlib] Issue 10469: Callbacks should receive env idx ... (#10477) 2020-09-03 17:27:05 +02:00
mock.py [tune] Use public methods for trainable (#9184) 2020-07-01 11:00:00 -07:00
registry.py [RLlib] Implement the SlateQ algorithm (#11450) 2020-11-03 09:52:04 +01:00
trainer.py Remove memory quota enforcement from actors (#11480) 2020-10-21 14:29:03 -07:00
trainer_template.py [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00