.. |
a3c
|
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
2020-04-15 13:25:16 +02:00 |
ars
|
[RLlib] Deprecate all Model(v1) usage. (#8146)
|
2020-04-29 12:12:59 +02:00 |
ddpg
|
[RLlib] Fix issue 8135 (DDPG inf actions when using [-inf,inf] action space). (#8302)
|
2020-05-04 22:27:30 +02:00 |
dqn
|
[RLlib] APEX_DDPG (PyTorch) test case and docs. (#8288)
|
2020-05-04 09:36:27 +02:00 |
es
|
[RLlib] Deprecate all Model(v1) usage. (#8146)
|
2020-04-29 12:12:59 +02:00 |
impala
|
[RLlib] IMPALA PyTorch (#8287)
|
2020-05-03 13:44:25 +02:00 |
marwil
|
[RLlib] MARWIL torch. (#7836)
|
2020-04-06 16:38:50 -07:00 |
pg
|
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
2020-04-15 13:25:16 +02:00 |
ppo
|
[RLlib] IMPALA PyTorch (#8287)
|
2020-05-03 13:44:25 +02:00 |
qmix
|
[RLlib] rllib/examples folder restructuring (#8250)
|
2020-05-01 22:59:34 +02:00 |
sac
|
[RLlib] Beta distribution. (#8229)
|
2020-04-30 11:09:33 -07:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
agent.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
callbacks.py
|
[RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972)
|
2020-04-16 16:06:42 -07:00 |
mock.py
|
[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304)
|
2020-05-04 23:53:38 +02:00 |
registry.py
|
[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338)
|
2020-03-02 15:16:37 -08:00 |
trainer.py
|
[rllib] Execute PPO using training workflow (#8206)
|
2020-04-30 01:18:09 -07:00 |
trainer_template.py
|
[rllib] Execute PPO using training workflow (#8206)
|
2020-04-30 01:18:09 -07:00 |