.. |
a3c
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
ars
|
[RLlib] Add testing Policy.compute_single_action() for all agents. (#8903)
|
2020-06-13 17:51:50 +02:00 |
ddpg
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
dqn
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
dyna
|
[RLlib] Prototype of a DynaTrainer (for env dynamics learning in upcoming MBMPO algo). (#8860)
|
2020-06-16 09:01:20 +02:00 |
es
|
[RLlib] Add testing Policy.compute_single_action() for all agents. (#8903)
|
2020-06-13 17:51:50 +02:00 |
impala
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
marwil
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
pg
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
ppo
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
qmix
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
sac
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
agent.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
callbacks.py
|
[rllib] observation function api for multi-agent (#8236)
|
2020-05-04 22:13:49 -07:00 |
mock.py
|
[RLlib] Fix broken tune tests in master due to framework=auto errors. (#8672)
|
2020-05-29 11:55:47 +02:00 |
registry.py
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
trainer.py
|
[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893)
|
2020-06-12 20:17:27 -07:00 |
trainer_template.py
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |