.. |
a3c
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
2020-06-25 19:01:32 +02:00 |
ars
|
[RLlib] Add testing Policy.compute_single_action() for all agents. (#8903)
|
2020-06-13 17:51:50 +02:00 |
ddpg
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
2020-06-25 19:01:32 +02:00 |
dqn
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
2020-06-25 19:01:32 +02:00 |
dyna
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
2020-06-25 19:01:32 +02:00 |
es
|
[RLlib] Add testing Policy.compute_single_action() for all agents. (#8903)
|
2020-06-13 17:51:50 +02:00 |
impala
|
This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970)
|
2020-06-27 20:50:01 +02:00 |
maml
|
[rllib] MAML Agent (#8862)
|
2020-06-23 09:48:23 -07:00 |
marwil
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
2020-06-25 19:01:32 +02:00 |
pg
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
ppo
|
This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970)
|
2020-06-27 20:50:01 +02:00 |
qmix
|
Issue 8407: RNN sequencing error in QMIX (#9139)
|
2020-06-26 09:50:31 +02:00 |
sac
|
This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970)
|
2020-06-27 20:50:01 +02:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
agent.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
callbacks.py
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
mock.py
|
[RLlib] Fix broken tune tests in master due to framework=auto errors. (#8672)
|
2020-05-29 11:55:47 +02:00 |
registry.py
|
[rllib] MAML Agent (#8862)
|
2020-06-23 09:48:23 -07:00 |
trainer.py
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
2020-06-25 19:01:32 +02:00 |
trainer_template.py
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
2020-06-25 19:01:32 +02:00 |