.. |
a3c
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
ars
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
cql
|
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
2020-12-30 10:11:57 -05:00 |
ddpg
|
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091)
|
2020-12-30 22:30:52 -05:00 |
dqn
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
dreamer
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
es
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
impala
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
maml
|
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091)
|
2020-12-30 22:30:52 -05:00 |
marwil
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
mbmpo
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
pg
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
ppo
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
qmix
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
sac
|
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091)
|
2020-12-30 22:30:52 -05:00 |
slateq
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
__init__.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
callbacks.py
|
[RLlib] Attention Nets: tf (#12753)
|
2020-12-20 20:22:32 -05:00 |
mock.py
|
[tune] Use public methods for trainable (#9184)
|
2020-07-01 11:00:00 -07:00 |
registry.py
|
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
2020-12-30 10:11:57 -05:00 |
trainer.py
|
[RLlib] Trajectory view API docs. (#12718)
|
2020-12-30 17:32:21 -08:00 |
trainer_template.py
|
WIP. (#12706)
|
2020-12-09 11:49:21 -08:00 |