.. |
a3c
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
ars
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
cql
|
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243)
|
2021-02-22 17:30:18 +01:00 |
ddpg
|
[RLlib] DDPG: Support simplex action space. (#14011)
|
2021-02-10 15:10:01 +01:00 |
dqn
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
dreamer
|
[RLlib] Dreamer: Fix broken import and add compilation test case. (#13553)
|
2021-01-21 16:30:26 +01:00 |
es
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
impala
|
Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289)" (#14360)
|
2021-02-25 14:27:35 -08:00 |
maml
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
marwil
|
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
2021-02-08 15:02:19 +01:00 |
mbmpo
|
[RLlib] Issue #13507: Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037)
|
2021-02-11 18:58:46 +01:00 |
pg
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
ppo
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
qmix
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
sac
|
Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289)" (#14360)
|
2021-02-25 14:27:35 -08:00 |
slateq
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
__init__.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
callbacks.py
|
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
2021-02-08 15:02:19 +01:00 |
mock.py
|
[RLlib] Allow rllib rollout to run distributed via evaluation workers. (#13718)
|
2021-02-08 12:05:16 +01:00 |
registry.py
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
trainer.py
|
Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289)" (#14360)
|
2021-02-25 14:27:35 -08:00 |
trainer_template.py
|
[RLlib] move evaluation to trainer.step() such that the result is properly logged (#12708)
|
2021-01-25 12:56:00 +01:00 |