.. |
a3c
|
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
2021-01-19 14:22:36 +01:00 |
ars
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
cql
|
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243)
|
2021-02-22 17:30:18 +01:00 |
ddpg
|
[RLlib] DDPG: Support simplex action space. (#14011)
|
2021-02-10 15:10:01 +01:00 |
dqn
|
[RLlib] Issue #13824: compress_observations=True crashes for all algos not using a replay buffer. (#14034)
|
2021-02-18 21:36:32 +01:00 |
dreamer
|
[RLlib] Dreamer: Fix broken import and add compilation test case. (#13553)
|
2021-01-21 16:30:26 +01:00 |
es
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
impala
|
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
2021-01-19 14:22:36 +01:00 |
maml
|
[RLlib] Pytorch MAML fix for more than two workers with discrete actions (#13835)
|
2021-02-08 12:06:02 +01:00 |
marwil
|
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
2021-02-08 15:02:19 +01:00 |
mbmpo
|
[RLlib] Issue #13507: Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037)
|
2021-02-11 18:58:46 +01:00 |
pg
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
ppo
|
[RLlib] Issue #13824: compress_observations=True crashes for all algos not using a replay buffer. (#14034)
|
2021-02-18 21:36:32 +01:00 |
qmix
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
sac
|
[RLlib] Issue #13342: Add validate_spaces to MB-MPO. (#14038)
|
2021-02-11 11:36:53 +01:00 |
slateq
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
__init__.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
callbacks.py
|
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
2021-02-08 15:02:19 +01:00 |
mock.py
|
[RLlib] Allow rllib rollout to run distributed via evaluation workers. (#13718)
|
2021-02-08 12:05:16 +01:00 |
registry.py
|
[RLlib] Allow rllib rollout to run distributed via evaluation workers. (#13718)
|
2021-02-08 12:05:16 +01:00 |
trainer.py
|
[RLlib] Allow rllib rollout to run distributed via evaluation workers. (#13718)
|
2021-02-08 12:05:16 +01:00 |
trainer_template.py
|
[RLlib] move evaluation to trainer.step() such that the result is properly logged (#12708)
|
2021-01-25 12:56:00 +01:00 |