ray/rllib/agents at 6475297bd3774fa1bdad5233164d2aa65e2c86e5 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 6475297bd3 [RLlib] Torch LR schedule not working. Fix and added test case. (#12396 )		2020-11-26 13:14:11 +01:00
..
a3c	[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747 )	2020-11-12 16:27:34 +01:00
ars	[RLlib] Trajectory view API: enable by default for ES and ARS (#11826 )	2020-11-12 10:33:10 -08:00
ddpg	[RLlib] Trajectory view API: enable by default for SAC, DDPG, DQN, SimpleQ (#11827 )	2020-11-16 10:54:35 -08:00
dqn	[RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063 )	2020-11-19 19:01:14 +01:00
dreamer	[rllib] Forgot to pass ioctx to child json readers (#11839 )	2020-11-05 22:07:57 -08:00
es	[RLlib] Trajectory view API: enable by default for ES and ARS (#11826 )	2020-11-12 10:33:10 -08:00
impala	[RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397 )	2020-11-25 11:27:46 -08:00
maml	[RLLib] MAML extension for all models except RNNs (#11337 )	2020-11-12 16:51:40 -08:00
marwil	[RLlib] Fix test_bc.py test case. (#11722 )	2020-10-31 00:16:09 -07:00
mbmpo	MBMPO Cartpole (#11832 )	2020-11-12 10:30:41 -08:00
pg	[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747 )	2020-11-12 16:27:34 +01:00
ppo	[RLlib] Torch LR schedule not working. Fix and added test case. (#12396 )	2020-11-26 13:14:11 +01:00
qmix	[RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609 )	2020-10-27 10:00:24 +01:00
sac	[RLlib] Issue 11591: SAC loss does not use PR-weights in critic loss term. (#12394 )	2020-11-25 11:28:46 -08:00
slateq	[RLlib] Implement the SlateQ algorithm (#11450 )	2020-11-03 09:52:04 +01:00
__init__.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
callbacks.py	[RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063 )	2020-11-19 19:01:14 +01:00
mock.py	[tune] Use public methods for trainable (#9184 )	2020-07-01 11:00:00 -07:00
registry.py	[RLlib] Implement the SlateQ algorithm (#11450 )	2020-11-03 09:52:04 +01:00
trainer.py	[RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397 )	2020-11-25 11:27:46 -08:00
trainer_template.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00