ray/rllib/agents at 1d01c668f0ed06d04585afe622649f50a78d693e - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

History

Sven Mika 57690a3a9f [RLlib] Trajectory view API - 02 actual API scaffold (#9753 )		2020-08-06 10:54:20 +02:00
..
a3c	[LINT] Except RLlib from checking for flake8 error F821 (#9946 )	2020-08-06 10:44:37 +02:00
ars	[RLlib] tf-eager support for ES and ARS (tf2.x preparation). (#9207 )	2020-07-02 13:03:10 +02:00
ddpg	[RLlib] Issue 9667 DDPG Torch bugs and enhancements. (#9680 )	2020-07-28 14:15:03 +02:00
dqn	[RLlib] Implement DQN PyTorch distributional head. (#9589 )	2020-07-25 09:29:24 +02:00
dyna	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 )	2020-06-25 19:01:32 +02:00
es	[rllib] Fix torch TD error, IMPALA LR updates (#9477 )	2020-07-23 12:50:25 -07:00
impala	[rllib] Fix torch TD error, IMPALA LR updates (#9477 )	2020-07-23 12:50:25 -07:00
maml	[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409 )	2020-08-02 18:12:09 +02:00
marwil	[RLlib] Issue 9402 MARWIL producing nan rewards. (#9429 )	2020-07-14 05:07:16 +02:00
mbmpo	[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409 )	2020-08-02 18:12:09 +02:00
pg	[RLlib] Tf2.x native. (#8752 )	2020-07-11 22:06:35 +02:00
ppo	[RLlib] Trajectory view API - 02 actual API scaffold (#9753 )	2020-08-06 10:54:20 +02:00
qmix	[RLlib] Issue 8384: QMIX doesn't learn anything. (#9527 )	2020-07-17 12:14:34 +02:00
sac	[RLlib] Tf2.x native. (#8752 )	2020-07-11 22:06:35 +02:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
agent.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
callbacks.py	[rllib] Add type annotations for evaluation/, env/ packages (#9003 )	2020-06-19 13:09:05 -07:00
mock.py	[tune] Use public methods for trainable (#9184 )	2020-07-01 11:00:00 -07:00
registry.py	[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409 )	2020-08-02 18:12:09 +02:00
trainer.py	[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678 )	2020-07-29 21:15:09 +02:00
trainer_template.py	[rllib] Remove deprecated policy optimizer package. (#9262 )	2020-07-02 14:39:40 -07:00