ray/rllib/agents at 06c3518aa1701ffc68570ba5e31143816190142c - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

History

Eric Liang 5acd3e66dd [rllib] Fix torch TD error, IMPALA LR updates (#9477 ) * update * add test * lint * fix super call * speed es test up		2020-07-23 12:50:25 -07:00
..
a3c	[RLlib] Tf2.x native. (#8752 )	2020-07-11 22:06:35 +02:00
ars	[RLlib] tf-eager support for ES and ARS (tf2.x preparation). (#9207 )	2020-07-02 13:03:10 +02:00
ddpg	[RLlib] Tf2.x native. (#8752 )	2020-07-11 22:06:35 +02:00
dqn	[rllib] Fix torch TD error, IMPALA LR updates (#9477 )	2020-07-23 12:50:25 -07:00
dyna	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 )	2020-06-25 19:01:32 +02:00
es	[rllib] Fix torch TD error, IMPALA LR updates (#9477 )	2020-07-23 12:50:25 -07:00
impala	[rllib] Fix torch TD error, IMPALA LR updates (#9477 )	2020-07-23 12:50:25 -07:00
maml	[rllib] MAML Transform (#9463 )	2020-07-16 11:11:33 -07:00
marwil	[RLlib] Issue 9402 MARWIL producing nan rewards. (#9429 )	2020-07-14 05:07:16 +02:00
pg	[RLlib] Tf2.x native. (#8752 )	2020-07-11 22:06:35 +02:00
ppo	[RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). (#9497 )	2020-07-16 14:55:50 +02:00
qmix	[RLlib] Issue 8384: QMIX doesn't learn anything. (#9527 )	2020-07-17 12:14:34 +02:00
sac	[RLlib] Tf2.x native. (#8752 )	2020-07-11 22:06:35 +02:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
agent.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
callbacks.py	[rllib] Add type annotations for evaluation/, env/ packages (#9003 )	2020-06-19 13:09:05 -07:00
mock.py	[tune] Use public methods for trainable (#9184 )	2020-07-01 11:00:00 -07:00
registry.py	[rllib] MAML Agent (#8862 )	2020-06-23 09:48:23 -07:00
trainer.py	[RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). (#9269 )	2020-07-14 04:27:49 +02:00
trainer_template.py	[rllib] Remove deprecated policy optimizer package. (#9262 )	2020-07-02 14:39:40 -07:00