ray/rllib/agents at be9a584a9436e147849445355f297b41b9d412fd - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Richard Liaw a2d2275ee1 Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289 )" (#14360 ) This reverts commit `6cd0cd3bd9`.		2021-02-25 14:27:35 -08:00
..
a3c	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
ars	[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035 )	2020-12-29 18:45:55 -05:00
cql	[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243 )	2021-02-22 17:30:18 +01:00
ddpg	[RLlib] DDPG: Support simplex action space. (#14011 )	2021-02-10 15:10:01 +01:00
dqn	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
dreamer	[RLlib] Dreamer: Fix broken import and add compilation test case. (#13553 )	2021-01-21 16:30:26 +01:00
es	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
impala	Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289 )" (#14360 )	2021-02-25 14:27:35 -08:00
maml	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
marwil	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
mbmpo	[RLlib] Issue #13507 : Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037 )	2021-02-11 18:58:46 +01:00
pg	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
ppo	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
qmix	[RLlib] Env directory cleanup and tests. (#13082 )	2021-01-19 10:09:39 +01:00
sac	Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289 )" (#14360 )	2021-02-25 14:27:35 -08:00
slateq	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
__init__.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
callbacks.py	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
mock.py	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
registry.py	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
trainer.py	Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289 )" (#14360 )	2021-02-25 14:27:35 -08:00
trainer_template.py	[RLlib] move evaluation to trainer.step() such that the result is properly logged (#12708 )	2021-01-25 12:56:00 +01:00