ray/rllib/agents at ec2c10309bb3676f5a77f817c1d8a38037b6ae6d - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Michael Luo ec2c10309b [RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243 )		2021-02-22 17:30:18 +01:00
..
a3c	[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238 )	2021-01-19 14:22:36 +01:00
ars	[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035 )	2020-12-29 18:45:55 -05:00
cql	[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243 )	2021-02-22 17:30:18 +01:00
ddpg	[RLlib] DDPG: Support simplex action space. (#14011 )	2021-02-10 15:10:01 +01:00
dqn	[RLlib] Issue #13824 : `compress_observations=True` crashes for all algos not using a replay buffer. (#14034 )	2021-02-18 21:36:32 +01:00
dreamer	[RLlib] Dreamer: Fix broken import and add compilation test case. (#13553 )	2021-01-21 16:30:26 +01:00
es	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
impala	[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238 )	2021-01-19 14:22:36 +01:00
maml	[RLlib] Pytorch MAML fix for more than two workers with discrete actions (#13835 )	2021-02-08 12:06:02 +01:00
marwil	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
mbmpo	[RLlib] Issue #13507 : Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037 )	2021-02-11 18:58:46 +01:00
pg	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
ppo	[RLlib] Issue #13824 : `compress_observations=True` crashes for all algos not using a replay buffer. (#14034 )	2021-02-18 21:36:32 +01:00
qmix	[RLlib] Env directory cleanup and tests. (#13082 )	2021-01-19 10:09:39 +01:00
sac	[RLlib] Issue #13342 : Add `validate_spaces` to MB-MPO. (#14038 )	2021-02-11 11:36:53 +01:00
slateq	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
__init__.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
callbacks.py	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
mock.py	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
registry.py	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
trainer.py	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
trainer_template.py	[RLlib] move evaluation to trainer.step() such that the result is properly logged (#12708 )	2021-01-25 12:56:00 +01:00