ray/rllib/agents at bc120730e58cf933e2b4ca0837a7cda2a38c65a1 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 04:46:38 -04:00

History

Eric Liang be48e1964b [rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504 ) * fix sched * lintc * lint * fix * add unit test * fix * format * fix test * fix test		2020-03-10 11:14:14 -07:00
..
a3c	[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338 )	2020-03-02 15:16:37 -08:00
ars	[RLlib] Issue 7136: rollout not working for ES and ARS. (#7444 )	2020-03-04 23:57:44 -08:00
ddpg	[rllib] Make timestep a required arg for exploration classes (#7380 )	2020-03-04 13:00:37 -08:00
dqn	[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504 )	2020-03-10 11:14:14 -07:00
es	[RLlib] Issue 7136: rollout not working for ES and ARS. (#7444 )	2020-03-04 23:57:44 -08:00
impala	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
marwil	[rllib] implemented compute_advantages without gae (#6941 )	2020-01-31 22:25:45 -08:00
pg	[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338 )	2020-03-02 15:16:37 -08:00
ppo	Fix issue with torch PPO not handling action spaces of shape=(>1,). (#7398 )	2020-03-02 10:53:19 -08:00
qmix	[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107 ) (#7124 )	2020-02-22 14:19:49 -08:00
sac	[RLlib] SAC add discrete action support. (#7320 )	2020-03-06 10:37:12 -08:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
agent.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
mock.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
registry.py	[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338 )	2020-03-02 15:16:37 -08:00
trainer.py	[RLlib] Make rollout always use `evaluation_config`. (#7396 )	2020-03-03 17:20:35 -08:00
trainer_template.py	[rllib] First pass at pipeline implementation of DQN (#7433 )	2020-03-07 14:47:58 -08:00