ray/rllib/agents at 2b2eb4debb381f9261a615125399c66c9413bbfb - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Ashwinee Panda 946ebfaa3c [rllib] Validate that entropy coeff is not an integer (#5687 ) * Validate that entropy coeff is not an integer Passing an integer value for entropy coeff such as 0 raises an error somewhere inside the TF policy graph, so this checks to make sure the entropy coeff is a float. * Cast to float instead Also move this check after the negative value check		2019-09-11 14:35:42 -07:00
..
a3c	[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436 )	2019-08-23 14:21:11 +08:00
ars	[rllib] Autoregressive action distributions (#5304 )	2019-08-10 14:05:12 -07:00
ddpg	[rllib] Revert [rllib] Port DDPG to the build_tf_policy pattern (#5626 )	2019-09-04 21:39:22 -07:00
dqn	[rllib] Fix TF2 import of EagerVariableStore (#5625 )	2019-09-07 12:10:03 -07:00
es	[rllib] Autoregressive action distributions (#5304 )	2019-08-10 14:05:12 -07:00
impala	[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436 )	2019-08-23 14:21:11 +08:00
marwil	[rllib] Autoregressive action distributions (#5304 )	2019-08-10 14:05:12 -07:00
pg	[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436 )	2019-08-23 14:21:11 +08:00
ppo	[rllib] Validate that entropy coeff is not an integer (#5687 )	2019-09-11 14:35:42 -07:00
qmix	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
sac	[rllib] Revert [rllib] Port DDPG to the build_tf_policy pattern (#5626 )	2019-09-04 21:39:22 -07:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
agent.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
mock.py	Ray, Tune, and RLlib support for memory, object_store_memory options (#5226 )	2019-08-21 23:01:10 -07:00
registry.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
trainer.py	Fix impala stress test (#5596 )	2019-08-31 01:20:53 -07:00
trainer_template.py	[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436 )	2019-08-23 14:21:11 +08:00