..
a3c
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00
ars
[rllib] Autoregressive action distributions ( #5304 )
2019-08-10 14:05:12 -07:00
ddpg
[rllib] Revert [rllib] Port DDPG to the build_tf_policy pattern ( #5626 )
2019-09-04 21:39:22 -07:00
dqn
[rllib] Fix TF2 import of EagerVariableStore ( #5625 )
2019-09-07 12:10:03 -07:00
es
[rllib] Autoregressive action distributions ( #5304 )
2019-08-10 14:05:12 -07:00
impala
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00
marwil
[rllib] Autoregressive action distributions ( #5304 )
2019-08-10 14:05:12 -07:00
pg
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00
ppo
[rllib] Validate that entropy coeff is not an integer ( #5687 )
2019-09-11 14:35:42 -07:00
qmix
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
sac
[rllib] Revert [rllib] Port DDPG to the build_tf_policy pattern ( #5626 )
2019-09-04 21:39:22 -07:00
__init__.py
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
agent.py
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
mock.py
Ray, Tune, and RLlib support for memory, object_store_memory options ( #5226 )
2019-08-21 23:01:10 -07:00
registry.py
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
trainer.py
Fix impala stress test ( #5596 )
2019-08-31 01:20:53 -07:00
trainer_template.py
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00