ray/rllib/agents at 6c2b9a4cfac5f6b971b79566d4475e0e7d7ad82a - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 6c2b9a4cfa [RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304 ) Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304)		2020-05-04 23:53:38 +02:00
..
a3c	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
ars	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
ddpg	[RLlib] Fix issue 8135 (DDPG inf actions when using [-inf,inf] action space). (#8302 )	2020-05-04 22:27:30 +02:00
dqn	[RLlib] APEX_DDPG (PyTorch) test case and docs. (#8288 )	2020-05-04 09:36:27 +02:00
es	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
impala	[RLlib] IMPALA PyTorch (#8287 )	2020-05-03 13:44:25 +02:00
marwil	[RLlib] MARWIL torch. (#7836 )	2020-04-06 16:38:50 -07:00
pg	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
ppo	[RLlib] IMPALA PyTorch (#8287 )	2020-05-03 13:44:25 +02:00
qmix	[RLlib] rllib/examples folder restructuring (#8250 )	2020-05-01 22:59:34 +02:00
sac	[RLlib] Beta distribution. (#8229 )	2020-04-30 11:09:33 -07:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
agent.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
callbacks.py	[RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972 )	2020-04-16 16:06:42 -07:00
mock.py	[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304 )	2020-05-04 23:53:38 +02:00
registry.py	[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338 )	2020-03-02 15:16:37 -08:00
trainer.py	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
trainer_template.py	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00