ray/rllib/agents
maxco2 b8436f0f00
[rllib] Fix SAC and DDPG tensorflow policy can't do grad_clip (#10499)
* Fix sac_tf_policy clip_by_norm missing argument

* Fix ddpg_tf_policy clip_by_norm missing argument

* Fix format
2020-09-11 12:04:44 -07:00
..
a3c [RLlib] Curiosity enhancements. (#10373) 2020-09-05 13:14:24 +02:00
ars [RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544) 2020-09-06 10:58:00 +02:00
ddpg [rllib] Fix SAC and DDPG tensorflow policy can't do grad_clip (#10499) 2020-09-11 12:04:44 -07:00
dqn [RLlib] Add type annotations for agents/dqn (#10626) 2020-09-09 18:55:26 +02:00
dreamer [RLlib] Dreamer (#10172) 2020-08-26 13:24:05 +02:00
es [rllib] Fix torch TD error, IMPALA LR updates (#9477) 2020-07-23 12:50:25 -07:00
impala [RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420) 2020-09-02 14:03:01 +02:00
maml [RLLib] MBMPO Fixes (#10296) 2020-09-09 09:34:34 +02:00
marwil [RLLib] Add missing .to() for MARWIL on PyTorch (#10685) 2020-09-09 18:52:55 -07:00
mbmpo [RLLib] MBMPO Fixes (#10296) 2020-09-09 09:34:34 +02:00
pg [RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420) 2020-09-02 14:03:01 +02:00
ppo [RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420) 2020-09-02 14:03:01 +02:00
qmix [RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544) 2020-09-06 10:58:00 +02:00
sac [rllib] Fix SAC and DDPG tensorflow policy can't do grad_clip (#10499) 2020-09-11 12:04:44 -07:00
__init__.py [RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544) 2020-09-06 10:58:00 +02:00
callbacks.py [RLlib] Issue 10469: Callbacks should receive env idx ... (#10477) 2020-09-03 17:27:05 +02:00
mock.py [tune] Use public methods for trainable (#9184) 2020-07-01 11:00:00 -07:00
registry.py [RLlib] Behavioral Cloning (from MARWIL). (#10619) 2020-09-09 17:33:21 +02:00
trainer.py [RLlib] Behavioral Cloning (from MARWIL). (#10619) 2020-09-09 17:33:21 +02:00
trainer_template.py [RLlib] Behavioral Cloning (from MARWIL). (#10619) 2020-09-09 17:33:21 +02:00