ray/rllib/agents at 7ce138a6dcc38f466222f87ea4927c3e93a6e247 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Eric Liang 7ce138a6dc [rllib] Support free_log_std in ModelV2 (#8380 ) * update * factor * update * fix test failures * fix torch net		2020-05-12 10:14:05 -07:00
..
a3c	[RLlib] Examples folder restructuring (Model examples; final part). (#8278 )	2020-05-12 08:23:10 +02:00
ars	[RLlib] Add light-weight `Trainer.compute_action()` tests for all Algos. (#8356 )	2020-05-08 16:31:31 +02:00
ddpg	[RLlib] Examples folder restructuring (Model examples; final part). (#8278 )	2020-05-12 08:23:10 +02:00
dqn	[rllib] Distributed exec workflow for impala (#8321 )	2020-05-11 20:24:43 -07:00
es	[RLlib] Add light-weight `Trainer.compute_action()` tests for all Algos. (#8356 )	2020-05-08 16:31:31 +02:00
impala	[rllib] Distributed exec workflow for impala (#8321 )	2020-05-11 20:24:43 -07:00
marwil	[RLlib] Add light-weight `Trainer.compute_action()` tests for all Algos. (#8356 )	2020-05-08 16:31:31 +02:00
pg	[RLlib] Add light-weight `Trainer.compute_action()` tests for all Algos. (#8356 )	2020-05-08 16:31:31 +02:00
ppo	[rllib] Support free_log_std in ModelV2 (#8380 )	2020-05-12 10:14:05 -07:00
qmix	[rllib] Port QMIX, MADDPG to new execution API (#8344 )	2020-05-07 23:41:10 -07:00
sac	[RLlib] Add light-weight `Trainer.compute_action()` tests for all Algos. (#8356 )	2020-05-08 16:31:31 +02:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
agent.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
callbacks.py	[rllib] observation function api for multi-agent (#8236 )	2020-05-04 22:13:49 -07:00
mock.py	[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304 )	2020-05-04 23:53:38 +02:00
registry.py	[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338 )	2020-03-02 15:16:37 -08:00
trainer.py	[rllib] Distributed exec workflow for impala (#8321 )	2020-05-11 20:24:43 -07:00
trainer_template.py	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00