.. |
a3c
|
[RLlib] Examples folder restructuring (Model examples; final part). (#8278)
|
2020-05-12 08:23:10 +02:00 |
ars
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
ddpg
|
[RLlib] Examples folder restructuring (Model examples; final part). (#8278)
|
2020-05-12 08:23:10 +02:00 |
dqn
|
[rllib] Distributed exec workflow for impala (#8321)
|
2020-05-11 20:24:43 -07:00 |
es
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
impala
|
[rllib] Distributed exec workflow for impala (#8321)
|
2020-05-11 20:24:43 -07:00 |
marwil
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
pg
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
ppo
|
[rllib] Support free_log_std in ModelV2 (#8380)
|
2020-05-12 10:14:05 -07:00 |
qmix
|
[rllib] Port QMIX, MADDPG to new execution API (#8344)
|
2020-05-07 23:41:10 -07:00 |
sac
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
agent.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
callbacks.py
|
[rllib] observation function api for multi-agent (#8236)
|
2020-05-04 22:13:49 -07:00 |
mock.py
|
[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304)
|
2020-05-04 23:53:38 +02:00 |
registry.py
|
[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338)
|
2020-03-02 15:16:37 -08:00 |
trainer.py
|
[rllib] Distributed exec workflow for impala (#8321)
|
2020-05-11 20:24:43 -07:00 |
trainer_template.py
|
[rllib] Execute PPO using training workflow (#8206)
|
2020-04-30 01:18:09 -07:00 |