Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
fd13bac9b3
|
[RLlib] Add worker arg (optional) to policy_mapping_fn . (#18184)
|
2021-09-17 12:07:11 +02:00 |
|
Sven Mika
|
1520c3d147
|
[RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to Trainer.add_policy() (#18428)
|
2021-09-09 07:10:06 +02:00 |
|
Sven Mika
|
45f60e51a9
|
[RLlib] DDPPO fixes and benchmarks. (#18390)
|
2021-09-08 19:39:01 +02:00 |
|
Sven Mika
|
f18213712f
|
[RLlib] Redo: "fix self play example scripts" PR (17566) (#17895)
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
|
2021-08-17 09:13:35 -07:00 |
|
Amog Kamsetty
|
77f28f1c30
|
Revert "[RLlib] Fix Trainer.add_policy for num_workers>0 (self play example scripts). (#17566)" (#17709)
This reverts commit 3b447265d8 .
|
2021-08-10 10:50:01 -07:00 |
|
Sven Mika
|
3b447265d8
|
[RLlib] Fix Trainer.add_policy for num_workers>0 (self play example scripts). (#17566)
|
2021-08-05 11:41:18 -04:00 |
|
Sven Mika
|
7bc4376466
|
[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077)
|
2021-07-22 10:59:13 -04:00 |
|