Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
e37afe0425
|
[RLlib; Docs] Auto API reference pages overhaul: rllib/policy and rllib/agents packages. (#20537)
|
2021-11-25 09:35:19 +01:00 |
|
Sven Mika
|
0b308719f8
|
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
2021-11-01 21:46:02 +01:00 |
|
Sven Mika
|
cd22a7d1bb
|
[RLlib] Add locking to PolicyMap in case it is accessed by a RolloutWorker and the same worker's AsyncSampler or the main LearnerThread. (#18444)
|
2021-09-08 23:32:23 +02:00 |
|
Sven Mika
|
f18213712f
|
[RLlib] Redo: "fix self play example scripts" PR (17566) (#17895)
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
|
2021-08-17 09:13:35 -07:00 |
|
Amog Kamsetty
|
77f28f1c30
|
Revert "[RLlib] Fix Trainer.add_policy for num_workers>0 (self play example scripts). (#17566)" (#17709)
This reverts commit 3b447265d8 .
|
2021-08-10 10:50:01 -07:00 |
|
Sven Mika
|
3b447265d8
|
[RLlib] Fix Trainer.add_policy for num_workers>0 (self play example scripts). (#17566)
|
2021-08-05 11:41:18 -04:00 |
|
Sven Mika
|
18d173b172
|
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031)
|
2021-07-19 13:16:03 -04:00 |
|