Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Artur Niederfahrenhorst
|
fb2915d26a
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
|
Sven Mika
|
f066180ed5
|
[RLlib] Deprecate timesteps_per_iteration config key (in favor of min_[sample|train]_timesteps_per_reporting . (#24372)
|
2022-05-02 12:51:14 +02:00 |
|
Artur Niederfahrenhorst
|
9a64bd4e9b
|
[RLlib] Simple-Q uses training iteration fn (instead of execution_plan); ReplayBuffer API for Simple-Q (#22842)
|
2022-03-29 14:44:40 +02:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
c01245763e
|
[RLlib] Revert "Revert "updated pettingzoo wrappers, env versions, urls"" (#21339)
|
2022-01-04 18:30:26 +01:00 |
|
Kai Fricke
|
489e6945a6
|
Revert "[RLlib] Updated pettingzoo wrappers, env versions, urls (#20113)" (#21338)
This reverts commit 327eb84154 .
|
2022-01-03 10:21:25 +00:00 |
|
Benjamin Black
|
327eb84154
|
[RLlib] Updated pettingzoo wrappers, env versions, urls (#20113)
|
2022-01-02 21:29:09 +01:00 |
|
Sven Mika
|
649580d735
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
|
Amog Kamsetty
|
38b5b6d24c
|
Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)" (#17036)
This reverts commit e4123fff27 .
|
2021-07-13 09:57:15 -07:00 |
|
Sven Mika
|
e4123fff27
|
[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)
|
2021-07-13 06:38:14 -04:00 |
|
Sven Mika
|
be6db06485
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
|
Amog Kamsetty
|
bd3cbfc56a
|
Revert "[RLlib] Allow policies to be added/deleted on the fly. (#16359)" (#16543)
This reverts commit e78ec370a9 .
|
2021-06-18 12:21:49 -07:00 |
|
Sven Mika
|
e78ec370a9
|
[RLlib] Allow policies to be added/deleted on the fly. (#16359)
|
2021-06-18 10:31:30 +02:00 |
|
Sven Mika
|
e74947cc94
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
|
Sven Mika
|
28ab797cf5
|
[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544)
|
2020-09-06 10:58:00 +02:00 |
|
Justin Terry
|
352718610d
|
Multi-agent Algorithm Documentation Updates (#9722)
|
2020-09-03 22:37:46 -07:00 |
|