Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
d5bfb7b7da
|
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2022-01-25 14:16:58 +01:00 |
|
Sven Mika
|
828f5d26b7
|
[RLlib] Custom view requirements (e.g. for prev-n-obs) work with compute_single_action and compute_actions_from_input_dict . (#18921)
|
2021-09-30 15:03:37 +02:00 |
|
Sven Mika
|
0c0f67c14d
|
[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308)
|
2020-10-12 13:49:48 -07:00 |
|
Sven Mika
|
c4ccbfdfa9
|
[RLlib] tf-eager support for ES and ARS (tf2.x preparation). (#9207)
|
2020-07-02 13:03:10 +02:00 |
|
Sven Mika
|
4ed796a7d6
|
[RLlib] Add testing Policy.compute_single_action() for all agents. (#8903)
|
2020-06-13 17:51:50 +02:00 |
|
Sven Mika
|
754290daad
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
|