Sven Mika
|
f3bbe4ea44
|
[RLlib] Test cases/BUILD cleanup; split "everything else" (longest running one rn) tests in 2. (#17640)
|
2021-08-16 22:01:01 +02:00 |
|
Sven Mika
|
0bd69edd71
|
[RLlib] Trajectory view API: enable by default for ES and ARS (#11826)
|
2020-11-12 10:33:10 -08:00 |
|
Sven Mika
|
0c0f67c14d
|
[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308)
|
2020-10-12 13:49:48 -07:00 |
|
Eric Liang
|
5acd3e66dd
|
[rllib] Fix torch TD error, IMPALA LR updates (#9477)
* update
* add test
* lint
* fix super call
* speed es test up
|
2020-07-23 12:50:25 -07:00 |
|
Sven Mika
|
c4ccbfdfa9
|
[RLlib] tf-eager support for ES and ARS (tf2.x preparation). (#9207)
|
2020-07-02 13:03:10 +02:00 |
|
Sven Mika
|
4ed796a7d6
|
[RLlib] Add testing Policy.compute_single_action() for all agents. (#8903)
|
2020-06-13 17:51:50 +02:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Sven Mika
|
754290daad
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
|
Sven Mika
|
3812bfedda
|
[RLlib] PyTorch version of ES (Evolution Strategies). (#8104)
PyTorch version of Evolution Strategies (ES) Algo.
|
2020-04-20 21:47:28 +02:00 |
|