Sven Mika
|
599e589481
|
[RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065)
|
2021-08-31 14:56:53 +02:00 |
|
Sven Mika
|
a428f10ebe
|
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
2021-08-18 17:21:01 +02:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|
Sven Mika
|
69202c6a7d
|
[RLlib] Obsolete usage tracking dict via sample batch. (#13065)
|
2021-03-17 08:18:15 +01:00 |
|
Sven Mika
|
ce96b03b07
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
|
Sven Mika
|
4da0e542d5
|
[RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204)
|
2020-07-08 16:12:20 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
4ed796a7d6
|
[RLlib] Add testing Policy.compute_single_action() for all agents. (#8903)
|
2020-06-13 17:51:50 +02:00 |
|
Sven Mika
|
754290daad
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
|
Sven Mika
|
1d4823c0ec
|
[RLlib] Add testing framework_iterator. (#7852)
* Add testing framework_iterator.
* LINT.
* WIP.
* Fix and LINT.
* LINT fix.
|
2020-04-03 12:24:25 -07:00 |
|
Sven Mika
|
20ef4a8603
|
[RLlib] Cleanup/unify all test cases. (#7533)
|
2020-03-11 20:39:47 -07:00 |
|
Sven Mika
|
83e06cd30a
|
[RLlib] DDPG refactor and Exploration API action noise classes. (#7314)
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* Fix
* WIP.
* Add TD3 quick Pendulum regresison.
* Cleanup.
* Fix.
* LINT.
* Fix.
* Sort quick_learning test cases, add TD3.
* Sort quick_learning test cases, add TD3.
* Revert test_checkpoint_restore.py (debugging) changes.
* Fix old soft_q settings in documentation and test configs.
* More doc fixes.
* Fix test case.
* Fix test case.
* Lower test load.
* WIP.
|
2020-03-01 11:53:35 -08:00 |
|