Sven Mika
|
e2edca45d4
|
[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238)
* Take out stats to analyze memory leak in torch PPO.
* WIP
* WIP
* WIP
* WIP
* WIP
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* LINT.
* Fix determine_tests_to_run.py.
* minor change to re-test after determine_tests_to_run.py.
* LINT.
* update comments.
* WIP
* WIP
* WIP
* FIX.
* Fix sequence_mask being dependent on torch being installed.
* Fix strange ray-core tf-error in test_memory_scheduling test case.
* Fix strange ray-core tf-error in test_memory_scheduling test case.
* Fix strange ray-core tf-error in test_memory_scheduling test case.
* Fix strange ray-core tf-error in test_memory_scheduling test case.
|
2020-02-22 11:02:31 -08:00 |
|
Eric Liang
|
46af992efd
|
[rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213)
|
2020-02-19 16:07:37 -08:00 |
|
Sven Mika
|
d537e9f0d8
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
|
Sven Mika
|
2e60f0d4d8
|
[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178)
* commit
* comment
|
2020-02-15 14:50:44 -08:00 |
|
Adrian O'Grady
|
fe6ce714a0
|
[rllib] - TaskPool.completed_prefetch() no longer returns stale object ids after an error (#7139)
|
2020-02-13 22:30:44 -08:00 |
|
Sven Mika
|
4c97348cb6
|
[RLlib] Schedule-classes multi-framework support. (#6926)
|
2020-01-28 11:07:55 -08:00 |
|
Sven Mika
|
ae9a3a2237
|
[RLlib] from_config util method for framework agnostic components; start moving RLlib tests into Bazel. (#6865)
|
2020-01-22 17:02:58 -08:00 |
|