Sven Mika
|
59a967a3a0
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Eric Liang
|
905258dbc1
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
|
Artur Niederfahrenhorst
|
fb2915d26a
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
|
Sven Mika
|
25001f6d8d
|
[RLlib] APPO Training iteration fn. (#24545)
|
2022-05-17 10:31:07 +02:00 |
|
Avnish Narayan
|
f2bb6f6806
|
[RLlib] Impala training iteration fn (#23454)
|
2022-05-05 16:11:08 +02:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Jun Gong
|
8ebc50f844
|
[RLlib] Issue 21334: Fix APPO when kl_loss is enabled. (#21855)
|
2022-01-27 20:08:58 +01:00 |
|
Sven Mika
|
49cd7ea6f9
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
|
Artur Niederfahrenhorst
|
d07e50e957
|
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552)
|
2021-11-19 11:57:37 +01:00 |
|
Sven Mika
|
0b308719f8
|
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
2021-11-01 21:46:02 +01:00 |
|
Sven Mika
|
ed85f59194
|
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
2021-09-30 16:39:05 +02:00 |
|
Chris Bamford
|
58a73821fb
|
[RLlib] IMPALA sample throughput calculation and full queue slowdown fixes (#17822)
|
2021-08-17 14:01:41 +02:00 |
|
Sven Mika
|
5a313ba3d6
|
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
2021-07-20 14:58:13 -04:00 |
|
Sven Mika
|
4b8dadccbd
|
[RLlib] Fix PR 16162: Having added sleep to _NextValueNotReady causes TD3 tests to become flakey. (#16309)
|
2021-06-08 07:27:02 -07:00 |
|
Chris Bamford
|
1e3721ef4a
|
[RLlib] Remove bad spinlocks to allow pytorch GPU scheduler to interrupt. (#16162)
|
2021-06-01 16:40:28 +02:00 |
|
Sven Mika
|
d001af3e59
|
[RLlib] Allow rllib rollout to run distributed via evaluation workers. (#13718)
|
2021-02-08 12:05:16 +01:00 |
|
Michael Luo
|
a2d1215200
|
[RLlib] Execution Annotation (#13036)
|
2020-12-24 09:30:33 -05:00 |
|
Edward Oakes
|
cde711aaf1
|
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
This reverts commit becca1424d .
|
2020-12-15 11:03:02 -08:00 |
|
Michael Luo
|
becca1424d
|
[RLLib] Execution-Folder Type Annotations (#12760)
|
2020-12-14 19:16:44 +01:00 |
|
Sven Mika
|
805dad3bc4
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
|
Sven Mika
|
fcdf410ae1
|
[RLlib] Tf2.x native. (#8752)
|
2020-07-11 22:06:35 +02:00 |
|
Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|