Sven Mika
|
59a967a3a0
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
|
Yi Cheng
|
7b8b0f8e03
|
Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776)
This reverts commit 804719876b .
|
2022-06-14 13:59:15 -07:00 |
|
Avnish Narayan
|
804719876b
|
[RLlib] Remove execution plan code no longer used by RLlib. (#25624)
|
2022-06-14 10:57:27 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
62dbf26394
|
[RLlib] POC: Run PGTrainer w/o the distr. exec API (Trainer's new training_iteration method). (#20984)
|
2021-12-21 08:39:05 +01:00 |
|
gjoliver
|
9226f9bddc
|
[RLlib] Report timesteps_this_iter to Tune, so it can track/checkpoint/restore total timesteps trained. (#19264)
* Report timesteps_this_iter to Tune, so it can track/checkpoint/restore
total timesteps trained.
* Trigger Build
* lint
|
2021-10-12 16:03:41 +02:00 |
|
Sven Mika
|
ed85f59194
|
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
2021-09-30 16:39:05 +02:00 |
|
Sven Mika
|
c3a15ecc0f
|
[RLlib] Issue #13802: Enhance metrics for multiagent->count_steps_by=agent_steps setting. (#14033)
|
2021-03-18 20:27:41 +01:00 |
|
Michael Luo
|
a2d1215200
|
[RLlib] Execution Annotation (#13036)
|
2020-12-24 09:30:33 -05:00 |
|
Edward Oakes
|
cde711aaf1
|
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
This reverts commit becca1424d .
|
2020-12-15 11:03:02 -08:00 |
|
Michael Luo
|
becca1424d
|
[RLLib] Execution-Folder Type Annotations (#12760)
|
2020-12-14 19:16:44 +01:00 |
|
Sven Mika
|
b0b0463161
|
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
2020-07-29 21:15:09 +02:00 |
|
Eric Liang
|
1e0e1a45e6
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
|
Eric Liang
|
9f04a65922
|
[rllib] Add PPO+DQN two trainer multiagent workflow example (#8334)
|
2020-05-07 23:40:29 -07:00 |
|
Eric Liang
|
baadbdf8d4
|
[rllib] Execute PPO using training workflow (#8206)
* wip
* add kl
* kl
* works now
* doc update
* reorg
* add ddppo
* add stats
* fix fetch
* comment
* fix learner stat regression
* test fixes
* fix test
|
2020-04-30 01:18:09 -07:00 |
|
Eric Liang
|
31b40b00f6
|
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
2020-04-10 00:56:08 -07:00 |
|