1
0
Fork 0
mirror of https://github.com/vale981/ray synced 2025-03-12 22:26:39 -04:00
Commit graph

20 commits

Author SHA1 Message Date
Sven Mika
a8494742a3
[RLlib] Memory leak finding toolset using tracemalloc + CI memory leak tests. () 2022-04-12 07:50:09 +02:00
Balaji Veeramani
7f1bacc7dc
[CI] Format Python code with Black ()
See  and  for the motivation behind these changes.
2022-01-29 18:41:57 -08:00
Sven Mika
daa4304a91
[RLlib] Switch off preprocessors by default for PGTrainer. () 2021-12-13 12:04:23 +01:00
Sven Mika
61a1274619
[RLlib] No Preprocessors (part 2). () 2021-09-23 12:56:45 +02:00
Sven Mika
a96dbd885b
[RLlib] Reinstate trajectory view API tests. () 2021-09-23 08:31:51 +02:00
simonsays1980
60aee4a330
[RLlib] Add example script for bare metal Policy with custom view_requirements. () 2021-08-20 12:17:13 +02:00
Richard Liaw
a78a2263e5
[RLlib] Fix reverted RockPaperScissors Pettingzoo example () 2021-07-22 10:55:07 -04:00
Amog Kamsetty
ecb632140f
Revert "RockPaperScissors Pettingzoo" ()
This reverts commit bf3e3225b6.
2021-07-06 09:43:47 -07:00
Rodrigo de Lazcano
bf3e3225b6
RockPaperScissors Pettingzoo () 2021-07-05 09:52:08 -07:00
Sven Mika
391cdfae8c
[RLlib] Trajectory view API docs. () 2020-12-30 17:32:21 -08:00
Sven Mika
99c81c6795
[RLlib] Attention Net prep PR . () 2020-12-07 13:08:17 +01:00
Sven Mika
3ad9365e1d
[RLlib] Attention Net prep PR : Smaller cleanups. () 2020-12-01 08:21:45 +01:00
Sven Mika
0df55a139c
[RLlib] Attention Net prep PR : Smaller cleanups. ()
* WIP.

* Fix.

* Fix.

* Fix.
2020-11-27 16:25:47 -08:00
Sven Mika
62c7ab5182
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). () 2020-11-12 16:27:34 +01:00
desktable
5af745c90d
[RLlib] Implement the SlateQ algorithm () 2020-11-03 09:52:04 +01:00
Sven Mika
36bda8432b
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic () 2020-10-01 16:57:10 +02:00
Sven Mika
e968b52cb7
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards () 2020-08-21 12:35:16 +02:00
Barak Michener
8e76796fd0
ci: Redo format.sh --all script & backfill lint fixes () 2020-08-07 16:49:49 -07:00
Sven Mika
e6ea33a03c
[RLlib] Enhance reward clipping test; add action_clipping tests. () 2020-07-28 10:44:54 +02:00
Sven Mika
5f278c6411
[RLlib] Examples folder restructuring (models) part 1 () 2020-05-08 08:20:18 +02:00