1
0
Fork 0
mirror of https://github.com/vale981/ray synced 2025-03-16 16:16:38 -04:00
Commit graph

29 commits

Author SHA1 Message Date
Christy Bergman
7c925fe99f
[RLlib; docs] Re-organize algorithms so TOC matches README. () 2022-07-13 10:46:36 +02:00
kourosh hakhamaneshi
25940cb95b
[RLlib] CRR documentation. () 2022-06-14 12:45:36 +02:00
Sven Mika
130b7eeaba
[RLlib] Trainer to Algorithm renaming. () 2022-06-11 15:10:39 +02:00
Sven Mika
a559efb7e4
[CI; LinkCheck] 3 RLlib fixes. () 2022-06-04 11:54:56 +02:00
Sven Mika
b5bc2b93c3
[RLlib] Move all remaining algos into algorithms directory. () 2022-06-04 07:35:24 +02:00
Yi Cheng
fd0f967d2e
Revert "[RLlib] Move (A/DD)?PPO and IMPALA algos to algorithms dir and rename policy and trainer classes. ()" ()
This reverts commit e4ceae19ef.

Reverts 

linux://python/ray/tests:test_client_library_integration never fail before this PR.

In the CI of the reverted PR, it also fails (https://buildkite.com/ray-project/ray-builders-pr/builds/34079#01812442-c541-4145-af22-2a012655c128). So high likely it's because of this PR.

And test output failure seems related as well (https://buildkite.com/ray-project/ray-builders-branch/builds/7923#018125c2-4812-4ead-a42f-7fddb344105b)
2022-06-02 20:38:44 -07:00
Sven Mika
e4ceae19ef
[RLlib] Move (A/DD)?PPO and IMPALA algos to algorithms dir and rename policy and trainer classes. () 2022-06-02 16:47:05 +02:00
Sven Mika
18c03f8d93
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). () 2022-06-01 09:29:16 +02:00
Sven Mika
e73c37cc17
[RLlib] MADDPG: Move into main algorithms folder and add proper unit and learning tests. () 2022-05-24 12:53:53 +02:00
Steven Morad
501d932449
[RLlib] SAC, RNNSAC, and CQL TrainerConfig objects () 2022-05-22 19:58:47 +02:00
Lucas Alava Peña
2a7ebd4dcf
[RLlib] Fix minor typos in docs () 2022-05-20 12:19:49 -07:00
kourosh hakhamaneshi
3815e52a61
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits () 2022-05-19 18:30:42 +02:00
Sven Mika
8f50087908
[RLlib] AlphaZero uses training_iteration API. () 2022-05-18 09:58:25 +02:00
Kai Fricke
96da5dc776
[rllib] Fix some missing agent->algorithm doc changes ()
 missed some doc changes that popped up in broken linkcheck. Note that there could be others that were not caught by this.
2022-05-16 11:52:49 +01:00
Jun Gong
68a9a33386
[RLlib] Retry agents -> algorithms. with proper doc changes this time. () 2022-05-16 09:45:32 +02:00
kourosh hakhamaneshi
69055f556d
[RLlib] Move agents.ars to algorithms.ars. () 2022-05-06 19:11:15 +02:00
kourosh hakhamaneshi
f48f1b252c
[RLlib] Moved agents.es to algorithms.es () 2022-05-06 14:54:22 +02:00
Sven Mika
7ab19ddc32
[RLlib] MADDPG: Move into agents folder (from contrib) and use training_iteration method. () 2022-05-06 12:35:21 +02:00
Christy Bergman
76eb47e226
[RLlib; docs] Rename UCB -> LinUCB. () 2022-05-05 10:20:16 +02:00
Sven Mika
ba14f0a41b
[RLlib] PGTrainer config object class (PGConfig). () 2022-04-28 22:25:16 +02:00
Eric Liang
1ff874e8e8
[spelling] Add linter rule for mis-capitalizations of RLLib -> RLlib () 2022-04-10 16:12:53 -07:00
Philipp Moritz
886cc4d674
Fix broken links in documentation and put linkcheck linter in place on CI () 2022-03-18 21:02:52 -07:00
Max Pumperla
11c40e363d
[docs] external promo content () 2022-03-10 11:39:44 -08:00
Sven Mika
8e00537b65
[RLlib] SlateQ: framework=tf fixes and SlateQ documentation update () 2022-02-23 13:03:45 +01:00
Max Pumperla
9482f03134
[docs] RLlib concepts consolidation, user guide, RL conf prep () 2022-02-18 09:35:20 -08:00
Sven Mika
e03606f0b3
[RLlib] Bandit documentation enhancements. () 2022-02-17 13:25:50 +01:00
Max Pumperla
5cc9355303
[Docs ] Tune docs overhaul (first part) ()
Continuing docs overhaul, tune now has:

- [x] better landing page
- [x] a getting started guide
- [x] user guide was cut down, partially merged with FAQ, and partially integrated with tutorials
- [x] the new user guide contains guides to tune features and practical integrations
- [x] we rewrote some of the feature guides for clarity 
- [x] we got rid of sphinx-gallery for this sub-project (only data and core left), as it looks bad and is unnecessarily complicated anyway (plus, makes the build slower)
- [x] sphinx-gallery examples are now moved to markdown notebook, as started in .
- [x] Examples are tested in the new framework, of course.

There's still a lot one can do, but this is already getting too large. Will follow up with more fine-tuning next week.

Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2022-02-07 15:47:03 +00:00
Sven Mika
893536ebd9
[RLlib] Move bandits into main agents folder; Make RecSim adapter more accessible; () 2022-01-27 13:58:12 +01:00
Max Pumperla
f9b71a8bf6
[docs] new structure ()
This PR consolidates both  and  (look there for features), but improves on them in the following way:

- [x] we reverted renaming of existing projects `tune`, `rllib`, `train`, `cluster`, `serve`, `raysgd` and `data` so that links won't break. I think my consolidation efforts with the `ray-` prefix were a little overeager in that regard. It's better like this. Only the creation of `ray-core` was a necessity, and some files moved into the `rllib` folder, so that should be relatively benign.
- [x] Additionally, we added Algolia `docsearch`, screenshot below. This is _much_ better than our current search. Caveat: there's a sphinx dependency that needs to be replaced (`sphinx-tabs`) by another, newer one (`sphinx-panels`), as the former prevents loading of the `algolia.js` library. Will follow-up in the next PR (hoping this one doesn't get re-re-re-re-reverted).
2022-01-21 15:42:05 -08:00
Renamed from doc/source/rllib-algorithms.rst (Browse further)