Commit graph

  • 5567a38a70
    Adding unique id to azure template to enable multiple clusters per resource group. Using unique id to set subnet random seed, change msi and vnet names, logging unique id, and adding it to filter vms in cluster. Example template files updated with comments. (#26392) Scott Graham 2022-08-17 12:24:26 -04:00
  • 5e4aad9fed
    [requirements/docker] Update xgboost-ray and lightgbm-ray versions (#27943) Kai Fricke 2022-08-17 18:03:37 +02:00
  • f7b4c5a7ec
    [RLlib] Remove unneeded args from offline learning examples. (#26666) Artur Niederfahrenhorst 2022-08-17 17:59:27 +02:00
  • 4a55f18a22
    [docs][serve] Fix linkcheck for production guide (#27941) Kai Fricke 2022-08-17 16:46:53 +02:00
  • 59b4454f72 cu Clarence Ng 2022-08-17 02:40:13 -07:00
  • c91be346f4 fix Clarence Ng 2022-08-17 02:11:45 -07:00
  • d449f8db27
    [CI] Update upstream requirements for XGB/LGBM-Ray (#27908) Antoni Baum 2022-08-17 10:55:02 +02:00
  • 68b5d4302c
    [Core] Suppress gRPC server alerting on too many keep-alive pings (#27769) Ricky Xu 2022-08-17 04:53:47 -04:00
  • 3f8a4311c3 fix Clarence Ng 2022-08-17 00:53:37 -07:00
  • a64675f23e log Clarence Ng 2022-08-17 00:43:15 -07:00
  • 9330d8f244
    [RLlib] Add DTTorchPolicy (#27889) Charles Sun 2022-08-17 00:28:00 -07:00
  • 2cc1021533 lint Clarence Ng 2022-08-16 23:32:11 -07:00
  • e03df6074c Merge branch 'master' into oomon Clarence Ng 2022-08-16 23:25:35 -07:00
  • dd9814a324 topn Clarence Ng 2022-08-16 23:25:12 -07:00
  • 287dc7002c
    [core] Don't override external dashboard URL in internal KV store (#27933) Nikita Vemuri 2022-08-16 22:59:46 -07:00
  • 42686a9c60
    [pick] AIR doc changes and benchmark updates (#27924) Richard Liaw 2022-08-16 22:53:28 -07:00
  • 4692e8d802
    [core] Don't override external dashboard URL in internal KV store (#27901) Nikita Vemuri 2022-08-16 22:48:05 -07:00
  • 86d4bf5b0a Fix nyc_taxi_basic_processing.ipynb end-to-end (#27927) Cheng Su 2022-08-16 21:30:19 -07:00
  • 4ad1b4c712
    Fix nyc_taxi_basic_processing.ipynb end-to-end (#27927) Cheng Su 2022-08-16 21:30:19 -07:00
  • a012296544 Replace robot image with emoji and replace word Trainer with Algorithm (#27928) Christy Bergman 2022-08-16 21:27:21 -07:00
  • 3f313d74ad
    Replace robot image with emoji and replace word Trainer with Algorithm (#27928) Christy Bergman 2022-08-16 21:27:21 -07:00
  • 0a8299f9d7
    [workflow][doc] First pass of workflow doc. (#27331) (#27937) Yi Cheng 2022-08-17 04:26:24 +00:00
  • 65f92a44e3
    [serve][docs] Consolidate production guides, add kuberay docs to it (#27747) Edward Oakes 2022-08-16 21:29:56 -05:00
  • 2262ac02f3
    [workflow][doc] First pass of workflow doc. (#27331) Yi Cheng 2022-08-17 01:48:05 +00:00
  • 61880591e9
    [RLlib] Add DTTorchModel (#27872) Charles Sun 2022-08-16 18:18:29 -07:00
  • d0891e8b16
    [cherry-pick] Simplify Ray start guide and move PI tutorial to examples page (#27930) Eric Liang 2022-08-16 17:24:19 -07:00
  • 87ce8480ff
    [core] Add stats for the gcs backend for telemetry. (#27876) Yi Cheng 2022-08-17 00:02:04 +00:00
  • 7ff914b06e
    [AIR][Docs] Set logging_strategy="epoch" for HF (#27917) Antoni Baum 2022-08-17 01:45:46 +02:00
  • aba2ddd646
    [AIR][Docs] Remove the excessive printing from Torch examples (#27903) (#27929) matthewdeng 2022-08-16 16:15:56 -07:00
  • 753fad9cad
    [RLlib] Add Segmentation Buffer for DT (#27829) Charles Sun 2022-08-16 15:20:41 -07:00
  • 70207c02e9
    [workflow] Documentation of http events (#27166) (#27884) Yi Cheng 2022-08-16 21:48:30 +00:00
  • 8a7be15b72
    [docs] Simplify Ray start guide and move PI tutorial to examples page (#27885) Eric Liang 2022-08-16 14:28:45 -07:00
  • 75051278d7
    Fix the undocumented ray log error (#27887) SangBin Cho 2022-08-17 06:28:09 +09:00
  • 759fbd9502
    [air][minor] Use drop_columns in docs (#27852) Richard Liaw 2022-08-16 14:01:25 -07:00
  • b2f1c84f47 cu Clarence Ng 2022-08-16 13:51:54 -07:00
  • e7d9425a9b error log Clarence Ng 2022-08-16 13:51:13 -07:00
  • 78648e3583
    [Serve][Docs] Mark metrics served for HTTP vs Python calls (#27858) Zoltan Fedor 2022-08-16 16:23:29 -04:00
  • 24508db920
    [Docs][GCP] Configuring ServiceAccounts for worker (#27915) Ian Rodney 2022-08-16 13:13:27 -07:00
  • c5a4605030
    Fix grammer of error message (#27900) Jiajun Yao 2022-08-16 11:26:03 -07:00
  • 5757909cd2
    [AIR] load_best_model_at_end validation for HF (#27875) Antoni Baum 2022-08-16 19:52:05 +02:00
  • b91246a093
    [air/benchmarks] Measure local training time in torch/tf benchmarks (#27902) Kai Fricke 2022-08-16 19:16:08 +02:00
  • 7c892092da
    Minor fix for nyc_taxi_basic_processing.ipynb (#27886) Cheng Su 2022-08-16 09:11:16 -07:00
  • b9a2fb79b6
    [AIR][Docs] Remove the excessive printing from Torch examples (#27903) Simon Mo 2022-08-16 09:09:54 -07:00
  • 4d19c0222b
    [AIR] Add rich notebook repr for DataParallelTrainer (#26335) Peyton Murray 2022-08-16 08:51:14 -07:00
  • bceef503b2
    [Kubernetes][docs] Restore legacy Ray operator migration discussion (#27841) Dmitri Gekhtman 2022-08-16 08:46:31 -07:00
  • 91f506304d
    [air] [checkpoint manager] handle nested metrics properly as scoring attribute. (#27715) xwjiang2010 2022-08-16 08:43:58 -07:00
  • 436c89ba1a
    [RLlib] Eval workers use async req manager. (#27390) Sven Mika 2022-08-16 12:05:55 +02:00
  • 1c4b3879a1
    [Serve]Fix classloader bug in Java Deployment (#27899) liuyang-my 2022-08-16 15:22:00 +08:00
  • 83cc9c0e3d
    [State Observability] Promote the API to alpha (#27788) (#27857) SangBin Cho 2022-08-16 15:10:42 +09:00
  • d48fa5c972 mod rtoom Clarence Ng 2022-08-15 20:47:17 -07:00
  • c2abfdb2f7
    [autoscaler][observability] Observability into when/why nodes fail to launch (#27697) Alex Wu 2022-08-15 18:14:29 -07:00
  • f05c744a65
    [Doc] minor fix on accessing AWS/S3 Chen Shen 2022-08-15 16:53:31 -07:00
  • a3236b6225
    [air] fix ptl release test (#27773) xwjiang2010 2022-08-15 14:47:33 -07:00
  • 34c494260f
    [workflow] Documentation of http events (#27166) Yuan-Chi Chang 2022-08-15 17:23:04 -04:00
  • be4e7a7d89 disable Clarence Ng 2022-08-15 13:48:15 -07:00
  • d1a86fc597 Fix broken links in the code (#27873) Jiajun Yao 2022-08-15 13:11:42 -07:00
  • 06ef4ab94e
    Fix broken links in the code (#27873) Jiajun Yao 2022-08-15 13:11:42 -07:00
  • 7f6578b81e [release test] increase air tf gpu benchmark non smoke test timeout from 3600 to 4800. (#27869) xwjiang2010 2022-08-15 10:03:40 -07:00
  • b90867c301 [Test] Fix broken test_base_trainer (#27855) SangBin Cho 2022-08-15 23:50:18 +09:00
  • 68cc544da6
    [release test] increase air tf gpu benchmark non smoke test timeout from 3600 to 4800. (#27869) xwjiang2010 2022-08-15 10:03:40 -07:00
  • 058c239cf1
    [runtime env] Test common failure scenarios (#25977) Archit Kulkarni 2022-08-15 09:35:56 -07:00
  • eb37bb857c
    Revamp ray core design patterns doc [1/n]: generators (#27823) Jiajun Yao 2022-08-15 09:24:34 -07:00
  • b88064dbb6 [release test] remove dask/modin_xgboost test completely. (#27865) xwjiang2010 2022-08-15 07:55:33 -07:00
  • 52440f1489
    [Docs] Fix a typo in index.md (#27859) Myeongju Kim 2022-08-15 08:26:40 -07:00
  • eac8d8f8da
    [Link Check] Fix the broken link check from the AIR doc (#27632) (#27856) SangBin Cho 2022-08-16 00:07:56 +09:00
  • f77ec350fa
    [release test] remove dask/modin_xgboost test completely. (#27865) xwjiang2010 2022-08-15 07:55:33 -07:00
  • d654636bfc
    [Test] Fix broken test_base_trainer (#27855) SangBin Cho 2022-08-15 23:50:18 +09:00
  • d4da334feb Merge branch 'master' into oomon Clarence Ng 2022-08-14 20:43:16 -07:00
  • 36953646af add test Clarence Ng 2022-08-14 20:42:27 -07:00
  • a2c168cd6d
    [Datasets][docs] Minor fix for nyc_taxi_basic_processing.ipynb (#27828) Cheng Su 2022-08-14 12:34:33 -07:00
  • 9ece110d27
    [State Observability] Promote the API to alpha (#27788) SangBin Cho 2022-08-14 15:43:01 +09:00
  • 1bae3c905d
    Improve text around ecosystem map (#27839) Eric Liang 2022-08-12 18:59:09 -07:00
  • 50547ffb18 [Core][Placement Group] Handling edge cases of max_cpu_fraction argument (#27035) SangBin Cho 2022-08-13 09:40:11 +09:00
  • 999715ebec
    [Core][Placement Group] Handling edge cases of max_cpu_fraction argument (#27035) SangBin Cho 2022-08-13 09:40:11 +09:00
  • 3901e66488
    [cherry-pick][data] update datasets API structure (#27836) matthewdeng 2022-08-12 17:20:25 -07:00
  • 795767a231
    [Serve] Fix memory leak issue in serve inference (#27815) (#27844) Simon Mo 2022-08-12 17:17:07 -07:00
  • 7e7c93f6ba
    [Serve] Fix memory leak issue in serve inference (#27815) Sihan Wang 2022-08-12 17:11:37 -07:00
  • f9d5d6df12
    [Serve] [Docs] Revise Java API documentation (#27831) shrekris-anyscale 2022-08-12 17:09:40 -07:00
  • 0a3c1de08b
    [Serve] [Docs] Replace references to dag.execute() with handle.predict.remote() (#27784) shrekris-anyscale 2022-08-12 17:09:28 -07:00
  • 79d2cd4499
    [AIR] [Docs] Revise "Which preprocessor should you use?" (#27835) (#27842) Balaji Veeramani 2022-08-12 16:26:24 -07:00
  • 62c123b352 config oomdoc Clarence Ng 2022-08-12 16:16:32 -07:00
  • 5c63e106df config Clarence Ng 2022-08-12 15:50:21 -07:00
  • dff0385ed5 update Clarence Ng 2022-08-12 15:45:07 -07:00
  • 0b37fcee1c oom doc Clarence Ng 2022-08-12 15:39:57 -07:00
  • 8614b264c5 cleanup Clarence Ng 2022-08-12 15:39:30 -07:00
  • 9ca83a7c4c cleanup Clarence Ng 2022-08-12 15:37:45 -07:00
  • 8308b7c78b cleanup Clarence Ng 2022-08-12 15:34:45 -07:00
  • 1e1022d065 [RLlib] CRR framework torch by default. (#27161) Sven Mika 2022-08-09 16:53:00 +02:00
  • 8cb09a9fc5
    Revert "Revert "[serve] Integrate and Document Bring-Your-Own Gradio Applications"" (#27662) zcin 2022-08-12 15:12:20 -07:00
  • 42c1e0dc2b cleanup Clarence Ng 2022-08-12 15:10:21 -07:00
  • 55e57d4f92
    [AIR] [Docs] Revise "Which preprocessor should you use?" (#27835) Balaji Veeramani 2022-08-12 14:43:36 -07:00
  • 00dbc46f3c
    [RLlib] pin gym-minigrid @ 1.0.3 (#27761) (#27838) matthewdeng 2022-08-12 13:58:54 -07:00
  • fa37ddc584
    [Serve][docs] Add type annotations to code samples (#27795) zcin 2022-08-12 13:41:08 -07:00
  • 4699fb0fd7
    [Pick] [AIR] Improve preprocessor documentation (#27809) Balaji Veeramani 2022-08-12 13:11:03 -07:00
  • 2cc565dea0 cleanup Clarence Ng 2022-08-12 12:35:03 -07:00
  • ccdc78ad58 cleanup Clarence Ng 2022-08-12 12:33:13 -07:00
  • 192d92bb77
    [Serve] [Doc] Update intro page (#27735) Simon Mo 2022-08-12 11:37:18 -07:00
  • ba36365c32
    [Doc] [Serve] Fix LINT by fixing outdated Ray Client doc link (#27826) Archit Kulkarni 2022-08-12 11:35:15 -07:00
  • f0404e00cd
    [Core] [Hotfix] Change "task failed with unretryable exception" log statement to debug-level. (#27714) Clark Zinzow 2022-08-12 12:28:49 -06:00
  • 7c7828f818
    [Datasets] Improve size estimation of image folder data source (#27219) Cheng Su 2022-08-12 11:26:03 -07:00