Commit graph

8433 commits

Author SHA1 Message Date
SangBin Cho
5efeb5334b
Revert "Same worker id in python and c++ (#16568)" (#16600)
This reverts commit 9b5c0c32da.
2021-06-21 18:58:31 -07:00
Tao Wang
2affe97f1a
[Core][Minor]Remove the hard check when disconnect GCS client (#16572) 2021-06-22 09:29:25 +08:00
SangBin Cho
497f6cee38
[Docs] [Dask on Ray] Specify version compatibility (#16595)
* Dask compat

* Update common.py

* Create common.py

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-06-21 18:15:20 -07:00
Ian Rodney
d3832ab2e1
[Client] Fix gRPC Timeout Options (#16554) 2021-06-21 14:25:41 -07:00
Alex Wu
9b5c0c32da
Same worker id in python and c++ (#16568)
* .

* .

* test

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-21 13:22:52 -07:00
mwtian
f5f23448fc
Support downloading and testing wheels for Python 3.9. (#16586) 2021-06-21 12:02:22 -07:00
Siyuan (Ryans) Zhuang
b7995f66a4
[Workflow] Sync mode fault tolerance (#16282) 2021-06-21 10:05:27 -07:00
mvindiola1
82a3ff795c
[RLlib] ensure curiosity exploration actions are passed in as tf tens… (#15704) 2021-06-21 10:03:17 -07:00
Benjamin D. Killeen
50049f86d0
[rllib] check if self.env is not None explicitly (#15634)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-21 10:02:13 -07:00
jenhaoyang
aabd507ec7
[docs] Add docker run gpu note (#15566)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-21 09:31:09 -07:00
Sven Mika
be6db06485
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569) 2021-06-21 13:46:01 +02:00
qicosmos
4da69174c8
[C++ Worker]Remove unused boost sub libs for the generated template project (#16526) 2021-06-21 14:46:48 +08:00
Qinghao Hu
d922a79385
[sgd] DataParallel after Apex init. (#15645)
* [FIX] DataParallel after Apex init.

* lint

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-20 22:44:15 -07:00
lanlin
e5b50fcc9d
[tune] allow to read trial results from json files in Analysis (#15915)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-20 20:41:48 -07:00
Dmitri Gekhtman
cb878b6514
[doc][kubernetes] K8s doc updates (#16570) 2021-06-20 19:38:34 -07:00
Brandon
2ab1c74032
[docs] Add link for launching ray manually in quickstart (#15384)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-20 17:47:12 -07:00
Eric Liang
a0da009645
Allocate inbound object chunks using CreateRequestQueue instead of immediate allocation (#16523) 2021-06-20 09:22:12 -07:00
Yorick van Zweeden
db7e2c8f21
Remove outdated code from PopulationBasedTrainingReplay (#16564)
Co-authored-by: Yorick van Zweeden <git@yorickvanzweeden.nl>
2021-06-20 15:22:52 +02:00
Amog Kamsetty
e6d9f0b393
[Dask] Support Dask 2021.06.1 (#16547) 2021-06-19 18:22:23 -07:00
Sven Mika
169ddabae7
[RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. (#16429) 2021-06-19 22:42:00 +02:00
Alex Wu
197dab0e2f
[docs] Deploying Ray (#16538)
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-19 10:07:15 -07:00
Ian Rodney
16d762aed0
[DocSprint] Ray Client Docs (#16497)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-19 10:05:37 -07:00
Amog Kamsetty
33d798f8fc
[Docs] Add e2e guide on using Pytorch Lightning with Ray (#16484)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-19 10:04:58 -07:00
Richard Liaw
669b7a2e8c
[docs] Update community libraries (#16557) 2021-06-19 09:01:40 -07:00
Sven Mika
79a9d6d517
[RLlib] Issues 16287 and 16200: RLlib not rendering custom multi-agent Envs. (#16428) 2021-06-19 08:57:53 +02:00
Chen Shen
853caea146
[tests]migrate test-many-tasks/test-dead-actors to nightly tests (#16469)
* init commit

* Update release/nightly_tests/nightly_tests.yaml

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

* Update release/nightly_tests/nightly_tests.yaml

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-06-18 18:43:25 -07:00
Achal Shah
eadee8aba7
[docs] Update API docs for ray.init (#16533)
The incorrect indentation caused the docs render weirdly: 

https://docs.ray.io/en/master/package-ref.html
2021-06-18 18:02:44 -07:00
Alex Wu
319d4fb164
Job timestamp should always be in milliseconds (fixed) (#16548)
* .

* Revert "Revert "Job timestamp should always be in milliseconds (#16455)" (#16545)"

This reverts commit 5030ed8588.

* .

* .

* .

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-18 17:07:21 -07:00
Amog Kamsetty
416cf3a2e7
Revert "Revert "Enable TryCreateImmediately to use the fallback allocation" (#16542)" (#16544)
This reverts commit 36fd741e6f.
2021-06-18 15:39:37 -07:00
Jiao
39cc81c633
[serve] Fix ray serve shutdown to properly go through controller (#16524) 2021-06-18 17:18:04 -05:00
architkulkarni
3ba1cb851e
[Core] [runtime env] Print message on driver when installing conda or pip (#16516) 2021-06-18 16:02:46 -05:00
Amog Kamsetty
e6fa8c0015
[Hotfix] [Dask] Fix Dask Pin (#16552)
* dask-pin-36

* fix
2021-06-18 13:31:50 -07:00
Amog Kamsetty
904232b4f8
[Dask] Pin dask version to 2021.06.0 (#16546) 2021-06-18 12:40:14 -07:00
Alex Wu
5030ed8588
Revert "Job timestamp should always be in milliseconds (#16455)" (#16545)
This reverts commit 1df19a04fe.
2021-06-18 12:37:05 -07:00
Amog Kamsetty
36fd741e6f
Revert "Enable TryCreateImmediately to use the fallback allocation" (#16542)
This reverts commit 41cf2e3d50.
2021-06-18 12:22:18 -07:00
Amog Kamsetty
bd3cbfc56a
Revert "[RLlib] Allow policies to be added/deleted on the fly. (#16359)" (#16543)
This reverts commit e78ec370a9.
2021-06-18 12:21:49 -07:00
architkulkarni
54d66ac637
[Core] iterate over entire dispatch queue instead of returning when worker unavailable (#16535) 2021-06-18 13:25:45 -05:00
Frank Luan
7588938e3c
Sorting benchmark (#16327)
* [WIP] Sorting benchmark

* Separate num_mappers and num_reducers

* Add tests

* Fix tests

* flake8

* flake8

* yapf

* Skip test on Windows

* Fix OS X test failure; test Windows again

* oops
2021-06-18 10:54:18 -07:00
Eric Liang
41cf2e3d50
Enable TryCreateImmediately to use the fallback allocation 2021-06-18 10:49:34 -07:00
Simon Mo
38b5fe7e51
[Buildkite] Add rest of the Python tests (#16517) 2021-06-18 11:18:05 -05:00
Sven Mika
2900a06dd7
[RLlib] Issue 14503: SAC not allowing custom action distributions. (#16427) 2021-06-18 17:27:29 +02:00
architkulkarni
6498ca3995
[Core] [runtime env] Don't delete working_dir from runtime env (#16475) 2021-06-18 10:15:20 -05:00
Chris K. W
a2c842ee3c
[Client] Add separate error message if dataclient has disconnected before a request is sent (#16508)
* Add earlier error message

* Adjust error message
2021-06-18 08:06:25 -07:00
Kai Fricke
172d33be02
[tune] Use unbuffered training when checkpoint_at_end is used. (#16504) 2021-06-18 14:19:14 +01:00
Kai Fricke
aecc4c8d28
[release] fix sgd base image, microbenchmark timeout, revert xgboost train_small to not use connect (#16532) 2021-06-18 11:40:04 +01:00
Kai Fricke
e13f0a4d91
[tune] Add option to keep random values constant over grid search (#16501) 2021-06-18 11:30:27 +01:00
Chris K. W
c91a1b1f92
[Client] Add warnings when user schedules many tasks with ray client (#16454)
* Add warnings when user schedules many tasks with ray client

* add test_client_warnings to BUILD

* better variable names

* use util.debug.log_once()

* batching -> explanation of batching

* Switch to warnings.warn

* Add links to Ray Design Pattern doc with code snippets

* Cleaner linking and refer to sections directly

* Better testNoWarning

* add sys.exit(pytest.main(...))

* Update python/ray/util/client/worker.py

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>

* Update python/ray/util/client/worker.py

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>

* better error messages

* Switch links to new readthedocs sections

* Revert "Switch links to new readthedocs sections"

This reverts commit d3785bf50459d89fb3f13966a030e954799309a8.

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>
2021-06-18 13:17:37 +03:00
Sven Mika
e78ec370a9
[RLlib] Allow policies to be added/deleted on the fly. (#16359) 2021-06-18 10:31:30 +02:00
Antoni Baum
d71ec6e874
[docs] Add examples of new features to contribute (#16477) 2021-06-18 00:07:03 -07:00
Stephanie Wang
5eb51c8b26
[core] Make object directory robust to out-of-order updates (#16314)
* Sequence ops

* id

* fix

* lint
2021-06-17 20:40:35 -07:00