Commit graph

4826 commits

Author SHA1 Message Date
Edward Oakes
421b3c9d8b
Fix serve long running test (#8268) 2020-05-01 11:54:27 -05:00
Eric Liang
2a0ad0b8ce
[rllib] [hotfix] Remove assert that trips on pytorch multiagent (#8241) 2020-05-01 06:32:54 +02:00
Edward Oakes
6373c70661
[serve] Refactor BackendConfig (#8202) 2020-04-30 22:31:07 -05:00
Edward Oakes
95d187e556
[serve] Add delete_endpoint call (#8256) 2020-04-30 20:59:07 -05:00
Edward Oakes
484f68765c
Fix resource_ids_ data race (#8253) 2020-04-30 18:55:54 -05:00
Edward Oakes
43be73e4cf
[serve] Add delete_backend call (#8252) 2020-04-30 13:10:39 -05:00
Sven Mika
c593fb09b7
[RLlib] Remove all f-strings to keep py3.5 compatibility. 2020-04-30 11:10:16 -07:00
Sven Mika
eea75ac623
[RLlib] Beta distribution. (#8229) 2020-04-30 11:09:33 -07:00
Sven Mika
b23b6addfc
[RLlib] Stabilize Pendulum-v0 regression test cases. (#8232)
Stabilize Pendulum regression test cases.
2020-04-30 15:48:11 +02:00
Richard Liaw
05df80afad
Extend timeout for test_tune_server (#8233) 2020-04-30 08:39:46 -05:00
Eric Liang
baadbdf8d4
[rllib] Execute PPO using training workflow (#8206)
* wip

* add kl

* kl

* works now

* doc update

* reorg

* add ddppo

* add stats

* fix fetch

* comment

* fix learner stat regression

* test fixes

* fix test
2020-04-30 01:18:09 -07:00
Richard Liaw
35eac2671e
[sgd] Resource limit lift for GPU test (#8238) 2020-04-30 00:24:48 -07:00
mehrdadn
254b1ec370
Set up testing and wheels for Windows on GitHub Actions (#8131)
* Move some Java tests into ci.sh

* Move C++ worker tests into ci.sh

* Define run()

* Prepare to move Python tests into ci.sh

* Fix issues in install-dependencies.sh

* Reload environment for GitHub Actions

* Move wheels to ci.sh and fix related issues

* Don't bypass failures in install-ray.sh anymore

* Make CI a little quieter

* Move linting into ci.sh

* Add vitals test right after build

* Fix os.uname() unavailability on Windows

Co-authored-by: Mehrdad <noreply@github.com>
2020-04-29 21:19:02 -07:00
Eric Liang
ae54e0dc0a
[rllib] Copy plasma memory before adding data to replay buffer 2020-04-29 14:17:54 -07:00
Edward Oakes
17f0d50f1a
[serve] Temporarily disable test_master_crashes (#8230) 2020-04-29 14:36:09 -05:00
Xianyang Liu
fbf23eb6ff
[SGD] Fix IterableDataset errors (#8208) 2020-04-29 10:51:31 -07:00
Simon Mo
1b1fe0cc5b
Fix Serve long running test (#8223) 2020-04-29 09:32:39 -07:00
ijrsvt
c393b6d165
Remove logging (#8211) 2020-04-29 09:15:43 -07:00
Sven Mika
bf25aee392
[RLlib] Deprecate all Model(v1) usage. (#8146)
Deprecate all Model(v1) usage.
2020-04-29 12:12:59 +02:00
Sven Mika
eb91619175
Fix release 0.8.5 tests for PPO torch Breakout. (#8226) 2020-04-29 10:36:41 +02:00
chaokunyang
91f630f709
[Streaming] Streaming Cross-Lang API (#7464) 2020-04-29 13:42:08 +08:00
Simon Mo
101255f782
[Serve] RayServe TF, PyTorch, Sklearn Examples (#8156) 2020-04-28 22:24:55 -07:00
Simon Mo
af3d3e778e
[RayServe] Specify installation instruction in doc (#8220) 2020-04-28 14:38:10 -07:00
Richard Liaw
4d639354cd
[tune] Hotfix for test_ls (#8215) 2020-04-28 14:06:12 -07:00
Edward Oakes
7c0200c93b
[serve] Master actor fault tolerance (#8116) 2020-04-28 15:52:29 -05:00
Edward Oakes
ebdccde030
Fetch internal config from raylet (#8195) 2020-04-28 13:12:11 -05:00
Sven Mika
1775e89f26
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143)
Deprecate TupleActions and support arbitrarily nested action spaces.
Closes issue #8143.
2020-04-28 14:59:16 +02:00
fangfengbin
deffc340ea
[GCS]Add in-memory gcs table storage (#8184) 2020-04-28 17:19:46 +08:00
aannadi
eb790bf3a3
[Dashboard] Set logdir in Tune Dashboard and TensorBoard Opt-in (#8074) 2020-04-27 20:17:52 -07:00
WuTao
32c2055c99
Streaming state (#7348) 2020-04-28 10:36:32 +08:00
Richard Liaw
be5235d982
[tune] Clarify Intro Tune Documentation (#8201) 2020-04-27 18:01:00 -07:00
ijrsvt
a77e5a8cbf
[Doc] Fix Docstring for Task Cancellation (#8198) 2020-04-27 17:06:08 -07:00
Neil Lugovoy
8cf598deab
[sgd] Fix GPU Reservations in LocalDistributedRunner (#8157) 2020-04-27 16:03:33 -07:00
Sven Mika
4e713152e9
[RLlib] Fix for issue https://github.com/ray-project/ray/issues/8191 (#8200)
Fix attribute error when missing exploration in Policy.
Issue #8191
2020-04-27 23:19:26 +02:00
Robert Nishihara
48250217ac
Fix API documentation formatting. (#8197) 2020-04-27 10:48:42 -07:00
Philipp Moritz
d7da25eee1
Use RAY_ADDRESS to connect to an existing Ray cluster if present (#7977) 2020-04-27 09:59:37 -07:00
Robert Zangnan Yu
a77b19e4f2
[docs] Comments on potential srun orders during Slurm Deployment (#8183) 2020-04-27 09:30:16 -07:00
Richard Liaw
87557a00fa
[tune] Refactor search algorithms (#7037)
* start refactoring of search algorithms

* format

* needs tests

* fix

* suggestions

* Fix PBT

* lint

* refactoring

* hyperopt_working

* dragonfly

* hyperopt

* change_half_of_algs

* save

* code-removed

* remove_lots_of_unneccessary

* changes

* formatting

* suggest

* reset

* rm

* tests

* search-change

* exception

* refactor-doc

* search

* py

* moredocs

* Update doc/source/tune-searchalg.rst

* concurrency

* max

* tune

* betterwarning

* bohb

* tests

* test-change

Co-authored-by: ujvl <misraujval@gmail.com>
2020-04-27 08:51:13 -07:00
Kai Yang
1d5bceddf0
fix java UT about multi-threading (#8014) 2020-04-27 15:11:22 +08:00
Sven Mika
7ec2223c84
[RLlib] DDPG PyTorch actor-model was missing sigmoid layer (#8188)
Fix DDPG PyTorch (missing sigmoid layer (to squash action outputs) after deterministic action outputs).
2020-04-26 23:08:13 +02:00
mehrdadn
b9de9dadd7
Fix Windows build (#8186)
Co-authored-by: Mehrdad <noreply@github.com>
2020-04-26 13:07:25 -07:00
chaokunyang
5cf49d5edd
Fix streaming ci (#8159) 2020-04-26 20:56:58 +08:00
fangfengbin
5bff707d20
[GCS]Add in-memory store client (#8144) 2020-04-26 19:09:26 +08:00
ZhuSenlin
9255fcd516
[GCS] Add node failure detector (#8119) 2020-04-26 19:08:27 +08:00
fangfengbin
c5d181e3d9
gcs adapts to worker table pub sub (#8182)
Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-04-26 17:58:55 +08:00
Richard Liaw
5bc6e32c0a
[autoscaler] latest_dlami update (#8178) 2020-04-26 00:25:46 -07:00
fangfengbin
f17bea2de5
Fix get gcs server address block bug (#8126) 2020-04-26 10:01:06 +08:00
Tomasz Wrona
b508166419
Copy initial state of an RNN to a CPU before converting it to a NumPy array (#8097) 2020-04-25 18:49:09 -07:00
Richard Liaw
b506f87117
[tune] New Doc edits, add Concepts page (#8083)
Co-Authored-By: Sven Mika <sven@anyscale.io>
2020-04-25 18:25:56 -07:00
ijrsvt
69ff7e3e35
TaskCancellation (#7669)
* Smol comment

* WIP, not passing ray.init

* Fixed small problem

* wip

* Pseudo interrupt things

* Basic prototype operational

* correct proc title

* Mostly done

* Cleanup

* cleaner raylet error

* Cleaning up a few loose ends

* Fixing Race Conds

* Prelim testing

* Fixing comments and adding second_check for kill

* Working_new_impl

* demo_ready

* Fixing my english

* Fixing a few problems

* Small problems

* Cleaning up

* Response to changes

* Fixing error passing

* Merged to master

* fixing lock

* Cleaning up print statements

* Format

* Fixing Unit test build failure

* mock_worker fix

* java_fix

* Canel

* Switching to Cancel

* Responding to Review

* FixFormatting

* Lease cancellation

* FInal comments?

* Moving exist check to CoreWorker

* Fix Actor Transport Test

* Fixing task manager test

* chaning clock repr

* Fix build

* fix white space

* lint fix

* Updating to medium size

* Fixing Java test compilation issue

* lengthen bad timeouts
2020-04-25 16:04:52 -07:00