Kai Fricke
63b85df828
[xgb] update docs ( #12549 )
2020-12-01 23:17:23 -08:00
Simon Mo
e428134137
[Hotfix] Pin llvmlite for windows build ( #12559 )
2020-12-01 19:43:08 -08:00
Siyuan (Ryans) Zhuang
615f974313
Add context for "test_buffer_alignment" ( #12519 )
2020-12-01 19:27:14 -08:00
Sven Mika
19c8033df2
[RLlib] Fix most remaining RLlib algos for running with trajectory view API. ( #12366 )
...
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* LINT and fixes.
MB-MPO and MAML not working yet.
* wip
* update
* update
* rmeove
* remove dep
* higher
* Update requirements_rllib.txt
* Update requirements_rllib.txt
* relpos
* no mbmpo
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-12-01 17:41:10 -08:00
Richard Liaw
4dc16730a7
[tune] with-params fix ( #12522 )
2020-12-01 16:47:03 -08:00
Simon Mo
7022278ce9
Deflake Serve tests ( #12542 )
2020-12-01 13:42:21 -08:00
Barak Michener
6412dfaf38
[ray_client] actors v0 ( #12388 )
2020-12-01 13:12:08 -08:00
SangBin Cho
0e892908f7
[Object Spilling] Delete spilled objects when references are gone out of scope. ( #12341 )
2020-12-01 13:10:39 -08:00
Simon Mo
ef1b0c13c3
Async Future Throws RayError as well ( #12419 )
2020-12-01 13:07:43 -08:00
Richard Liaw
bdf8ad3b5a
fix ( #12528 )
...
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-01 09:58:12 -08:00
Simon Mo
f596113fc7
[Core] Actor Retries Out of Order Tasks on Restart ( #12338 )
2020-12-01 09:35:54 -08:00
SangBin Cho
f6f3cc9af1
[Core]Remove checkpoint table ( #12235 )
...
* Delete an actor entry from node manager.
* Remove checkpoint table
* remote checkpoint interface
* remove checkpoint interface
* fix ExitActorTest
Co-authored-by: chaokunyang <shawn.ck.yang@gmail.com>
2020-12-01 08:58:36 -08:00
Sven Mika
9021f15b2a
[RLlib] Fix setup-dev.py error when creating a softlink for new_dashboard. ( #12442 )
2020-12-01 11:46:59 +01:00
Edward Oakes
e72147de38
Fix Serve typo ( #12524 )
2020-11-30 23:15:42 -08:00
Eric Liang
fd8ae0697b
[autoscaler] Fix test heartbeats single test ( #12513 )
...
* update
* update
* update
2020-11-30 21:24:45 -08:00
Amog Kamsetty
f9a99f20dd
Revert "Re-Revert "[Core] zero-copy serializer for pytorch ( #12344 )" ( #12478 )" ( #12515 )
...
This reverts commit 3f22448834
.
2020-11-30 19:05:55 -08:00
SangBin Cho
8223a33bff
[Logging] Log rotation on all components ( #12101 )
...
* In Progress.
* Done.
* Fix the issue.
* Add wait for condition because logs are not written right away now.
* debug string.
* lint.
* Fix flaky test.
* Fix issues.
* Fix test.
* lint.
2020-11-30 19:03:55 -08:00
Ian Rodney
e422ace053
[serve] Create CurrentState & GoalState ( #12369 )
2020-11-30 17:34:30 -08:00
Eric Liang
234df9091e
[autoscaler] Try to improve the request_resources() documentation ( #12465 )
2020-11-30 16:03:30 -08:00
Richard Liaw
9ce7ad17fd
[tune] remove some bottlenecks in trialrunner ( #12476 )
2020-11-30 14:54:25 -08:00
Siyuan (Ryans) Zhuang
3f22448834
Re-Revert "[Core] zero-copy serializer for pytorch ( #12344 )" ( #12478 )
...
* [Core] zero-copy serializer for pytorch (#12344 )
* zero-copy serializer for pytorch
* address possible bottleneck
* add tests & device support
(cherry picked from commit 0a505ca83d
)
* add environmental variables
* update doc
2020-11-30 11:43:03 -08:00
Sven Mika
bb03e2499b
[RLlib] PyBullet Env native support via env str-specifier (if installed). ( #12209 )
2020-11-30 12:41:24 +01:00
Tao Wang
b85c6abc3e
Rename fields/variables from client id to node id ( #12457 )
2020-11-30 14:33:36 +08:00
SangBin Cho
3964defbe1
[Logging] Fix tensorflow logging issue. ( #12225 )
...
* in progress.
* ip
* In Progress
* done.
* fix lint.
* Addressed code review
* Addressed code review.
2020-11-29 22:16:52 -08:00
SangBin Cho
91d54ef621
[Core] Remove actor arg from executor to allow users to specify actor… ( #12239 )
...
* [Core] Remove actor arg from executor to allow users to specify actor arg in their Actor.remote.
* Addressed code review.
2020-11-29 22:15:48 -08:00
chaokunyang
17a6b9bbe7
Fix not cp jars ( #12456 )
2020-11-30 13:53:09 +08:00
Philipp Moritz
cf73ccddae
Allow more fields for object metadata ( #12484 )
2020-11-29 21:50:18 -08:00
Alex Wu
f1cc33a6a6
Actor resource backlog hotfix ( #12471 )
...
* prepare implemented
* works?
* deflek
* git
* deflek round 2
* .
* improve the test
Co-authored-by: Alex <alex@anyscale.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-11-29 20:55:50 -08:00
Amog Kamsetty
8a406e1f9a
[SGD] Add PTL Docs ( #12440 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-28 10:09:38 -08:00
Kai Fricke
1d0ade1b93
Revert "[Core] zero-copy serializer for pytorch ( #12344 )" ( #12469 )
...
This reverts commit 0a505ca8
2020-11-28 10:06:02 -08:00
Eric Liang
569eee5e71
Enable more new scheduler tests ( #12421 )
2020-11-27 16:10:38 -08:00
Richard Liaw
7c009d22cf
[docs] Add xgboost_ray to docs ( #12184 )
...
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2020-11-27 11:36:56 -08:00
Siyuan (Ryans) Zhuang
0a505ca83d
[Core] zero-copy serializer for pytorch ( #12344 )
...
* zero-copy serializer for pytorch
* address possible bottleneck
* add tests & device support
2020-11-26 16:09:54 -08:00
Amog Kamsetty
e0573df337
[CI] Fix windows build ( #12415 )
...
* attempt to fix windows
* fix syntax
* try again
* try again
* try again
* Revert "[ray_client] Support calling functions from other functions and correct the tests (#12141 )"
This reverts commit 4066056a0d
.
* Revert
* Revert "Revert "[ray_client] Support calling functions from other functions and correct the tests (#12141 )""
This reverts commit bb27b87b6c8d780ad796f4d4aeaa20113c8eca79.
* please work
* works
* fix
2020-11-26 10:52:11 -08:00
Sven Mika
c1d7826bb7
[RLlib] Move pettingzoo
from requirements.txt into requirements_rllib.txt ( #12400 )
2020-11-26 19:30:35 +01:00
Ameer Haj Ali
9ccf5f6ccc
[ray client] add metadata and secure options to Worker. ( #12409 )
2020-11-25 17:48:13 -08:00
Richard Liaw
323941c745
[tune] fix pbt flakey test ( #12418 )
2020-11-25 16:58:37 -08:00
Eric Liang
f6a5b733d5
Remove flaky object manager test that's no longer needed
2020-11-25 12:45:47 -08:00
Ian Rodney
679492a235
[serve] Use Long Polling in Backend Worker ( #12093 )
2020-11-25 12:11:38 -08:00
SangBin Cho
753cda2f28
[Dashboard] Delete old dashboard ( #12144 )
...
* Delete old dashboard from repo.
* Delete old dashboard from repo. 2
2020-11-25 11:31:02 -08:00
ZhuSenlin
dc55f6ba3a
skip gcs fault tolerance test for the time being when new scheduler is enabled ( #12393 )
...
Co-authored-by: senlin.zsl <senlin.zsl@antfin.com>
2020-11-25 10:40:47 -08:00
SangBin Cho
2e4e285ef0
[Object Spilling] Fusion small objects ( #12087 )
2020-11-25 10:13:32 -08:00
Ian Rodney
c5845c3a4e
[docker] Docker stop on each node ( #12357 )
2020-11-24 23:15:53 -08:00
Barak Michener
4066056a0d
[ray_client] Support calling functions from other functions and correct the tests ( #12141 )
...
* Add test mode and fix f calling g
* formatting
* remove unused functions
* fix tests -- which will be better in actor PR
2020-11-24 22:19:20 -08:00
Tao Wang
e1075c0a82
[GCS]Fill resource fields when re-report heartbeat after gcs restarted ( #12097 )
2020-11-25 11:07:02 +08:00
Edward Oakes
dae137b919
Don't allow 'optional' files in setup.py ( #12359 )
2020-11-24 17:41:58 -06:00
Eric Liang
5895554555
[autoscaler] Raise node "start" deadline to 900s, make configurable ( #12316 )
2020-11-24 12:16:01 -08:00
Edward Oakes
4ada3e4c99
[serve] Incremental change towards async control loop for replica startup ( #12281 )
2020-11-24 13:06:08 -06:00
roireshef
888357d251
added address resolution fix for running in docker containers ( #11944 )
...
* added address resolution fix for running in docker containers
* added address resolution fix for running in docker containers (java)
* Update RayNativeRuntime.java
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-11-24 10:34:56 -08:00
Edward Oakes
be0fa7b8b4
Properly specify kubectl-rsync.sh in setup.py ( #12356 )
2020-11-24 12:13:29 -06:00