Commit graph

5776 commits

Author SHA1 Message Date
Amog Kamsetty
0aec4cbccb
[Tune] Update PBT Transformers Example (#10289)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: krfricke <krfricke@users.noreply.github.com>
2020-08-27 08:25:05 -07:00
krfricke
53ab228b75
[tune] Fix log to file on actor reuse (#10363) 2020-08-27 08:22:19 -07:00
Alex Wu
6d2af33a01
[Autoscaler] Proper resource demand plumbing (#10329) 2020-08-26 23:36:01 -07:00
Ian Rodney
9056854c06
drop keep alive (#10347) 2020-08-26 21:15:48 -07:00
Edward Oakes
60665fc936
Clean up task dependency and scheduler metrics (#10340) 2020-08-26 22:56:03 -05:00
Lixin Wei
fe6daef85e
[Core]Add runtime context for python worker (#10309)
* add runtime context for python

* fixed

* code fixed

* test added

* lint

* lint
2020-08-26 20:11:42 -07:00
Ian Rodney
2526c06b5e
[WIP] [docker] Cleanup Docker Base-Deps (#9988)
* cleanup-base deps

* only build base-deps a bit

* remove parens

* formatting

* add ray-deps

* gpu enabled

* always include wheel

* fix script

* log new variables

* run tests for docker

* try to include env variables

* source files

* remove bash when sourcing

* add new lines

* use wget

* dual build autoscaler

* switch to gnupg

* add gcc cmake

* remove blist

* clarify build-docker-images
2020-08-26 19:36:11 -07:00
Ian Rodney
e2eef6469b
Deprecate Jenkins (#10314) 2020-08-26 15:43:27 -07:00
Ameer Haj Ali
17c8c63e7e
Metadata schema (#10328)
* metadata

* Eric

Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-08-26 15:43:03 -07:00
Richard Liaw
29e8a664c4
[cli] make sure old-style works (#10344) 2020-08-26 15:26:24 -07:00
Lixin Wei
4b856fa416
[Core]Async updating issue fixed for actor's num_restart (#10176)
* bug fixed for num_restart updating

* add log

* log updated

* lint

* fixed

* Update src/ray/gcs/gcs_server/gcs_actor_manager.cc

Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>

* bug fixed

* bug fixed

* test passed

Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
2020-08-26 11:49:26 -07:00
Edward Oakes
c35ad8237d
[metrics] Clean up object manager stats (#10316) 2020-08-26 13:43:06 -05:00
Ian Rodney
dc378a80b7
[autoscaler/docker] Docker Inititialization Revamp (#9515)
* Basic idea

* Small fixes

* dockerize start commands in Command Runner

* Remove run_init from CommandRunnerInterface

* Add Parens

Co-authored-by: Simon Mo <simon.mo@hey.com>

* Cleaning up

* Response to richards comments

* Further small fixes

* Fix Json

* schema format fix

* cleanup

* run more often

* fix indent

* Fix richards responses

* fix ups

* remove docker_commands from schema

* default to list

* fix docker cmd runner test

* lint fix

Co-authored-by: Simon Mo <simon.mo@hey.com>
2020-08-26 10:29:06 -07:00
Edward Oakes
916a19363f
Clean up actor metrics (#10317) 2020-08-26 10:21:15 -05:00
Sven Mika
93120e0347
Unity3D API Fixes (recent changes in Unity's MLAgents API caused errors on RLlib side). (#10285) 2020-08-26 14:16:08 +02:00
Michael Luo
4e9888ce2f
[RLlib] Dreamer (#10172) 2020-08-26 13:24:05 +02:00
Alex Wu
9ca159aa0b
[Autoscaler] Multi node commands (#10236) 2020-08-25 23:35:38 -07:00
Olli Huotari
0dae50b5eb
Fixed num_atoms>1 in pytorch (#10330) 2020-08-25 23:10:20 -07:00
Amog Kamsetty
8c0503ddd3
[Tune] Convert PBT DCGAN Example to Function API (#10246)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-08-25 22:34:19 -07:00
Antoni Baum
87ed20738e
[tune] Add on_pause, on_unpause to ConcurrencyLimiter (#10320)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-08-25 22:33:17 -07:00
Simon Mo
ed3fdd2c0b
[Serve] Remove register_custom_serializer (#10331) 2020-08-25 21:20:43 -07:00
Edward Oakes
cbd9632f3a
Fix wait timeout logic (#10199) 2020-08-25 22:41:39 -05:00
fyrestone
08adbb371f
Cross language exception (#10023) 2020-08-26 10:46:05 +08:00
Edward Oakes
1e99b814f0
Remove unused scheduler states (#10318)
* remove unused state

* remove unused states
2020-08-25 18:56:21 -07:00
Eric Liang
deea1861ab
[rllib] Try fixing torch GPU and masking errors (#10168) 2020-08-25 18:34:19 -07:00
Eric Liang
6fcb816fdd
Ray operator deprecation message (#10334) 2020-08-25 18:26:02 -07:00
Robert Nishihara
79eefbf357
Better checking that ray.init() has been called. (#10261) 2020-08-25 17:13:11 -07:00
Stephanie Wang
d4537ac1ce
[core] Try to schedule tasks locally before spilling over to remote nodes (#10302)
* Regression test

* Spillback

* Remove check for actor tasks
2020-08-25 15:01:59 -07:00
Richard Liaw
146d91385c
[tune] custom trial directory name (#10214) 2020-08-25 12:52:54 -07:00
kisuke95
24a7a8a04d
[Streaming] Build fix (#10233) 2020-08-25 11:37:21 -07:00
Matthew Strawbridge
7a5af7e744
Fix links to ddpg tuned examples (#9713) 2020-08-25 11:30:13 -07:00
Ian Rodney
b14c56e599
fix lint (#10315) 2020-08-25 10:07:20 -07:00
Benjamin Black
2689fb439c
Fixed pettingzoo env example (#9973) 2020-08-25 13:22:25 +02:00
wanxing
e816e3aefb
[Streaming]Streaming queue support failover (#8161) 2020-08-25 14:19:45 +08:00
krfricke
5a787a8253
[tune] added FAQ to docs (#10222)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-08-24 21:51:02 -07:00
SangBin Cho
7ea4bcc1ab
Add a basic rule to contributors / PR template. (#10277)
* Add a basic rule to contributors / PR template.

* Fix.

* Addressed code reivew.
2020-08-24 20:15:06 -07:00
SangBin Cho
3b3ca96a4e
[Placement Group] Wait (#10259)
* Initial progress done.

* Fix wrong test.

* Improve tests.

* Update code.

* Addressed code review and merge conflict.

* Addressed code review.
2020-08-24 20:14:48 -07:00
Richard Liaw
6dc22a6d68
[autoscaler] Fix logging regression (#10280) 2020-08-24 14:25:12 -07:00
fyrestone
05c103af94
[Dashboard] Start the new dashboard (#10131)
* Use new dashboard if environment var RAY_USE_NEW_DASHBOARD exists; new dashboard startup

* Make fake client/build/static directory for dashboard

* Add test_dashboard.py for new dashboard

* Travis CI enable new dashboard test

* Update new dashboard

* Agent manager service

* Add agent manager

* Register agent to agent manager

* Add a new line to the end of agent_manager.cc

* Fix merge; Fix lint

* Update dashboard/agent.py

Co-authored-by: SangBin Cho <rkooo567@gmail.com>

* Update dashboard/head.py

Co-authored-by: SangBin Cho <rkooo567@gmail.com>

* Fix bug

* Add tests for dashboard

* Fix

* Remove const from Process::Kill() & Fix bugs

* Revert error check of execute_after

* Raise exception from DashboardAgent.run

* Add more tests.

* Fix compile on Linux

* Use dict comprehension instead of dict(generator)

* Fix lint

* Fix windows compile

* Fix lint

* Test Windows CI

* Revert "Test Windows CI"

This reverts commit 945e01051ec95cff5fcc1c0bc37045b46e7ad9a6.

* Fix ParseWindowsCommandLine bug

* Update src/ray/util/util.cc

Co-authored-by: Robert Nishihara <robertnishihara@gmail.com>

Co-authored-by: 刘宝 <po.lb@antfin.com>
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
Co-authored-by: Robert Nishihara <robertnishihara@gmail.com>
2020-08-24 13:24:23 -07:00
Max Fitton
832f5cdccb
[Dashboard] Memory View Group by Stack Trace and UI Overhaul (#10227) 2020-08-24 14:54:42 -05:00
raoul-khour-ts
c8c4832794
Prevent Local Worker creation from blocking remote worker creation by creating remote workers before local worker (#10245)
* create remote workers before local worker

* reformatted
2020-08-24 12:29:55 -07:00
PidgeyBE
a82124d304
Update memory_monitor.py (#9212) 2020-08-24 10:29:01 -07:00
Eric Liang
4761eacc3e
[autoscaler] Also account for head node resources in multi node type autoscaling (#10230) 2020-08-24 10:26:22 -07:00
Ian Rodney
f051c2852e
[docker] docker cp correctly into container (#10253) 2020-08-24 09:18:34 -07:00
Kai Yang
07f6cb17e4
[Core] Multi-tenancy: Refine worker env variable passing (#10191)
* Resolve issues with environment variable handling

* fix

* fix warning

* lint

Co-authored-by: Mehrdad <noreply@github.com>
2020-08-24 09:04:22 -07:00
SangBin Cho
1f54acd274
[Tech Debt] Use f-string for python/ray/*.py (#10268)
* In progress.

* Done with critical path.

* Modified cluster_utils.py and log_monitor.py

* Addressed code review.
2020-08-23 22:01:31 -07:00
fangfengbin
b61a79efd7
[Placement Group]Fix SigSegv bug (#10262)
* fix SigSegv bug

* fix review comments

* fix ut bug

Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-08-23 11:33:40 -07:00
Richard Liaw
73c4246332
[Core] fix-bad-stack (#10266) 2020-08-23 10:33:29 -07:00
Michael Luo
48a39d7cb9
[RLlib] Deepmind Control Suite Examples (#9751) 2020-08-23 12:53:08 +02:00
Yu Shan
5264f888e4
fix iterable dataset (issue 9899) (#9952) 2020-08-22 19:40:38 -07:00