Commit graph

1545 commits

Author SHA1 Message Date
jichan3751
de95117e96 [sgd] Tune interface for Pytorch MultiNode SGD (#5350) 2019-08-10 13:51:44 -07:00
Eric Liang
df47bdf6c9
Allow address instead of redis_address (#5412)
* addr

* wip

* fix typo

* add to start

* switch to ray address for train

* say address

* disambiguate help

* comments 2
2019-08-10 00:18:41 -07:00
Juan Redondo Pizarro
7e8a4a62ea [tune] Add hyperopt warm start feature (#5372) 2019-08-09 12:27:57 -07:00
Simon Mo
18f1e904de Bump 0.8.0.dev2 -> 0.8.0.dev3 (#5409) 2019-08-09 11:37:19 -07:00
Simon Mo
d9b45cceec [Project] Implementing Project CLI (#5397) 2019-08-08 21:28:25 -07:00
Richard Liaw
ed89897a31
[tune,autoscaler] Test yaml, add better distributed docs (#5403) 2019-08-08 00:59:23 -07:00
Eric Liang
4a6ebe69d4
Fix setup (#5400) 2019-08-07 16:17:29 -07:00
Philipp Moritz
8d6c50c821
Fix compiler warnings and make warnings fatal (#5375) 2019-08-07 14:04:05 -07:00
Qing Wang
d372f24e3c
[ID Refactor] Refactor ActorID, TaskID and ObjectID (#5286)
* Refactor ActorID, TaskID on the Java side.

Left a TODO comment

WIP for ObjectID

ADD test

Fix

Add java part

Fix Java test

Fix

Refine test.

Enable test in CI

* Extra a helper function.

* Resolve TODOs

* Fix Python CI

* Fix Java lint

* Update .travis.yml

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Address some comments.

Address some comments.

Add id_specification.rst

Reanme id_specification.rst to id_specification.md

typo

Address zhijun's comments.

Fix test

Address comments.

Fix lint

Address comments

* Fix test

* Address comments.

* Fix build error

* Update src/ray/design_docs/id_specification.md

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Update src/ray/design_docs/id_specification.md

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Update src/ray/design_docs/id_specification.md

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Update src/ray/design_docs/id_specification.md

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Update src/ray/design_docs/id_specification.md

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Address comments

* Update src/ray/common/id.h

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Update src/ray/common/id.h

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Update src/ray/common/id.h

Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>

* Update src/ray/design_docs/id_specification.md

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Update src/ray/design_docs/id_specification.md

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Address comments.

* Address comments.

* Address comments.

* Update C++ part to make sure task id is generated determantic

* WIP

* Fix core worker

* Fix Java part

* Fix comments.

* Add Python side

* Fix python

* Address comments

* Fix linting

* Fix

* Fix C++ linting

* Add JobId() method to TaskID

* Fix linting

* Update src/ray/common/id.h

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Update java/api/src/main/java/org/ray/api/id/TaskId.java

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Update java/api/src/main/java/org/ray/api/id/TaskId.java

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Update java/api/src/main/java/org/ray/api/id/ActorId.java

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Address comments

* Add DriverTaskId embeding job id

* Fix tests

* Add python dor_fake_driver_id

* Address comments and fix linting

* Fix CI
2019-08-07 11:04:51 +08:00
Richard Liaw
094ec7adbc
[tune] Allow nested values in trial runner (#5346) 2019-08-06 14:36:17 -07:00
Philipp Moritz
e8d9cfc1f1
Ray projects schema and validation (#5329) 2019-08-06 14:36:04 -07:00
Matthew A. Wright
e3c9f7e83a Custom action distributions (#5164)
* custom action dist wip

* Test case for custom action dist

* ActionDistribution.get_parameter_shape_for_action_space pattern

* Edit exception message to also suggest using a custom action distribution

* Clean up ModelCatalog.get_action_dist

* Pass model config to ActionDistribution constructors

* Update custom action distribution test case

* Name fix

* Autoformatter

* parameter shape static methods for torch distributions

* Fix docstring

* Generalize fake array for graph initialization

* Fix action dist constructors

* Correct parameter shape static methods for multicategorical and gaussian

* Make suggested changes to custom action dist's

* Correct instances of not passing model config to action dist

* Autoformatter

* fix tuple distribution constructor

* bugfix
2019-08-06 11:13:16 -07:00
Eric Liang
0a3ff489fa
Send raylet error logs through the log monitor (#5351) 2019-08-05 23:35:09 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
Richard Liaw
32f275344b [tune] Pandas as soft dep 2019-08-05 18:59:08 -07:00
Eric Liang
955154a19d
Reduce Ray / RLlib startup messages (#5368) 2019-08-05 13:23:54 -07:00
Michael Gump
67f9e225cf [tune] Fix small bug in experiment_analysis (#5365)
* [tune] Fix small bug in experiment_analysis

_experiment_state was not set as an attribute of ExperimentAnalysis

* Update python/ray/tune/analysis/experiment_analysis.py

Co-Authored-By: Richard Liaw <rliaw@berkeley.edu>
2019-08-05 13:19:30 -07:00
Simon Mo
25b5bd1530 ray stop sends SIGKILL instead of SIGTERM (#5354) 2019-08-02 14:46:03 -07:00
Richard Liaw
1eaa57c98f
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
Kristian Hartikainen
13fb9fe3db [rllib] Feature/soft actor critic v2 (#5328)
* Add base for Soft Actor-Critic

* Pick changes from old SAC branch

* Update sac.py

* First implementation of sac model

* Remove unnecessary SAC imports

* Prune unnecessary noise and exploration code

* Implement SAC model and use that in SAC policy

* runs but doesn't learn

* clear state

* fix batch size

* Add missing alpha grads and vars

* -200 by 2k timesteps

* doc

* lazy squash

* one file

* ignore tfp

* revert done
2019-08-01 23:37:36 -07:00
Xianyang Liu
3ae54a2b20 Fix log monitor read error (#5221) 2019-08-01 15:47:10 -07:00
Eric Liang
20450a4e82
[rllib] Add rock paper scissors multi-agent example (#5336) 2019-08-01 13:03:59 -07:00
jichan3751
bd6dfc994f [sgd] Replaced class Resources in sgd with use_gpu (#5252) 2019-08-01 01:03:10 -07:00
Lukas Folle
0391050daf Fixed link in tune that was not working (#5331) 2019-07-31 21:43:46 -07:00
Simon Mo
d762379904 [Asyncio] Allow Async_API to init when loop is running (#5323) 2019-07-31 15:29:42 -07:00
Kristian Hartikainen
1345802c39 [autoscaler] Change sys.exit(1) in update ssh_cmd (#5266) 2019-07-31 13:45:05 -07:00
Jaroslaw Rzepecki
b3c8091a35 Fix Tuple spaces in rollout.py (#5332)
Make sure that the initial action is also properly flattened.
2019-07-31 11:38:49 -07:00
Robert Nishihara
63a6b0e710 Fix bug in passing large arguments to tasks. (#5325) 2019-07-30 22:28:35 -07:00
Hao Chen
991e71dde6 Submit task asynchronously from raylet client (#5313) 2019-07-30 12:58:57 -07:00
Simon Mo
196495a4de Fix Redis Test (#5302) 2019-07-30 00:22:16 -07:00
micafan
b3bcf59148 Rename ClientTableData to GcsNodeInfo (#5251) 2019-07-30 11:22:47 +08:00
Simon Mo
3ba8680963 Bump version to 0.8.0.dev3 (#5308) 2019-07-29 18:28:38 -07:00
Michael Luo
1337c98f02 [rllib] Importance Sampling and KL Loss for APPO (#5051) 2019-07-29 15:02:32 -07:00
Simon Mo
3b00144e7d Bump version to 0.7.3 (#5301) 2019-07-29 10:25:32 -07:00
Eric Liang
3bdd114282
[rllib] Better example rnn envs (#5300) 2019-07-28 14:07:18 -07:00
Qing Wang
1465a30ea9
Fix releasing CPUs incorrectly when actor creation task blocked. (#5271)
* Fix

* Remove useless log

* Address

* Fix typo

* sleep
2019-07-28 15:46:17 +08:00
Richard Liaw
5ea859dc73
[sgd] hotfix example failure (#5297)
* hotfix

* Update train_example.py
2019-07-27 18:13:22 -07:00
Eric Liang
6f2c5b2819
Revert "[autoscaler] Clean up error messages on setup failure (#5210)" (#5299)
This reverts commit 7fc15dbf7f.
2019-07-27 16:53:47 -07:00
lanlin
341dbf6c45 [tune] support nested dictionaries for CSVLogger (#5295) 2019-07-27 14:44:34 -07:00
Richard Liaw
b4823d63c6
[autoscaler] Local YAML readability (#5290) 2019-07-27 12:51:50 -07:00
Eric Liang
a62c5f40f6
[rllib] Document ModelV2 and clean up the models/ directory (#5277) 2019-07-27 02:08:16 -07:00
Richard Liaw
9c00616cdc
Retry and exception for hang on memory store full (#5143) 2019-07-27 01:20:13 -07:00
Richard Liaw
5e15b36d6e
[tune] experiment_analysis split to Analysis (#5115) 2019-07-27 01:10:52 -07:00
Richard Liaw
7e715520e5
[sgd] Example for Training (#5292) 2019-07-27 01:10:25 -07:00
Daniel Edgecumbe
06fec63c87 [autoscaler] Add a 'request_cores' function for manual autoscaling (#4754) 2019-07-26 17:14:45 -07:00
lanlin
d9e81da3b8 [tune] configurable maximum length of trial identifier (#5287) 2019-07-26 17:09:54 -07:00
Antoine Galataud
827618254a [rllib] Configure learner queue timeout (#5270)
* configure learner queue timeout

* lint

* use config

* fix method args order, add unit test

* fix wrong param name
2019-07-25 21:18:05 -07:00
Stephanie Wang
3321555975
Increase timeout for ray.wait test (#5273)
* Increase test timeout for ray.wait

* make sure the actor is scheduled
2019-07-25 14:23:46 -07:00
Eric Liang
bf9199ad77
[rllib] ModelV2 support for pytorch (#5249) 2019-07-25 11:02:53 -07:00
Joey Jiang
40395acadf [gRPC] Migrate raylet client implementation to grpc (#5120) 2019-07-25 14:48:56 +08:00