Commit graph

1557 commits

Author SHA1 Message Date
Robert Nishihara
a82e8118a0 Fix resource bookkeeping bug with acquiring unknown resource. (#4945) 2019-06-07 21:07:27 -07:00
Eric Liang
77689d1116
[rllib] Port remainder of algorithms to build_trainer() pattern (#4920) 2019-06-07 16:45:36 -07:00
Eric Liang
9e328fbe6f
[rllib] Add docs on how to use TF eager execution (#4927) 2019-06-07 16:42:37 -07:00
Yuhong Guo
5eff47b657 [C++] Add hash table to Redis-Module (#4911) 2019-06-07 16:11:37 +08:00
Robert Nishihara
c3f8fc1c44
Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941) 2019-06-06 17:22:45 -07:00
Robert Nishihara
a0f14e9e6c Bump version from 0.7.1 to 0.8.0.dev1. (#4937) 2019-06-06 11:20:05 -07:00
Timon Ruban
2702b15b04 [tune] Add requirements-dev.txt and update docs for contributing (#4925)
* Add requirements-dev.txt and update docs.

* Update doc/source/tune-contrib.rst

Co-Authored-By: Richard Liaw <rliaw@berkeley.edu>

* Unpin everything except for yapf.
2019-06-05 09:04:36 -07:00
Hao Chen
d106283769 Better organize ray_common module (#4898) 2019-06-04 23:19:09 -07:00
Timon Ruban
c2253d2313 [tune] Make PBT Quantile fraction configurable (#4912) 2019-06-03 18:45:15 -07:00
Hersh Godse
89722ff003 [tune] Directional metrics for components (#4120) (#4915) 2019-06-02 22:13:40 -07:00
Richard Liaw
084b22181e Fix local cluster yaml (#4918) 2019-06-03 08:45:57 +08:00
Eric Liang
7501ee51db
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820) 2019-06-03 06:49:24 +08:00
Eric Liang
99eae05cf6
[tune] Disallow setting resources_per_trial when it is already configured (#4880)
* disallow it

* import fix

* fix example

* fix test

* fix tests

* Update mock.py

* fix

* make less convoluted

* fix tests
2019-06-03 06:47:39 +08:00
Akshat Gokhale
d86ee8c83e fetching objects in parallel in _get_arguments_for_execution (#4775) 2019-06-01 23:35:48 -07:00
Eric Liang
665d081fe9
[rllib] Rough port of DQN to build_tf_policy() pattern (#4823) 2019-06-02 14:14:31 +08:00
Peter Schafhalter
c2ade075a3 [sgd] Distributed Training via PyTorch (#4797)
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Kristian Hartikainen
88bab5d3c4 [tune] Pretty print params json in logger.py (#4903) 2019-06-01 12:38:59 -07:00
Eric Liang
9aa1cd613d
[rllib] Allow Torch policies access to full action input dict in extra_action_out_fn (#4894)
* fix torch extra out

* preserve setitem

* fix docs
2019-06-01 16:58:49 +08:00
Eric Liang
1c073e92e4
[rllib] Fix documentation on custom policies (#4910)
* wip

* add docs

* lint

* todo sections

* fix doc
2019-06-01 16:13:21 +08:00
Yuhong Guo
1f0809e2b4 Refactor ID Serial 2: change all ID functions to CamelCase (#4896) 2019-05-31 11:31:18 +08:00
Si-Yuan
4e0be8b450
Drop duplicated string format (#4897)
This string format is unnecessary. java_worker_options has been appended to the commandline later.
2019-05-30 19:43:27 +08:00
Eric Liang
3f4d37cd0e
[rllib] Fix Multidiscrete support (#4869) 2019-05-29 20:41:02 -07:00
Eric Liang
2dd0beb5bd
[rllib] Allow access to batches prior to postprocessing (#4871) 2019-05-29 18:17:14 -07:00
Robert Nishihara
a218a14c92 Bump version from 0.8.0.dev0 to 0.7.1. (#4890) 2019-05-29 16:57:28 -07:00
Richard Liaw
acee89b1f6
[tune] Auto-init Ray + default SearchAlg (#4815) 2019-05-29 12:09:34 -07:00
Philipp Moritz
64eb7b322c Upgrade arrow to latest master (#4858) 2019-05-28 16:04:16 -07:00
Eric Liang
d7be5a5d36
[rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO 2019-05-27 17:24:45 -07:00
Eric Liang
a45c61e19b
[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section (#4821)
* wip

* fix index

* fix bugs

* todo

* add imports

* note on get ph

* note on get ph

* rename to building custom algs

* add rnn state info
2019-05-27 14:17:32 -07:00
Richard Liaw
574e1c7695
[tune] Fix up Ax Search and Examples (#4851)
* update Ax for cleaner API

* docs update
2019-05-27 13:23:17 -07:00
Robert Nishihara
7a78e1e320 Install bazel in autoscaler development configs. (#4874) 2019-05-26 16:13:50 -07:00
Robert Nishihara
6703519144 Move global state API out of global_state object. (#4857) 2019-05-26 11:27:53 -07:00
Eric Liang
7237ea70c4 [rllib] [RFC] Deprecate Python 2 / RLlib (#4832) 2019-05-25 10:45:26 -07:00
Richard Liaw
0ce0ecbe9c
[tune] Later expansion of local_dir (#4806) 2019-05-25 02:19:28 -07:00
Devin Petersohn
a7d01aba9b Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847) 2019-05-24 16:49:13 -07:00
Robert Nishihara
49fe894e22 Export remote functions when first used and also fix bug in which rem… (#4844)
* Export remote functions when first used and also fix bug in which remote functions and actor classes are not exported from workers during subsequent ray sessions.

* Documentation update

* Fix tests.

* Fix grammar
2019-05-24 13:44:39 -07:00
Devin Petersohn
ba6c595094 Bump Ray master version to 0.8.0.dev0 (#4845) 2019-05-23 17:02:20 -07:00
Robert Nishihara
2015085192
Fix bug in which actor classes are not exported multiple times. (#4838) 2019-05-23 09:22:46 -07:00
Yuhong Guo
1a39fee9c6
Refactor ID Serial 1: Separate ObjectID and TaskID from UniqueID (#4776)
* Enable BaseId.

* Change TaskID and make python test pass

* Remove unnecessary functions and fix test failure and change TaskID to
16 bytes.

* Java code change draft

* Refine

* Lint

* Update java/api/src/main/java/org/ray/api/id/TaskId.java

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Update java/api/src/main/java/org/ray/api/id/BaseId.java

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Update java/api/src/main/java/org/ray/api/id/BaseId.java

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Update java/api/src/main/java/org/ray/api/id/ObjectId.java

Co-Authored-By: Hao Chen <chenh1024@gmail.com>

* Address comment

* Lint

* Fix SINGLE_PROCESS

* Fix comments

* Refine code

* Refine test

* Resolve conflict
2019-05-22 14:46:30 +08:00
Qing Wang
259cdfa0de Fix issue when starting raylet_monitor (#4829) 2019-05-22 11:08:24 +08:00
Eric Liang
02583a8598 [rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819)
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Eric Liang
6cb5b90bd6
[rllib] [RFC] Dynamic definition of loss functions and modularization support (#4795)
* dynamic graph

* wip

* clean up

* fix

* document trainer

* wip

* initialize the graph using a fake batch

* clean up dynamic init

* wip

* spelling

* use builder for ppo pol graph

* add ppo graph

* fix naming

* order

* docs

* set class name correctly

* add torch builder

* add custom model support in builder

* cleanup

* remove underscores

* fix py2 compat

* Update dynamic_tf_policy_graph.py

* Update tracking_dict.py

* wip

* rename

* debug level

* rename policy_graph -> policy in new classes

* fix test

* rename ppo tf policy

* port appo too

* forgot grads

* default policy optimizer

* make default config optional

* add config to optimizer

* use lr by default in optimizer

* update

* comments

* remove optimizer

* fix tuple actions support in dynamic tf graph
2019-05-18 00:23:11 -07:00
Noah Golmant
1ef9c0729d [tune] Initial track integration (#4362)
Introduces a minimally invasive utility for logging experiment results. A broad requirement for this tool is that it should integrate seamlessly with Tune execution.
2019-05-17 11:34:05 -07:00
Qing Wang
dcd6d4949c Fix Java worker log dir (#4781) 2019-05-17 16:13:28 +08:00
Richard Liaw
e20855ccae
[tune] Remove extra parsing functionality (#4804) 2019-05-16 23:11:35 -07:00
Richard Liaw
88b45a53d6
[autoscaler] rsync cluster (#4785) 2019-05-16 23:11:06 -07:00
Richard Liaw
ffe61fcc70
[tune] Support non-arg submit (#4803) 2019-05-16 23:10:07 -07:00
Eric Liang
3807fb505b
[rllib] TensorFlow 2 compatibility (#4802) 2019-05-16 22:12:07 -07:00
Eric Liang
7d5ef6d99c
[rllib] Support continuous action distributions in IMPALA/APPO (#4771) 2019-05-16 22:05:07 -07:00
Richard Liaw
9f2645d6ea [tune] Fix CLI test (#4801) 2019-05-16 13:50:03 -07:00
Devin Petersohn
1490a98a71 Bump version to 0.7.0 (#4791) 2019-05-15 22:55:21 -07:00