Commit graph

259 commits

Author SHA1 Message Date
gehring
b520f6141e [rllib] Adds eager support with a generic TFEagerPolicy class (#5436) 2019-08-23 14:21:11 +08:00
Richard Liaw
cdc9227f1b
[tune] ASHA xgboost and lightgbm examples (#5500) 2019-08-22 10:37:59 -07:00
Philipp Moritz
e065f55998
Fix impala stress test (#5491) 2019-08-20 19:49:46 -07:00
Robert Nishihara
851c5b2dae Add a script for benchmarking performance for Ray developers. (#5472) 2019-08-19 23:41:23 -07:00
Richard Liaw
d7b309223b
[tune] MLFlow Logger (#5438) 2019-08-14 15:58:18 -07:00
Lisa Dunlap
b7d0733362 [tune] Implement BOHB (#5382) 2019-08-13 12:32:07 -07:00
Eric Liang
a1d2e17623
[rllib] Autoregressive action distributions (#5304) 2019-08-10 14:05:12 -07:00
jichan3751
de95117e96 [sgd] Tune interface for Pytorch MultiNode SGD (#5350) 2019-08-10 13:51:44 -07:00
Simon Mo
18f1e904de Bump 0.8.0.dev2 -> 0.8.0.dev3 (#5409) 2019-08-09 11:37:19 -07:00
Eric Liang
1a8fa5d2fa
Clean up top level Ray dir (#5404) 2019-08-08 23:35:55 -07:00
Eric Liang
592f313210
[rllib] Centralized critic / PPO example on TwoStepGame (#5392) 2019-08-08 14:03:28 -07:00
Simon Mo
50b93bf179 Check upstream with git remote (#5377) 2019-08-06 17:30:33 -07:00
Wonseok Jeon
281829e712 MADDPG implementation in RLlib (#5348) 2019-08-06 16:22:06 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
Richard Liaw
1eaa57c98f
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
Eric Liang
3bdd114282
[rllib] Better example rnn envs (#5300) 2019-07-28 14:07:18 -07:00
Eric Liang
a62c5f40f6
[rllib] Document ModelV2 and clean up the models/ directory (#5277) 2019-07-27 02:08:16 -07:00
Richard Liaw
7e715520e5
[sgd] Example for Training (#5292) 2019-07-27 01:10:25 -07:00
Eric Liang
f9043cc49a
[rllib] Remove experimental eager support 2019-07-21 12:27:17 -07:00
Eric Liang
d58b986858
[rllib] MultiCategorical shouldn't return array for kl or entropy (#5215)
* wip

* fix
2019-07-19 12:12:04 -07:00
Jones Wong
0af07bd493 Enable seeding actors for reproducible experiments (#5197)
*  enable graph-level worker-specific seed

*  lint checked

*  revised according to eric's suggestions

*  revised accordingly and added a test case

*  formated

* Update test_reproducibility.py

* Update trainer.py

* Update rollout_worker.py

* Update run_rllib_tests.sh

* Update worker_set.py
2019-07-17 23:31:34 -07:00
Eric Liang
047f4ccd61
[rllib] Fix rollout.py with tuple action space (#5201)
* fix it

* update doc too

* fix rollout
2019-07-16 10:52:35 -07:00
Simon Mo
fd71ffde2f Improve release process 0.7.2 (#5187) 2019-07-15 14:46:54 -07:00
Richard Liaw
b6509f46b0
Update wheels to 0.8.0dev2 (#5186) 2019-07-12 17:27:03 -07:00
Philipp Moritz
ccee77aafd fix node_failures.py (#5167) 2019-07-11 11:40:13 -07:00
Hao Chen
0c34749779
Use bazel disk cache for all CI jobs (#5144) 2019-07-10 22:03:45 +08:00
Richard Liaw
0b540ab492
[tune] Test example checkpointing (#4728) 2019-07-10 01:58:26 -07:00
Eric Liang
5ab5017c67
[rllib] Fix impala stress test (#5101)
* add copy

* upgrade to tf 1.14

* update

* reduce count to workaround https://github.com/ray-project/ray/issues/5125

* Update impala.py

* placeholder

* comments

* update
2019-07-09 20:22:30 -07:00
Joey Jiang
b4e51c8aa1 Support clang-format whose version is not 7.0 (#5139) 2019-07-08 17:15:09 +08:00
Robert Nishihara
9cc4cc6a52
Fail format.sh if yapf/flake8 versions are incorrect. (#5083) 2019-07-04 23:22:01 -07:00
Eric Liang
34d054ff19
[rllib] ModelV2 API (#4926) 2019-07-03 15:59:47 -07:00
Eric Liang
904dcf081d
Switch cluster longevity tests to DLAMI, fix ray up verbosity (#5084)
* fix

* add branch commit

* comments

* Update ci/long_running_tests/.gitignore

Co-Authored-By: Robert Nishihara <robertnishihara@gmail.com>
2019-07-02 00:19:05 -07:00
Philipp Moritz
a6a02fccd0 Do not compile redis twice (#5074) 2019-07-01 15:42:54 -07:00
Simon Mo
0c4dd3c401 Use bazel disk cache with travis (#5068) 2019-06-30 17:57:48 -07:00
Robert Nishihara
bcc379556b Make some fixes to long running stress tests. (#5056) 2019-06-28 15:42:54 -07:00
Richard Liaw
b1827d5fbe
[tune] Update MNIST Example (#4991) 2019-06-25 22:50:15 -07:00
Richard Liaw
bd8aceb896 [ci] Change Jenkins to py3 (#5022)
* conda3

* integration

* add nevergrad, remotedata

* pytest 0.3.1

* otherdockers

* setup

* tune
2019-06-24 21:50:37 -07:00
Hao Chen
2bf92e02e2
[gRPC] Use gRPC for inter-node-manager communication (#4968) 2019-06-17 19:00:50 +08:00
Hao Chen
3c92b2ee4d
Upgrade CI clang-format to 6.0 (#4976) 2019-06-14 14:52:32 +08:00
Robert Nishihara
6f48992322 Make release stress tests work and improve them. (#4955) 2019-06-10 23:04:01 -07:00
Eric Liang
9e328fbe6f
[rllib] Add docs on how to use TF eager execution (#4927) 2019-06-07 16:42:37 -07:00
Robert Nishihara
c3f8fc1c44
Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941) 2019-06-06 17:22:45 -07:00
Hersh Godse
89722ff003 [tune] Directional metrics for components (#4120) (#4915) 2019-06-02 22:13:40 -07:00
Eric Liang
7501ee51db
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820) 2019-06-03 06:49:24 +08:00
Peter Schafhalter
c2ade075a3 [sgd] Distributed Training via PyTorch (#4797)
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Eric Liang
1c073e92e4
[rllib] Fix documentation on custom policies (#4910)
* wip

* add docs

* lint

* todo sections

* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang
d7be5a5d36
[rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO 2019-05-27 17:24:45 -07:00
Robert Nishihara
7a78e1e320 Install bazel in autoscaler development configs. (#4874) 2019-05-26 16:13:50 -07:00
Robert Nishihara
6703519144 Move global state API out of global_state object. (#4857) 2019-05-26 11:27:53 -07:00
Devin Petersohn
a7d01aba9b Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847) 2019-05-24 16:49:13 -07:00