Commit graph

91 commits

Author SHA1 Message Date
Eric Liang
f9043cc49a
[rllib] Remove experimental eager support 2019-07-21 12:27:17 -07:00
Eric Liang
d58b986858
[rllib] MultiCategorical shouldn't return array for kl or entropy (#5215)
* wip

* fix
2019-07-19 12:12:04 -07:00
Jones Wong
0af07bd493 Enable seeding actors for reproducible experiments (#5197)
*  enable graph-level worker-specific seed

*  lint checked

*  revised according to eric's suggestions

*  revised accordingly and added a test case

*  formated

* Update test_reproducibility.py

* Update trainer.py

* Update rollout_worker.py

* Update run_rllib_tests.sh

* Update worker_set.py
2019-07-17 23:31:34 -07:00
Eric Liang
047f4ccd61
[rllib] Fix rollout.py with tuple action space (#5201)
* fix it

* update doc too

* fix rollout
2019-07-16 10:52:35 -07:00
Simon Mo
fd71ffde2f Improve release process 0.7.2 (#5187) 2019-07-15 14:46:54 -07:00
Richard Liaw
b6509f46b0
Update wheels to 0.8.0dev2 (#5186) 2019-07-12 17:27:03 -07:00
Philipp Moritz
ccee77aafd fix node_failures.py (#5167) 2019-07-11 11:40:13 -07:00
Hao Chen
0c34749779
Use bazel disk cache for all CI jobs (#5144) 2019-07-10 22:03:45 +08:00
Richard Liaw
0b540ab492
[tune] Test example checkpointing (#4728) 2019-07-10 01:58:26 -07:00
Eric Liang
5ab5017c67
[rllib] Fix impala stress test (#5101)
* add copy

* upgrade to tf 1.14

* update

* reduce count to workaround https://github.com/ray-project/ray/issues/5125

* Update impala.py

* placeholder

* comments

* update
2019-07-09 20:22:30 -07:00
Joey Jiang
b4e51c8aa1 Support clang-format whose version is not 7.0 (#5139) 2019-07-08 17:15:09 +08:00
Robert Nishihara
9cc4cc6a52
Fail format.sh if yapf/flake8 versions are incorrect. (#5083) 2019-07-04 23:22:01 -07:00
Eric Liang
34d054ff19
[rllib] ModelV2 API (#4926) 2019-07-03 15:59:47 -07:00
Eric Liang
904dcf081d
Switch cluster longevity tests to DLAMI, fix ray up verbosity (#5084)
* fix

* add branch commit

* comments

* Update ci/long_running_tests/.gitignore

Co-Authored-By: Robert Nishihara <robertnishihara@gmail.com>
2019-07-02 00:19:05 -07:00
Philipp Moritz
a6a02fccd0 Do not compile redis twice (#5074) 2019-07-01 15:42:54 -07:00
Simon Mo
0c4dd3c401 Use bazel disk cache with travis (#5068) 2019-06-30 17:57:48 -07:00
Robert Nishihara
bcc379556b Make some fixes to long running stress tests. (#5056) 2019-06-28 15:42:54 -07:00
Richard Liaw
b1827d5fbe
[tune] Update MNIST Example (#4991) 2019-06-25 22:50:15 -07:00
Richard Liaw
bd8aceb896 [ci] Change Jenkins to py3 (#5022)
* conda3

* integration

* add nevergrad, remotedata

* pytest 0.3.1

* otherdockers

* setup

* tune
2019-06-24 21:50:37 -07:00
Hao Chen
2bf92e02e2
[gRPC] Use gRPC for inter-node-manager communication (#4968) 2019-06-17 19:00:50 +08:00
Hao Chen
3c92b2ee4d
Upgrade CI clang-format to 6.0 (#4976) 2019-06-14 14:52:32 +08:00
Robert Nishihara
6f48992322 Make release stress tests work and improve them. (#4955) 2019-06-10 23:04:01 -07:00
Eric Liang
9e328fbe6f
[rllib] Add docs on how to use TF eager execution (#4927) 2019-06-07 16:42:37 -07:00
Robert Nishihara
c3f8fc1c44
Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941) 2019-06-06 17:22:45 -07:00
Hersh Godse
89722ff003 [tune] Directional metrics for components (#4120) (#4915) 2019-06-02 22:13:40 -07:00
Eric Liang
7501ee51db
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820) 2019-06-03 06:49:24 +08:00
Peter Schafhalter
c2ade075a3 [sgd] Distributed Training via PyTorch (#4797)
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Eric Liang
1c073e92e4
[rllib] Fix documentation on custom policies (#4910)
* wip

* add docs

* lint

* todo sections

* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang
d7be5a5d36
[rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO 2019-05-27 17:24:45 -07:00
Robert Nishihara
7a78e1e320 Install bazel in autoscaler development configs. (#4874) 2019-05-26 16:13:50 -07:00
Robert Nishihara
6703519144 Move global state API out of global_state object. (#4857) 2019-05-26 11:27:53 -07:00
Devin Petersohn
a7d01aba9b Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847) 2019-05-24 16:49:13 -07:00
Eric Liang
351753aae5
[rllib] Remove dependency on TensorFlow (#4764)
* remove hard tf dep

* add test

* comment fix

* fix test
2019-05-10 20:36:18 -07:00
Devin Petersohn
edb8465910 [ray-core] Initial addition of performance integration testing files (#4325) 2019-05-08 13:40:54 -07:00
Devin Petersohn
3a72430e87 Bump Modin version to 0.5 (#4752)
* Bump Modin version

* Update Travis pandas version
2019-05-07 01:12:52 -07:00
Si-Yuan
9ce3039390
Fix webui api (#4686)
* fix webui

* Apply suggestions from code review

lint

Co-Authored-By: suquark <suquark@gmail.com>

* add dependencies for this unittest

* move dependencies to the script file
2019-04-27 15:23:56 +08:00
Devin Petersohn
fb2655fa93 Update Release Process documentation (#4670) 2019-04-25 00:05:19 -07:00
justinwyang
2f4b784d0e Fixe flakequotes to allow escaping quotes (#4666) 2019-04-19 13:55:20 -07:00
Eric Liang
ce66a552bf
Move large mem test to end (#4664) 2019-04-19 11:43:22 -07:00
Wang Qing
0d433b220c [hotfix] Hot fix for CI linting error 2019-04-18 21:48:55 -07:00
Eric Liang
3fd9dea721
[rllib] Fix tune.run(Agent class) (#4630)
* update

* Update __init__.py
2019-04-15 09:12:23 -07:00
cfan
bb207a205b [rllib] Support torch device and distributions. (#4553) 2019-04-12 11:39:14 -07:00
justinwyang
e88e706fcc Enforce quoting style in Travis. (#4589) 2019-04-11 14:24:26 -07:00
Eric Liang
4f46d3e9bf
[rllib] Add multi-agent examples for hand-coded policy, centralized VF (#4554) 2019-04-09 00:36:49 -07:00
ctombumila37
7746d20d30 [rllib] ExternalMultiAgentEnv (#4200) 2019-04-06 19:58:14 -07:00
Philipp Moritz
b0f6ddf6d1 Remove CMake files (#4493) 2019-04-02 22:17:33 -07:00
Eric Liang
0d94f3eeef
[rllib] Improve datapath throughput of IMPALA / APPO (#4324) 2019-03-31 12:25:52 -07:00
bjg2
77005d1814 [rllib] Make batch timeout for remote workers tunable (#4435) 2019-03-29 13:19:42 -07:00
Eric Liang
2ffe67c5c3
[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs (#4478) 2019-03-29 12:44:23 -07:00
Robert Nishihara
c6f12e5219 Update documentation from 0.7.0.dev1 to 0.7.0.dev2. (#4485) 2019-03-26 17:32:53 -07:00