Philipp Moritz
ccee77aafd
fix node_failures.py ( #5167 )
2019-07-11 11:40:13 -07:00
Eric Liang
5ab5017c67
[rllib] Fix impala stress test ( #5101 )
...
* add copy
* upgrade to tf 1.14
* update
* reduce count to workaround https://github.com/ray-project/ray/issues/5125
* Update impala.py
* placeholder
* comments
* update
2019-07-09 20:22:30 -07:00
Eric Liang
904dcf081d
Switch cluster longevity tests to DLAMI, fix ray up verbosity ( #5084 )
...
* fix
* add branch commit
* comments
* Update ci/long_running_tests/.gitignore
Co-Authored-By: Robert Nishihara <robertnishihara@gmail.com>
2019-07-02 00:19:05 -07:00
Robert Nishihara
bcc379556b
Make some fixes to long running stress tests. ( #5056 )
2019-06-28 15:42:54 -07:00
Hersh Godse
89722ff003
[tune] Directional metrics for components ( #4120 ) ( #4915 )
2019-06-02 22:13:40 -07:00
Robert Nishihara
7a78e1e320
Install bazel in autoscaler development configs. ( #4874 )
2019-05-26 16:13:50 -07:00
Devin Petersohn
fb2655fa93
Update Release Process documentation ( #4670 )
2019-04-25 00:05:19 -07:00
Philipp Moritz
b0f6ddf6d1
Remove CMake files ( #4493 )
2019-04-02 22:17:33 -07:00
bjg2
77005d1814
[rllib] Make batch timeout for remote workers tunable ( #4435 )
2019-03-29 13:19:42 -07:00
Robert Nishihara
c6f12e5219
Update documentation from 0.7.0.dev1 to 0.7.0.dev2. ( #4485 )
2019-03-26 17:32:53 -07:00
William Ma
11580fb7dc
Changes where actor resources are assigned ( #4323 )
2019-03-24 15:49:36 -07:00
William Ma
f423909aec
Temporary fix for many_actor_task.py ( #4315 )
2019-03-09 00:07:45 -08:00
Robert Nishihara
fd2d8c2c06
Remove Jenkins backend tests and add new long running stress test. ( #4288 )
2019-03-08 15:29:39 -08:00
Philipp Moritz
39eed24d47
update version from 0.7.0.dev0 to 0.7.0.dev1 ( #4282 )
2019-03-06 14:43:09 -08:00
Robert Nishihara
f151aa8723
Update long running stress tests and add actor death test. ( #4275 )
2019-03-06 14:26:45 -08:00
Eric Liang
6e3384a719
[rllib] Add three new long-running stress tests {APEX, IMPALA, PBT} ( #4215 )
2019-03-04 14:05:42 -08:00
Robert Nishihara
c4aa90314d
Add script for shutting down tests. ( #4203 )
2019-03-01 19:56:30 -08:00
Robert Nishihara
75504b9586
Add script for running infinitely long stress tests. ( #4163 )
...
Running `./ci/long_running_tests/start_workloads.sh` will start several workloads running (each in their own EC2 instance).
- The workloads run forever.
- The workloads all simulate multiple nodes but use a single machine.
- You can get the tail of each workload by running `./ci/long_running_tests/check_workloads.sh`.
- You have to manually shut down the instances.
As discussed with @ericl @richardliaw, the idea here is to optimize for the debuggability of the tests. If one of them fails, you can ssh to the relevant instance and see all of the logs.
2019-02-27 14:33:06 -08:00