Robert Nishihara
c92a867c8b
Fix log monitor CPU utilization. ( #4091 )
2019-02-19 12:19:21 -08:00
Wang Qing
794a093249
Add runtime_context to get some runtime fields in worker ( #4065 )
2019-02-19 15:57:30 +08:00
Wang Qing
7574757391
Fix crash for Java task's task.argument()
in state. ( #4063 )
2019-02-19 12:46:07 +08:00
Philipp Moritz
cfc7e2c5a9
Fix modin test ( #4069 )
2019-02-18 12:17:36 -08:00
Eric Liang
6e46d75554
[tune] Remove slow gzip of checkpoints; ignore jupyter stop errors ( #4076 )
...
* fix gzip
* ignore jupyter
2019-02-18 01:30:13 -08:00
Eric Liang
f8bef004da
[rllib] Improve error message for bad envs, add remote env docs ( #4044 )
...
* commit
* fix up rew
2019-02-18 01:28:19 -08:00
Robert Nishihara
b78d77257b
Speed up test/component_failures_test.py::test_actor_creation_node_failure. ( #4056 )
2019-02-17 15:35:54 -08:00
Robert Nishihara
5a9098891f
Add serialization test for more collection types. ( #3982 )
...
* Add serialization test for more collection types.
* Reorganize serialization tests a little.
* Update
2019-02-17 13:57:33 -08:00
Philipp Moritz
f51969964d
Fix linting on master ( #4077 )
2019-02-17 13:55:40 -08:00
Megan Kawakami
346885068c
[rllib] add torch pg ( #3857 )
...
* add torch pg
* add torch imports
* added torch pg
* working torch pg implementation
* add pg pytorch
* Update a3c.py
* Update a3c.py
* Update torch_policy_graph.py
* Update torch_policy_graph.py
2019-02-16 19:54:14 -08:00
Zekun Shi
a708ab66f5
Add simplex action space and dirichlet action distribution ( #4070 )
...
* add simplex action space and dirichlet action distribution
* Update and rename spaces.py to extra_spaces.py
* Update __init__.py
* Update catalog.py
* Fix python 2
* Update extra_spaces.py
* change Simplex.contains() to return False
2019-02-16 12:44:59 -08:00
Kristian Hartikainen
0cc5c88075
[tune] Add number of trials to the trial runner logger ( #4068 )
2019-02-16 01:12:59 -08:00
Yu Kobayashi
d2d66c576e
Support non ascii characters in the source code ( #4047 )
2019-02-16 11:45:44 +08:00
Hao Chen
de17443dc2
Propagate backend error to worker ( #4039 )
2019-02-16 11:39:15 +08:00
William Ma
4be3d0c5d3
Update shipped modin to 0.3.1 ( #4058 )
2019-02-15 15:49:38 -08:00
Robert Nishihara
2d07df7f3f
Replace '__main__' with "__main__". ( #4055 )
2019-02-15 13:32:43 -08:00
Robert Nishihara
5f71751891
API cleanups. Remove worker argument. Remove some deprecated arguments. ( #4025 )
...
* Remove worker argument from API methods.
* Remove deprecated arguments and deprecate redirect_output and redirect_worker_output.
* Fix
2019-02-15 10:49:16 -08:00
Hao Chen
042ad84573
Simplify Cython ID types and fix bug of ActorCheckpointID ( #4045 )
2019-02-15 20:15:16 +08:00
Richard Liaw
bb7c4ce9c4
[tune] Improve error message when Ray crashes ( #3795 )
2019-02-15 01:04:17 -08:00
Richard Liaw
7cf62a10cd
[tune] Fix TF checkpointing example ( #4043 )
...
Closes #3912 , closes #3963 .
2019-02-15 00:30:27 -08:00
Stephanie Wang
3684e5bc0d
Fix memory leak in Redis by using auto memory management ( #4054 )
...
* Table appends should always succeed
* Use Redis auto memory management
* Remove unneeded namespace
2019-02-14 19:51:18 -08:00
Eric Liang
0c0bd4d41c
[rllib] Use model.value_function() in MARWIL ( #4036 )
...
* fix marwil
* add ph
* fix
2019-02-14 19:35:21 -08:00
William Ma
8ee53297b1
Add documentation on how to use debug tools ( #4000 )
2019-02-14 13:50:21 -08:00
Philipp Moritz
077ffd99bf
Bump version from 0.6.3 to 0.7.0.dev0 in docs and .yaml ( #4042 )
2019-02-14 12:08:48 -08:00
Yuhong Guo
4b0db437ee
Linting Bazel scripts ( #4032 )
...
* Use buildifier as bazel script linter
* Checkout golang version in travis
* Using golang-1.8-go in travis
* Add golang apt-repository
* Fix the bazel lint failure example.
* Address comment
2019-02-14 22:16:19 +08:00
Philipp Moritz
810cc17062
Fix LRU eviction of client notification datastructure ( #4021 )
...
* convert notification_key map to C++ datastructure
* fix crash and add debug string
* clean notification map up (this was a bug before)
* remove checks
* add jenkins test
* linting
* fixes
* properly erase
* clean up
* linting
* Update test_wait_hanging.py
* Update run_multi_node_tests.sh
* increase redis_max_memory
* fix dat jenkins
* update
* Update run_multi_node_tests.sh
2019-02-13 22:20:27 -08:00
Stephanie Wang
fd5b58a827
Increase timeout for object manager valgrind tests ( #4027 )
...
* Avoid second copy of data for inlined objects
* Increase Wait timeout for valgrind tests
* Run object manager tests with and without inlined objects
* Fix test
2019-02-13 18:29:03 -08:00
Wang Qing
1fb56a4316
Remove deprecated module ( #4038 )
2019-02-14 10:04:09 +08:00
Si-Yuan
2de31eb489
minor fix ( #4040 )
2019-02-13 17:22:45 -08:00
Eric Liang
2dccf383dd
[rllib] Basic infrastructure for off-policy estimation (IS, WIS) ( #3941 )
2019-02-13 16:25:05 -08:00
Kristian Hartikainen
729d0b2825
[autoscaler] docker run options ( #3921 )
...
Adds support for docker options, allowing for use of nvidia-docker.
Closes #2657 .
2019-02-13 12:26:28 -08:00
Stephanie Wang
4347ab644e
Use Redis lists in the GCS instead of zset ( #4023 )
...
* Convert zset to list
* Remove object evictions map from the object directory, yay
* comments
* Fix tests
2019-02-13 10:32:57 -08:00
bjg2
0e37ac6d1d
[wingman -> rllib] Remote and entangled environments ( #3968 )
...
* added all our environment changes
* fixed merge request comments and remote env
* fixed remote check
* moved remote_worker_envs to correct config section
* lint
* auto wrap impl
* fix
* fixed the tests
2019-02-13 10:08:26 -08:00
Philipp Moritz
b3f72e8a75
Add regression tests for dataclass serialization ( #3984 )
2019-02-13 09:07:03 -08:00
Hao Chen
f31a79f3f7
Implement actor checkpointing ( #3839 )
...
* Implement Actor checkpointing
* docs
* fix
* fix
* fix
* move restore-from-checkpoint to HandleActorStateTransition
* Revert "move restore-from-checkpoint to HandleActorStateTransition"
This reverts commit 9aa4447c1e3e321f42a1d895d72f17098b72de12.
* resubmit waiting tasks when actor frontier restored
* add doc about num_actor_checkpoints_to_keep=1
* add num_actor_checkpoints_to_keep to Cython
* add checkpoint_expired api
* check if actor class is abstract
* change checkpoint_ids to long string
* implement java
* Refactor to delay actor creation publish until checkpoint is resumed
* debug, lint
* Erase from checkpoints to restore if task fails
* fix lint
* update comments
* avoid duplicated actor notification log
* fix unintended change
* add actor_id to checkpoint_expired
* small java updates
* make checkpoint info per actor
* lint
* Remove logging
* Remove old actor checkpointing Python code, move new checkpointing code to FunctionActionManager
* Replace old actor checkpointing tests
* Fix test and lint
* address comments
* consolidate kill_actor
* Remove __ray_checkpoint__
* fix non-ascii char
* Loosen test checks
* fix java
* fix sphinx-build
2019-02-13 19:39:02 +08:00
Andrew Tan
57dcd3033e
[tune] Trial reporter fix ( #3951 )
...
Fixes #3949 .
2019-02-13 01:03:54 -08:00
Wang Qing
3a7fb182cc
Change the num of parallel jobs when building
2019-02-13 00:33:05 -08:00
William Ma
e1a479b137
Add teardown_module to test_queue.py ( #4012 )
2019-02-12 22:43:09 -08:00
Si-Yuan
21472b890a
Integrate "tempfile_service" into "ray.node.Node" ( #3953 )
2019-02-12 17:34:04 -08:00
Adi Zimmerman
dac1969647
[tune] Add Nevergrad to Tune ( #3985 )
2019-02-12 11:00:04 -08:00
Wang Qing
c523bc04ad
Enable redis password in Java worker ( #3943 )
...
* Support Java redis password
* Fix
* Refine
* Fix lint.
2019-02-12 13:11:25 +08:00
Adi Zimmerman
9797028a91
[tune] Add scikit-optimize to Tune ( #3924 )
2019-02-11 17:06:02 -08:00
Eric Liang
8df772867c
[rllib] rename compute_apply to learn_on_batch
2019-02-11 15:22:15 -08:00
Eric Liang
c4182463f6
[rllib] Add helper to iterate over envs in a vectorized environment ( #4001 )
...
* add foreach env func
* fix
* add test
2019-02-11 10:40:47 -08:00
Daniel Edgecumbe
a70ae1687b
.gitignore: Add Vim swap files ( #4016 )
2019-02-11 10:27:10 -08:00
Ion
3c32343c63
Ray signal ( #3624 )
2019-02-11 10:14:48 -08:00
ebrevdo
52dfde1cbb
Update flatbuffer bazel rule to work with flatbuffer master branch. ( #4008 )
2019-02-11 10:00:06 -08:00
Zhijun Fu
7097ba393b
protect raylet against bad messages ( #4003 )
...
* protect raylet against bad messages
* address comments
* linting and regression test
2019-02-12 00:39:38 +08:00
Wang Qing
bc438ca73b
[Java] Refine Java config item ( #4014 )
...
* Refine
* Address comment.
2019-02-11 23:55:40 +08:00
Philipp Moritz
ab809bd927
update ray version to 0.7.0dev ( #3995 )
2019-02-10 19:56:42 -08:00