Commit graph

1855 commits

Author SHA1 Message Date
Eric Liang
f31a6ca965
[rllib] Count actual sample batch size instead of configured batch size in A3C. (#2399)
This fixes a metrics accounting bug where the sample count is not reported correctly.
2018-07-18 08:59:52 +02:00
Richard Liaw
8e8c733696
[tune] Fix Categorical Space + Add Keras Example (#2401)
Previously did not properly resolve categorical variables for HyperOpt.
2018-07-17 23:52:52 +02:00
Yuhong Guo
e3badb9b09 Fix that parquet and arrow will build every time. (#2405)
* Fix the bug that parquet and arrow will build every time.

* Update build_arrow.sh

* Update build_arrow.sh
2018-07-16 22:56:14 -07:00
Eric Liang
0cecf6b79c
[rllib] Cleanup RNN support and make it work with multi-GPU optimizer (#2394)
Cleanup: TFPolicyGraph now automatically adds loss input entries for state_in_*, so that graph sub-classes don't need to worry about it.

Multi-GPU support:

Allow setting up model tower replicas with existing state input tensors

Truncate the per-device minibatch slices so that they are always a multiple of max_seq_len.
2018-07-17 06:55:46 +02:00
Robert Nishihara
1b645fcc8b Add parameter server blog post. (#2398)
* Saving work on parameter server blog post.

* Updates

* Updates to blog post.

* Add notes about tasks and actors.

* Updates

* Add RLlib paper link

* Update intro

* Address comments.

* More fixes.

* Clarify ray.get

* Change date

* Add @ray.remote clarification.

* Update site deployment instructions.

* Minor wording
2018-07-16 21:51:39 -07:00
Peter Schafhalter
f5c46c7765 Add queue data structures (#2261) 2018-07-16 16:26:20 -07:00
Yuhong Guo
404bfc5da2 Add const to to_plasma_id function to make it usable by const ObjectID (#2404)
* Add const to to_plasma_id to make it usable by const ObjectID

* Separate the building script to another PR.
2018-07-16 11:05:51 -07:00
Yuhong Guo
ded260b1b7 Add const to to_plasma_id function to make it usable by const ObjectID (#2404)
* Add const to to_plasma_id to make it usable by const ObjectID

* Separate the building script to another PR.
2018-07-16 11:05:37 -07:00
Yuhong Guo
206254bcf3 Add const to to_plasma_id function to make it usable by const ObjectID (#2404)
* Add const to to_plasma_id to make it usable by const ObjectID

* Separate the building script to another PR.
2018-07-16 11:05:29 -07:00
Hao Chen
8a3e180156 Move profiling code to a new file and fix thread safety (#2397) 2018-07-15 18:09:52 -07:00
Yuhong Guo
bbea73155a Fix parquet missing error and improve arrow commit id changing (#2319)
* Fix parquet missing error and improve arrow commit id changing

* Remove build cache for arrow.

* Update build_parquet.sh

* Update build_ui.sh

* Update build_arrow.sh
2018-07-14 16:08:13 -07:00
Eric Liang
7865dbab84 [tune] Raise error if incorrect key used in config (#2400) 2018-07-15 00:25:19 +02:00
Hao Chen
c1575e98c1 Make local scheduler client thread-safe (#2386)
* Make local scheduler client thread-safe for python

* lock write_messages

* remove allow-threads

* fix linter

* rename _write_message to do_write_message
2018-07-13 16:19:00 -07:00
Eric Liang
62f84d2f07 [rllib] Restore TF soft placement config to fix multi-GPU optimizer (#2395) 2018-07-13 10:34:37 +02:00
Hao Chen
d6af50785e move import_thread to a separate file (#2349)
* move import_thread to a separate file

* sort imports

* group imports regardless of `from`

* re-organize imoprts based on google style

* Update import_thread.py

* fix event_type names in profile statement

* unify duplicate code
2018-07-12 21:26:24 -07:00
Crystal
ebf4070d88 Documentation- Basic Profiling for Ray Users (#2326)
* Ray documentation - created new section 'Profiling for Ray Users', opposed to current Profiling section for Ray developers. Completed three sections 'A Basic Profiling Example', 'Timing Performance Using Python's Timestamps', and 'Profiling Using An External Profiler (Line_Profiler).' Left to-do two sections on CProfile and Ray Timeline Visualization.'

* Ray documentation - Fixed rst codeblock linebreaks in 'User Profiling'

* Ray documentation - For User Profiling, added section on cProfile

* Ray documentation - For User Profiling, completed Ray Timeline Visualization section, including graphical images

* Ray documentation - made User Profiling timeline image larger, minor wording edits

* Ray documentation - minor wording edits to User Profiling

* Ray documentation - User Profiling- fixed broken link

* Minor wording changes requested by Philipp Moritz addressed. Still need to address (1) compressing the image files, (2) correcting ex 3 to not be remote, and (3) using cProfile on an actor

* Ray documentation - For user-profiling.rst, revised example 3 to show a semi-parallelized example. Compressed timeline example image to be under 50 KB, removed view timeline GUI image. Updated timeline example image to reflect revised example 3. cProfile actor example left

* Ray documentation - in user-profiling.rst, added a new example including actors in the cProfile section

* Ray documentation - For user-profiling.rst, added section header for the Ray actor cProfile example

* Update user-profiling.rst

* Update user-profiling.rst

* 4 space indentation

* Update user-profiling.rst

* Update user-profiling.rst

* Update user-profiling.rst

* corrections
2018-07-12 16:57:39 -07:00
Robert Nishihara
515da7721a Change ray.worker.cleanup -> ray.shutdown and improve API documentation. (#2374)
* Change ray.worker.cleanup -> ray.shutdown and improve API documentation.

* Deprecate ray.worker.cleanup() gracefully.

* Fix linting
2018-07-12 12:00:00 -07:00
Eric Liang
b316afeb43 [rllib] Add debug info back to PPO and fix optimizer compatibility (#2366) 2018-07-12 19:22:46 +02:00
Eric Liang
8ea926c266
[rllib] _init renamed to _build_layers in example 2018-07-12 19:21:58 +02:00
Richard Liaw
5188b1d080
[autoscaler] Bug for file mounts for tilde (#2382) 2018-07-12 19:18:47 +02:00
Richard Liaw
0048e77093
[rllib] RLlib CLI (#2375) 2018-07-12 19:12:04 +02:00
Robert Nishihara
54487b1d7f Pin the number of CPUs in failing actor test. (#2368)
* Pin the number of CPUs in failing actor test.

* Pin number of CPUs in multi_node_test.py.

* Fix linting.
2018-07-11 18:34:19 -07:00
Philipp Moritz
4dadc60968 Update arrow to include uninitialized memory fixes (#2371) 2018-07-11 07:52:02 -05:00
Hanwei Jin
450b11f1d6 update to slf4j, remove DynamicLog (#2384) 2018-07-09 23:33:59 -07:00
Richard Liaw
55d5e28872 [core] Better Actor Representation (#2369) 2018-07-09 11:20:21 -07:00
Zhijun Fu
fa33ea5283 [Java] Java worker cluster support (#2359) 2018-07-09 10:20:41 -07:00
Eric Liang
4ef9d15315
[rllib] Add concepts section of docs (#2373)
This fills in the rllib concepts documentation.
2018-07-08 18:46:52 -07:00
Richard Liaw
4d7da9f668
[rllib] Remove "Common", cleanup some code (#2348) 2018-07-08 13:03:53 -07:00
Zongheng Yang
1d05cd7077 Minor fix to 0.5 release blog post (#2377) 2018-07-08 00:38:31 -07:00
Robert Nishihara
2788376d8f Add 0.5 release notes. (#2365)
* Add 0.5 release notes.

* Add note.

* Add note about Modin.

* Add vectorized environments note.

* Link to PR instead of commit.

* Add autoscaler link.

* syntax
2018-07-07 17:31:13 -07:00
Robert Nishihara
35f4a3070c Update 0.4.0 to 0.5.0 in autoscaler and installation examples. (#2352) 2018-07-07 14:34:20 -07:00
Eric Liang
d24f19fd1e
[rllib] Fix stats collection and some docs bugs since the refactoring (#2361)
* fix

* fix pbt example

* fix

* fix

* single thread by default

* vec

* fix

* fix
2018-07-07 13:29:20 -07:00
Eric Liang
9a6e329325 [rllib] Move repeat field to asv script (#2367) 2018-07-07 12:10:06 -07:00
Richard Liaw
e32aed8717
[rllib] more user-friendly Optimizer signature + compute_apply (#2335)
* Move signature of optimizers

* fix

* expose compute_apply for policy_graphs

* dictionaries and such

* test for multiagent
2018-07-07 12:08:49 -07:00
Robert Nishihara
e3534c46df [xray] Re-enable some stress tests and convert stress_tests to pytest. (#2285)
* Fix one of the stress tests, fix ray.global_state.client_table when called early on.

* Re-enable testWait.

* Convert stress_tests.py to pytest.

* Fix
2018-07-06 23:21:00 -07:00
Robert Nishihara
3a972893ee Bump version to 0.5.0. (#2351) 2018-07-06 22:31:33 -07:00
Philipp Moritz
e45bec1e51 Fix import error when tensorflow-gpu and tensorflow are both installed (#2364) 2018-07-06 22:29:54 -07:00
Philipp Moritz
fbde8cad74 Update apache arrow to include TensorFlow fix (#2345) 2018-07-06 13:18:56 -07:00
Devin Petersohn
4185aaed10 Dataframe deprecation (#2353) 2018-07-06 00:16:22 -07:00
Philipp Moritz
4f1d14e176 improve error message when flex or bison is not installed (#2355) 2018-07-05 23:46:55 -07:00
Zongheng Yang
23a98a223f Doc: redis memory management / automatic flushing. (#2344)
* Doc: redis memory management / automatic flushing.

* Address comments

* Update redis-memory-management.rst

* Change cross ref style
2018-07-05 23:44:37 -07:00
Stephanie Wang
5b7475a2e0
[xray] Unsubscribe to task dependencies when task starts execution (#2354)
* Add back call to unsubscribe to task dependencies

* fix
2018-07-05 21:08:58 -07:00
Stephanie Wang
c50f1966e0 Publish a notification for empty keys in the GCS (#2347)
* Publish an empty notification for empty keys

* Add failure callback to Table::Subscribe, add unit test for new behavior
2018-07-05 13:39:07 -07:00
Wang Qing
b7088c1010 Clean the pom files (#2350) 2018-07-05 13:36:01 -07:00
Robert Nishihara
1cf1645e04 Require Cython 0.27 for setup. (#2313)
* Require Cython 0.27 for setup.

* Make Cython version even more specific.

* Restrict to any 0.27 version
2018-07-05 00:26:49 -07:00
Robert Nishihara
b90e551b41 [xray] Implement timeline and profiling API. (#2306)
* Add profile table and store profiling information there.

* Code for dumping timeline.

* Improve color scheme.

* Push timeline events on driver only for raylet.

* Improvements to profiling and timeline visualization

* Some linting

* Small fix.

* Linting

* Propagate node IP address through profiling events.

* Fix test.

* object_id.hex() should return byte string in python 2.

* Include gcs.fbs in node_manager.fbs.

* Remove flatbuffer definition duplication.

* Decode to unicode in Python 3 and bytes in Python 2.

* Minor

* Submit profile events in a batch. Revert some CMake changes.

* Fix

* Workaround test failure.

* Fix linting

* Linting

* Don't return anything from chrome_tracing_dump when filename is provided.

* Remove some redundancy from profile table.

* Linting

* Move TODOs out of docstring.

* Minor
2018-07-04 23:23:48 -07:00
Shuo
8e687cbc98 Unify the identity of a process while logging. (#2325) 2018-07-04 14:26:19 -07:00
Yuhong Guo
1d51e57b6e Fix Plasma starting failure when specify the memory in float value. (#2337) 2018-07-04 13:35:51 -07:00
Robert Nishihara
1ede458519 Stop building wheels for Python 3.3 on Linux. (#2342)
* Stop building wheels for Python 3.3 on Linux.

* Fix test.
2018-07-04 12:22:33 -07:00
Zongheng Yang
ba28dddf6f Make xray object table credis-managed and hence flushable. (#2338)
* monitor.py: issue flushes to data shard

* ResultTableAdd & ObjectTableAdd: add credis-managed versions

* Fix return codes

* Credis-manage xray object table & associated ray.table_append cmd

* Fix incorrect return code from TableAppend_DoWrite()

* Revert "ResultTableAdd & ObjectTableAdd: add credis-managed versions"

This reverts commit 628c2ea190df4c861dda0c284fab7ca6faa1ea24.

* Address comments

* Lint: fix indent

* Address comment
2018-07-03 17:32:44 -07:00