hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-10 13:26:39 -04:00

Author	SHA1	Message	Date
Eric Liang	38d00986a5	[rllib] Cleanups: deep merge configs properly; enforce min iter time on APEX (#2500 ) The dict merge prevents crashes when tune is trying to get resource requests for agents and you override a config subkey. The min iter time prevents iterations from getting too small, incurring high overhead. This is easy to run into on Ape-X since throughput can get very high.	2018-07-30 13:25:35 -07:00
Eric Liang	62a52ee989	[rllib] Fix corner case in rnn episode handling We should use episode ids instead of the timestep to determine when sequences should be cut, since when batches are concatenated, increasing t does not guarantee we are part of the same episode.	2018-07-30 13:24:43 -07:00
Philipp Moritz	696a229ece	Fix text verbosity in python 2.7 by running tests with pytest (#2470 )	2018-07-30 11:04:06 -07:00
Hao Chen	fe65f9fbbc	improve java api doc (#2508 )	2018-07-29 20:41:11 -07:00
Robert Nishihara	3f3514c2b3	Deprecate PYTHON_MODE more gracefully. (#2487 )	2018-07-29 16:25:46 -07:00
Steve Severance	f1b4ea69a3	Prevent hasher from running out of memory on large files (#2451 ) * Prevent hasher from running out of memory on large files * dump out keys * only print if failed * remove debugging * Fix lint error. Reverse adding newline.	2018-07-28 23:29:09 -07:00
Ion	80db69d245	State transition diagram documentation. (#2502 ) * Added description of transition diagram and a few name changes for imporved clarity. * rename some methods and update task_states.rst	2018-07-28 22:28:45 -07:00
Hao Chen	0ea7a6abf0	add java tutorial (#2491 )	2018-07-28 17:09:30 -07:00
Eric Liang	90a3ea9443	[xray] Fix heartbeat subscription for autoscaler (#2498 )	2018-07-28 13:34:55 -07:00
Peter Schafhalter	e10377567c	Add benchmark for ray.get (#2499 )	2018-07-28 09:09:21 -07:00
Philipp Moritz	ecc100cb3b	Upgrade arrow to include pytorch fix (#2496 )	2018-07-28 01:28:44 -07:00
Peter Schafhalter	ccb9a27393	Add benchmarks for ray.put (#2489 )	2018-07-27 17:49:21 -07:00
Peter Schafhalter	302510ada0	[asv] Add actor benchmarks (#2469 ) * Add actor benchmarks * Fix bug * Address comments and refactor * Update benchmark_actor.py	2018-07-27 17:40:02 -07:00
Robert Nishihara	2be1ccbd8f	Raise application-level exceptions for some failure scenarios. (#2429 ) * Raise application level exception for actor methods that can't be executed and failed tasks. * Retry task forwarding for actor tasks. * Small cleanups * Move constant to ray_config. * Create ForwardTaskOrResubmit method. * Minor * Clean up queued tasks for dead actors. * Some cleanups. * Linting * Notify task_dependency_manager_ about failed tasks. * Manage timer lifetime better. * Use smart pointers to deallocate the timer. * Fix * add comment	2018-07-27 19:53:30 -04:00
Eric Liang	24649726dc	[rllib] Use batch.count in async samples optimizer (#2488 ) Using the actual batch size reduces the risk of mis-accounting. Here, we under-counted samples since in truncate_episodes mode we were doubling the batch size by accident in policy_evaluator.	2018-07-27 16:44:21 -07:00
Peter Schafhalter	1e6b130b90	[asv] Add benchmarks for tasks (#2471 ) * Add benchmarks for tasks * Fixes * Add timeout	2018-07-27 13:59:55 -07:00
Stephanie Wang	6675361684	[xray] Track `ray.get` calls as task dependencies (#2362 )	2018-07-27 11:59:17 -07:00
Hao Chen	5b015f9a79	Remove the check of java primitive types (#2495 )	2018-07-27 11:44:19 -07:00
Shuo	29451cca82	Add test: running a driver for twice. (#2464 )	2018-07-27 00:57:52 -07:00
Zhijun Fu	9ad6a973a0	[xray] lineage optimization: avoid unnecessary lineage entry allocation & free (#2463 ) * merge from ray * Revert "merge from ray" This reverts commit 32b181ebbb1fa184026631e1a7368112c4c3118d. * [xray] avoid unnecessary lineage entry allocation & free * address comments * address review comments * address comments	2018-07-26 10:44:38 -04:00
Yuhong Guo	46351957bb	Fix MAC_WHEELS=1 (#2477 )	2018-07-25 14:57:28 -07:00
Yuhong Guo	b35ce5dbf1	Update Arrow Package with breaking changes (#2440 ) * Merge the breaking change of Arrow Package. * Fix typo * Fix lint. * put forward declarations into header * fix * add protocol.h * fix linting	2018-07-25 14:28:33 -07:00
Richard Liaw	7edc677304	[rllib] Extra Changes for Usability (#2363 )	2018-07-24 20:51:22 -07:00
Sergey Kolesnikov	05490b8cb9	[rllib] dqn/ddpg policy customization (#2445 ) * dqn policy update - more customization * docs for custom DQN graph * Update rllib-training.rst * Update rllib-models.rst * Update rllib.rst * Update rllib-training.rst * Update rllib-concepts.rst * yapf codestyle	2018-07-22 14:47:14 -07:00
Eric Liang	68660453e4	[rllib] Better support and add two-trainer example for multiagent (#2443 ) This adds a simple DQN+PPO example for multi-agent. We don't do anything fancy here, just syncing weights between two separate trainers. This potentially is wasting some compute, but is very simple to set up. It might be nice to share experience collection between the top-level trainers in the future.	2018-07-22 05:09:25 -07:00
Shuo	99d0d96aef	Use different serialization context for each driver. (#2406 )	2018-07-20 23:42:49 -07:00
Hao Chen	05f485e274	Allow Ray API to be used from multiple threads (#2422 )	2018-07-20 15:39:01 -07:00
Robert Nishihara	4b6157ed09	Remove link to install Linux Python 3.3 wheel. (#2434 )	2018-07-20 15:15:43 -07:00
nam-cern	c0b4c3b6cf	Use absolute path to get to thirdparty dir (#2442 ) * Use absolute path to get to thirdparty dir In case this script is executed from a different directory than the Ray's directory, the `pushd` will fail. This commit uses absolute path to `thirdparty` directory. * Update setup_thirdparty.sh	2018-07-20 15:12:25 -07:00
Eric Liang	807f309b3a	[test] Fix broken rllib test (#2446 ) This fixes the broken build.	2018-07-20 13:47:41 -07:00
Philipp Moritz	e821f852ef	[xray] Silence some object manager logging (#2437 )	2018-07-20 13:10:03 -07:00
Peter Schafhalter	2a3b02649a	Add queue test to xray tests (#2433 )	2018-07-19 17:18:13 -07:00
Peter Schafhalter	400a3e5705	Add queue size and __len__ methods (#2432 )	2018-07-19 17:04:42 -07:00
Peter Schafhalter	4225ac5081	Add benchmark using queue (#2431 )	2018-07-19 16:43:22 -07:00
Eric Liang	8e75d150f7	[rllib] Apex crash when compress_observations: False (#2426 ) We shouldn't try to decompress uncompressed data. Also, fix resource requests for ddpg + GPU.	2018-07-19 15:58:09 -07:00
Eric Liang	d01dc9e22d	[rllib] format with yapf (#2427 ) * initial yapf * manual fix yapf bugs	2018-07-19 15:30:36 -07:00
Robert Nishihara	24eb140e07	Remove redundant reconstruct call. (#2421 )	2018-07-19 11:22:02 -07:00
Robert Nishihara	eed39163f9	Add callback to node manager for client removed event. (#2417 ) * Add callback to node manager for client removed event. * Fix linting.	2018-07-18 16:59:04 -07:00
Robert Nishihara	991d0911d1	Move profile data flushing to background thread on workers. (#2415 ) * Move profile data flushing to background thread on workers. * Remove outdated comment.	2018-07-18 12:34:53 -07:00
Philipp Moritz	4c82ac72df	Upgrade arrow to include the plasma TensorFlow op (#2412 )	2018-07-18 12:33:02 -07:00
Wang Qing	344e3d2c05	Fix bug: Init RayLog before using it. (#2408 )	2018-07-18 00:44:37 -07:00
Eric Liang	f31a6ca965	[rllib] Count actual sample batch size instead of configured batch size in A3C. (#2399 ) This fixes a metrics accounting bug where the sample count is not reported correctly.	2018-07-18 08:59:52 +02:00
Richard Liaw	8e8c733696	[tune] Fix Categorical Space + Add Keras Example (#2401 ) Previously did not properly resolve categorical variables for HyperOpt.	2018-07-17 23:52:52 +02:00
Yuhong Guo	e3badb9b09	Fix that parquet and arrow will build every time. (#2405 ) * Fix the bug that parquet and arrow will build every time. * Update build_arrow.sh * Update build_arrow.sh	2018-07-16 22:56:14 -07:00
Eric Liang	0cecf6b79c	[rllib] Cleanup RNN support and make it work with multi-GPU optimizer (#2394 ) Cleanup: TFPolicyGraph now automatically adds loss input entries for state_in_*, so that graph sub-classes don't need to worry about it. Multi-GPU support: Allow setting up model tower replicas with existing state input tensors Truncate the per-device minibatch slices so that they are always a multiple of max_seq_len.	2018-07-17 06:55:46 +02:00
Robert Nishihara	1b645fcc8b	Add parameter server blog post. (#2398 ) * Saving work on parameter server blog post. * Updates * Updates to blog post. * Add notes about tasks and actors. * Updates * Add RLlib paper link * Update intro * Address comments. * More fixes. * Clarify ray.get * Change date * Add @ray.remote clarification. * Update site deployment instructions. * Minor wording	2018-07-16 21:51:39 -07:00
Peter Schafhalter	f5c46c7765	Add queue data structures (#2261 )	2018-07-16 16:26:20 -07:00
Yuhong Guo	404bfc5da2	Add const to to_plasma_id function to make it usable by const ObjectID (#2404 ) * Add const to to_plasma_id to make it usable by const ObjectID * Separate the building script to another PR.	2018-07-16 11:05:51 -07:00
Yuhong Guo	ded260b1b7	Add const to to_plasma_id function to make it usable by const ObjectID (#2404 ) * Add const to to_plasma_id to make it usable by const ObjectID * Separate the building script to another PR.	2018-07-16 11:05:37 -07:00
Yuhong Guo	206254bcf3	Add const to to_plasma_id function to make it usable by const ObjectID (#2404 ) * Add const to to_plasma_id to make it usable by const ObjectID * Separate the building script to another PR.	2018-07-16 11:05:29 -07:00

... 11 12 13 14 15 ...

2446 commits