hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Peter Schafhalter	400a3e5705	Add queue size and __len__ methods (#2432 )	2018-07-19 17:04:42 -07:00
Peter Schafhalter	4225ac5081	Add benchmark using queue (#2431 )	2018-07-19 16:43:22 -07:00
Eric Liang	8e75d150f7	[rllib] Apex crash when compress_observations: False (#2426 ) We shouldn't try to decompress uncompressed data. Also, fix resource requests for ddpg + GPU.	2018-07-19 15:58:09 -07:00
Eric Liang	d01dc9e22d	[rllib] format with yapf (#2427 ) * initial yapf * manual fix yapf bugs	2018-07-19 15:30:36 -07:00
Robert Nishihara	24eb140e07	Remove redundant reconstruct call. (#2421 )	2018-07-19 11:22:02 -07:00
Robert Nishihara	991d0911d1	Move profile data flushing to background thread on workers. (#2415 ) * Move profile data flushing to background thread on workers. * Remove outdated comment.	2018-07-18 12:34:53 -07:00
Eric Liang	f31a6ca965	[rllib] Count actual sample batch size instead of configured batch size in A3C. (#2399 ) This fixes a metrics accounting bug where the sample count is not reported correctly.	2018-07-18 08:59:52 +02:00
Richard Liaw	8e8c733696	[tune] Fix Categorical Space + Add Keras Example (#2401 ) Previously did not properly resolve categorical variables for HyperOpt.	2018-07-17 23:52:52 +02:00
Eric Liang	0cecf6b79c	[rllib] Cleanup RNN support and make it work with multi-GPU optimizer (#2394 ) Cleanup: TFPolicyGraph now automatically adds loss input entries for state_in_*, so that graph sub-classes don't need to worry about it. Multi-GPU support: Allow setting up model tower replicas with existing state input tensors Truncate the per-device minibatch slices so that they are always a multiple of max_seq_len.	2018-07-17 06:55:46 +02:00
Peter Schafhalter	f5c46c7765	Add queue data structures (#2261 )	2018-07-16 16:26:20 -07:00
Hao Chen	8a3e180156	Move profiling code to a new file and fix thread safety (#2397 )	2018-07-15 18:09:52 -07:00
Eric Liang	7865dbab84	[tune] Raise error if incorrect key used in config (#2400 )	2018-07-15 00:25:19 +02:00
Eric Liang	62f84d2f07	[rllib] Restore TF soft placement config to fix multi-GPU optimizer (#2395 )	2018-07-13 10:34:37 +02:00
Hao Chen	d6af50785e	move import_thread to a separate file (#2349 ) * move import_thread to a separate file * sort imports * group imports regardless of `from` * re-organize imoprts based on google style * Update import_thread.py * fix event_type names in profile statement * unify duplicate code	2018-07-12 21:26:24 -07:00
Robert Nishihara	515da7721a	Change ray.worker.cleanup -> ray.shutdown and improve API documentation. (#2374 ) * Change ray.worker.cleanup -> ray.shutdown and improve API documentation. * Deprecate ray.worker.cleanup() gracefully. * Fix linting	2018-07-12 12:00:00 -07:00
Eric Liang	b316afeb43	[rllib] Add debug info back to PPO and fix optimizer compatibility (#2366 )	2018-07-12 19:22:46 +02:00
Richard Liaw	5188b1d080	[autoscaler] Bug for file mounts for tilde (#2382 )	2018-07-12 19:18:47 +02:00
Richard Liaw	0048e77093	[rllib] RLlib CLI (#2375 )	2018-07-12 19:12:04 +02:00
Richard Liaw	55d5e28872	[core] Better Actor Representation (#2369 )	2018-07-09 11:20:21 -07:00
Richard Liaw	4d7da9f668	[rllib] Remove "Common", cleanup some code (#2348 )	2018-07-08 13:03:53 -07:00
Robert Nishihara	35f4a3070c	Update 0.4.0 to 0.5.0 in autoscaler and installation examples. (#2352 )	2018-07-07 14:34:20 -07:00
Eric Liang	d24f19fd1e	[rllib] Fix stats collection and some docs bugs since the refactoring (#2361 ) * fix * fix pbt example * fix * fix * single thread by default * vec * fix * fix	2018-07-07 13:29:20 -07:00
Eric Liang	9a6e329325	[rllib] Move repeat field to asv script (#2367 )	2018-07-07 12:10:06 -07:00
Richard Liaw	e32aed8717	[rllib] more user-friendly Optimizer signature + compute_apply (#2335 ) * Move signature of optimizers * fix * expose compute_apply for policy_graphs * dictionaries and such * test for multiagent	2018-07-07 12:08:49 -07:00
Robert Nishihara	e3534c46df	[xray] Re-enable some stress tests and convert stress_tests to pytest. (#2285 ) * Fix one of the stress tests, fix ray.global_state.client_table when called early on. * Re-enable testWait. * Convert stress_tests.py to pytest. * Fix	2018-07-06 23:21:00 -07:00
Robert Nishihara	3a972893ee	Bump version to 0.5.0. (#2351 )	2018-07-06 22:31:33 -07:00
Devin Petersohn	4185aaed10	Dataframe deprecation (#2353 )	2018-07-06 00:16:22 -07:00
Robert Nishihara	1cf1645e04	Require Cython 0.27 for setup. (#2313 ) * Require Cython 0.27 for setup. * Make Cython version even more specific. * Restrict to any 0.27 version	2018-07-05 00:26:49 -07:00
Robert Nishihara	b90e551b41	[xray] Implement timeline and profiling API. (#2306 ) * Add profile table and store profiling information there. * Code for dumping timeline. * Improve color scheme. * Push timeline events on driver only for raylet. * Improvements to profiling and timeline visualization * Some linting * Small fix. * Linting * Propagate node IP address through profiling events. * Fix test. * object_id.hex() should return byte string in python 2. * Include gcs.fbs in node_manager.fbs. * Remove flatbuffer definition duplication. * Decode to unicode in Python 3 and bytes in Python 2. * Minor * Submit profile events in a batch. Revert some CMake changes. * Fix * Workaround test failure. * Fix linting * Linting * Don't return anything from chrome_tracing_dump when filename is provided. * Remove some redundancy from profile table. * Linting * Move TODOs out of docstring. * Minor	2018-07-04 23:23:48 -07:00
Yuhong Guo	1d51e57b6e	Fix Plasma starting failure when specify the memory in float value. (#2337 )	2018-07-04 13:35:51 -07:00
Robert Nishihara	1ede458519	Stop building wheels for Python 3.3 on Linux. (#2342 ) * Stop building wheels for Python 3.3 on Linux. * Fix test.	2018-07-04 12:22:33 -07:00
Zongheng Yang	ba28dddf6f	Make xray object table credis-managed and hence flushable. (#2338 ) * monitor.py: issue flushes to data shard * ResultTableAdd & ObjectTableAdd: add credis-managed versions * Fix return codes * Credis-manage xray object table & associated ray.table_append cmd * Fix incorrect return code from TableAppend_DoWrite() * Revert "ResultTableAdd & ObjectTableAdd: add credis-managed versions" This reverts commit 628c2ea190df4c861dda0c284fab7ca6faa1ea24. * Address comments * Lint: fix indent * Address comment	2018-07-03 17:32:44 -07:00
Richard Liaw	178346fa16	Printing messages to stderr (#2312 ) Move core python code onto logging module. Addressing #1884.	2018-07-02 16:10:57 -07:00
Richard Liaw	f0ed1c1674	[rllib] Add more regression tests and autogenerate (#2324 )	2018-07-02 08:20:53 -07:00
Eric Liang	8aa56c12e6	[rllib] Document "v2" APIs (#2316 ) * re * wip * wip * a3c working * torch support * pg works * lint * rm v2 * consumer id * clean up pg * clean up more * fix python 2.7 * tf session management * docs * dqn wip * fix compile * dqn * apex runs * up * impotrs * ddpg * quotes * fix tests * fix last r * fix tests * lint * pass checkpoint restore * kwar * nits * policy graph * fix yapf * com * class * pyt * vectorization * update * test cpe * unit test * fix ddpg2 * changes * wip * args * faster test * common * fix * add alg option * batch mode and policy serving * multi serving test * todo * wip * serving test * doc async env * num envs * comments * thread * remove init hook * update * fix ppo * comments1 * fix * updates * add jenkins tests * fix * fix pytorch * fix * fixes * fix a3c policy * fix squeeze * fix trunc on apex * fix squeezing for real * update * remove horizon test for now * multiagent wip * update * fix race condition * fix ma * t * doc * st * wip * example * wip * working * cartpole * wip * batch wip * fix bug * make other_batches None default * working * debug * nit * warn * comments * fix ppo * fix obs filter * update * wip * tf * update * fix * cleanup * cleanup * spacing * model * fix * dqn * fix ddpg * doc * keep names * update * fix * com * docs * clarify model outputs * Update torch_policy_graph.py * fix obs filter * pass thru worker index * fix * rename * vlad torch comments * fix log action * debug name * fix lstm * remove unused ddpg net * remove conv net * revert lstm * wip * wip * cast * wip * works * fix a3c * works * lstm util test * doc * clean up * update * fix lstm check * move to end * fix sphinx * fix cmd * remove bad doc * envs * vec * doc prep * models * rl * alg * up * clarify * copy * async sa * fix * comments * fix a3c conf * tune lstm * fix reshape * fix * back to 16 * tuned a3c update * update * tuned * optional * merge * wip * fix up * move pg class * rename env * wip * update * tip * alg * readme * fix catalog * readme * doc * context * remove prep * comma * add env * link to paper * paper * update * rnn * update * wip * clean up ev creation * fix * fix * fix * fix lint * up * no comma * ma * Update run_multi_node_tests.sh * fix * sphinx is stupid * sphinx is stupid * clarify torch graph * no horizon * fix config * sb * Update test_optimizers.py	2018-07-01 00:05:08 -07:00
Philipp Moritz	762bdf646e	[xray] Put GCS data into the redis data shard (#2298 )	2018-06-30 15:42:10 -10:00
Richard Liaw	d75b39f6df	[tune] Return error trials(#2292 )	2018-06-28 20:23:38 -07:00
Hao Chen	20c0ecb522	Reuse code of checking large pickles (#2291 )	2018-06-28 16:51:23 -10:00
Sergey Kolesnikov	cd63804768	[rllib] Different Activation Support (#2311 )	2018-06-28 18:41:04 -07:00
Richard Liaw	3cc27d2840	[rllib][asv] Support ASV for RLlib (#2304 )	2018-06-28 17:20:09 -07:00
Richard Liaw	92ab7e56ec	[rllib] Fix PPO regression	2018-06-28 16:00:53 -07:00
Adam Gleave	89460b8d11	autoscaler: count head node, don't kill below target (fixes #2317 ) (#2320 ) Specifically, subtracts 1 from the target number of workers, taking into account that the head node has some computational resources. Do not kill an idle node if it would drop us below the target number of nodes (in which case we just immediately relaunch).	2018-06-28 15:33:51 -07:00
Richard Liaw	b4dff9f933	[rllib] PPO onto new RLlib APIs (#2270 )	2018-06-28 09:49:08 -07:00
Eric Liang	b197c0c404	[rllib] General RNN support (#2299 ) * wip * cls * re * wip * wip * a3c working * torch support * pg works * lint * rm v2 * consumer id * clean up pg * clean up more * fix python 2.7 * tf session management * docs * dqn wip * fix compile * dqn * apex runs * up * impotrs * ddpg * quotes * fix tests * fix last r * fix tests * lint * pass checkpoint restore * kwar * nits * policy graph * fix yapf * com * class * pyt * vectorization * update * test cpe * unit test * fix ddpg2 * changes * wip * args * faster test * common * fix * add alg option * batch mode and policy serving * multi serving test * todo * wip * serving test * doc async env * num envs * comments * thread * remove init hook * update * fix ppo * comments1 * fix * updates * add jenkins tests * fix * fix pytorch * fix * fixes * fix a3c policy * fix squeeze * fix trunc on apex * fix squeezing for real * update * remove horizon test for now * multiagent wip * update * fix race condition * fix ma * t * doc * st * wip * example * wip * working * cartpole * wip * batch wip * fix bug * make other_batches None default * working * debug * nit * warn * comments * fix ppo * fix obs filter * update * wip * tf * update * fix * cleanup * cleanup * spacing * model * fix * dqn * fix ddpg * doc * keep names * update * fix * com * docs * clarify model outputs * Update torch_policy_graph.py * fix obs filter * pass thru worker index * fix * rename * vlad torch comments * fix log action * debug name * fix lstm * remove unused ddpg net * remove conv net * revert lstm * wip * wip * cast * wip * works * fix a3c * works * lstm util test * doc * clean up * update * fix lstm check * move to end * fix sphinx * fix cmd * remove bad doc * clarify * copy * async sa * fix * comments * fix a3c conf * tune lstm * fix reshape * fix * back to 16 * tuned a3c update * update * tuned * optional * fix catalog * remove prep	2018-06-27 22:51:04 -07:00
Richard Liaw	d3f81d5aad	[rllib] Add stats for A3C (#2315 ) * add stats for a3c again * fix multigpu too	2018-06-27 22:41:34 -07:00
Eric Liang	737f3e3cf2	[tune] Fix registering trainable twice (#2293 ) * register twice * isolate * Update registry.py * Update registry.py	2018-06-27 16:29:39 -07:00
Eric Liang	44f5f0520b	[rllib] Rename optimizers for clarity (#2303 ) * rename * fix * update * mgpu * Update a3c.py * Update bc.py * Update a3c.py * Update test_optimizers.py * Update a3c.py	2018-06-27 02:30:15 -07:00
Richard Liaw	e657497225	[xray] Fix tune tests (#2305 ) * fix xray tests * yapf * unleash tests	2018-06-26 23:56:23 -07:00
Eric Liang	1251abf0d1	[rllib] Modularize Torch and TF policy graphs (#2294 ) * wip * cls * re * wip * wip * a3c working * torch support * pg works * lint * rm v2 * consumer id * clean up pg * clean up more * fix python 2.7 * tf session management * docs * dqn wip * fix compile * dqn * apex runs * up * impotrs * ddpg * quotes * fix tests * fix last r * fix tests * lint * pass checkpoint restore * kwar * nits * policy graph * fix yapf * com * class * pyt * vectorization * update * test cpe * unit test * fix ddpg2 * changes * wip * args * faster test * common * fix * add alg option * batch mode and policy serving * multi serving test * todo * wip * serving test * doc async env * num envs * comments * thread * remove init hook * update * fix ppo * comments1 * fix * updates * add jenkins tests * fix * fix pytorch * fix * fixes * fix a3c policy * fix squeeze * fix trunc on apex * fix squeezing for real * update * remove horizon test for now * multiagent wip * update * fix race condition * fix ma * t * doc * st * wip * example * wip * working * cartpole * wip * batch wip * fix bug * make other_batches None default * working * debug * nit * warn * comments * fix ppo * fix obs filter * update * wip * tf * update * fix * cleanup * cleanup * spacing * model * fix * dqn * fix ddpg * doc * keep names * update * fix * com * docs * clarify model outputs * Update torch_policy_graph.py * fix obs filter * pass thru worker index * fix * rename * vlad torch comments * fix log action * debug name * fix lstm * remove unused ddpg net * remove conv net * revert lstm * cast * clean up * fix lstm check * move to end * fix sphinx * fix cmd * remove bad doc * clarify * copy * async sa * fix	2018-06-26 13:17:15 -07:00
Eric Liang	a9a26b7560	[rllib] Part 2 of multiagent support (#2286 ) * wip * cls * re * wip * wip * a3c working * torch support * pg works * lint * rm v2 * consumer id * clean up pg * clean up more * fix python 2.7 * tf session management * docs * dqn wip * fix compile * dqn * apex runs * up * impotrs * ddpg * quotes * fix tests * fix last r * fix tests * lint * pass checkpoint restore * kwar * nits * policy graph * fix yapf * com * class * pyt * vectorization * update * test cpe * unit test * fix ddpg2 * changes * wip * args * faster test * common * fix * add alg option * batch mode and policy serving * multi serving test * todo * wip * serving test * doc async env * num envs * comments * thread * remove init hook * update * fix ppo * comments1 * fix * updates * add jenkins tests * fix * fix pytorch * fix * fixes * fix a3c policy * fix squeeze * fix trunc on apex * fix squeezing for real * update * remove horizon test for now * multiagent wip * update * fix race condition * fix ma * t * doc * st * wip * example * wip * working * cartpole * wip * batch wip * fix bug * make other_batches None default * working * debug * nit * warn * comments * fix ppo * fix obs filter * update * fix obs filter * pass thru worker index * fix * fix log action * debug name * fix sphinx	2018-06-25 22:33:57 -07:00

1 2 3 4 5 ...

673 commits