hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

Author	SHA1	Message	Date
Eric Liang	93a9d32288	[docs] Switch docs to use rllib train instead of train.py	2018-12-04 17:36:06 -08:00
Eric Liang	ce355d13d4	[rllib] Allow envs to be auto-registered; add on_train_result callback with curriculum example (#3451 ) * train step and docs * debug * doc * doc * fix examples * fix code * integration test * fix * ... * space * instance * Update .travis.yml * fix test	2018-12-03 23:15:43 -08:00
Eric Liang	d8205976e8	[rllib] Auto clip actions to Box space range; deprecate squash_to_range (#3426 ) * fix clip * tweak wording * remove squash entirely * Update rllib-models.rst * fix argument order * Apply suggestions from code review Co-Authored-By: ericl <ekhliang@gmail.com>	2018-12-03 19:55:25 -08:00
Eric Liang	13c8ce4d84	Update README.rst with 0.6.0 version number. (#3453 )	2018-12-01 19:16:45 -08:00
Devin Petersohn	57512616e1	Update readme to contain logo (#3443 ) * Adding logo to readme * Updating link * Add badge * Addressing comments * Moving logo * Change align * Move image	2018-11-30 18:28:35 -08:00
GiliR4t1qbit	454d3aa07d	[docs] Snippet did not have a code-block tag above it (#3442 )	2018-11-30 16:39:40 -08:00
Eric Liang	07d8cbf414	[rllib] Support batch norm layers (#3369 ) * batch norm * lint * fix dqn/ddpg update ops * bn model * Update tf_policy_graph.py * Update multi_gpu_impl.py * Apply suggestions from code review Co-Authored-By: ericl <ekhliang@gmail.com>	2018-11-29 13:33:39 -08:00
Robert Nishihara	82863b5251	[autoscaler] Update autoscaler to use heartbeat batches. (#3409 )	2018-11-27 23:46:27 -08:00
Eric Liang	f0df97db6f	[rllib] example and docs on how to use parametric actions with DQN / PG algorithms (#3384 )	2018-11-27 23:35:19 -08:00
Eric Liang	0d56fc10cc	Move setproctitle to ray[debug] package (#3415 )	2018-11-27 09:50:59 -08:00
Eric Liang	8b76bab25c	[rllib] docs for td3 (#3381 ) * td3 doc * Update rllib-env.rst	2018-11-22 13:36:47 -08:00
Richard Liaw	c24d87b4d1	[autoscaler] Submit command (#3312 )	2018-11-20 14:03:34 -08:00
Eric Liang	abdc3b592e	[rllib] Update multi-gpu impala numbers (#3327 )	2018-11-19 20:55:27 -08:00
Eric Liang	61e3bbbfee	Update stale example links	2018-11-17 15:40:38 -08:00
Robert Nishihara	98edf752a9	Note requirement cython==0.27.3 in installation instructions. (#3322 )	2018-11-15 15:27:19 -08:00
Eric Liang	706dc1d473	[rllib] Add test for multi-agent support and fix IMPALA multi-agent (#3289 ) IMPALA support for multiagent was broken since IMPALA has a requirement that batch sizes be of a certain length. However multi-agent envs can create variable-length batches. Fix this by adding zero-padding as needed (similar to the RNN case).	2018-11-14 14:14:07 -08:00
Eric Liang	65c27c70cf	[rllib] Clean up agent resource configurations (#3296 ) Closes #3284	2018-11-13 18:00:03 -08:00
Philipp Moritz	d4fad222e1	Update profiling instructions for raylet (#3311 )	2018-11-13 17:48:33 -05:00
Richard Liaw	c3a2c7ebed	[tune] Doc: Autofilled, StatusReporter (#3294 ) * autofill and revise doc page for things * lint * comments	2018-11-13 13:15:56 -08:00
Eric Liang	d90f365394	[rllib] Add self-supervised loss to model (#3291 ) # What do these changes do? Allow self-supervised losses to be easily defined in custom models. Add this to the reference policy graphs.	2018-11-12 18:55:24 -08:00
Eric Liang	bd0dbde149	[rllib] Rename ServingEnv => ExternalEnv (#3302 )	2018-11-12 16:31:27 -08:00
Eric Liang	53489d2f85	[sgd] Document and add simple MNIST example (#3236 )	2018-11-10 21:52:20 -08:00
Eric Liang	9dd3eedbac	[rllib] rollout.py should reduce num workers (#3263 ) ## What do these changes do? Don't create an excessive amount of workers for rollout.py, and also fix up the env wrapping to be consistent with the internal agent wrapper. ## Related issue number Closes #3260.	2018-11-09 12:29:16 -08:00
Richard Liaw	22113be04c	[tune] Annotated Example Page and showcase Tutorials (#3267 ) Adds an example page and link in codebase. Closes #2728.	2018-11-08 23:45:05 -08:00
eugenevinitsky	344b4ef0ff	[rllib] Fix filter sync for ES and ARS (#2918 )	2018-11-06 19:09:34 -08:00
Eric Liang	725df3a485	Set the process title in workers and actors (#3219 )	2018-11-06 14:59:22 -08:00
Eric Liang	369cb833fe	[rllib] Implement custom metrics (#3144 )	2018-11-03 18:48:32 -07:00
Eric Liang	9a0f0db070	Add `ray stack` tool for debugging (#3213 )	2018-11-03 13:13:02 -07:00
Si-Yuan	5ce7ed7dad	Fix 'tempfile' docs (#3180 ) * Fix docs. * Update doc/source/tempfile.rst Co-Authored-By: suquark <suquark@gmail.com> * Remove doc for raylet socket.	2018-11-02 16:50:55 -07:00
Eric Liang	8c03683573	Add warning about using latest wheels (#3207 )	2018-11-02 15:41:10 -07:00
Robert Nishihara	e495ab5e7c	Fix some paths /tmp/raylogs -> /tmp/ray. (#3189 )	2018-11-02 12:10:53 -07:00
Robert Nishihara	57d6e98302	Update actor fault tolerance documentation. (#3175 )	2018-11-01 11:52:05 -07:00
Robert Nishihara	60f28040ea	Document fractional resources. (#3174 )	2018-11-01 10:50:56 -07:00
Eric Liang	cd284bb487	[rllib] Document env compatibility, Ape-X support for multi-agent (#3147 )	2018-10-31 21:59:34 -07:00
Richard Liaw	2086a57e61	[tune] Add Fractional GPU example/docs (#3169 ) * Add example for fractional GPU support * Update tune_mnist_keras.py * Update doc/source/tune-usage.rst	2018-10-31 18:53:16 -07:00
Eric Liang	a221f55b0d	[rllib] Add custom value functions, fix up and document multi-agent variable sharing (#3151 )	2018-10-29 19:37:27 -07:00
Eric Liang	af0c1174cd	[sgd] Merge sharded param server based SGD implementation (#3033 ) This includes most of the TF code used for the OSDI experiment. Perf sanity check on p3.16xl instances: Overall scaling looks ok, with the multi-node results within 5% of OSDI final numbers. This seems reasonable given that hugepages are not enabled here, and the param server shards are placed randomly. $ RAY_USE_XRAY=1 ./test_sgd.py --gpu --batch-size=64 --num-workers=N \ --devices-per-worker=M --strategy=<simple\|ps> \ --warmup --object-store-memory=10000000000 Images per second total gpus total \| simple \| ps ======================================== 1 \| 218 2 (1 worker) \| 388 4 (1 worker) \| 759 4 (2 workers) \| 176 \| 623 8 (1 worker) \| 985 8 (2 workers) \| 349 \| 1031 16 (2 nodes, 2 workers) \| 600 \| 1661 16 (2 nodes, 4 workers) \| 468 \| 1712 <--- OSDI perf was 1817	2018-10-27 21:25:02 -07:00
Robert Nishihara	658c14282c	Remove legacy Ray code. (#3121 ) * Remove legacy Ray code. * Fix cmake and simplify monitor. * Fix linting * Updates * Fix * Implement some methods. * Remove more plasma manager references. * Fix * Linting * Fix * Fix * Make sure class IDs are strings. * Some path fixes * Fix * Path fixes and update arrow * Fixes. * linting * Fixes * Java fixes * Some java fixes * TaskLanguage -> Language * Minor * Fix python test and remove unused method signature. * Fix java tests * Fix jenkins tests * Remove commented out code.	2018-10-26 13:36:58 -07:00
Richard Liaw	eff7cb4458	[tune] Fix SearchAlg finishing early (#3081 ) * Fix trial search alg finishing early * Fix lint * fix lint * nit fix	2018-10-22 12:17:13 -07:00
Eric Liang	59901a88a0	[rllib] Native support for Dict and Tuple spaces; fix Tuple action spaces; add prev a, r to LSTM (#3051 )	2018-10-20 15:21:22 -07:00
Robert Nishihara	9a2b5333ef	Add links for latest Python 3.7 wheels to documentation. (#3091 )	2018-10-19 12:15:22 -07:00
Peter Schafhalter	a41bbc10ef	Add password authentication to Redis ports (#2952 ) * Implement Redis authentication * Throw exception for legacy Ray * Add test * Formatting * Fix bugs in CLI * Fix bugs in Raylet * Move default password to constants.h * Use pytest.fixture * Fix bug * Authenticate using formatted strings * Add missing passwords * Add test * Improve authentication of async contexts * Disable Redis authentication for credis * Update test for credis * Fix rebase artifacts * Fix formatting * Add workaround for issue #3045 * Increase timeout for test * Improve C++ readability * Fixes for CLI * Add security docs * Address comments * Address comments * Adress comments * Use ray.get * Fix lint	2018-10-16 22:48:30 -07:00
Eric Liang	a9e454f6fd	[rllib] Include config dicts in the sphinx docs (#3064 )	2018-10-16 15:55:11 -07:00
Eric Liang	3c891c6ece	[rllib] Parallel-data loading and multi-gpu support for IMPALA (#2766 )	2018-10-15 11:02:50 -07:00
Richard Liaw	f9b58d7b02	[tune] Tweaks to Trainable and Verbosity (#2889 )	2018-10-11 23:42:13 -07:00
Robert Nishihara	d73ee36e60	Update links to use latest 0.5.3 wheels instead of 0.5.2. (#3018 )	2018-10-03 13:43:40 -07:00
Si-Yuan	cc7e2ecdd5	Change logfile names and also allow plasma store socket to be passed in. (#2862 )	2018-10-03 10:03:53 -07:00
Eric Liang	b45bed4bce	[rllib] Propagate model options correctly in ARS / ES, to action dist of PPO (#2974 ) * fix * fix * fix it * propagate conf to action dist * move carla example too * rr * Update policies.py * wip * lint	2018-10-01 12:49:39 -07:00
Eric Liang	814c35b7d7	[rllib] Simplify sample batch size and num envs config, n_step adjustment (#2995 ) * simplify vec batch requirements * Update rllib-training.rst * Update rllib-training.rst * Update rllib-training.rst * Update rllib-training.rst * Update rllib-training.rst * Update rllib-models.rst	2018-09-30 18:36:22 -07:00
Eric Liang	b06c604a51	[rllib] Add some more tuned atari results to documentation (#2991 ) * dqn results ++ * add scale * hour * fix * small dqn table * update * steps * upd * apex * up * add apex results * tip	2018-09-29 23:13:36 -07:00

1 2 3 4 5 ...

343 commits