hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Eric Liang	1d2a28ab07	[rllib] test all combinations of {obs_space} x {action_space} (#1449 )	2018-01-24 11:03:43 -08:00
Robert Nishihara	5acc98e629	Update arrow with better dataframe serialization and get rid of custo… (#1413 ) * Update arrow with better dataframe serialization and get rid of custom dataframe serializers. * Update plasma client API. * Fix potential bug. * Bug fix. * Update arrow to use deduplicated file descriptors and mutable buffers. * Fix tests. * Update commit. * Update commit. * Update commit. * Update commit. * Update commit * Update commit back to arrow codebase.'	2018-01-24 10:03:29 -08:00
Alexey Tumanov	f1303291b4	Ray scheduler spillback plumbing + mechanism (#1362 ) * spillback mechanism and plumbing : adding spillback counter + timestamp * linting fix * documentation * Fix argument name.	2018-01-23 20:18:12 -08:00
Roy Fox	4b0ef5eb2c	[rllib] Behavior Cloning (#1400 ) * Behavior Cloning * episode_reward_mean -> mean_loss * removing vestigial code * punctuation * unnecessary * Behavior Cloning * Behavior Cloning * Update __init__.py	2018-01-23 10:50:45 -08:00
Eric Liang	ee36effd8e	[rllib] Add n-step Q learning for DQN (#1439 ) * n-step * add sample adjustm * Oops * fix nstep * metric adjustment * Sat Jan 20 23:30:34 PST 2018 * Sun Jan 21 16:40:46 PST 2018 * Mon Jan 22 22:24:57 PST 2018	2018-01-23 10:31:19 -08:00
Devin Petersohn	4aca016bff	Adding series and a way to validate our API. (#1435 ) * Adding series and a way to validate our API. * Moving partitions into protected status	2018-01-21 19:20:54 -08:00
Peter Schafhalter	83949a533b	[autoscaler] Increased head and worker storage to 25 GiB (#1401 ) * Increased head and worker storage to 25 GiB * Update example.yaml	2018-01-21 13:09:29 -08:00
Eric Liang	a2b190e65b	Fix occasional task timeline failure to get task ids (#1442 )	2018-01-21 12:04:44 -08:00
Eric Liang	424bd7f74d	[rllib] improve custom env docs (#1447 ) * env docs * add env * update env * Fri Jan 19 18:55:34 PST 2018	2018-01-19 21:36:18 -08:00
Eric Liang	e216766bbc	[rllib] Update docs with api and components overview figures (#1443 )	2018-01-19 10:08:45 -08:00
eugenevinitsky	37076a9ff8	Multiagent model using concatenated observations (#1416 ) * working multi action distribution and multiagent model * currently working but the splits arent done in the right place * added shared models * added categorical support and mountain car example * now compatible with generalized advantage estimation * working multiagent code with discrete and continuous example * moved reshaper to utils * code review changes made, ppo action placeholder moved to model catalog, all multiagent code moved out of fcnet * added examples in * added PEP8 compliance * examples are mostly pep8 compliant * removed all flake errors * added examples to jenkins tests * fixed custom options bug * added lines to let docker file find multiagent tests * shortened example run length * corrected nits * fixed flake errors	2018-01-18 19:51:31 -08:00
Peter Schafhalter	215d526e0d	Load evaluation configuration from checkpoint (#1392 )	2018-01-17 10:51:33 -08:00
Eric Liang	b8811cbe34	[autoscaling] increase connect timeout, boto retries, and check subnet conf (#1422 ) * some autoscaling config tweaks * Sun Jan 14 13:56:55 PST 2018 * Mon Jan 15 14:21:09 PST 2018 * increase backoff * Mon Jan 15 14:40:47 PST 2018 * check boto version	2018-01-16 16:11:09 -08:00
Robert Nishihara	eac11c252c	Update wheel in autoscaler example. (#1408 )	2018-01-13 01:06:23 -08:00
Yaroslav Bulatov	78fb3c5ed9	[autoscaler] Fix ValueError: Missing required config keyavailability_zoneof type str	2018-01-13 00:59:15 -08:00
Richard Liaw	d4592382a4	[tune][minor] Fixes (#1383 )	2018-01-11 18:14:20 -08:00
Philipp Moritz	1290072764	[rllib] Expose PPO evaluator resource requirements (#1391 )	2018-01-11 11:09:01 -08:00
Peter Schafhalter	a59a9e20af	Added option for availability zone (#1393 )	2018-01-09 13:49:47 -08:00
Devin Petersohn	112ef07563	Adding all DataFrame methods with NotImplementedErrors (#1403 ) * Adding all DataFrame methods with NotImplementedErrors * Moving dataframe creation into function call	2018-01-07 12:00:16 -08:00
Robert Nishihara	1e0dfca2dc	Remove pyarrow version check. (#1394 )	2018-01-06 22:42:55 -08:00
Eric Liang	c60ccbad46	[carla] [rllib] Add support for carla nav planner and scenarios from paper (#1382 ) * wip * Sat Dec 30 15:07:28 PST 2017 * log video * video doesn't work well * scenario integration * Sat Dec 30 17:30:22 PST 2017 * Sat Dec 30 17:31:05 PST 2017 * Sat Dec 30 17:31:32 PST 2017 * Sat Dec 30 17:32:16 PST 2017 * Sat Dec 30 17:34:11 PST 2017 * Sat Dec 30 17:34:50 PST 2017 * Sat Dec 30 17:35:34 PST 2017 * Sat Dec 30 17:38:49 PST 2017 * Sat Dec 30 17:40:39 PST 2017 * Sat Dec 30 17:43:00 PST 2017 * Sat Dec 30 17:43:04 PST 2017 * Sat Dec 30 17:45:56 PST 2017 * Sat Dec 30 17:46:26 PST 2017 * Sat Dec 30 17:47:02 PST 2017 * Sat Dec 30 17:51:53 PST 2017 * Sat Dec 30 17:52:54 PST 2017 * Sat Dec 30 17:56:43 PST 2017 * Sat Dec 30 18:27:07 PST 2017 * Sat Dec 30 18:27:52 PST 2017 * fix train * Sat Dec 30 18:41:51 PST 2017 * Sat Dec 30 18:54:11 PST 2017 * Sat Dec 30 18:56:22 PST 2017 * Sat Dec 30 19:05:04 PST 2017 * Sat Dec 30 19:05:23 PST 2017 * Sat Dec 30 19:11:53 PST 2017 * Sat Dec 30 19:14:31 PST 2017 * Sat Dec 30 19:16:20 PST 2017 * Sat Dec 30 19:18:05 PST 2017 * Sat Dec 30 19:18:45 PST 2017 * Sat Dec 30 19:22:44 PST 2017 * Sat Dec 30 19:24:41 PST 2017 * Sat Dec 30 19:26:57 PST 2017 * Sat Dec 30 19:40:37 PST 2017 * wip models * reward bonus * test prep * Sun Dec 31 18:45:25 PST 2017 * Sun Dec 31 18:58:28 PST 2017 * Sun Dec 31 18:59:34 PST 2017 * Sun Dec 31 19:03:33 PST 2017 * Sun Dec 31 19:05:05 PST 2017 * Sun Dec 31 19:09:25 PST 2017 * fix train * kill * add tuple preprocessor * Sun Dec 31 20:38:33 PST 2017 * Sun Dec 31 22:51:24 PST 2017 * Sun Dec 31 23:14:13 PST 2017 * Sun Dec 31 23:16:04 PST 2017 * Mon Jan 1 00:08:35 PST 2018 * Mon Jan 1 00:10:48 PST 2018 * Mon Jan 1 01:08:31 PST 2018 * Mon Jan 1 14:45:44 PST 2018 * Mon Jan 1 14:54:56 PST 2018 * Mon Jan 1 17:29:29 PST 2018 * switch to euclidean dists * Mon Jan 1 17:39:27 PST 2018 * Mon Jan 1 17:41:47 PST 2018 * Mon Jan 1 17:44:18 PST 2018 * Mon Jan 1 17:47:09 PST 2018 * Mon Jan 1 20:31:02 PST 2018 * Mon Jan 1 20:39:33 PST 2018 * Mon Jan 1 20:40:55 PST 2018 * Mon Jan 1 20:55:06 PST 2018 * Mon Jan 1 21:05:52 PST 2018 * fix env path * merge richards fix * fix hash * Mon Jan 1 22:04:00 PST 2018 * Mon Jan 1 22:25:29 PST 2018 * Mon Jan 1 22:30:42 PST 2018 * simplified reward function * add framestack * add env configs * simplify speed reward * Tue Jan 2 17:36:15 PST 2018 * Tue Jan 2 17:49:16 PST 2018 * Tue Jan 2 18:10:38 PST 2018 * add lane keeping simple mode * Tue Jan 2 20:25:26 PST 2018 * Tue Jan 2 20:30:30 PST 2018 * Tue Jan 2 20:33:26 PST 2018 * Tue Jan 2 20:41:42 PST 2018 * ppo lane keep * simplify discrete actions * Tue Jan 2 21:41:05 PST 2018 * Tue Jan 2 21:49:03 PST 2018 * Tue Jan 2 22:12:23 PST 2018 * Tue Jan 2 22:14:42 PST 2018 * Tue Jan 2 22:20:59 PST 2018 * Tue Jan 2 22:23:43 PST 2018 * Tue Jan 2 22:26:27 PST 2018 * Tue Jan 2 22:27:20 PST 2018 * Tue Jan 2 22:44:00 PST 2018 * Tue Jan 2 22:57:58 PST 2018 * Tue Jan 2 23:08:51 PST 2018 * Tue Jan 2 23:11:32 PST 2018 * update dqn reward * Thu Jan 4 12:29:40 PST 2018 * Thu Jan 4 12:30:26 PST 2018 * Update train_dqn.py * fix	2018-01-05 21:32:41 -08:00
Eric Liang	77af2b5516	[autoscaler] Sometimes instances are restarted even when they don't need to be (#1385 ) * fix hash * Update autoscaler.py	2018-01-02 16:34:46 -08:00
Eric Liang	1bc55e182d	Update the pip wheel in example.yaml and add docs (#1381 )	2018-01-01 13:02:05 -08:00
Eric Liang	6e6674a824	[rllib] Split docs into user and development guide (#1377 ) * docs * Update README.rst * Sat Dec 30 15:23:49 PST 2017 * comments * Sun Dec 31 23:33:30 PST 2017 * Sun Dec 31 23:33:38 PST 2017 * Sun Dec 31 23:37:46 PST 2017 * Sun Dec 31 23:39:28 PST 2017 * Sun Dec 31 23:43:05 PST 2017 * Sun Dec 31 23:51:55 PST 2017 * Sun Dec 31 23:52:51 PST 2017	2018-01-01 11:10:44 -08:00
Eric Liang	b6c42f96be	Auto-scale ray clusters based on GCS load metrics (#1348 ) This adds (experimental) auto-scaling support for Ray clusters based on GCS load metrics. The auto-scaling algorithm is as follows: Based on current (instantaneous) load information, we compute the approximate number of "used workers". This is based on the bottleneck resource, e.g. if 8/8 GPUs are used in a 8-node cluster but all the CPUs are idle, the number of used nodes is still counted as 8. This number can also be fractional. We scale that number by 1 / target_utilization_fraction and round up to determine the target cluster size (subject to the max_workers constraint). The autoscaler control loop takes care of launching new nodes until the target cluster size is met. When a node is idle for more than idle_timeout_minutes, we remove it from the cluster if that would not drop the cluster size below min_workers. Note that we'll need to update the wheel in the example yaml file after this PR is merged.	2017-12-31 14:39:57 -08:00
Robert Nishihara	e970e24ea5	Update arrow, and pass memcopy_threads into put. (#1374 )	2017-12-31 13:32:06 -08:00
Richard Liaw	3304099cc4	[rllib] Evaluators and Optimizers Refactoring (#1339 )	2017-12-30 00:24:54 -08:00
Eric Liang	22c7c87e14	[rllib] [tune] Custom preprocessors and models, various fixes (#1372 )	2017-12-28 13:19:04 -08:00
Philipp Moritz	3d224c4edf	Second Part of Internal API Refactor (#1326 )	2017-12-26 16:22:04 -08:00
Richard Liaw	4bb5b6bd5b	[rllib] A3C Configurations (#1370 ) * initial introduction of a3c configs * fix sample batch * flake but need to check save * save,resotre * fix * pickles * entropy * fix * moving ppo * results * jenkins	2017-12-24 12:25:13 -08:00
Richard Liaw	b217a5ef14	[rllib] Fix Pong-PPO tuned example Config (#1369 )	2017-12-23 01:36:33 -08:00
Eric Liang	43e78217f8	Thu Dec 21 23:19:24 PST 2017 (#1367 )	2017-12-22 17:29:45 -08:00
Robert Nishihara	22460ff7af	Use Anaconda for autoscaling example and add example config for devel… (#1361 ) * Use Anaconda for autoscaling example and add example config for development. * Install Python2 for building the web ui.	2017-12-22 01:59:02 -08:00
Eric Liang	0ae660ce4e	[carla] In carla example, save all images and measurements to local disk (#1350 ) * revamp saving * smaller jpgs * hide verbose * Tue Dec 19 22:25:01 PST 2017 * make sure temp dirs sort lexiographically * save total reward too * zero pad i * 160x160 dqn * ever higher res dqn	2017-12-21 15:19:55 -08:00
Philipp Moritz	3a301c3d56	Fix pyarrow version check (#1360 )	2017-12-21 13:00:36 -08:00
Devin Petersohn	a75a473d7f	Add a distributed Dataframe API to Ray (#1330 ) * Adding dataframe object and minor APIs * Adding reduce functionality * Adding some print and making reduce work on current Ray * Cleanup * Added new functionality and docs. * Adding more functionality. * New functionality with older cleanup * Complying with flake8 formatting * Added tests and addressed reviewer comments * Complying with flake8. * Adding pandas to travis and requirements doc * Fixing flake8 failures * Fixing flake8 errors from imports * Fixing import error * Fixing import errors * Addressing reviewer comments * Addressing lint error	2017-12-20 09:31:22 -08:00
Cathy Wu	772527caa4	[rllib] Support 1-dimensional action spaces (PPO) (#1347 ) * Small fix for supporting custom preprocessors * PEP8 * Remove squeeze from actions	2017-12-19 14:17:06 -08:00
Eric Liang	6724f57b03	[Examples] Add Carla test env (#1343 ) * add carla example * add reward * set obs * Sun Dec 17 16:06:00 PST 2017 * add spec * fix measurement * add train script * resize to 80x80 * null * initial small training run * robustify env, clean up action space * clean up vars * switch to town2 which is faster * tunify train.py * add discrete mode * update * fix excessive brakinG * fix the weather * rename * redirect output and from future import * doc * update * fix rebase * allow dqn gpu growht * adjust dqn hyperparams * better ppo parameters	2017-12-19 12:57:58 -08:00
Melih Elibol	24b93b1123	fixes default type for product of empty shape. (#1341 )	2017-12-18 17:41:44 -08:00
Eric Liang	47b1f02d3e	[rllib] Pull out multi-gpu optimizer as a generic class (#1313 )	2017-12-17 15:59:57 -08:00
Cathy Wu	53e736fe01	[rllib] Small fix for supporting custom preprocessors (#1334 ) * Small fix for supporting custom preprocessors * PEP8 * fix test	2017-12-17 04:37:29 -08:00
Eric Liang	bab44837e0	[tune] Tensorboard logger incorrectly reports training iteration as cur timestep value	2017-12-16 23:30:15 -08:00
Eric Liang	d21ea0ca45	Switch EC2 example config to use AWS deep learning AMI + latest Ray wheel (#1331 ) * update * install --user	2017-12-16 17:39:46 -08:00
Eric Liang	f5ea44338e	EC2 cluster setup scripts and initial version of auto-scaler (#1311 )	2017-12-15 23:56:39 -08:00
Eric Liang	fbf1806b8a	[tune] Clean up result logging: move out of /tmp, add timestamp (#1297 )	2017-12-15 14:19:08 -08:00
Stephanie Wang	12fdb3f53a	Convert actor dummy objects to task execution edges. (#1281 ) * Define execution dependencies flatbuffer and add to Redis commands * Convert TaskSpec to TaskExecutionSpec * Add execution dependencies to Python bindings * Submitting actor tasks uses execution dependency API instead of dummy argument * Fix dependency getters and some cleanup for fetching missing dependencies * C++ convention * Make TaskExecutionSpec a C++ class * Convert local scheduler to use TaskExecutionSpec class * Convert some pointers to references * Finish conversion to TaskExecutionSpec class * fix * Fix * Fix memory errors? * Cast flatbuffers GetSize to size_t * Fixes * add more retries in global scheduler unit test * fix linting and cast fbb.GetSize to size_t * Style and doc * Fix linting and simplify from_flatbuf.	2017-12-14 20:47:54 -08:00
Richard Liaw	c5c83a4465	[rllib] PPO and A3C unification (#1253 )	2017-12-14 01:08:23 -08:00
Richard Liaw	cabbd27c56	[rllib] Support Nested Configuration Merging (#1268 )	2017-12-13 14:39:01 -08:00
Robert Nishihara	f75b51d178	Register Common.error with local scheduler extension module. (#1316 ) * Register Common.error with local scheduler extension module. * Add test.	2017-12-13 11:55:54 -08:00
Richard Liaw	b6a35e0395	[rllib] Introduce pip install rllib (#1310 ) * update setup * more dependencies	2017-12-12 13:58:28 -08:00

... 103 104 105 106 107 ...

5545 commits