hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Eric Liang	52888e4c6f	[tune] Improve the tune Python API and variant generation (#1154 ) * new variant gen * wip * Sat Oct 21 18:21:34 PDT 2017 * update * comment * fix * update * update readme * fix * Update README.rst * Update README.rst * fix repeat * update * note on restore	2017-11-06 23:41:17 -08:00
Richard Liaw	6222ec3bd7	[tune] hyperband (#1156 ) * trial scheduler interface * remove * wip median stopping * remove * median stopping rule * update * docs * update * Revrt * update * hyperband untested * small changes before moving on * added endpoints * good changes * init tests * smore tests * unfinished tests * testing * testing code * morbugs * fixes * end * tests and typo * nit * try this * tests * testing * lint * lint * lint * comments and docs * almost screwed up * lint	2017-11-06 22:30:25 -08:00
Eric Liang	d06beacd84	[tune] Implement median stopping rule (#1170 ) * trial scheduler interface * remove * wip median stopping * remove * median stopping rule * update * docs * update * Revrt * update * comments * fix tesT	2017-11-03 11:25:02 -07:00
Philipp Moritz	fdf069bd1d	update version to 0.2.2 (#1178 )	2017-11-01 20:41:24 -07:00
Robert Nishihara	3317d38278	Replace hostnames with numerical IP addresses in redis address. (#1177 ) * Replace hostnames with numerical IP addresses in redis address. * Also do conversion for node_ip_address. Add test. * Simplifications.	2017-11-01 17:13:22 -07:00
Eric Liang	202e7bf19a	fix (#1174 )	2017-11-01 13:45:39 -07:00
Richard Liaw	dc66a2d7d5	[rllib] A3C Refactoring (#1166 ) * fixing policy * Compute Action is singular, fixed weird issue with arrays * remove vestige * extraneous ipdb * Can Drop in Pytorch Model * lint * naming * finish comments	2017-10-29 11:12:17 -07:00
Eric Liang	4cace0976d	[rllib] Fix DQN inefficiency, and cleanup for different modes of parallelism (#1151 ) * initial checkin * flake * dqn * docs * add tuned pong * remove * upd * add both * better gamma * update * Last nit	2017-10-29 10:52:30 -07:00
Richard Liaw	304c3cade4	[tune] 10 second timeout for stopping (#1169 ) * 10 second timeout for stopping * prints for travis * lint * try better returning mechanism * lint	2017-10-29 00:49:29 -07:00
Robert Nishihara	6852e8839e	Expose custom serializers through the API. (#1147 ) * Expose custom serializers through the API. * minor renaming * Add test. * Remove comment. * Clean up assertions.	2017-10-29 00:08:55 -07:00
Eric Liang	3b157ab933	[tune] Allow resources to not all be assigned to the driver (#1150 ) * dgpu * update * update * update * also support cmdline * limit * Update README.rst * documentation * typo * small coverage for driver_gpu_limit * lint * fix lint	2017-10-28 22:16:05 -07:00
Robert Nishihara	f59867850e	Upgrade to cloudpickle 0.4.1. (#1164 ) * Upgrade to cloudpickle 0.4.1. * Catch more general exceptions thrown by cloudpickle.	2017-10-28 01:35:35 -07:00
Eric Liang	2b6c7af8ad	[tune] Trial scheduler interface (#1160 ) * trial scheduler interface * remove * update	2017-10-27 13:29:15 -07:00
Richard Liaw	797f4fcbf3	Fixing Lint after flake upgrade (#1162 ) * Fixing Lint after flake upgrade * more lint fixes	2017-10-26 21:02:07 -05:00
Eric Liang	cd9dc398ff	[rllib] Support discrete observation spaces such as FrozenLake-v0 (#1140 ) * add * remove transform_shape * fix test * fix	2017-10-23 23:16:52 -07:00
Richard Liaw	0c9817fa76	[tune] Tune Pausing (#1136 ) * fix yaml bug * add ext agent * gpus * update * tuning * docs * Sun Oct 15 21:09:25 PDT 2017 * lint * update * Sun Oct 15 22:39:55 PDT 2017 * Sun Oct 15 22:40:17 PDT 2017 * Sun Oct 15 22:43:06 PDT 2017 * Sun Oct 15 22:46:06 PDT 2017 * Sun Oct 15 22:46:21 PDT 2017 * Sun Oct 15 22:48:11 PDT 2017 * Sun Oct 15 22:48:44 PDT 2017 * Sun Oct 15 22:49:23 PDT 2017 * Sun Oct 15 22:50:21 PDT 2017 * Sun Oct 15 22:53:00 PDT 2017 * Sun Oct 15 22:53:34 PDT 2017 * Sun Oct 15 22:54:33 PDT 2017 * Sun Oct 15 22:54:50 PDT 2017 * Sun Oct 15 22:55:20 PDT 2017 * Sun Oct 15 22:56:56 PDT 2017 * Sun Oct 15 22:59:03 PDT 2017 * fix * Update tune_mnist_ray.py * remove script trial * fix * reorder * fix ex * py2 support * upd * comments * comments * cleanup readme * fix trial * annotate * Update rllib.rst * init pausing * Docs, Lint * fix danglings and restore endpoint moved to trialrunner * renaming * nit * start always starts from checkpoint * smalls * nits * lint * last change	2017-10-22 23:04:15 -07:00
Eric Liang	81ca27dc08	[rllib] [minor] Rename agent_id to experiment_tag (#1143 ) * tagstr * doc * rename * fix test	2017-10-22 18:44:18 -07:00
Robert Nishihara	97c6369b49	Update arrow to include custom serializer for pytorch and register default serialization handlers. (#1152 ) * Update arrow to include custom serializer for pytorch. * Call pyarrow function for registering default custom serialization handlers. * Change class ID used in serialization context for object IDs.	2017-10-21 21:24:10 -07:00
Stephanie Wang	af47737bd5	Prototype distributed actor handles (#1137 ) * Add actor handle ID to the task spec * Local scheduler dispatches actor tasks according to a task counter per handle * Fix python test * Allow passing actor handles into tasks. Not completely working yet. Also this is very messy. * Fixes, should be roughly working now. * Refactor actor handle wrapper * Fix __init__ tests * Terminate actor when the original handle goes out of scope * TODO and a couple test cases * Make tests for unsupported cases * Fix Python mode tests * Linting. * Cache actor definitions that occur before ray.init() is called. * Fix export actor class * Deterministically compute actor handle ID * Fix __getattribute__ * Fix string encoding for python3 * doc * Add comment and assertion.	2017-10-19 23:49:59 -07:00
Philipp Moritz	2f45ac9e95	Make travis runs less verbose. (#1145 ) * make travis runs less verbose * update * more -q flags	2017-10-19 22:25:56 -07:00
Robert Nishihara	8ab56b5906	Always redirect redis stdout/stderr. (#1142 )	2017-10-19 17:09:09 -07:00
Eric Liang	782125ef3f	warn if agent failed (#1141 )	2017-10-19 11:39:25 -07:00
Eric Liang	5a50e0e1d7	[rllib] Add the ability to run arbitrary Python scripts with ray.tune (#1132 ) * fix yaml bug * add ext agent * gpus * update * tuning * docs * Sun Oct 15 21:09:25 PDT 2017 * lint * update * Sun Oct 15 22:39:55 PDT 2017 * Sun Oct 15 22:40:17 PDT 2017 * Sun Oct 15 22:43:06 PDT 2017 * Sun Oct 15 22:46:06 PDT 2017 * Sun Oct 15 22:46:21 PDT 2017 * Sun Oct 15 22:48:11 PDT 2017 * Sun Oct 15 22:48:44 PDT 2017 * Sun Oct 15 22:49:23 PDT 2017 * Sun Oct 15 22:50:21 PDT 2017 * Sun Oct 15 22:53:00 PDT 2017 * Sun Oct 15 22:53:34 PDT 2017 * Sun Oct 15 22:54:33 PDT 2017 * Sun Oct 15 22:54:50 PDT 2017 * Sun Oct 15 22:55:20 PDT 2017 * Sun Oct 15 22:56:56 PDT 2017 * Sun Oct 15 22:59:03 PDT 2017 * fix * Update tune_mnist_ray.py * remove script trial * fix * reorder * fix ex * py2 support * upd * comments * comments * cleanup readme * fix trial * annotate * Update rllib.rst	2017-10-18 11:49:28 -07:00
Robert Nishihara	f3e3c7ec71	Add is_actor_checkpoint_method to TaskSpec. (#1117 ) * Add is_actor_checkpoint_method to TaskSpec. * Fix linting. * Fix rebase error. * Fix errors from rebase.	2017-10-15 16:52:10 -07:00
Eric Liang	802941994d	[rllib] Use RLlib preprocessors in DQN (fixes PongDeterministic-v4) (#1124 ) * fix pong * rename * update	2017-10-14 20:16:36 -07:00
Stephanie Wang	15486a14a0	Refactor actor task queues (#1118 ) * Refactor add_task_to_actor_queue into queue_actor_task and insert_actor_task_queue * Refactor actor task queue to share the waiting task queue * Fix	2017-10-13 20:52:11 -07:00
Eric Liang	79ea205b3e	[rllib] Initial work on integrating hyperparameter search tool (#1107 ) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test	2017-10-13 16:18:16 -07:00
Stephanie Wang	3764f2f2e1	Actor checkpointing with object lineage reconstruction (#1004 ) * Worker reports error in previous task, actor task counter is incremented after task is successful * Refactor actor task execution - Return new task counter in GetTaskRequest - Update worker state for actor tasks inside of the actor method executor * Manually invoked checkpoint method * Scheduling for actor checkpoint methods * Fix python bugs in checkpointing * Return task success from worker to local scheduler instead of actor counter * Kill local schedulers halfway through actor execution instead of waiting for all tasks to execute once * Remove redundant actor tasks during dispatch, reconstruct missing dependencies for actor tasks * Make executor for temporary actor methods * doc * Set default argument for whether the previous task was a success * Refactor actor method call * Simplify checkpoint task submission * lint * fix philipp's comments * Add missing line * Make actor reconstruction tests run faster * Unimportant whitespace. * Unimportant whitespace. * Update checkpoint method signature * Documentation and handle exceptions during checkpoint save/resume * Rename get_task message field to actor_checkpoint_failed * Fix bug. * Remove debugging check, redirect test output	2017-10-12 09:53:32 -07:00
Richard Liaw	379b0604b4	Fixing MeanSTDFilter (#1101 ) * Fixing MeanSTDFilter * documentation	2017-10-11 18:16:08 -07:00
Philipp Moritz	0684258d2e	Update arrow to include pandas serialization (#1102 ) * update arrow to include pandas serialization * update	2017-10-10 22:16:35 -07:00
Eric Liang	b1660c4edf	[rllib] Refactor to support passing custom env_creator function (#1096 ) * refactor to use env creator * doc * lint	2017-10-10 12:49:42 -07:00
Robert Nishihara	8f1a73f041	Allow Ray to be built without UI by setting INCLUDE_UI=0. (#1094 ) * Allow building Ray without UI by setting INCLUDE_UI=0. * Fix bash. * Fix linting.	2017-10-09 23:32:38 -07:00
Robert Nishihara	a52a1e893f	Automatically set CUDA_VISIBLE_DEVICES when worker gets task. (#1044 ) * Automatically set CUDA_VISIBLE_DEVICES when worker gets task. * Add test.	2017-10-06 18:38:08 -07:00
Robert Nishihara	4669c59fa8	Release GPU resources as soon as an actor exits. (#1088 ) * Release GPU resources as soon as an actor exits. * Add a test. * Store local_scheduler_id and driver_id in the worker object instead of the actor object.	2017-10-06 17:58:19 -07:00
Stephanie Wang	aebe9f9374	Fix actor garbage collection by breaking cyclic references (#1064 ) * Fix bug in wait_for_pid_to_exit, add test for actor deletion. * Fix actor garbage collection by breaking cyclic references * Add test for calling actor method immediately after actor creation. * Fix bug, must dispatch tasks when workers are killed. * Fix python test * Fix cyclic reference problem by creating ActorMethod objects on the fly. * Try simply increasing the time allowed for many_drivers_test.py.	2017-10-05 00:55:33 -07:00
Robert Nishihara	971becc905	[rllib] Use NoFilter instead of MeanStdFilter for PPO. (#1082 ) * Make NoFilter the default observation filter for PPO. * Make reward filter NoFilter for PPO.	2017-10-04 21:31:17 -07:00
Eric Liang	6ecc899cf2	[rllib] Fix DQN checkpoint/restore and enable test in jenkins (#1063 ) * fix dqn restore and add test * Update .gitignore * Update test_checkpoint_restore.py * add checkpoint restore	2017-10-03 23:17:54 -07:00
Richard Liaw	cb6dea94bc	[rllib] Fix Preprocessor for ATARI (#1066 ) * Removing squeeze, fix atari preprocessing * nit comment * comments * jenkins * Lint	2017-10-03 18:45:02 -07:00
Philipp Moritz	57bd1d6ff5	Specialize Serialization for OrderedDict (#1035 ) Specialize Serialization for OrderedDict and defaultdict	2017-10-02 17:33:10 -07:00
Philipp Moritz	b94d85fb5d	[rllib] Fix logging to Athena (#1058 ) * Fix logging to Athena * fixes	2017-10-02 17:16:52 -07:00
Richard Liaw	54aaeb25ab	fixing (#1048 )	2017-10-01 22:34:55 -07:00
Robert Nishihara	2892955119	Pass string into json.loads, not bytes object. (#991 )	2017-10-01 14:56:39 -07:00
Robert Nishihara	6d393828db	Bump version number to 0.2.1. (#1026 )	2017-10-01 12:33:13 -07:00
Robert Nishihara	ad61af7333	Workaround for passing empty list to ray.wait. (#1043 ) * Workaround for passing empty list to ray.wait. * Add test for passing empty list to wait.	2017-10-01 11:45:02 -07:00
Augusto Yao	0d90a17426	Pass cleanup argument to start_monitor. (#1040 )	2017-09-30 15:35:25 -07:00
Wapaul1	97b3355adc	Register Class Only Creates Entry in Redis Once (#1038 ) Don't export the same custom class definition multiple times.	2017-09-30 15:30:27 -07:00
Richard Liaw	16e82b43d1	[rllib] Changes for preprocessors (#1033 ) * Changes for preprocessors * removed comments * Changes + push for lint * linted * adding dependency for travis * linting won't pass * reordering * needed for testing * added comments * pip it * pip dependencies	2017-09-30 13:11:20 -07:00
Alexey Tumanov	2d0f439b7b	hugepage + plasma directory support plumbing + documentation (#1030 ) * hugepage + plasma directory support plumbing + documentation * Indentation fix. * huge_pages_enabled --> huge_pages * One more change	2017-09-30 09:56:52 -07:00
Robert Nishihara	b991dc8900	Add flag for ignoring the UI, don't start UI in jenkins tests. (#1021 )	2017-09-29 15:22:51 -07:00
Eric Liang	9f3a4fce50	[rllib] Parallelize sample collection and gradient computation in DQN (#746 ) * wip * works with cartpole * lint * fix pg * comment * action dist rename * preprocessor * fix test * typo * fix the action[0] nonsense * revert * satisfy the lint * wip * wip * works with cartpole * lint * fix pg * comment * action dist rename * preprocessor * fix test * typo * fix the action[0] nonsense * revert * satisfy the lint * Minor indentation changes. * fix merge * add humanoid * initial dqn refactor * remove tfutil * fix calls * fix tf errors 1 * closer * runs now * lint * tensorboard graph * fix linting * more 4 space * fix * fix linT * more lint * oops * es parity * remove example.py * fix training bug * add cartpole demo * try fixing cartpole * allow model options, configure cartpole * debug * simplify * no dueling * avoid out of file handles * Test dqn in jenkins. * Minor formatting. * lint * fix py3 * fix issue * remove chekcpoint * revert * Fixit * sanity check configs * update cuda * fix * parallel gradient computation * update * upd * bug * upd * always record training stats * fix * comments * revert assert * add gpu mask * fofset * a tie * Merge * fix * fix * fix examples * A3C -> DQN * fix dqn test * remove submodule * fix linting	2017-09-29 00:06:51 -07:00

... 105 106 107 108 109 ...

5545 commits