hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 12:56:46 -04:00

Author	SHA1	Message	Date
Eric Liang	8b5827b9da	[rllib] Better document which methods are abstract and which ones are overrides (#3480 )	2018-12-08 16:28:58 -08:00
Eric Liang	462e6ef066	[rllib] Use smoothed version of collect metrics for DQN (#3491 ) * fix * lint	2018-12-07 18:36:23 -08:00
Tianming Xu	f6490f9bef	Resolve no handlers could be found for logger 'ray.worker' when importing ray (#3483 )	2018-12-06 20:46:53 -08:00
Eric Liang	8395523f81	[rllib] Copy data before passing to Ape-X learner thread (fixes transient plasma crashes) (#3484 )	2018-12-06 18:01:11 -08:00
Si-Yuan	c2c501bbe6	Experimental asyncio support (#2015 ) * Init commit for async plasma client * Create an eventloop model for ray/plasma * Implement a poll-like selector base on `ray.wait`. Huge improvements. * Allow choosing workers & selectors * remove original design * initial implementation of epoll-like selector for plasma * Add a param for `worker` used in `PlasmaSelectorEventLoop` * Allow accepting a `Future` which returns object_id * Do not need `io.py` anymore * Create a basic testing model * fix: `ray.wait` returns tuple of lists * fix a few bugs * improving performance & bug fixing * add test * several improvements & fixing * fix relative import * [async] change code format, remove old files * [async] Create context wrapper for the eventloop * [async] fix: context should return a value * [async] Implement futures grouping * [async] Fix bugs & replace old functions * [async] Fix bugs found in tests * [async] Implement `PlasmaEpoll` * [async] Make test faster, add tests for epoll * [async] Fix code format * [async] Add comments for main code. * [async] Fix import path. * [async] Fix test. * [async] Compatibility. * [async] less verbose to not annoy the CI. * [async] Add test for new API * [async] Allow showing debug info in some of the test. * [async] Fix test. * [async] Proper shutdown. * [async] Lint~ * [async] Move files to experimental and create API * [async] Use async/await syntax * [async] Fix names & styles * [async] comments * [async] bug fixing & use pytest * [async] bug fixing & change tests * [async] use logger * [async] add tests * [async] lint * [async] type checking * [async] add more tests * [async] fix bugs on waiting a future while timeout. Add more docs. * [async] Formal docs. * [async] Add typing info since these codes are compatible with py3.5+. * [async] Documents. * [async] Lint. * [async] Fix deprecated call. * [async] Fix deprecated call. * [async] Implement a more reasonable way for dealing with pending inputs. * [async] Fix docs * [async] Lint * [async] Fix bug: Type for time * [async] Set our eventloop as the default eventloop so that we can get it through `asyncio.get_event_loop()`. * [async] Update test & docs. * [async] Lint. * [async] Temporarily print more debug info. * [async] Use `Poll` as a default option. * [async] Limit resources. * new async implementation for Ray * implement linked list * bug fix * update * support seamless async operations * update * update API * fix tests * lint * bug fix * refactor names * improve doc * properly shutdown async_api * doc * Change the table on the index page. * Adjust table size. * Only keeps `as_future`. * change how we init connection * init connection in `ray.worker.connect` * doc * fix * Move initialization code into the module. * Fix docs & code * Update pyarrow version. * lint * Restore index.rst * Add known issues. * Apply suggestions from code review Co-Authored-By: suquark <suquark@gmail.com> * rename * Update async_api.rst * Update async_api.py * Update async_api.rst * Update async_api.py * Update worker.py * Update async_api.rst * fix tests * lint * lint * replace the magic number	2018-12-06 17:39:05 -08:00
Devin Petersohn	970babf31a	Removing the check about the size re: ray-project/ray#3450 (#3464 ) * Removing the check about the size re: ray-project/ray#3450 * Addressing comments * Update services.py	2018-12-06 16:59:24 -08:00
Eugene Vinitsky	7a7c6e53c8	[tune/rllib] Use cloudpickle to dump config (#3462 ) <!-- Thank you for your contribution! Please review https://github.com/ray-project/ray/blob/master/CONTRIBUTING.rst before opening a pull request. --> ## What do these changes do? JSON Logger now uses cloudpickle to dump the configs as welll, which pkls the functions needed for multi-agent replay. ## Related issue number <!-- Are there any issues opened that will be resolved by merging this change? -->	2018-12-06 15:52:44 -08:00
Eric Liang	412aaa5195	[tune] Deprecate ambiguous function values (use tune.function / tune.sample_from instead) (#3457 ) * wip * exclude	2018-12-06 11:35:20 -08:00
Eric Liang	d864f299d7	[rllib] fixes from dogfooding multi-agent (#3456 ) auto wrap multi-agent dict and tuple spaces by keeping a policy -> preprocessor in the sampler add some Q-learning debug stats report min, max of custom metrics better errors	2018-12-05 23:31:45 -08:00
Si-Yuan	2e6f9bedf2	Add the extra fallback for serialization (#3468 ) * Add the extra fallback for serialization. * Better comments & warnings. quotes. * Update test/runtest.py Co-Authored-By: suquark <suquark@gmail.com> * Update test/runtest.py Co-Authored-By: suquark <suquark@gmail.com> * linting * Don't hijack too much errors. * simplify the test * Update runtest.py * simplify	2018-12-05 13:09:08 -08:00
Eric Liang	93a9d32288	[docs] Switch docs to use rllib train instead of train.py	2018-12-04 17:36:06 -08:00
Richard Liaw	9d0bd50e78	[tune] Component notification on node failure + Tests (#3414 ) Changes include: - Notify Components on Requeue - Slight refactoring of Node Failure handling - Better tests	2018-12-04 14:47:31 -08:00
Eric Liang	ce355d13d4	[rllib] Allow envs to be auto-registered; add on_train_result callback with curriculum example (#3451 ) * train step and docs * debug * doc * doc * fix examples * fix code * integration test * fix * ... * space * instance * Update .travis.yml * fix test	2018-12-03 23:15:43 -08:00
Kristian Hartikainen	be6567e6fd	Tweak/exec attach info (#3447 ) * Add custom cluster name to exec info * Update submit info to match exec info	2018-12-03 21:39:43 -08:00
Eric Liang	d8205976e8	[rllib] Auto clip actions to Box space range; deprecate squash_to_range (#3426 ) * fix clip * tweak wording * remove squash entirely * Update rllib-models.rst * fix argument order * Apply suggestions from code review Co-Authored-By: ericl <ekhliang@gmail.com>	2018-12-03 19:55:25 -08:00
Eric Liang	7abfbfd2f7	[rllib] Better error message for unsupported non-atari image observation sizes (#3444 )	2018-12-03 01:24:36 -08:00
Eric Liang	13c8ce4d84	Update README.rst with 0.6.0 version number. (#3453 )	2018-12-01 19:16:45 -08:00
Robert Nishihara	0603e0b73a	Bump version from 0.5.3 to 0.6.0. (#3420 )	2018-12-01 11:39:36 -08:00
Eric Liang	07d8cbf414	[rllib] Support batch norm layers (#3369 ) * batch norm * lint * fix dqn/ddpg update ops * bn model * Update tf_policy_graph.py * Update multi_gpu_impl.py * Apply suggestions from code review Co-Authored-By: ericl <ekhliang@gmail.com>	2018-11-29 13:33:39 -08:00
Devin Petersohn	4d2010a852	Ship Modin with Ray. (#3109 )	2018-11-29 20:05:24 +01:00
Chunyang Wen	fd7e494344	Remove: duplicate feed_dict constructing (#3431 )	2018-11-29 10:21:46 -08:00
Kristian Hartikainen	7e319dbf0c	Automatically indent tune logger params (#3399 )	2018-11-29 00:15:50 -08:00
Eric Liang	c46ea2ff4b	Click 0.7 changes the naming convention for commands; fix this	2018-11-28 14:59:58 -08:00
Robert Nishihara	82863b5251	[autoscaler] Update autoscaler to use heartbeat batches. (#3409 )	2018-11-27 23:46:27 -08:00
Eric Liang	f0df97db6f	[rllib] example and docs on how to use parametric actions with DQN / PG algorithms (#3384 )	2018-11-27 23:35:19 -08:00
Eric Liang	0d56fc10cc	Move setproctitle to ray[debug] package (#3415 )	2018-11-27 09:50:59 -08:00
Eric Liang	e3c088fa1e	[rllib] PPO doesn't work with fractional num gpus (#3396 ) * frac ppo * gpu test	2018-11-27 01:14:10 -08:00
Eric Liang	aa94d3dd50	[autoscaler] Allow more than 5s from node creation to first heartbeat (#3385 )	2018-11-26 17:25:05 -08:00
Robert Nishihara	0f0099fb90	UI changes, fix the task timeline and add the object transfer timeline to UI. (#3397 ) * Saving * Fix cmake and remove object/task search boxes. * Add comment	2018-11-25 10:16:49 -08:00
Eric Liang	b85e7b43f3	[rllib] Refactor the sampler (#3387 ) * refactor * fix test * add perf test * Update sampler.py	2018-11-24 18:16:54 -08:00
Robert Nishihara	3856533065	Fix incompatibility with most recent version of Redis. (#3379 ) * Fix incompatibility with most recent version of Redis. * Fix * Fixes.	2018-11-24 16:36:38 -08:00
Eric Liang	18a8dbfcfb	[rllib] Clip DDPG ou-noise to avoid exceeding action bounds (#3386 ) Closes #2965	2018-11-24 00:56:50 -08:00
Eric Liang	55fca828ce	[rllib] Fix use_lstm option when using custom model with dict space (#3368 ) ## What do these changes do? This passes in the right obs space to the lstm model wrapper, so that it doesn't attempt to un-flatten the already processed dict observation. ## Related issue number Closes https://github.com/ray-project/ray/issues/3367	2018-11-23 22:51:08 -08:00
Eric Liang	8b76bab25c	[rllib] docs for td3 (#3381 ) * td3 doc * Update rllib-env.rst	2018-11-22 13:36:47 -08:00
Eric Liang	41b6b50d09	fix py3 (#3382 )	2018-11-22 11:43:52 -08:00
GiliR4t1qbit	b9ae5edf74	When getting a role/profile, catch only exception that indicates the role/profile already exists, allow others to be raised (#3383 )	2018-11-22 09:42:58 -08:00
Jones Wong	24bfe8ab76	Enable Twin Delayed DDPG for RLlib DDPG agent (#3353 )	2018-11-21 20:03:20 -08:00
Richard Liaw	784a6399b0	[tune] Node Fault Tolerance (#3238 ) This PR introduces single-node fault tolerance for Tune. ## Previous behavior: - Actors will be restarted without checking if resources are available. This can lead to problems if we lose resources. ## New behavior: - RUNNING trials will be resumed on another node on a best effort basis (meaning they will run if resources available). - If the cluster is saturated, RUNNING trials on that failed node will become PENDING and queued. - During recovery, TrialSchedulers and SearchAlgorithms should receive notification of this (via `trial_runner.stop_trial`) so that they don’t wait/block for a trial that isn’t running. Remaining questions: - Should `last_result` be consistent during restore? Yes; but not for earlier trials (trials that are yet to be checkpointed). - Waiting for some PRs to merge first (#3239) Closes #2851.	2018-11-21 12:38:16 -08:00
Richard Liaw	c24d87b4d1	[autoscaler] Submit command (#3312 )	2018-11-20 14:03:34 -08:00
Eric Liang	abdc3b592e	[rllib] Update multi-gpu impala numbers (#3327 )	2018-11-19 20:55:27 -08:00
Eric Liang	5972c29d28	[rllib] Set ape-x local exploration to 0, also load explorations before training steps (#3349 ) ## What do these changes do? This should fix high explorations being used after restore / for rollouts. ## Related issue number (dev list issue)	2018-11-19 20:36:25 -08:00
Eric Liang	afc48d7b77	Don't setpgid() on actors (#3347 )	2018-11-19 17:35:26 -08:00
Eric Liang	e4bb5d8d16	Fix logging when ray cluster utils is used	2018-11-18 21:49:27 -08:00
Wenting Shen	ab1e0f5c2f	support home path and relative path for temp-dir (#3329 )	2018-11-16 17:41:10 -08:00
Eric Liang	e0bf9d7305	Add debug string to raylet (#3317 ) * initial debug string * format * wip debug string * fix compile * fix * update * finished * to file * logs dir * use temp root * fix * override	2018-11-15 21:47:50 -08:00
Robert Nishihara	d10cb570ab	Rename _submit -> _remote. (#3321 )	2018-11-15 15:30:18 -08:00
Eric Liang	5723291db6	Raise exception if the node is nearly out of memory (#3323 ) * wip * add * comment * escape hatch * update * object store too * .2	2018-11-15 12:55:25 -08:00
Lewis Belcher	5319fd044c	Update redis version in setup.py (#3333 ) * `redis` has released a new version (https://github.com/andymccurdy/redis-py/releases/tag/3.0.0) * `ray` is not compatible with this version * This PR adds the "compatible release" operator for `redis` version 2.10.6.	2018-11-15 10:40:08 -08:00
Eric Liang	706dc1d473	[rllib] Add test for multi-agent support and fix IMPALA multi-agent (#3289 ) IMPALA support for multiagent was broken since IMPALA has a requirement that batch sizes be of a certain length. However multi-agent envs can create variable-length batches. Fix this by adding zero-padding as needed (similar to the RNN case).	2018-11-14 14:14:07 -08:00
andrewztan	57c7b4238e	KL Divergence Metrics (#3300 ) * added KL divergence metrics * fix	2018-11-13 23:12:35 -08:00

1 2 3 4 5 ...

995 commits