hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 11:31:40 -05:00

Author	SHA1	Message	Date
Jones Wong	3ac8fd7ee8	Exploration with Parameter Space Noise (#4048 ) * enable parameter space noise for exploration * enable parameter space noise for exploration * yapf formatted * remove the usage of scipy softmax avialable in the latest version only * enable subclass that has no parameter_noise in the config * run user specified callbacks and test parameter space noise in multi node setting * formatted by yapf * Update dqn.py * lint	2019-02-20 22:35:18 -08:00
Philipp Moritz	bcd5af78c7	Lint Cython files (#4097 )	2019-02-20 22:29:25 -08:00
Richard Liaw	acf4d53b55	[autoscaler] Fix redirects, fix submit (#4085 )	2019-02-20 21:35:33 -08:00
Yuhong Guo	3549cd8195	Add the Delete function in GCS (#4081 ) * Add the Delete function in GCS * Unify BatchDelete and Delete * Fix comment * Lint * Refine according to comments * Unify test. * Address comment * C++ lint * Update ray_redis_module.cc	2019-02-21 13:33:37 +08:00
Yuhong Guo	1f864a02bc	Add option of load_code_from_local which is required in cross-language ray call. (#3675 )	2019-02-21 12:37:17 +08:00
Eric Liang	e3066d1fa5	[autoscaler] Try making GCP node provider thread-safe	2019-02-20 16:35:20 -08:00
Hao Chen	a99676e39b	[Java] lint unused imports (#4100 )	2019-02-20 12:37:04 -08:00
Csordás Róbert	b2677fabc0	[tune] Fix not saving a checkpoint in certain cases (issue #4041 ) (#4053 ) ## What do these changes do? It saves checkpoint if needed regardless of what the scheduler have returned. Until now, it have not saved the checkpoint when scheduler returned TrialScheduler.PAUSE, which caused PopulationBasedTraining preventing to save any checkpoints in certain cases. See issue #4041 for more details. ## Related issue number #4041	2019-02-20 11:54:28 -08:00
mika	64c95aea85	[rllib] Update README.md for qmix (#4101 ) ## What do these changes do? Fixed PyMARL repository path. ## Related issue number N/A	2019-02-20 10:21:08 -08:00
alegithub111	67fa0b5c25	Refine JNI bazel script to make it suitable for more systems (#4060 ) * Refine JNI bazel script to make it suitable for more systems * Update BUILD.bazel the script format has changed * Update BUILD.plasma the script format has changed * Lint bazel/BUILD.plasma BUILD.bazel	2019-02-20 22:37:41 +08:00
Robert Nishihara	e7651b1117	Fix excessive buffering of worker stdout/stderr. (#4094 ) * Start workers with 'python -u' to prevent buffering of prints. * Set sys.stdout and sys.stderr. * Add comment.	2019-02-19 20:20:47 -08:00
Robert Nishihara	5fe7b1c618	Make object_manager_test::test_object_transfer_retry less flaky. (#4057 ) * Make object_manager_test::test_object_transfer_retry less flaky. * Make the test pass.	2019-02-19 20:03:11 -08:00
Eric Liang	e9ee38ace2	More compact format for worker logs (#4092 )	2019-02-19 19:53:43 -08:00
Robert Nishihara	c92a867c8b	Fix log monitor CPU utilization. (#4091 )	2019-02-19 12:19:21 -08:00
Wang Qing	794a093249	Add runtime_context to get some runtime fields in worker (#4065 )	2019-02-19 15:57:30 +08:00
Wang Qing	7574757391	Fix crash for Java task's `task.argument()` in state. (#4063 )	2019-02-19 12:46:07 +08:00
Philipp Moritz	cfc7e2c5a9	Fix modin test (#4069 )	2019-02-18 12:17:36 -08:00
Eric Liang	6e46d75554	[tune] Remove slow gzip of checkpoints; ignore jupyter stop errors (#4076 ) * fix gzip * ignore jupyter	2019-02-18 01:30:13 -08:00
Eric Liang	f8bef004da	[rllib] Improve error message for bad envs, add remote env docs (#4044 ) * commit * fix up rew	2019-02-18 01:28:19 -08:00
Robert Nishihara	b78d77257b	Speed up test/component_failures_test.py::test_actor_creation_node_failure. (#4056 )	2019-02-17 15:35:54 -08:00
Robert Nishihara	5a9098891f	Add serialization test for more collection types. (#3982 ) * Add serialization test for more collection types. * Reorganize serialization tests a little. * Update	2019-02-17 13:57:33 -08:00
Philipp Moritz	f51969964d	Fix linting on master (#4077 )	2019-02-17 13:55:40 -08:00
Megan Kawakami	346885068c	[rllib] add torch pg (#3857 ) * add torch pg * add torch imports * added torch pg * working torch pg implementation * add pg pytorch * Update a3c.py * Update a3c.py * Update torch_policy_graph.py * Update torch_policy_graph.py	2019-02-16 19:54:14 -08:00
Zekun Shi	a708ab66f5	Add simplex action space and dirichlet action distribution (#4070 ) * add simplex action space and dirichlet action distribution * Update and rename spaces.py to extra_spaces.py * Update __init__.py * Update catalog.py * Fix python 2 * Update extra_spaces.py * change Simplex.contains() to return False	2019-02-16 12:44:59 -08:00
Kristian Hartikainen	0cc5c88075	[tune] Add number of trials to the trial runner logger (#4068 )	2019-02-16 01:12:59 -08:00
Yu Kobayashi	d2d66c576e	Support non ascii characters in the source code (#4047 )	2019-02-16 11:45:44 +08:00
Hao Chen	de17443dc2	Propagate backend error to worker (#4039 )	2019-02-16 11:39:15 +08:00
William Ma	4be3d0c5d3	Update shipped modin to 0.3.1 (#4058 )	2019-02-15 15:49:38 -08:00
Robert Nishihara	2d07df7f3f	Replace '__main__' with "__main__". (#4055 )	2019-02-15 13:32:43 -08:00
Robert Nishihara	5f71751891	API cleanups. Remove worker argument. Remove some deprecated arguments. (#4025 ) * Remove worker argument from API methods. * Remove deprecated arguments and deprecate redirect_output and redirect_worker_output. * Fix	2019-02-15 10:49:16 -08:00
Hao Chen	042ad84573	Simplify Cython ID types and fix bug of ActorCheckpointID (#4045 )	2019-02-15 20:15:16 +08:00
Richard Liaw	bb7c4ce9c4	[tune] Improve error message when Ray crashes (#3795 )	2019-02-15 01:04:17 -08:00
Richard Liaw	7cf62a10cd	[tune] Fix TF checkpointing example (#4043 ) Closes #3912, closes #3963.	2019-02-15 00:30:27 -08:00
Stephanie Wang	3684e5bc0d	Fix memory leak in Redis by using auto memory management (#4054 ) * Table appends should always succeed * Use Redis auto memory management * Remove unneeded namespace	2019-02-14 19:51:18 -08:00
Eric Liang	0c0bd4d41c	[rllib] Use model.value_function() in MARWIL (#4036 ) * fix marwil * add ph * fix	2019-02-14 19:35:21 -08:00
William Ma	8ee53297b1	Add documentation on how to use debug tools (#4000 )	2019-02-14 13:50:21 -08:00
Philipp Moritz	077ffd99bf	Bump version from 0.6.3 to 0.7.0.dev0 in docs and .yaml (#4042 )	2019-02-14 12:08:48 -08:00
Yuhong Guo	4b0db437ee	Linting Bazel scripts (#4032 ) * Use buildifier as bazel script linter * Checkout golang version in travis * Using golang-1.8-go in travis * Add golang apt-repository * Fix the bazel lint failure example. * Address comment	2019-02-14 22:16:19 +08:00
Philipp Moritz	810cc17062	Fix LRU eviction of client notification datastructure (#4021 ) * convert notification_key map to C++ datastructure * fix crash and add debug string * clean notification map up (this was a bug before) * remove checks * add jenkins test * linting * fixes * properly erase * clean up * linting * Update test_wait_hanging.py * Update run_multi_node_tests.sh * increase redis_max_memory * fix dat jenkins * update * Update run_multi_node_tests.sh	2019-02-13 22:20:27 -08:00
Stephanie Wang	fd5b58a827	Increase timeout for object manager valgrind tests (#4027 ) * Avoid second copy of data for inlined objects * Increase Wait timeout for valgrind tests * Run object manager tests with and without inlined objects * Fix test	2019-02-13 18:29:03 -08:00
Wang Qing	1fb56a4316	Remove deprecated module (#4038 )	2019-02-14 10:04:09 +08:00
Si-Yuan	2de31eb489	minor fix (#4040 )	2019-02-13 17:22:45 -08:00
Eric Liang	2dccf383dd	[rllib] Basic infrastructure for off-policy estimation (IS, WIS) (#3941 )	2019-02-13 16:25:05 -08:00
Kristian Hartikainen	729d0b2825	[autoscaler] docker run options (#3921 ) Adds support for docker options, allowing for use of nvidia-docker. Closes #2657.	2019-02-13 12:26:28 -08:00
Stephanie Wang	4347ab644e	Use Redis lists in the GCS instead of zset (#4023 ) * Convert zset to list * Remove object evictions map from the object directory, yay * comments * Fix tests	2019-02-13 10:32:57 -08:00
bjg2	0e37ac6d1d	[wingman -> rllib] Remote and entangled environments (#3968 ) * added all our environment changes * fixed merge request comments and remote env * fixed remote check * moved remote_worker_envs to correct config section * lint * auto wrap impl * fix * fixed the tests	2019-02-13 10:08:26 -08:00
Philipp Moritz	b3f72e8a75	Add regression tests for dataclass serialization (#3984 )	2019-02-13 09:07:03 -08:00
Hao Chen	f31a79f3f7	Implement actor checkpointing (#3839 ) * Implement Actor checkpointing * docs * fix * fix * fix * move restore-from-checkpoint to HandleActorStateTransition * Revert "move restore-from-checkpoint to HandleActorStateTransition" This reverts commit 9aa4447c1e3e321f42a1d895d72f17098b72de12. * resubmit waiting tasks when actor frontier restored * add doc about num_actor_checkpoints_to_keep=1 * add num_actor_checkpoints_to_keep to Cython * add checkpoint_expired api * check if actor class is abstract * change checkpoint_ids to long string * implement java * Refactor to delay actor creation publish until checkpoint is resumed * debug, lint * Erase from checkpoints to restore if task fails * fix lint * update comments * avoid duplicated actor notification log * fix unintended change * add actor_id to checkpoint_expired * small java updates * make checkpoint info per actor * lint * Remove logging * Remove old actor checkpointing Python code, move new checkpointing code to FunctionActionManager * Replace old actor checkpointing tests * Fix test and lint * address comments * consolidate kill_actor * Remove __ray_checkpoint__ * fix non-ascii char * Loosen test checks * fix java * fix sphinx-build	2019-02-13 19:39:02 +08:00
Andrew Tan	57dcd3033e	[tune] Trial reporter fix (#3951 ) Fixes #3949.	2019-02-13 01:03:54 -08:00
Wang Qing	3a7fb182cc	Change the num of parallel jobs when building	2019-02-13 00:33:05 -08:00

1 2 3 4 5 ...

2600 commits