hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Eric Liang	955154a19d	Reduce Ray / RLlib startup messages (#5368 )	2019-08-05 13:23:54 -07:00
Richard Liaw	1eaa57c98f	[tune] Distributed example + walkthrough (#5157 )	2019-08-02 09:17:20 -07:00
Kristian Hartikainen	13fb9fe3db	[rllib] Feature/soft actor critic v2 (#5328 ) * Add base for Soft Actor-Critic * Pick changes from old SAC branch * Update sac.py * First implementation of sac model * Remove unnecessary SAC imports * Prune unnecessary noise and exploration code * Implement SAC model and use that in SAC policy * runs but doesn't learn * clear state * fix batch size * Add missing alpha grads and vars * -200 by 2k timesteps * doc * lazy squash * one file * ignore tfp * revert done	2019-08-01 23:37:36 -07:00
Eric Liang	20450a4e82	[rllib] Add rock paper scissors multi-agent example (#5336 )	2019-08-01 13:03:59 -07:00
Samir Al-Stouhi	51b8915c0a	Added CARLA Community Example (#5333 )	2019-07-31 18:10:50 -07:00
Eric Liang	a62c5f40f6	[rllib] Document ModelV2 and clean up the models/ directory (#5277 )	2019-07-27 02:08:16 -07:00
Eric Liang	bf9199ad77	[rllib] ModelV2 support for pytorch (#5249 )	2019-07-25 11:02:53 -07:00
Peter Schafhalter	fc589050c9	[sgd] Deprecate old distributed SGD implementation (#5160 ) * Deprecate old distributed SGD implementation * Update README	2019-07-22 15:47:10 -07:00
Vince Jankovics	80b976efcb	Ray namespace added for k8s (#4111 ) * Ray namespace added for k8s * Submit.yaml update with k8s namespace * K8s deployment doc update with namespace	2019-07-22 15:45:05 -07:00
Eric Liang	f9043cc49a	[rllib] Remove experimental eager support	2019-07-21 12:27:17 -07:00
Eric Liang	047f4ccd61	[rllib] Fix rollout.py with tuple action space (#5201 ) * fix it * update doc too * fix rollout	2019-07-16 10:52:35 -07:00
Richard Liaw	b6509f46b0	Update wheels to 0.8.0dev2 (#5186 )	2019-07-12 17:27:03 -07:00
Kai Yang	43b6513d19	[GCS] Move node resource info from client table to resource table (#5050 )	2019-07-11 13:17:19 +08:00
Richard Liaw	0b540ab492	[tune] Test example checkpointing (#4728 )	2019-07-10 01:58:26 -07:00
Eric Liang	34d054ff19	[rllib] ModelV2 API (#4926 )	2019-07-03 15:59:47 -07:00
Kristian Hartikainen	9e0192bc0b	[tune] Change the log syncing behavior (#4450 ) * Change the log syncing behavior * fix up abstractions for syncer * Finished checkpoint syncing * Code * Set of changes to get things running * Fixes for log syncing * Fix parts * Lint and other fixes * fix some test * Remove extra parsing functionality * some test fixes * Fix up cloud syncing * Another thing to do * Fix up tests and local sync Changes LogSync into a mixin, and adds tests for different functionalities. * Fix up tests, start on local migration * fix distributed migrations * comments * formatting * Better checkpoint directory handling * fix tests * fix tests * fix click * comments * formatting comments * formatting and comments * sync function deprecations * syncfunction * Add documentation for Syncing and Uploading * nit * BaseSyncer as base for Mixin in edge case * more docs * clean up assertions * validate * nit * Update test_cluster.py * betterdoc * Update tune-usage.rst * cleanup * nit	2019-07-02 20:46:00 -07:00
Hao Chen	0131353d42	[gRPC] Migrate gcs data structures to protobuf (#5024 )	2019-06-25 14:31:19 -07:00
Ashwinee Panda	11ccf66346	[docs] docs for running Tensorboard without sudo (#5015 ) * Instructions for running Tensorboard without sudo When we run Tensorboard to visualize the results of Ray outputs on multi-user clusters where we don't have sudo access, such as RISE clusters, a few commands need to first be run to make sure tensorboard can edit the tmp directory. This is a pretty common usecase so I figured we may as well put it in the documentation for Tune. * Update tune-usage.rst	2019-06-24 11:26:53 -07:00
Tianhong Dai	1b86e551fb	Fix bugs in the a3c code template. (#4984 )	2019-06-14 17:22:36 -07:00
Eric Liang	77689d1116	[rllib] Port remainder of algorithms to build_trainer() pattern (#4920 )	2019-06-07 16:45:36 -07:00
Eric Liang	9e328fbe6f	[rllib] Add docs on how to use TF eager execution (#4927 )	2019-06-07 16:42:37 -07:00
Yuhong Guo	5eff47b657	[C++] Add hash table to Redis-Module (#4911 )	2019-06-07 16:11:37 +08:00
Stephanie Wang	cbc67fc750	[doc] Update developer docs with bazel instructions (#4944 )	2019-06-06 18:18:24 -07:00
Robert Nishihara	c3f8fc1c44	Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941 )	2019-06-06 17:22:45 -07:00
Timon Ruban	2702b15b04	[tune] Add requirements-dev.txt and update docs for contributing (#4925 ) * Add requirements-dev.txt and update docs. * Update doc/source/tune-contrib.rst Co-Authored-By: Richard Liaw <rliaw@berkeley.edu> * Unpin everything except for yapf.	2019-06-05 09:04:36 -07:00
Hersh Godse	89722ff003	[tune] Directional metrics for components (#4120 ) (#4915 )	2019-06-02 22:13:40 -07:00
Eric Liang	7501ee51db	[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )	2019-06-03 06:49:24 +08:00
Peter Schafhalter	c2ade075a3	[sgd] Distributed Training via PyTorch (#4797 ) Implements distributed SGD using distributed PyTorch.	2019-06-01 21:39:22 -07:00
Eric Liang	9aa1cd613d	[rllib] Allow Torch policies access to full action input dict in extra_action_out_fn (#4894 ) * fix torch extra out * preserve setitem * fix docs	2019-06-01 16:58:49 +08:00
Eric Liang	1c073e92e4	[rllib] Fix documentation on custom policies (#4910 ) * wip * add docs * lint * todo sections * fix doc	2019-06-01 16:13:21 +08:00
Eric Liang	2dd0beb5bd	[rllib] Allow access to batches prior to postprocessing (#4871 )	2019-05-29 18:17:14 -07:00
Eric Liang	a45c61e19b	[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section (#4821 ) * wip * fix index * fix bugs * todo * add imports * note on get ph * note on get ph * rename to building custom algs * add rnn state info	2019-05-27 14:17:32 -07:00
Richard Liaw	574e1c7695	[tune] Fix up Ax Search and Examples (#4851 ) * update Ax for cleaner API * docs update	2019-05-27 13:23:17 -07:00
Robert Nishihara	6703519144	Move global state API out of global_state object. (#4857 )	2019-05-26 11:27:53 -07:00
Devin Petersohn	a7d01aba9b	Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847 )	2019-05-24 16:49:13 -07:00
Robert Nishihara	49fe894e22	Export remote functions when first used and also fix bug in which rem… (#4844 ) * Export remote functions when first used and also fix bug in which remote functions and actor classes are not exported from workers during subsequent ray sessions. * Documentation update * Fix tests. * Fix grammar	2019-05-24 13:44:39 -07:00
Eric Liang	02583a8598	[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819 ) This implements some of the renames proposed in #4813 We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.	2019-05-20 16:46:05 -07:00
Philipp Moritz	84cf474abc	Change tagline in documentation and README. (#4807 ) * Update README.rst, index.rst, tutorial.rst and _config.yml	2019-05-17 13:34:14 +08:00
Eric Liang	3807fb505b	[rllib] TensorFlow 2 compatibility (#4802 )	2019-05-16 22:12:07 -07:00
Eric Liang	7d5ef6d99c	[rllib] Support continuous action distributions in IMPALA/APPO (#4771 )	2019-05-16 22:05:07 -07:00
Adi Zimmerman	f3b8b9093d	Update tutorial link in doc (#4777 )	2019-05-12 15:08:47 -07:00
Romil Bhardwaj	004440f526	Dynamic Custom Resources - create and delete resources (#3742 )	2019-05-11 20:06:04 +08:00
Adi Zimmerman	28d381373d	[tune] Add Ax to Tune (#4731 )	2019-05-08 15:54:29 -07:00
Eric Liang	71b2dec3b4	[rllib] Fix bounds of space returned by preprocessor.observation_space (#4736 )	2019-05-05 18:25:38 -07:00
Si-Yuan	bd00735fe8	Fix tempfile issues (#4605 )	2019-05-05 16:06:15 -07:00
Richard Liaw	f2faf5ce75	[tune] Contributor Guide and Design Page (#4716 ) * Move setup script out * some changes * Finished Contributor guide * some comments to the design * move * Apply suggestions from code review Co-Authored-By: richardliaw <rliaw@berkeley.edu> * sourcecode * comments	2019-05-05 00:04:13 -07:00
Adi Zimmerman	36b71d1446	[Tune] Post-Experiment Tools (#4351 )	2019-05-04 02:51:26 -04:00
William Ma	406c429384	[doc] Remove instructions for installing Python2 for webui. (#4740 )	2019-05-02 22:35:23 -07:00
Andrew Tan	f87235f232	[tune] Example for Tune blog post (#4673 )	2019-05-02 13:16:48 -04:00
Sam Toyer	663e92ab3f	[rllib] TD3/DDPG improvements and MuJoCo benchmarks (#4694 ) * [rllib] Separate optimisers for DDPG actor & crit. * [rllib] Better names for DDPG variables & options Config changes: - noise_scale -> exploration_ou_noise_scale - exploration_theta -> exploration_ou_theta - exploration_sigma -> exploration_ou_sigma - act_noise -> exploration_gaussian_sigma - noise_clip -> target_noise_clip * [rllib] Make DDPG less class-y Used functions to replace three classes with only an __init__ method & a handful of unrelated attributes. * [rllib] Refactor DDPG noise * [rllib] Unify DDPG exploration annealing Added option "exploration_should_anneal" to enable linear annealing of exploration noise. By default this is off, for consistency with DDPG & TD3 papers. Also renamed "exploration_final_eps" to "exploration_final_scale" (that name seems to have been carried over from DQN, and doesn't really make sense here). Finally, tried to rename "eps" to "noise_scale" wherever possible.	2019-04-26 17:49:53 -07:00

1 2 3 4 5 ...

497 commits