Rehan Sohail Durrani
d2e8331d9a
[docs] remove table from walkthrough ( #5389 )
2019-08-06 17:29:48 -07:00
Wonseok Jeon
281829e712
MADDPG implementation in RLlib ( #5348 )
2019-08-06 16:22:06 -07:00
Richard Liaw
94bff244e4
[docs] Hotfix for removing unneeded files ( #5383 )
2019-08-06 08:46:59 -07:00
Richard Liaw
a08ea09760
[docs] rewrite ( #5175 )
2019-08-05 23:33:14 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
Eric Liang
955154a19d
Reduce Ray / RLlib startup messages ( #5368 )
2019-08-05 13:23:54 -07:00
Richard Liaw
1eaa57c98f
[tune] Distributed example + walkthrough ( #5157 )
2019-08-02 09:17:20 -07:00
Kristian Hartikainen
13fb9fe3db
[rllib] Feature/soft actor critic v2 ( #5328 )
...
* Add base for Soft Actor-Critic
* Pick changes from old SAC branch
* Update sac.py
* First implementation of sac model
* Remove unnecessary SAC imports
* Prune unnecessary noise and exploration code
* Implement SAC model and use that in SAC policy
* runs but doesn't learn
* clear state
* fix batch size
* Add missing alpha grads and vars
* -200 by 2k timesteps
* doc
* lazy squash
* one file
* ignore tfp
* revert done
2019-08-01 23:37:36 -07:00
Eric Liang
20450a4e82
[rllib] Add rock paper scissors multi-agent example ( #5336 )
2019-08-01 13:03:59 -07:00
Samir Al-Stouhi
51b8915c0a
Added CARLA Community Example ( #5333 )
2019-07-31 18:10:50 -07:00
Eric Liang
a62c5f40f6
[rllib] Document ModelV2 and clean up the models/ directory ( #5277 )
2019-07-27 02:08:16 -07:00
Eric Liang
bf9199ad77
[rllib] ModelV2 support for pytorch ( #5249 )
2019-07-25 11:02:53 -07:00
Peter Schafhalter
fc589050c9
[sgd] Deprecate old distributed SGD implementation ( #5160 )
...
* Deprecate old distributed SGD implementation
* Update README
2019-07-22 15:47:10 -07:00
Vince Jankovics
80b976efcb
Ray namespace added for k8s ( #4111 )
...
* Ray namespace added for k8s
* Submit.yaml update with k8s namespace
* K8s deployment doc update with namespace
2019-07-22 15:45:05 -07:00
Eric Liang
f9043cc49a
[rllib] Remove experimental eager support
2019-07-21 12:27:17 -07:00
Eric Liang
047f4ccd61
[rllib] Fix rollout.py with tuple action space ( #5201 )
...
* fix it
* update doc too
* fix rollout
2019-07-16 10:52:35 -07:00
Richard Liaw
b6509f46b0
Update wheels to 0.8.0dev2 ( #5186 )
2019-07-12 17:27:03 -07:00
Kai Yang
43b6513d19
[GCS] Move node resource info from client table to resource table ( #5050 )
2019-07-11 13:17:19 +08:00
Richard Liaw
0b540ab492
[tune] Test example checkpointing ( #4728 )
2019-07-10 01:58:26 -07:00
Eric Liang
34d054ff19
[rllib] ModelV2 API ( #4926 )
2019-07-03 15:59:47 -07:00
Kristian Hartikainen
9e0192bc0b
[tune] Change the log syncing behavior ( #4450 )
...
* Change the log syncing behavior
* fix up abstractions for syncer
* Finished checkpoint syncing
* Code
* Set of changes to get things running
* Fixes for log syncing
* Fix parts
* Lint and other fixes
* fix some test
* Remove extra parsing functionality
* some test fixes
* Fix up cloud syncing
* Another thing to do
* Fix up tests and local sync
Changes LogSync into a mixin, and adds tests for different
functionalities.
* Fix up tests, start on local migration
* fix distributed migrations
* comments
* formatting
* Better checkpoint directory handling
* fix tests
* fix tests
* fix click
* comments
* formatting comments
* formatting and comments
* sync function deprecations
* syncfunction
* Add documentation for Syncing and Uploading
* nit
* BaseSyncer as base for Mixin in edge case
* more docs
* clean up assertions
* validate
* nit
* Update test_cluster.py
* betterdoc
* Update tune-usage.rst
* cleanup
* nit
2019-07-02 20:46:00 -07:00
Hao Chen
0131353d42
[gRPC] Migrate gcs data structures to protobuf ( #5024 )
2019-06-25 14:31:19 -07:00
Ashwinee Panda
11ccf66346
[docs] docs for running Tensorboard without sudo ( #5015 )
...
* Instructions for running Tensorboard without sudo
When we run Tensorboard to visualize the results of Ray outputs on multi-user clusters where we don't have sudo access, such as RISE clusters, a few commands need to first be run to make sure tensorboard can edit the tmp directory. This is a pretty common usecase so I figured we may as well put it in the documentation for Tune.
* Update tune-usage.rst
2019-06-24 11:26:53 -07:00
Tianhong Dai
1b86e551fb
Fix bugs in the a3c code template. ( #4984 )
2019-06-14 17:22:36 -07:00
Eric Liang
77689d1116
[rllib] Port remainder of algorithms to build_trainer() pattern ( #4920 )
2019-06-07 16:45:36 -07:00
Eric Liang
9e328fbe6f
[rllib] Add docs on how to use TF eager execution ( #4927 )
2019-06-07 16:42:37 -07:00
Yuhong Guo
5eff47b657
[C++] Add hash table to Redis-Module ( #4911 )
2019-06-07 16:11:37 +08:00
Stephanie Wang
cbc67fc750
[doc] Update developer docs with bazel instructions ( #4944 )
2019-06-06 18:18:24 -07:00
Robert Nishihara
c3f8fc1c44
Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. ( #4941 )
2019-06-06 17:22:45 -07:00
Timon Ruban
2702b15b04
[tune] Add requirements-dev.txt and update docs for contributing ( #4925 )
...
* Add requirements-dev.txt and update docs.
* Update doc/source/tune-contrib.rst
Co-Authored-By: Richard Liaw <rliaw@berkeley.edu>
* Unpin everything except for yapf.
2019-06-05 09:04:36 -07:00
Hersh Godse
89722ff003
[tune] Directional metrics for components ( #4120 ) ( #4915 )
2019-06-02 22:13:40 -07:00
Eric Liang
7501ee51db
[rllib] Rename PolicyEvaluator => RolloutWorker ( #4820 )
2019-06-03 06:49:24 +08:00
Peter Schafhalter
c2ade075a3
[sgd] Distributed Training via PyTorch ( #4797 )
...
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Eric Liang
9aa1cd613d
[rllib] Allow Torch policies access to full action input dict in extra_action_out_fn ( #4894 )
...
* fix torch extra out
* preserve setitem
* fix docs
2019-06-01 16:58:49 +08:00
Eric Liang
1c073e92e4
[rllib] Fix documentation on custom policies ( #4910 )
...
* wip
* add docs
* lint
* todo sections
* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang
2dd0beb5bd
[rllib] Allow access to batches prior to postprocessing ( #4871 )
2019-05-29 18:17:14 -07:00
Eric Liang
a45c61e19b
[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section ( #4821 )
...
* wip
* fix index
* fix bugs
* todo
* add imports
* note on get ph
* note on get ph
* rename to building custom algs
* add rnn state info
2019-05-27 14:17:32 -07:00
Richard Liaw
574e1c7695
[tune] Fix up Ax Search and Examples ( #4851 )
...
* update Ax for cleaner API
* docs update
2019-05-27 13:23:17 -07:00
Robert Nishihara
6703519144
Move global state API out of global_state object. ( #4857 )
2019-05-26 11:27:53 -07:00
Devin Petersohn
a7d01aba9b
Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. ( #4847 )
2019-05-24 16:49:13 -07:00
Robert Nishihara
49fe894e22
Export remote functions when first used and also fix bug in which rem… ( #4844 )
...
* Export remote functions when first used and also fix bug in which remote functions and actor classes are not exported from workers during subsequent ray sessions.
* Documentation update
* Fix tests.
* Fix grammar
2019-05-24 13:44:39 -07:00
Eric Liang
02583a8598
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ ( #4819 )
...
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Philipp Moritz
84cf474abc
Change tagline in documentation and README. ( #4807 )
...
* Update README.rst, index.rst, tutorial.rst and _config.yml
2019-05-17 13:34:14 +08:00
Eric Liang
3807fb505b
[rllib] TensorFlow 2 compatibility ( #4802 )
2019-05-16 22:12:07 -07:00
Eric Liang
7d5ef6d99c
[rllib] Support continuous action distributions in IMPALA/APPO ( #4771 )
2019-05-16 22:05:07 -07:00
Adi Zimmerman
f3b8b9093d
Update tutorial link in doc ( #4777 )
2019-05-12 15:08:47 -07:00
Romil Bhardwaj
004440f526
Dynamic Custom Resources - create and delete resources ( #3742 )
2019-05-11 20:06:04 +08:00
Adi Zimmerman
28d381373d
[tune] Add Ax to Tune ( #4731 )
2019-05-08 15:54:29 -07:00
Eric Liang
71b2dec3b4
[rllib] Fix bounds of space returned by preprocessor.observation_space ( #4736 )
2019-05-05 18:25:38 -07:00
Si-Yuan
bd00735fe8
Fix tempfile issues ( #4605 )
2019-05-05 16:06:15 -07:00