Philipp Moritz
a64e32c53b
[docs] Fix broken links in documentation and add linkcheck to documentation ( #20030 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-11-04 13:19:43 -07:00
Will Drevo
97f04b118d
[RLlib; Docs] Added fixes to CartPole example. ( #19908 )
...
* Added fixes to CartPole example
* Apply suggestions from code review
Co-authored-by: will <will@anyscale.com>
Co-authored-by: Sven Mika <sven@anyscale.io>
2021-11-02 10:06:39 +01:00
Philipp Moritz
0a5942d8b0
[Documentation] Fix quotes for windows installations ( #19859 )
...
* [Documentation] Fix quotes for windows installations
* update
* formatting
2021-10-29 10:54:38 -07:00
gjoliver
1dbe7fc26a
[RLlib] Config dict should use true instad of True in docs/examples. ( #17889 )
2021-08-17 11:46:10 +02:00
Sven Mika
e61922c4ac
[RLlib] Add one-liner to docs on internship/RL-engineer position. ( #16050 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-05-25 12:58:54 -07:00
Sven Mika
4e9555cad3
[RLlib] Issue 15724: Breaking example script in docs due to outdated eager
config flag (use framework='tf2|tfe' instead). ( #15736 )
2021-05-18 11:34:46 +02:00
Richard Liaw
6c77aeb98a
[docs] ray slack remove banners ( #13898 )
...
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-04 01:14:34 -08:00
Sven Mika
391cdfae8c
[RLlib] Trajectory view API docs. ( #12718 )
2020-12-30 17:32:21 -08:00
Eric Squires
9f70293700
Remove debug extras from setup.py ( #12751 )
2020-12-10 16:23:11 -06:00
Simon Mo
5a38a76c83
[Doc] Use sphinx_book_theme ( #10379 )
2020-09-08 16:25:23 -07:00
architkulkarni
a3a9421787
added single quotes in pip install 'ray[rllib]'
2020-08-19 15:34:49 -07:00
Eric Liang
668f555755
[rllib] Clean up outdated docs #9915
2020-08-06 18:29:04 -07:00
Sven Mika
4da0e542d5
[RLlib] DDPG and SAC eager support (preparation for tf2.x) ( #9204 )
2020-07-08 16:12:20 +02:00
Sven Mika
2746fc0476
[RLlib] Auto-framework, retire use_pytorch
in favor of framework=...
( #8520 )
2020-05-27 16:19:13 +02:00
Bill Chambers
b3d686b78f
[docs] Add Overview Section & Gentle Introduction ( #8517 )
2020-05-26 10:39:34 -05:00
Eric Liang
9a83908c46
[rllib] Deprecate policy optimizers ( #8345 )
2020-05-21 10:16:18 -07:00
Eric Liang
eabb801a40
less important ( #8439 )
2020-05-13 22:52:38 -07:00
Jason McGhee
24ced808cd
Fix config key in docs for using PyTorch ( #8300 )
...
Docs improperly suggest using "torch" when the actual flag is called "use_pytorch"
2020-05-11 12:41:21 -07:00
Richard Liaw
314250d072
[docs] Make Ray slack more prominent ( #7870 )
2020-04-02 11:14:02 -07:00
Eric Liang
5cebee68d6
[rllib] Add scaling guide to documentation, improve bandit docs ( #7780 )
...
* update
* reword
* update
* ms
* multi node sgd
* reorder
* improve bandit docs
* contrib
* update
* ref
* improve refs
* fix build
* add pillow dep
* add pil
* update pil
* pillow
* remove false
2020-03-27 22:05:43 -07:00
Eric Liang
9392cdbf74
[rllib] Add high-performance external application connector ( #7641 )
2020-03-20 12:43:57 -07:00
Eric Liang
dd70720578
[rllib] Rename sample_batch_size => rollout_fragment_length ( #7503 )
...
* bulk rename
* deprecation warn
* update doc
* update fig
* line length
* rename
* make pytest comptaible
* fix test
* fi sys
* rename
* wip
* fix more
* lint
* update svg
* comments
* lint
* fix use of batch steps
2020-03-14 12:05:04 -07:00
Eric Liang
026f6884b5
[rllib] Add Decentralized DDPPO trainer and documentation ( #7088 )
2020-02-10 15:28:27 -08:00
Eric Liang
dc7e78deb4
[rllib] Add -v and --torch flags to first page of docs ( #7032 )
...
* add verbose doc
* torch
2020-02-04 10:17:51 -08:00
Eric Liang
6bb30c9f1b
fix links ( #6883 )
2020-01-22 01:06:07 -08:00
Eric Liang
14016535a5
[rllib] Add TF and Torch icons to show which are available for each algo ( #6869 )
2020-01-20 15:22:21 -08:00
Eric Liang
5ecb02fb80
Release 0.7.5 updates ( #5727 )
2019-09-26 10:30:37 -07:00
Eric Liang
249ca2cf9e
[rllib] add blog posts to examples list ( #5762 )
...
* add blog post
* remove
* link
2019-09-23 10:42:21 -07:00
gehring
8903bcd0c3
[rllib] Tracing for eager tensorflow policies with tf.function
( #5705 )
...
* Added tracing of eager policies with `tf.function`
* lint
* add config option
* add docs
* wip
* tracing now works with a3c
* typo
* none
* file doc
* returns
* syntax error
* syntax error
2019-09-17 01:44:20 -07:00
Eric Liang
fe5bd09b46
Fix rllib image in readme and doc typo ( #5579 )
...
* fix
* rlllig
2019-08-29 16:02:16 -07:00
gehring
b520f6141e
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00
Eric Liang
79949fb8a0
[rllib] RLlib in 60 seconds documentation ( #5430 )
2019-08-12 17:39:02 -07:00
Eric Liang
a1d2e17623
[rllib] Autoregressive action distributions ( #5304 )
2019-08-10 14:05:12 -07:00
Wonseok Jeon
281829e712
MADDPG implementation in RLlib ( #5348 )
2019-08-06 16:22:06 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
Kristian Hartikainen
13fb9fe3db
[rllib] Feature/soft actor critic v2 ( #5328 )
...
* Add base for Soft Actor-Critic
* Pick changes from old SAC branch
* Update sac.py
* First implementation of sac model
* Remove unnecessary SAC imports
* Prune unnecessary noise and exploration code
* Implement SAC model and use that in SAC policy
* runs but doesn't learn
* clear state
* fix batch size
* Add missing alpha grads and vars
* -200 by 2k timesteps
* doc
* lazy squash
* one file
* ignore tfp
* revert done
2019-08-01 23:37:36 -07:00
Eric Liang
20450a4e82
[rllib] Add rock paper scissors multi-agent example ( #5336 )
2019-08-01 13:03:59 -07:00
Eric Liang
a62c5f40f6
[rllib] Document ModelV2 and clean up the models/ directory ( #5277 )
2019-07-27 02:08:16 -07:00
Eric Liang
9e328fbe6f
[rllib] Add docs on how to use TF eager execution ( #4927 )
2019-06-07 16:42:37 -07:00
Eric Liang
1c073e92e4
[rllib] Fix documentation on custom policies ( #4910 )
...
* wip
* add docs
* lint
* todo sections
* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang
a45c61e19b
[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section ( #4821 )
...
* wip
* fix index
* fix bugs
* todo
* add imports
* note on get ph
* note on get ph
* rename to building custom algs
* add rnn state info
2019-05-27 14:17:32 -07:00
Eric Liang
02583a8598
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ ( #4819 )
...
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Eric Liang
6e7680bf21
[rllib] Clean up concepts documentation and policy optimizer creation ( #4592 )
2019-04-12 21:03:26 -07:00
Eric Liang
37208216ae
[rllib] Rename Agent to Trainer ( #4556 )
2019-04-07 00:36:18 -07:00
Eric Liang
2871609296
[rllib] Report sampler performance metrics ( #4427 )
2019-03-27 13:24:23 -07:00
Eric Liang
4b8b703561
[rllib] Some API cleanups and documentation improvements ( #4409 )
2019-03-21 21:34:22 -07:00
Eric Liang
c7f74dbdc7
[rllib] Add async remote workers ( #4253 )
2019-03-08 15:39:48 -08:00
Eric Liang
78ad9c4cbb
Add "ray timeline" command to auto-dump Chrome trace for the current Ray instance ( #4239 )
2019-03-05 16:28:00 -08:00
Eric Liang
3896b726dd
Dynamically adjust redis memory usage ( #4152 )
...
* f
* Update services.py
2019-02-25 16:21:37 -08:00
Eric Liang
d9da183c7d
[rllib] Custom supervised loss API ( #4083 )
2019-02-24 15:36:13 -08:00