Ujval Misra
a4659a8f8b
[tune] Add support for function-based stopping condition ( #5754 )
2019-09-23 18:39:00 -07:00
Eric Liang
249ca2cf9e
[rllib] add blog posts to examples list ( #5762 )
...
* add blog post
* remove
* link
2019-09-23 10:42:21 -07:00
Mitchell Stern
98dcc1d440
[Dashboard] Add initial version of new dashboard ( #5730 )
2019-09-23 08:50:40 -07:00
Robert Nishihara
c91a37f622
Set redis password in slurm deployment documentation. ( #5747 )
2019-09-21 15:33:15 -07:00
Vince Jankovics
7e214fd95e
[tune] TensorBoard HParams for TF2.0 ( #5678 )
2019-09-21 11:06:34 -07:00
gehring
8903bcd0c3
[rllib] Tracing for eager tensorflow policies with tf.function
( #5705 )
...
* Added tracing of eager policies with `tf.function`
* lint
* add config option
* add docs
* wip
* tracing now works with a3c
* typo
* none
* file doc
* returns
* syntax error
* syntax error
2019-09-17 01:44:20 -07:00
Robert Nishihara
f74aaf2619
Add more links for getting involved.git status ( #5708 )
2019-09-16 20:26:03 -07:00
Edward Oakes
ee5db5b67f
Raise error if space in redis password ( #5673 )
2019-09-11 20:58:39 -07:00
Eric Liang
bc6a95deb0
[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent ( #5683 )
2019-09-11 12:15:34 -07:00
Eric Liang
2fdefe19b7
Take into account queue length in autoscaling ( #5684 )
2019-09-11 11:31:35 -07:00
Eric Liang
74abeab057
[rllib] Improve accessing model state docs ( #5656 )
...
* [rllib] better model docs
* fix
* s
2019-09-08 23:01:26 -07:00
Eric Liang
1455a19c85
Consolidate and clean up documentation ( #5645 )
2019-09-07 11:50:18 -07:00
Stephanie Wang
edcc56ea01
Project fixes and cleanups ( #5632 )
2019-09-05 11:55:42 -07:00
Richard Liaw
34f6d2fc5c
[tune] Update trainable docs and support hparams ( #5558 )
2019-09-04 12:44:42 -07:00
jichan3751
1711e202a3
[training] Tensorflow interface for MultiNode SGD ( #5440 )
2019-09-03 15:35:42 -07:00
Richard Liaw
d37c09aac0
[docs] Add a feedback form ( #5610 )
...
* Create a feedback form
* feedback
* fix
* fix
* fix
* wording
2019-09-02 01:28:28 -07:00
Si-Yuan
4cccfcc967
Fix the rllib-stack image display problem ( #5612 )
2019-09-01 18:29:58 -07:00
Eric Liang
a101812b9f
Replace --redis-address with --address in test, docs, tune, rllib ( #5602 )
...
* wip
* add tests and tune
* add ci
* test fix
* lint
* fix tests
* wip
* sugar dep
2019-09-01 16:53:02 -07:00
Eric Liang
daf38c8723
[tune] Deprecate tune.function ( #5601 )
...
* remove tune function
* remove examples
* Update tune-usage.rst
2019-08-31 16:00:10 -07:00
Eric Liang
550c96b965
[rllib] Add docs on policy.model ( #5597 )
2019-08-30 21:10:42 -07:00
Robert Nishihara
93e103135b
Update doc versions from 0.8.0.dev3 to 0.8.0.dev4. ( #5585 )
2019-08-29 22:42:57 -07:00
Eric Liang
fe5bd09b46
Fix rllib image in readme and doc typo ( #5579 )
...
* fix
* rlllig
2019-08-29 16:02:16 -07:00
Richard Liaw
fb40787603
[docs] Distributed Training Quickfix ( #5571 )
2019-08-29 15:38:43 -07:00
Richard Liaw
411f30c125
[docs] Second push of changes ( #5391 )
2019-08-28 17:54:15 -07:00
Philipp Moritz
f1dcce5a47
[projects] Add named commands to sessions ( #5525 )
2019-08-26 14:16:17 -07:00
Eric Liang
7d28bbbdbb
[rllib] Document on traj postprocess ( #5532 )
...
* document on traj postprocess
* shorten it
2019-08-24 20:37:45 -07:00
Eric Liang
28623d2eac
Add docs for memory quota settings ( #5441 )
2019-08-24 18:15:16 -07:00
gehring
b520f6141e
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00
Richard Liaw
cdc9227f1b
[tune] ASHA xgboost and lightgbm examples ( #5500 )
2019-08-22 10:37:59 -07:00
Philipp Moritz
c852213b83
[projects] Project examples and documentation ( #5407 )
2019-08-20 20:49:15 -07:00
Olli Huotari
0916603e61
Fixed few broken links in docs ( #5477 )
...
* hyperband link changed
* tuned_examples link fix
* doc lstm link fix
* kubernetes example link fix
2019-08-19 14:22:25 -07:00
Greg Schwartz
9d7e8c1173
[docs] Added Instructions for Slurm ( #5467 )
...
* Added Instructions for Slurm
Made in response to #826
2019-08-18 21:46:26 -07:00
Richard Liaw
d7b309223b
[tune] MLFlow Logger ( #5438 )
2019-08-14 15:58:18 -07:00
Lisa Dunlap
b7d0733362
[tune] Implement BOHB ( #5382 )
2019-08-13 12:32:07 -07:00
Eric Liang
79949fb8a0
[rllib] RLlib in 60 seconds documentation ( #5430 )
2019-08-12 17:39:02 -07:00
Adam Ochayon
158567b952
Rename function to make actor example correct ( #5432 )
2019-08-12 01:16:16 -07:00
Eric Liang
a1d2e17623
[rllib] Autoregressive action distributions ( #5304 )
2019-08-10 14:05:12 -07:00
Simon Mo
18f1e904de
Bump 0.8.0.dev2 -> 0.8.0.dev3 ( #5409 )
2019-08-09 11:37:19 -07:00
Eric Liang
1a8fa5d2fa
Clean up top level Ray dir ( #5404 )
2019-08-08 23:35:55 -07:00
Eric Liang
592f313210
[rllib] Centralized critic / PPO example on TwoStepGame ( #5392 )
2019-08-08 14:03:28 -07:00
Richard Liaw
ed89897a31
[tune,autoscaler] Test yaml, add better distributed docs ( #5403 )
2019-08-08 00:59:23 -07:00
Eric Liang
7d747da420
[rllib] [docs] Add some architecture diagrams ( #5390 )
2019-08-06 20:14:57 -07:00
Rehan Sohail Durrani
d2e8331d9a
[docs] remove table from walkthrough ( #5389 )
2019-08-06 17:29:48 -07:00
Wonseok Jeon
281829e712
MADDPG implementation in RLlib ( #5348 )
2019-08-06 16:22:06 -07:00
Richard Liaw
94bff244e4
[docs] Hotfix for removing unneeded files ( #5383 )
2019-08-06 08:46:59 -07:00
Richard Liaw
a08ea09760
[docs] rewrite ( #5175 )
2019-08-05 23:33:14 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
Eric Liang
955154a19d
Reduce Ray / RLlib startup messages ( #5368 )
2019-08-05 13:23:54 -07:00
Richard Liaw
1eaa57c98f
[tune] Distributed example + walkthrough ( #5157 )
2019-08-02 09:17:20 -07:00
Kristian Hartikainen
13fb9fe3db
[rllib] Feature/soft actor critic v2 ( #5328 )
...
* Add base for Soft Actor-Critic
* Pick changes from old SAC branch
* Update sac.py
* First implementation of sac model
* Remove unnecessary SAC imports
* Prune unnecessary noise and exploration code
* Implement SAC model and use that in SAC policy
* runs but doesn't learn
* clear state
* fix batch size
* Add missing alpha grads and vars
* -200 by 2k timesteps
* doc
* lazy squash
* one file
* ignore tfp
* revert done
2019-08-01 23:37:36 -07:00