Timon Ruban
2702b15b04
[tune] Add requirements-dev.txt and update docs for contributing ( #4925 )
...
* Add requirements-dev.txt and update docs.
* Update doc/source/tune-contrib.rst
Co-Authored-By: Richard Liaw <rliaw@berkeley.edu>
* Unpin everything except for yapf.
2019-06-05 09:04:36 -07:00
Hersh Godse
89722ff003
[tune] Directional metrics for components ( #4120 ) ( #4915 )
2019-06-02 22:13:40 -07:00
Eric Liang
7501ee51db
[rllib] Rename PolicyEvaluator => RolloutWorker ( #4820 )
2019-06-03 06:49:24 +08:00
Peter Schafhalter
c2ade075a3
[sgd] Distributed Training via PyTorch ( #4797 )
...
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Eric Liang
9aa1cd613d
[rllib] Allow Torch policies access to full action input dict in extra_action_out_fn ( #4894 )
...
* fix torch extra out
* preserve setitem
* fix docs
2019-06-01 16:58:49 +08:00
Eric Liang
1c073e92e4
[rllib] Fix documentation on custom policies ( #4910 )
...
* wip
* add docs
* lint
* todo sections
* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang
2dd0beb5bd
[rllib] Allow access to batches prior to postprocessing ( #4871 )
2019-05-29 18:17:14 -07:00
Eric Liang
a45c61e19b
[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section ( #4821 )
...
* wip
* fix index
* fix bugs
* todo
* add imports
* note on get ph
* note on get ph
* rename to building custom algs
* add rnn state info
2019-05-27 14:17:32 -07:00
Richard Liaw
574e1c7695
[tune] Fix up Ax Search and Examples ( #4851 )
...
* update Ax for cleaner API
* docs update
2019-05-27 13:23:17 -07:00
Robert Nishihara
6703519144
Move global state API out of global_state object. ( #4857 )
2019-05-26 11:27:53 -07:00
Devin Petersohn
a7d01aba9b
Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. ( #4847 )
2019-05-24 16:49:13 -07:00
Robert Nishihara
49fe894e22
Export remote functions when first used and also fix bug in which rem… ( #4844 )
...
* Export remote functions when first used and also fix bug in which remote functions and actor classes are not exported from workers during subsequent ray sessions.
* Documentation update
* Fix tests.
* Fix grammar
2019-05-24 13:44:39 -07:00
Eric Liang
02583a8598
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ ( #4819 )
...
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Philipp Moritz
84cf474abc
Change tagline in documentation and README. ( #4807 )
...
* Update README.rst, index.rst, tutorial.rst and _config.yml
2019-05-17 13:34:14 +08:00
Eric Liang
3807fb505b
[rllib] TensorFlow 2 compatibility ( #4802 )
2019-05-16 22:12:07 -07:00
Eric Liang
7d5ef6d99c
[rllib] Support continuous action distributions in IMPALA/APPO ( #4771 )
2019-05-16 22:05:07 -07:00
Adi Zimmerman
f3b8b9093d
Update tutorial link in doc ( #4777 )
2019-05-12 15:08:47 -07:00
Romil Bhardwaj
004440f526
Dynamic Custom Resources - create and delete resources ( #3742 )
2019-05-11 20:06:04 +08:00
Adi Zimmerman
28d381373d
[tune] Add Ax to Tune ( #4731 )
2019-05-08 15:54:29 -07:00
Eric Liang
71b2dec3b4
[rllib] Fix bounds of space returned by preprocessor.observation_space ( #4736 )
2019-05-05 18:25:38 -07:00
Si-Yuan
bd00735fe8
Fix tempfile issues ( #4605 )
2019-05-05 16:06:15 -07:00
Richard Liaw
f2faf5ce75
[tune] Contributor Guide and Design Page ( #4716 )
...
* Move setup script out
* some changes
* Finished Contributor guide
* some comments to the design
* move
* Apply suggestions from code review
Co-Authored-By: richardliaw <rliaw@berkeley.edu>
* sourcecode
* comments
2019-05-05 00:04:13 -07:00
Adi Zimmerman
36b71d1446
[Tune] Post-Experiment Tools ( #4351 )
2019-05-04 02:51:26 -04:00
William Ma
406c429384
[doc] Remove instructions for installing Python2 for webui. ( #4740 )
2019-05-02 22:35:23 -07:00
Andrew Tan
f87235f232
[tune] Example for Tune blog post ( #4673 )
2019-05-02 13:16:48 -04:00
Sam Toyer
663e92ab3f
[rllib] TD3/DDPG improvements and MuJoCo benchmarks ( #4694 )
...
* [rllib] Separate optimisers for DDPG actor & crit.
* [rllib] Better names for DDPG variables & options
Config changes:
- noise_scale -> exploration_ou_noise_scale
- exploration_theta -> exploration_ou_theta
- exploration_sigma -> exploration_ou_sigma
- act_noise -> exploration_gaussian_sigma
- noise_clip -> target_noise_clip
* [rllib] Make DDPG less class-y
Used functions to replace three classes with only an __init__ method & a
handful of unrelated attributes.
* [rllib] Refactor DDPG noise
* [rllib] Unify DDPG exploration annealing
Added option "exploration_should_anneal" to enable linear annealing of
exploration noise. By default this is off, for consistency with DDPG &
TD3 papers. Also renamed "exploration_final_eps" to
"exploration_final_scale" (that name seems to have been carried over
from DQN, and doesn't really make sense here). Finally, tried to rename
"eps" to "noise_scale" wherever possible.
2019-04-26 17:49:53 -07:00
Andy Jones
05c896d6f7
Adds a note on how to avoid contention when using PyTorch. ( #4692 )
2019-04-26 13:22:26 -07:00
Eric Liang
6848dfd179
[rllib] Replace ray.get() with ray_get_and_free() to optimize memory usage ( #4586 )
2019-04-17 20:30:03 -04:00
Eric Liang
6e7680bf21
[rllib] Clean up concepts documentation and policy optimizer creation ( #4592 )
2019-04-12 21:03:26 -07:00
Eric Liang
f8e8743347
[tune] Improve PBT example ( #4575 )
2019-04-09 20:59:17 -07:00
Eric Liang
4f46d3e9bf
[rllib] Add multi-agent examples for hand-coded policy, centralized VF ( #4554 )
2019-04-09 00:36:49 -07:00
Eric Liang
37208216ae
[rllib] Rename Agent to Trainer ( #4556 )
2019-04-07 00:36:18 -07:00
Andrew Tan
991b911e1d
[tune] Add --columns
flag for CLI ( #4564 )
2019-04-05 19:49:01 -07:00
Andrew Tan
bfd0af52bc
[tune] Add documentation to --output flag ( #4518 )
...
## What do these changes do?
Add documentation for the `--output` flag for ls / lsx in the Tune CLI.
## Related issue number
Closes #4511
## Linter
- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 00:16:35 -07:00
Tasha Chin
5693cd1344
[docs] Show source code ( #3281 )
2019-04-03 21:30:20 -07:00
Yuhong Guo
c2349cf12d
Remove local/global_scheduler from code and doc. ( #4549 )
2019-04-03 17:05:09 -07:00
Adi Zimmerman
51dae23d5c
[tune] Search Alg delay import + CLI timing test ( #4230 )
2019-04-03 08:52:45 -07:00
Philipp Moritz
b0f6ddf6d1
Remove CMake files ( #4493 )
2019-04-02 22:17:33 -07:00
Eric Liang
fce0062380
[rllib] Switch to tune.run() instead of run_experiments() ( #4515 )
2019-03-30 14:07:50 -07:00
bjg2
77005d1814
[rllib] Make batch timeout for remote workers tunable ( #4435 )
2019-03-29 13:19:42 -07:00
Eric Liang
2871609296
[rllib] Report sampler performance metrics ( #4427 )
2019-03-27 13:24:23 -07:00
Andrew Tan
12db684f72
[tune] add filter flag for Tune CLI ( #4337 )
...
## What do these changes do?
Adds filter flag (--filter) to ls / lsx commands for Tune CLI.
Usage: `tune ls [path] --filter [column] [operator] [value]`
e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`
2019-03-27 11:19:25 -07:00
Robert Nishihara
c6f12e5219
Update documentation from 0.7.0.dev1 to 0.7.0.dev2. ( #4485 )
2019-03-26 17:32:53 -07:00
Eric Liang
cff08e19ff
[rllib] Print out intermediate data shapes on the first iteration ( #4426 )
2019-03-26 00:27:59 -07:00
Eric Liang
5b8eb475ce
[rllib] Allow None to be specified in multi-agent envs ( #4464 )
...
* wip
* check
* doc update
* Update hierarchical_training.py
2019-03-25 11:38:17 -07:00
Eric Liang
4b8b703561
[rllib] Some API cleanups and documentation improvements ( #4409 )
2019-03-21 21:34:22 -07:00
Philipp Moritz
80ef8c19aa
Add initial news reader example ( #4348 )
2019-03-20 18:47:12 -07:00
Richard Liaw
ea5a6f8455
[tune] Simplify API ( #4234 )
...
Uses `tune.run` to execute experiments as preferred API.
@noahgolmant
This does not break backwards compat, but will slowly internalize `Experiment`.
In a separate PR, Tune schedulers should only support 1 running experiment at a time.
2019-03-17 13:03:32 -07:00
Philipp Moritz
c5e2c9af4d
Build wheels for macOS with Bazel ( #4280 )
2019-03-15 10:37:57 -07:00
Richard Liaw
6630a35353
[tune] Initial Commit for Tune CLI ( #3983 )
...
This introduces a light CLI for Tune.
2019-03-08 16:46:05 -08:00