Eric Liang
4f46d3e9bf
[rllib] Add multi-agent examples for hand-coded policy, centralized VF ( #4554 )
2019-04-09 00:36:49 -07:00
Eric Liang
37208216ae
[rllib] Rename Agent to Trainer ( #4556 )
2019-04-07 00:36:18 -07:00
Andrew Tan
991b911e1d
[tune] Add --columns
flag for CLI ( #4564 )
2019-04-05 19:49:01 -07:00
Andrew Tan
bfd0af52bc
[tune] Add documentation to --output flag ( #4518 )
...
## What do these changes do?
Add documentation for the `--output` flag for ls / lsx in the Tune CLI.
## Related issue number
Closes #4511
## Linter
- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 00:16:35 -07:00
Tasha Chin
5693cd1344
[docs] Show source code ( #3281 )
2019-04-03 21:30:20 -07:00
Yuhong Guo
c2349cf12d
Remove local/global_scheduler from code and doc. ( #4549 )
2019-04-03 17:05:09 -07:00
Adi Zimmerman
51dae23d5c
[tune] Search Alg delay import + CLI timing test ( #4230 )
2019-04-03 08:52:45 -07:00
Philipp Moritz
b0f6ddf6d1
Remove CMake files ( #4493 )
2019-04-02 22:17:33 -07:00
Eric Liang
fce0062380
[rllib] Switch to tune.run() instead of run_experiments() ( #4515 )
2019-03-30 14:07:50 -07:00
bjg2
77005d1814
[rllib] Make batch timeout for remote workers tunable ( #4435 )
2019-03-29 13:19:42 -07:00
Eric Liang
2871609296
[rllib] Report sampler performance metrics ( #4427 )
2019-03-27 13:24:23 -07:00
Andrew Tan
12db684f72
[tune] add filter flag for Tune CLI ( #4337 )
...
## What do these changes do?
Adds filter flag (--filter) to ls / lsx commands for Tune CLI.
Usage: `tune ls [path] --filter [column] [operator] [value]`
e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`
2019-03-27 11:19:25 -07:00
Robert Nishihara
c6f12e5219
Update documentation from 0.7.0.dev1 to 0.7.0.dev2. ( #4485 )
2019-03-26 17:32:53 -07:00
Eric Liang
cff08e19ff
[rllib] Print out intermediate data shapes on the first iteration ( #4426 )
2019-03-26 00:27:59 -07:00
Eric Liang
5b8eb475ce
[rllib] Allow None to be specified in multi-agent envs ( #4464 )
...
* wip
* check
* doc update
* Update hierarchical_training.py
2019-03-25 11:38:17 -07:00
Eric Liang
4b8b703561
[rllib] Some API cleanups and documentation improvements ( #4409 )
2019-03-21 21:34:22 -07:00
Philipp Moritz
80ef8c19aa
Add initial news reader example ( #4348 )
2019-03-20 18:47:12 -07:00
Richard Liaw
ea5a6f8455
[tune] Simplify API ( #4234 )
...
Uses `tune.run` to execute experiments as preferred API.
@noahgolmant
This does not break backwards compat, but will slowly internalize `Experiment`.
In a separate PR, Tune schedulers should only support 1 running experiment at a time.
2019-03-17 13:03:32 -07:00
Philipp Moritz
c5e2c9af4d
Build wheels for macOS with Bazel ( #4280 )
2019-03-15 10:37:57 -07:00
Richard Liaw
6630a35353
[tune] Initial Commit for Tune CLI ( #3983 )
...
This introduces a light CLI for Tune.
2019-03-08 16:46:05 -08:00
Eric Liang
c7f74dbdc7
[rllib] Add async remote workers ( #4253 )
2019-03-08 15:39:48 -08:00
Philipp Moritz
39eed24d47
update version from 0.7.0.dev0 to 0.7.0.dev1 ( #4282 )
2019-03-06 14:43:09 -08:00
Eric Liang
78ad9c4cbb
Add "ray timeline" command to auto-dump Chrome trace for the current Ray instance ( #4239 )
2019-03-05 16:28:00 -08:00
Adi Zimmerman
4cf2c9ecb8
[tune] Doc fixes ( #4207 )
...
Co-Authored-By: adizim <adizim@berkeley.edu>
2019-03-05 14:11:53 -08:00
Eric Liang
6e3384a719
[rllib] Add three new long-running stress tests {APEX, IMPALA, PBT} ( #4215 )
2019-03-04 14:05:42 -08:00
Eric Liang
ba03048254
[rllib] TF model custom_loss() should actually allow access to full rollout data ( #4220 )
2019-03-02 22:57:51 -08:00
Robert Nishihara
4b89eebfc7
Move test folders under rllib/tune from test -> tests. ( #4214 )
2019-03-02 13:37:16 -08:00
Robert Nishihara
f21e6a2cff
Update documentation regarding UI and timeline. ( #4189 )
2019-03-01 19:54:33 -08:00
Richard Liaw
c695402dc3
[tune] Introduce ability to turn off default logging. ( #4104 )
2019-02-28 17:02:41 -08:00
Ion
44afcf4fa8
[doc] Document experimental signal API. ( #4019 )
...
* [doc] Document signal API.
* minor
* resolve conflicts
2019-02-28 11:05:28 -08:00
Robert Nishihara
641f703879
Update installation instructions to include bazel and remove outdated… ( #4171 )
2019-02-26 23:07:43 -08:00
Adi Zimmerman
5cf388f29d
[tune] Support RESTful API for the Web Server ( #4080 )
...
Change the client/server API to RESTful design. This includes resource modeling, model URI's, and correct HTTP methods.
2019-02-26 21:56:02 -08:00
justinwyang
19b8793b6a
Updated test script paths in documentation ( #4170 )
2019-02-26 16:14:55 -08:00
Eric Liang
3896b726dd
Dynamically adjust redis memory usage ( #4152 )
...
* f
* Update services.py
2019-02-25 16:21:37 -08:00
Eric Liang
d9da183c7d
[rllib] Custom supervised loss API ( #4083 )
2019-02-24 15:36:13 -08:00
Robert Nishihara
7b04ed059e
Move TensorFlowVariables to ray.experimental.tf_utils. ( #4145 )
2019-02-24 14:26:46 -08:00
Eric Liang
05d96ce81b
[rllib] Raise an error if multi-agent envs terminate without a last observation for agents ( #4139 )
...
* fix it
* lint
* Update rllib-training.rst
2019-02-23 21:23:40 -08:00
Philipp Moritz
ba52caff37
Make Bazel the default build system ( #3898 )
2019-02-23 11:58:59 -08:00
Tianming Xu
692bb336a1
Fix master branch compilation error and lint error ( #4109 )
2019-02-21 11:54:30 -08:00
Eric Liang
f8bef004da
[rllib] Improve error message for bad envs, add remote env docs ( #4044 )
...
* commit
* fix up rew
2019-02-18 01:28:19 -08:00
Megan Kawakami
346885068c
[rllib] add torch pg ( #3857 )
...
* add torch pg
* add torch imports
* added torch pg
* working torch pg implementation
* add pg pytorch
* Update a3c.py
* Update a3c.py
* Update torch_policy_graph.py
* Update torch_policy_graph.py
2019-02-16 19:54:14 -08:00
Hao Chen
de17443dc2
Propagate backend error to worker ( #4039 )
2019-02-16 11:39:15 +08:00
Robert Nishihara
5f71751891
API cleanups. Remove worker argument. Remove some deprecated arguments. ( #4025 )
...
* Remove worker argument from API methods.
* Remove deprecated arguments and deprecate redirect_output and redirect_worker_output.
* Fix
2019-02-15 10:49:16 -08:00
William Ma
8ee53297b1
Add documentation on how to use debug tools ( #4000 )
2019-02-14 13:50:21 -08:00
Philipp Moritz
077ffd99bf
Bump version from 0.6.3 to 0.7.0.dev0 in docs and .yaml ( #4042 )
2019-02-14 12:08:48 -08:00
Eric Liang
2dccf383dd
[rllib] Basic infrastructure for off-policy estimation (IS, WIS) ( #3941 )
2019-02-13 16:25:05 -08:00
Hao Chen
f31a79f3f7
Implement actor checkpointing ( #3839 )
...
* Implement Actor checkpointing
* docs
* fix
* fix
* fix
* move restore-from-checkpoint to HandleActorStateTransition
* Revert "move restore-from-checkpoint to HandleActorStateTransition"
This reverts commit 9aa4447c1e3e321f42a1d895d72f17098b72de12.
* resubmit waiting tasks when actor frontier restored
* add doc about num_actor_checkpoints_to_keep=1
* add num_actor_checkpoints_to_keep to Cython
* add checkpoint_expired api
* check if actor class is abstract
* change checkpoint_ids to long string
* implement java
* Refactor to delay actor creation publish until checkpoint is resumed
* debug, lint
* Erase from checkpoints to restore if task fails
* fix lint
* update comments
* avoid duplicated actor notification log
* fix unintended change
* add actor_id to checkpoint_expired
* small java updates
* make checkpoint info per actor
* lint
* Remove logging
* Remove old actor checkpointing Python code, move new checkpointing code to FunctionActionManager
* Replace old actor checkpointing tests
* Fix test and lint
* address comments
* consolidate kill_actor
* Remove __ray_checkpoint__
* fix non-ascii char
* Loosen test checks
* fix java
* fix sphinx-build
2019-02-13 19:39:02 +08:00
Si-Yuan
21472b890a
Integrate "tempfile_service" into "ray.node.Node" ( #3953 )
2019-02-12 17:34:04 -08:00
Adi Zimmerman
dac1969647
[tune] Add Nevergrad to Tune ( #3985 )
2019-02-12 11:00:04 -08:00
Adi Zimmerman
9797028a91
[tune] Add scikit-optimize to Tune ( #3924 )
2019-02-11 17:06:02 -08:00