Eric Liang
4b8b703561
[rllib] Some API cleanups and documentation improvements ( #4409 )
2019-03-21 21:34:22 -07:00
Philipp Moritz
80ef8c19aa
Add initial news reader example ( #4348 )
2019-03-20 18:47:12 -07:00
Richard Liaw
ea5a6f8455
[tune] Simplify API ( #4234 )
...
Uses `tune.run` to execute experiments as preferred API.
@noahgolmant
This does not break backwards compat, but will slowly internalize `Experiment`.
In a separate PR, Tune schedulers should only support 1 running experiment at a time.
2019-03-17 13:03:32 -07:00
Philipp Moritz
c5e2c9af4d
Build wheels for macOS with Bazel ( #4280 )
2019-03-15 10:37:57 -07:00
Richard Liaw
6630a35353
[tune] Initial Commit for Tune CLI ( #3983 )
...
This introduces a light CLI for Tune.
2019-03-08 16:46:05 -08:00
Eric Liang
c7f74dbdc7
[rllib] Add async remote workers ( #4253 )
2019-03-08 15:39:48 -08:00
Philipp Moritz
39eed24d47
update version from 0.7.0.dev0 to 0.7.0.dev1 ( #4282 )
2019-03-06 14:43:09 -08:00
Eric Liang
78ad9c4cbb
Add "ray timeline" command to auto-dump Chrome trace for the current Ray instance ( #4239 )
2019-03-05 16:28:00 -08:00
Adi Zimmerman
4cf2c9ecb8
[tune] Doc fixes ( #4207 )
...
Co-Authored-By: adizim <adizim@berkeley.edu>
2019-03-05 14:11:53 -08:00
Eric Liang
6e3384a719
[rllib] Add three new long-running stress tests {APEX, IMPALA, PBT} ( #4215 )
2019-03-04 14:05:42 -08:00
Eric Liang
ba03048254
[rllib] TF model custom_loss() should actually allow access to full rollout data ( #4220 )
2019-03-02 22:57:51 -08:00
Robert Nishihara
4b89eebfc7
Move test folders under rllib/tune from test -> tests. ( #4214 )
2019-03-02 13:37:16 -08:00
Robert Nishihara
f21e6a2cff
Update documentation regarding UI and timeline. ( #4189 )
2019-03-01 19:54:33 -08:00
Richard Liaw
c695402dc3
[tune] Introduce ability to turn off default logging. ( #4104 )
2019-02-28 17:02:41 -08:00
Ion
44afcf4fa8
[doc] Document experimental signal API. ( #4019 )
...
* [doc] Document signal API.
* minor
* resolve conflicts
2019-02-28 11:05:28 -08:00
Robert Nishihara
641f703879
Update installation instructions to include bazel and remove outdated… ( #4171 )
2019-02-26 23:07:43 -08:00
Adi Zimmerman
5cf388f29d
[tune] Support RESTful API for the Web Server ( #4080 )
...
Change the client/server API to RESTful design. This includes resource modeling, model URI's, and correct HTTP methods.
2019-02-26 21:56:02 -08:00
justinwyang
19b8793b6a
Updated test script paths in documentation ( #4170 )
2019-02-26 16:14:55 -08:00
Eric Liang
3896b726dd
Dynamically adjust redis memory usage ( #4152 )
...
* f
* Update services.py
2019-02-25 16:21:37 -08:00
Eric Liang
d9da183c7d
[rllib] Custom supervised loss API ( #4083 )
2019-02-24 15:36:13 -08:00
Robert Nishihara
7b04ed059e
Move TensorFlowVariables to ray.experimental.tf_utils. ( #4145 )
2019-02-24 14:26:46 -08:00
Eric Liang
05d96ce81b
[rllib] Raise an error if multi-agent envs terminate without a last observation for agents ( #4139 )
...
* fix it
* lint
* Update rllib-training.rst
2019-02-23 21:23:40 -08:00
Philipp Moritz
ba52caff37
Make Bazel the default build system ( #3898 )
2019-02-23 11:58:59 -08:00
Tianming Xu
692bb336a1
Fix master branch compilation error and lint error ( #4109 )
2019-02-21 11:54:30 -08:00
Eric Liang
f8bef004da
[rllib] Improve error message for bad envs, add remote env docs ( #4044 )
...
* commit
* fix up rew
2019-02-18 01:28:19 -08:00
Megan Kawakami
346885068c
[rllib] add torch pg ( #3857 )
...
* add torch pg
* add torch imports
* added torch pg
* working torch pg implementation
* add pg pytorch
* Update a3c.py
* Update a3c.py
* Update torch_policy_graph.py
* Update torch_policy_graph.py
2019-02-16 19:54:14 -08:00
Hao Chen
de17443dc2
Propagate backend error to worker ( #4039 )
2019-02-16 11:39:15 +08:00
Robert Nishihara
5f71751891
API cleanups. Remove worker argument. Remove some deprecated arguments. ( #4025 )
...
* Remove worker argument from API methods.
* Remove deprecated arguments and deprecate redirect_output and redirect_worker_output.
* Fix
2019-02-15 10:49:16 -08:00
William Ma
8ee53297b1
Add documentation on how to use debug tools ( #4000 )
2019-02-14 13:50:21 -08:00
Philipp Moritz
077ffd99bf
Bump version from 0.6.3 to 0.7.0.dev0 in docs and .yaml ( #4042 )
2019-02-14 12:08:48 -08:00
Eric Liang
2dccf383dd
[rllib] Basic infrastructure for off-policy estimation (IS, WIS) ( #3941 )
2019-02-13 16:25:05 -08:00
Hao Chen
f31a79f3f7
Implement actor checkpointing ( #3839 )
...
* Implement Actor checkpointing
* docs
* fix
* fix
* fix
* move restore-from-checkpoint to HandleActorStateTransition
* Revert "move restore-from-checkpoint to HandleActorStateTransition"
This reverts commit 9aa4447c1e3e321f42a1d895d72f17098b72de12.
* resubmit waiting tasks when actor frontier restored
* add doc about num_actor_checkpoints_to_keep=1
* add num_actor_checkpoints_to_keep to Cython
* add checkpoint_expired api
* check if actor class is abstract
* change checkpoint_ids to long string
* implement java
* Refactor to delay actor creation publish until checkpoint is resumed
* debug, lint
* Erase from checkpoints to restore if task fails
* fix lint
* update comments
* avoid duplicated actor notification log
* fix unintended change
* add actor_id to checkpoint_expired
* small java updates
* make checkpoint info per actor
* lint
* Remove logging
* Remove old actor checkpointing Python code, move new checkpointing code to FunctionActionManager
* Replace old actor checkpointing tests
* Fix test and lint
* address comments
* consolidate kill_actor
* Remove __ray_checkpoint__
* fix non-ascii char
* Loosen test checks
* fix java
* fix sphinx-build
2019-02-13 19:39:02 +08:00
Si-Yuan
21472b890a
Integrate "tempfile_service" into "ray.node.Node" ( #3953 )
2019-02-12 17:34:04 -08:00
Adi Zimmerman
dac1969647
[tune] Add Nevergrad to Tune ( #3985 )
2019-02-12 11:00:04 -08:00
Adi Zimmerman
9797028a91
[tune] Add scikit-optimize to Tune ( #3924 )
2019-02-11 17:06:02 -08:00
Eric Liang
c4182463f6
[rllib] Add helper to iterate over envs in a vectorized environment ( #4001 )
...
* add foreach env func
* fix
* add test
2019-02-11 10:40:47 -08:00
Robert Nishihara
6a32b410bb
Update versions from 0.6.2 -> 0.6.3 in the documentation. ( #3981 )
2019-02-07 20:57:37 -08:00
Alex LaGrassa
b0fe5af7c8
[doc] Update example-parameter-server.rst ( #3773 )
2019-02-05 22:00:54 -08:00
Andrew Tan
8323419a6d
[tune] Add SigOpt Integration ( #3844 )
2019-02-03 18:23:57 -08:00
Michael Luo
1a015e420b
Optimal PPO Configs (10k reward in 1 hr) + PPO grad clipping implemented ( #3934 )
2019-02-02 22:10:58 -08:00
Peter Schafhalter
62a0a7bdc7
[tune] Add BayesOpt ( #3864 )
...
Adds BayesOpt as a Tune suggestion algorithm.
2019-01-31 16:54:17 -08:00
Philipp Moritz
beb75193da
Fix linting on master ( #3913 )
2019-01-31 01:28:45 -08:00
Rong Ou
8f6bd6cece
change kubernetes examples to use Deployment
( #3909 )
2019-01-30 17:50:37 -08:00
Eric Liang
152375aa8a
[rllib] Add evaluation option to DQN agent ( #3835 )
...
* add eval
* interval
* multiagent minor fix
* Update rllib.rst
* Update ddpg.py
* Update qmix.py
2019-01-29 21:19:53 -08:00
Eric Liang
fb73cedf70
[rllib] Add examples page, add hierarchical training example, delete SC2 examples ( #3815 )
...
* wip
* lint
* wip
* up
* wip
* update examples
* wip
* remove carla
* update
* improve envspec
* link to custom
* Update rllib-env.rst
* update
* fix
* fn
* lint
* ds
* ssd games
* desc
* fix up docs
* fix
2019-01-29 21:06:09 -08:00
Stephanie Wang
eddd60e14e
Improve backend debug logging, refactor scheduling queues ( #3819 )
2019-01-26 16:15:48 +08:00
Si-Yuan
48139cf861
Migrate Python C extension to Cython ( #3541 )
2019-01-24 09:17:14 -08:00
Eric Liang
04ec47cbd4
[rllib] annotate public vs developer vs private APIs ( #3808 )
2019-01-23 21:27:26 -08:00
Robert Nishihara
01e18b47f4
Direct people to stackoverflow for questions about usage. ( #3830 )
...
* Direct people to stackoverflow for questions about usage.
* Improve wording
2019-01-23 13:30:02 -08:00
Robert Nishihara
0b1608a546
Factor out code for starting new processes and test plasma store in valgrind. ( #3824 )
...
* Factor out starting Ray processes.
* Detect flags through environment variables.
* Return ProcessInfo from start_ray_process.
* Print valgrind errors at exit.
* Test valgrind in travis.
* Some valgrind fixes.
* Undo raylet monitor change.
* Only test plasma store in valgrind.
2019-01-22 14:59:11 -08:00