Commit graph

880 commits

Author SHA1 Message Date
Chapman Siu
04cffb7e65
[docs] rllib-models.rst - QMIX +parametric (#8868)
Updating docs to show that QMIX supports parametric action space, as per SMAC environments. 

This is reflected in the code here: https://github.com/ray-project/ray/blob/master/rllib/agents/qmix/qmix_policy.py#L179 and consistent with QMIX being an extension of DQN
2020-06-09 21:56:16 -07:00
mehrdadn
f93bb008bb
Change os.uname()[1] and socket.gethostname() to the portable and faster platform.node_ip() (#8839)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-08 21:29:46 -07:00
Edward Oakes
5d124489a9
[serve] Require backend when creating endpoint (#8764) 2020-06-06 21:10:42 -05:00
Eric Liang
be26a7b1b0
[rllib] Support for complex / variable-length observation spaces (#8393) 2020-06-06 12:22:19 +02:00
Edward Oakes
7bfce5c027
[serve] Clarify OMP_NUM_THREADS behavior (#8740) 2020-06-05 15:39:37 -05:00
Edward Oakes
c0df913b19
[serve] [docs] Cleanup splitting traffic, add A/B testing and incremental rollout (#8741) 2020-06-05 15:39:09 -05:00
Sven Mika
c74dc58f8b
[RLlib] Fix use_lstm flag for ModelV2 (w/o ModelV1 wrapping) and add it for PyTorch. (#8734) 2020-06-05 15:40:30 +02:00
Amog Kamsetty
9410e5884d
[Tune] Parametrize Cloud Syncing Frequency (#8771) 2020-06-04 18:55:50 -07:00
Edward Oakes
c1a97c8c04
[Doc] clarify delete in serve docs (#8765) 2020-06-04 15:22:30 -07:00
Edward Oakes
0306e4d589
[serve] Refer to serve "instances," not "clusters" (#8746) 2020-06-02 15:16:29 -07:00
Edward Oakes
2e82e05e4b
[serve] Add list_backends and list_endpoints (#8737) 2020-06-02 15:14:10 -07:00
Alex Wu
e9ce47bb6b
[iter] user-tip for better perf (#8712) 2020-06-02 14:23:50 -07:00
Edward Oakes
b8a9be0378
[serve] Specify how to pass init args to actors (#8738) 2020-06-02 14:03:18 -07:00
Alex Wu
dcf58a43dc
[SGD] Dataset API (#7839) 2020-06-01 15:48:15 -07:00
Sven Mika
d8a081a185
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590) 2020-05-30 22:48:34 +02:00
Edward Oakes
5bec951ece
[docs] [serve] Deployment as a service on k8s docs (#8663) 2020-05-29 14:53:42 -07:00
Simon Mo
6b04664645
[Serve] Add Tutorial for Batch Inference (#8490) 2020-05-29 09:55:47 -07:00
SangBin Cho
448011f822
0.8.5 Release change. (#8358) 2020-05-28 09:37:19 -07:00
Bill Chambers
fadd47e44e
[docs] Ray Serve Documentation Overhaul (#8524) 2020-05-27 11:03:28 -05:00
Sven Mika
2746fc0476
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520) 2020-05-27 16:19:13 +02:00
Sven Mika
0422e9c5a8
[RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624) 2020-05-27 10:19:47 +02:00
Bill Chambers
b3d686b78f
[docs] Add Overview Section & Gentle Introduction (#8517) 2020-05-26 10:39:34 -05:00
Edward Oakes
860eb6f13a
Update named actor API (#8559) 2020-05-24 20:08:03 -05:00
Eric Liang
9a83908c46
[rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
Ian Rodney
f56b3be916
[Docs] Add Cancelation to main docs. (#8508)
* Update walkthrough.rst

* Adding example

* Better example

* Better example

* Adding Ray Kill Info
2020-05-20 10:31:57 -07:00
Bill Chambers
f8f7efc24f
[Serve] Rename RayServe -> "Ray Serve" in Documentation (#8504) 2020-05-19 19:13:54 -07:00
Simon Mo
c9c84c87f4
[Serve] Add Instructions for GPU (#8495) 2020-05-19 18:33:58 -07:00
Max Fitton
13231ba63b
Rename redis-port to port and add default (#8406) 2020-05-18 13:25:34 -05:00
Richard Liaw
b6c4f45ae0
[tune] Fix links (#8477) 2020-05-18 10:08:29 -07:00
Edward Oakes
9a721ed71a
Link to serve in tune overview (#8487) 2020-05-18 11:29:38 -05:00
Sven Mika
796a834c48
[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00
Richard Liaw
87cbf2aedd
[docs][tune] Make search algorithm, scheduler docs better! (#8179) 2020-05-17 12:19:44 -07:00
SangBin Cho
2f01776d09
Fix ray memory example (#8462) 2020-05-17 11:34:11 -05:00
Tao Wang
acffdb2349
[TEST]use cc_test to run core_worker_test, enforce/reuse RedisServiceManagerForTest (#8443) 2020-05-17 18:43:00 +08:00
Edward Oakes
fb23bd6fc0
[serve] Optionally namespace serve clusters (#8447) 2020-05-17 00:14:42 -05:00
Richard Liaw
67c01455fe
[tune] tune.track -> tune.report (#8388) 2020-05-16 12:55:08 -07:00
Stephanie Wang
bd169749e0
Option to retry failed actor tasks (#8330)
* Python

* Consolidate state in the direct actor transport, set the caller starts at

* todo

* Remove unused

* Update and unit tests

* Doc

* Remove unused

* doc

* Remove debug

* Update src/ray/core_worker/transport/direct_actor_transport.h

Co-authored-by: Eric Liang <ekhliang@gmail.com>

* Update src/ray/core_worker/transport/direct_actor_transport.cc

Co-authored-by: Eric Liang <ekhliang@gmail.com>

* lint and fix build

* Update

* Fix build

* Fix tests

* Unit test for max_task_retries=0

* Fix java?

* Fix bad test

* Cross language fix

* fix java

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-05-15 20:15:15 -07:00
Edward Oakes
ef498e8aa5
[serve] Add basic session affinity via shard key (#8449) 2020-05-15 16:18:52 -05:00
Max Fitton
00325eb2b2
Rename max_reconstructions to max_restarts and use -1 for infinite (#8274)
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
2020-05-14 10:30:29 -05:00
Eric Liang
eabb801a40
less important (#8439) 2020-05-13 22:52:38 -07:00
Siyuan (Ryans) Zhuang
ab278071ac
Update serialization doc (#8381)
* update serialization doc
2020-05-12 16:47:00 -07:00
Jason McGhee
24ced808cd
Fix config key in docs for using PyTorch (#8300)
Docs improperly suggest using "torch" when the actual flag is called "use_pytorch"
2020-05-11 12:41:21 -07:00
Eric Liang
f48da50e1c
[rllib] observation function api for multi-agent (#8236) 2020-05-04 22:13:49 -07:00
Rüdiger Busche
e93ec3134a
Use kubectl delete pod in example (#8295)
Co-authored-by: rbusche <rbusche@inserve.de>
2020-05-04 21:39:30 -05:00
Sven Mika
b95e28faea
[RLlib] APEX_DDPG (PyTorch) test case and docs. (#8288)
APEX_DDPG (PyTorch) test case and docs.
2020-05-04 09:36:27 +02:00
Sven Mika
166bb5d690
[RLlib] IMPALA PyTorch (#8287)
This PR adds an IMPALA PyTorch implementation.

- adds compilation tests for LSTM and w/o LSTM.
- adds learning test for CartPole.
2020-05-03 13:44:25 +02:00
Sven Mika
42991d723f
[RLlib] rllib/examples folder restructuring (#8250)
Cleans up of the rllib/examples folder by moving all example Envs into rllibexamples/env (so they can be used by other scripts and tests as well).
2020-05-01 22:59:34 +02:00
Edward Oakes
6373c70661
[serve] Refactor BackendConfig (#8202) 2020-04-30 22:31:07 -05:00
Edward Oakes
95d187e556
[serve] Add delete_endpoint call (#8256) 2020-04-30 20:59:07 -05:00
Edward Oakes
43be73e4cf
[serve] Add delete_backend call (#8252) 2020-04-30 13:10:39 -05:00