Commit graph

2443 commits

Author SHA1 Message Date
Jimpachnet
d3551dd8df [tune] Added possibility to execute infinite recovery retries for a trial (#3901)
Allows to let a trial try to do infinite recoveries by setting _max_failures_ to a negative number.
2019-01-31 02:21:16 -08:00
Philipp Moritz
beb75193da Fix linting on master (#3913) 2019-01-31 01:28:45 -08:00
Richard Liaw
d128636bab Ray Logging Configuration (#3691)
* fix logging for autoscaler

* module logging

* try this for logging

* yapf

* fix

* Initial logging setup

* momery

* ok

* remove basicconfig

* catch

* remove package logging

* print

* fix

* try_fix

* fix 1

* revert rllib

* logging level

* flake8

* fix

* fix

* Remove vestigal TODO
2019-01-30 21:01:12 -08:00
Richard Liaw
5f145041ef Update Release Docs (#3693) 2019-01-30 19:37:48 -08:00
Robert Nishihara
93214891b0 Small improvement to kubernetes config files. (#3875) 2019-01-30 18:00:20 -08:00
Rong Ou
8f6bd6cece change kubernetes examples to use Deployment (#3909) 2019-01-30 17:50:37 -08:00
Robert Nishihara
d06d9fc5d7 Fix Python linting errors. (#3905) 2019-01-30 13:43:18 -08:00
Kai Yang
02766adeca Limit maximum starting workers per language (#3852) 2019-01-29 21:43:12 -08:00
Eric Liang
152375aa8a
[rllib] Add evaluation option to DQN agent (#3835)
* add eval

* interval

* multiagent minor fix

* Update rllib.rst

* Update ddpg.py

* Update qmix.py
2019-01-29 21:19:53 -08:00
Yuhong Guo
c45b91dcca Make redis module safe without crashing by removing RAY_CHECK (#3855) 2019-01-29 21:06:31 -08:00
Eric Liang
fb73cedf70
[rllib] Add examples page, add hierarchical training example, delete SC2 examples (#3815)
* wip

* lint

* wip

* up

* wip

* update examples

* wip

* remove carla

* update

* improve envspec

* link to custom

* Update rllib-env.rst

* update

* fix

* fn

* lint

* ds

* ssd games

* desc

* fix up docs

* fix
2019-01-29 21:06:09 -08:00
Bruno Morier
c9819a721d Update tempfile_services.py (#3896)
Fix an invalid reference to os.errno. errno have been removed from os in python 3.7. The fix only replaces it by the already imported errno.
2019-01-29 19:33:02 -08:00
Robert Nishihara
2887dac427 Use Redis version 5.0.3. (#3886) 2019-01-29 19:19:05 -08:00
Philipp Moritz
0aadf11c10 Fix compilation on macOS by adding virtual destructors (#3878) 2019-01-28 13:22:52 -08:00
Philipp Moritz
f7415b37c5 Build Ray with Bazel (#3867) 2019-01-27 18:32:04 -08:00
Eric Liang
c75038b945
[autoscaler] Updating a file in file mounts causes all worker nodes to get restarted 2019-01-27 17:41:37 -08:00
Stephanie Wang
ad9f1721d1 Fix object_manager_test.py::object_transfer_retry test (#3863) 2019-01-27 13:55:38 -08:00
Stephanie Wang
eddd60e14e Improve backend debug logging, refactor scheduling queues (#3819) 2019-01-26 16:15:48 +08:00
Yuhong Guo
066fa8abf3
Fix monitor_test.py by waiting for moniter.py to start working (#3840)
* Wait for moniter.py to start working

* Checkout None result in state.py
2019-01-25 18:07:15 +08:00
Philipp Moritz
20162ce159 Compile raylet cython bindings with bazel (#3842) 2019-01-25 00:57:31 -08:00
Si-Yuan
48139cf861 Migrate Python C extension to Cython (#3541) 2019-01-24 09:17:14 -08:00
Yuhong Guo
c1a52b1c86 Remove duplicated code in RayConfig (#3831) 2019-01-24 17:04:10 +08:00
Hao Chen
bfcf254e52 Fix: do not treat actor task as failed if the actor will be reconstructed (#3736) 2019-01-23 23:28:44 -08:00
Eric Liang
04ec47cbd4
[rllib] annotate public vs developer vs private APIs (#3808) 2019-01-23 21:27:26 -08:00
Robert Nishihara
01e18b47f4 Direct people to stackoverflow for questions about usage. (#3830)
* Direct people to stackoverflow for questions about usage.

* Improve wording
2019-01-23 13:30:02 -08:00
Wang Qing
dcb744518e Implement actor dummy object gc in java (#3822)
* Add dummy object gc in java

* Fix

* Address comments.

* Refine

* Address comments.
2019-01-23 11:56:25 -08:00
Wang Qing
816406ea3d [Java] Fix setCurrentTask() in multi threading (#3821) 2019-01-23 20:45:30 +08:00
Robert Nishihara
0b1608a546 Factor out code for starting new processes and test plasma store in valgrind. (#3824)
* Factor out starting Ray processes.

* Detect flags through environment variables.

* Return ProcessInfo from start_ray_process.

* Print valgrind errors at exit.

* Test valgrind in travis.

* Some valgrind fixes.

* Undo raylet monitor change.

* Only test plasma store in valgrind.
2019-01-22 14:59:11 -08:00
Eric Liang
f0e6523323
[rllib] Don't call reset() unless necessary for multi-agent envs 2019-01-20 15:00:18 -08:00
Philipp Moritz
0dad4e6a25 Build Raylet with Bazel (#3806) 2019-01-20 12:16:47 -08:00
Eric Liang
aad48ee5a5 [tune] Fully deprecate raw function literals in Tune (#3788)
Related: https://github.com/ray-project/ray/issues/3785
2019-01-19 17:09:36 -08:00
Michael Luo
16f7ca45e4 Appo (#3779)
* Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder

* Deleted unneccesary vtrace.py file

* Update pong-impala.yaml

* Cleaned PPO Code

* Update pong-impala.yaml

* Update pong-impala.yaml

* wip

* new ifle

* refactor

* add vtrace off option

* revert

* support any space

* docs

* fix comment

* remove kl

* Update cartpole-appo-vtrace.yaml
2019-01-18 13:40:26 -08:00
Philipp Moritz
931e6a2fc3 Fix compilation error on ARM. (#3800) 2019-01-18 00:25:16 -08:00
Robert Nishihara
9af5a62e05 Give better error for old-style actor classes. (#3793) 2019-01-17 19:05:04 -08:00
Richard Liaw
0537508106 Bump strings for 0.6.2 (#3801) 2019-01-17 19:03:27 -08:00
Si-Yuan
16a3b99d8d Get rid of Arrow test utils (#3734)
* convert code to proper C++

* revert changes to "id.h" because #3765 has been merged.

* revert changes to Python bindings because they will be removed in #3541

* remove dependencies of Arrow logging

* revert changes to Arrow logging

* lint
2019-01-17 18:35:41 -08:00
Jones Wong
319c1340cb [rllib] Develop MARWIL (#3635)
*  add marvil policy graph

*  fix typo

*  add offline optimizer and enable running marwil

*  fix loss function

*  add maintaining the moving average of advantage norm

*  use sync replay optimizer for unifying

*  remove offline optimizer and use sync replay optimizer

*  format by yapf

*  add imitation learning objective

*  fix according to eric's review

*  format by yapf

* revise

* add test data

* marwil
2019-01-16 19:00:43 -08:00
Hao Chen
d1840bc7a9 Simplify RayConfig (#3714) 2019-01-16 16:43:26 -08:00
Richard Liaw
75ac016e2b Bump version (#3787) 2019-01-16 11:40:54 -08:00
Richard Liaw
fa99fda2b4
Application Stress Tests (#3612) 2019-01-16 02:05:16 -08:00
Richard Liaw
c28e6d41f5
[tune] Avoid overwriting checkpoint file (#3781) 2019-01-16 02:03:16 -08:00
ggdupont
a237b4a6a1 [Java] Fix package jaxb not exist when JDK11 (#3738) 2019-01-16 14:15:00 +08:00
Philipp Moritz
3b39066c15 Fix pandas 0.22 incompatibility by upgrading Arrow (#3786) 2019-01-15 21:17:32 -08:00
Eric Liang
401e656b95 [rllib] Sync filters at end of iteration not start; hierarchical docs (#3769) 2019-01-15 16:25:25 -08:00
Richard Liaw
3918934dfd
[tune] Cross-Node Recovery (#3725)
Augments trial restore to also check if the runner is at the same
location. If not, the checkpoint files are pushed onto the new location.
2019-01-15 10:37:28 -08:00
Si-Yuan
a5df8e3532 minor fix (#3770) 2019-01-14 13:52:51 -08:00
Tianming Xu
0b8008f41c remove RAY_CHECK around wait_state.remaining.erase (#3745) 2019-01-14 10:32:31 -08:00
Philipp Moritz
02bdaf221d Update arrow to include https://github.com/apache/arrow/pull/3392 (#3765)
* update arrow to include https://github.com/apache/arrow/pull/3392

* add appropriate includes

* update
2019-01-14 19:20:26 +08:00
Wang Qing
3cf59855af [Java] Replace junit with testNG (#3768) 2019-01-14 17:49:17 +08:00
Robert Nishihara
19908c01b8 Use environment markers to only install faulthandler in Python < 3.3. (#3764) 2019-01-14 15:55:59 +08:00