Jimpachnet
d3551dd8df
[tune] Added possibility to execute infinite recovery retries for a trial ( #3901 )
...
Allows to let a trial try to do infinite recoveries by setting _max_failures_ to a negative number.
2019-01-31 02:21:16 -08:00
Philipp Moritz
beb75193da
Fix linting on master ( #3913 )
2019-01-31 01:28:45 -08:00
Richard Liaw
d128636bab
Ray Logging Configuration ( #3691 )
...
* fix logging for autoscaler
* module logging
* try this for logging
* yapf
* fix
* Initial logging setup
* momery
* ok
* remove basicconfig
* catch
* remove package logging
* print
* fix
* try_fix
* fix 1
* revert rllib
* logging level
* flake8
* fix
* fix
* Remove vestigal TODO
2019-01-30 21:01:12 -08:00
Richard Liaw
5f145041ef
Update Release Docs ( #3693 )
2019-01-30 19:37:48 -08:00
Robert Nishihara
93214891b0
Small improvement to kubernetes config files. ( #3875 )
2019-01-30 18:00:20 -08:00
Rong Ou
8f6bd6cece
change kubernetes examples to use Deployment
( #3909 )
2019-01-30 17:50:37 -08:00
Robert Nishihara
d06d9fc5d7
Fix Python linting errors. ( #3905 )
2019-01-30 13:43:18 -08:00
Kai Yang
02766adeca
Limit maximum starting workers per language ( #3852 )
2019-01-29 21:43:12 -08:00
Eric Liang
152375aa8a
[rllib] Add evaluation option to DQN agent ( #3835 )
...
* add eval
* interval
* multiagent minor fix
* Update rllib.rst
* Update ddpg.py
* Update qmix.py
2019-01-29 21:19:53 -08:00
Yuhong Guo
c45b91dcca
Make redis module safe without crashing by removing RAY_CHECK ( #3855 )
2019-01-29 21:06:31 -08:00
Eric Liang
fb73cedf70
[rllib] Add examples page, add hierarchical training example, delete SC2 examples ( #3815 )
...
* wip
* lint
* wip
* up
* wip
* update examples
* wip
* remove carla
* update
* improve envspec
* link to custom
* Update rllib-env.rst
* update
* fix
* fn
* lint
* ds
* ssd games
* desc
* fix up docs
* fix
2019-01-29 21:06:09 -08:00
Bruno Morier
c9819a721d
Update tempfile_services.py ( #3896 )
...
Fix an invalid reference to os.errno. errno have been removed from os in python 3.7. The fix only replaces it by the already imported errno.
2019-01-29 19:33:02 -08:00
Robert Nishihara
2887dac427
Use Redis version 5.0.3. ( #3886 )
2019-01-29 19:19:05 -08:00
Philipp Moritz
0aadf11c10
Fix compilation on macOS by adding virtual destructors ( #3878 )
2019-01-28 13:22:52 -08:00
Philipp Moritz
f7415b37c5
Build Ray with Bazel ( #3867 )
2019-01-27 18:32:04 -08:00
Eric Liang
c75038b945
[autoscaler] Updating a file in file mounts causes all worker nodes to get restarted
2019-01-27 17:41:37 -08:00
Stephanie Wang
ad9f1721d1
Fix object_manager_test.py::object_transfer_retry test ( #3863 )
2019-01-27 13:55:38 -08:00
Stephanie Wang
eddd60e14e
Improve backend debug logging, refactor scheduling queues ( #3819 )
2019-01-26 16:15:48 +08:00
Yuhong Guo
066fa8abf3
Fix monitor_test.py by waiting for moniter.py to start working ( #3840 )
...
* Wait for moniter.py to start working
* Checkout None result in state.py
2019-01-25 18:07:15 +08:00
Philipp Moritz
20162ce159
Compile raylet cython bindings with bazel ( #3842 )
2019-01-25 00:57:31 -08:00
Si-Yuan
48139cf861
Migrate Python C extension to Cython ( #3541 )
2019-01-24 09:17:14 -08:00
Yuhong Guo
c1a52b1c86
Remove duplicated code in RayConfig ( #3831 )
2019-01-24 17:04:10 +08:00
Hao Chen
bfcf254e52
Fix: do not treat actor task as failed if the actor will be reconstructed ( #3736 )
2019-01-23 23:28:44 -08:00
Eric Liang
04ec47cbd4
[rllib] annotate public vs developer vs private APIs ( #3808 )
2019-01-23 21:27:26 -08:00
Robert Nishihara
01e18b47f4
Direct people to stackoverflow for questions about usage. ( #3830 )
...
* Direct people to stackoverflow for questions about usage.
* Improve wording
2019-01-23 13:30:02 -08:00
Wang Qing
dcb744518e
Implement actor dummy object gc in java ( #3822 )
...
* Add dummy object gc in java
* Fix
* Address comments.
* Refine
* Address comments.
2019-01-23 11:56:25 -08:00
Wang Qing
816406ea3d
[Java] Fix setCurrentTask()
in multi threading ( #3821 )
2019-01-23 20:45:30 +08:00
Robert Nishihara
0b1608a546
Factor out code for starting new processes and test plasma store in valgrind. ( #3824 )
...
* Factor out starting Ray processes.
* Detect flags through environment variables.
* Return ProcessInfo from start_ray_process.
* Print valgrind errors at exit.
* Test valgrind in travis.
* Some valgrind fixes.
* Undo raylet monitor change.
* Only test plasma store in valgrind.
2019-01-22 14:59:11 -08:00
Eric Liang
f0e6523323
[rllib] Don't call reset() unless necessary for multi-agent envs
2019-01-20 15:00:18 -08:00
Philipp Moritz
0dad4e6a25
Build Raylet with Bazel ( #3806 )
2019-01-20 12:16:47 -08:00
Eric Liang
aad48ee5a5
[tune] Fully deprecate raw function literals in Tune ( #3788 )
...
Related: https://github.com/ray-project/ray/issues/3785
2019-01-19 17:09:36 -08:00
Michael Luo
16f7ca45e4
Appo ( #3779 )
...
* Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder
* Deleted unneccesary vtrace.py file
* Update pong-impala.yaml
* Cleaned PPO Code
* Update pong-impala.yaml
* Update pong-impala.yaml
* wip
* new ifle
* refactor
* add vtrace off option
* revert
* support any space
* docs
* fix comment
* remove kl
* Update cartpole-appo-vtrace.yaml
2019-01-18 13:40:26 -08:00
Philipp Moritz
931e6a2fc3
Fix compilation error on ARM. ( #3800 )
2019-01-18 00:25:16 -08:00
Robert Nishihara
9af5a62e05
Give better error for old-style actor classes. ( #3793 )
2019-01-17 19:05:04 -08:00
Richard Liaw
0537508106
Bump strings for 0.6.2 ( #3801 )
2019-01-17 19:03:27 -08:00
Si-Yuan
16a3b99d8d
Get rid of Arrow test utils ( #3734 )
...
* convert code to proper C++
* revert changes to "id.h" because #3765 has been merged.
* revert changes to Python bindings because they will be removed in #3541
* remove dependencies of Arrow logging
* revert changes to Arrow logging
* lint
2019-01-17 18:35:41 -08:00
Jones Wong
319c1340cb
[rllib] Develop MARWIL ( #3635 )
...
* add marvil policy graph
* fix typo
* add offline optimizer and enable running marwil
* fix loss function
* add maintaining the moving average of advantage norm
* use sync replay optimizer for unifying
* remove offline optimizer and use sync replay optimizer
* format by yapf
* add imitation learning objective
* fix according to eric's review
* format by yapf
* revise
* add test data
* marwil
2019-01-16 19:00:43 -08:00
Hao Chen
d1840bc7a9
Simplify RayConfig ( #3714 )
2019-01-16 16:43:26 -08:00
Richard Liaw
75ac016e2b
Bump version ( #3787 )
2019-01-16 11:40:54 -08:00
Richard Liaw
fa99fda2b4
Application Stress Tests ( #3612 )
2019-01-16 02:05:16 -08:00
Richard Liaw
c28e6d41f5
[tune] Avoid overwriting checkpoint file ( #3781 )
2019-01-16 02:03:16 -08:00
ggdupont
a237b4a6a1
[Java] Fix package jaxb not exist when JDK11 ( #3738 )
2019-01-16 14:15:00 +08:00
Philipp Moritz
3b39066c15
Fix pandas 0.22 incompatibility by upgrading Arrow ( #3786 )
2019-01-15 21:17:32 -08:00
Eric Liang
401e656b95
[rllib] Sync filters at end of iteration not start; hierarchical docs ( #3769 )
2019-01-15 16:25:25 -08:00
Richard Liaw
3918934dfd
[tune] Cross-Node Recovery ( #3725 )
...
Augments trial restore to also check if the runner is at the same
location. If not, the checkpoint files are pushed onto the new location.
2019-01-15 10:37:28 -08:00
Si-Yuan
a5df8e3532
minor fix ( #3770 )
2019-01-14 13:52:51 -08:00
Tianming Xu
0b8008f41c
remove RAY_CHECK around wait_state.remaining.erase ( #3745 )
2019-01-14 10:32:31 -08:00
Philipp Moritz
02bdaf221d
Update arrow to include https://github.com/apache/arrow/pull/3392 ( #3765 )
...
* update arrow to include https://github.com/apache/arrow/pull/3392
* add appropriate includes
* update
2019-01-14 19:20:26 +08:00
Wang Qing
3cf59855af
[Java] Replace junit with testNG ( #3768 )
2019-01-14 17:49:17 +08:00
Robert Nishihara
19908c01b8
Use environment markers to only install faulthandler in Python < 3.3. ( #3764 )
2019-01-14 15:55:59 +08:00