Ameer Haj Ali
81238945b9
Update index.rst ( #6935 )
2020-01-27 18:35:48 -06:00
Eric Liang
e659699ca9
[tune] Fix directory naming regression ( #6839 )
2020-01-27 15:53:40 -08:00
Alex Wu
d9a2294298
Ssh identities only ( #6931 )
2020-01-27 17:01:21 -06:00
Richard Liaw
e0078a0d78
[autoscaler][minor] default -> latest_dlami ( #6922 )
...
* config
* latest
* Update python/ray/autoscaler/aws/config.py
2020-01-27 14:34:07 -08:00
Ameer Haj Ali
a7ecda6017
Support of scikit-learn with ray joblib backend ( #6925 )
2020-01-27 15:00:00 -06:00
Simon Mo
396d7fafc8
UI improvement for asyncio ( #6905 )
2020-01-27 12:45:51 -08:00
mehrdadn
bde575b8dd
Revert "Use Boost.Process instead of pid_t ( #6510 )" ( #6909 )
...
This reverts commit fb8e3615d5
.
2020-01-26 10:26:44 -06:00
Eric Liang
2fb53396ad
[rllib] [experimental] Decentralized Distributed PPO for torch (DD-PPO) ( #6918 )
2020-01-25 22:36:43 -08:00
hyggan
552156f22d
[tune] Handles nan case for AsyncHyperBand ( #6916 )
2020-01-25 17:26:30 -08:00
Ujval Misra
ed9de8b2fa
[tune] Expose progress reporter to users ( #6915 )
...
* Pluggable progress reporter
* Fix types
* Fix bug, address comments
* lint
* Add convenience function and test
* lint
* Use trials instead of trial_runner
* Add docs
* Update docs
* Fix doc examples
* More doc updates
* Address comments, add configurable frequency
* use reward
2020-01-25 12:28:05 -08:00
Eric Liang
2e88e2e773
Split up bazel test into tune / non tune tests ( #6846 )
...
* fix it
* move
* Update .travis.yml
2020-01-25 12:25:12 -08:00
Yunzhi Zhang
aa5427ca78
[Dashboard] Kill actor ( #6906 )
2020-01-24 17:21:44 -08:00
Mitchell Stern
33423627ca
[Dashboard] Add profiling button to logical view ( #6901 )
2020-01-24 11:52:14 -08:00
Sven Mika
446cbdf2e0
[RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions ( #6890 )
...
* Add `RandomEnv` example to examples folder.
Convert warning into Error message when using an LSTM in a non-shared-vf network (after the warning, the program would crash).
* LINT.
* Fix issue #6884 . LSTM + non-shared vf NN + PPO crashes when using a Tuple action space.
* LINT
* Change warning message for Model: shared_vf=False, LSTM=True cases.
* Bug fix.
* Add examples/random_env.py test to Jenkins.
2020-01-24 10:29:35 -08:00
Daniel Edgecumbe
e516c50745
[autoscaler]: Kill workers if the monitor raises an exception ( #3977 )
...
Co-authored-by: CJosephides <cjosephides@gmail.com>
2020-01-23 14:12:52 -06:00
Qing Wang
cfbde39ba8
[Java] Generate head redis port randomly ( #6879 )
...
* Random head port
* address comments.
2020-01-23 23:37:41 +08:00
AnanthHari
aa2a0cb6da
Fixes empty state
argument in compute_single_action method ( #6894 )
...
* Fixes empty `state` parameter in compute_single_action method
* Fixed style
2020-01-23 00:42:52 -08:00
Ujval Misra
1558307ac4
[tune] Prevent MEMORY checkpoints from breaking trial FT ( #6691 )
...
* Prevent MEMORY checkpoints from breaking FT
* Add save/pause/resume/restore test
* change checkpoint return value based on status
* Fix test_checkpoint_manager_tests.
* Fix test + checkpoint manager bug
* lint
* Add docstring
* Add docstring to checkpoint_manager constructor
* Change variable name for clarity
* Revert on_checkpoint docstring wording
* Break after success
* nit: more informative warning
* Quarantine test
2020-01-22 23:17:09 -08:00
Yunzhi Zhang
0834bda8c1
[Dashboard] Display actor task execution info ( #6705 )
...
Co-authored-by: Philipp Moritz <pcmoritz@gmail.com>
2020-01-22 22:33:55 -08:00
Sven Mika
ae9a3a2237
[RLlib] from_config util method for framework agnostic components; start moving RLlib tests into Bazel. ( #6865 )
2020-01-22 17:02:58 -08:00
Simon Mo
5f527816fe
Fix async actor high cpu utilization when idle ( #6877 )
2020-01-22 16:07:08 -08:00
Simon Mo
4dd41844d0
Ignore blocking ray.wait if timeout is zero ( #6891 )
2020-01-22 16:05:34 -08:00
Eric Liang
6bb30c9f1b
fix links ( #6883 )
2020-01-22 01:06:07 -08:00
Richard Liaw
2b0e93586f
[autoscaler] Auto-replace "DEFAULT" with most recent DLAMI ( #6848 )
...
* try_this
* fix
* actual fix
* default
2020-01-21 13:54:04 -08:00
Richard Liaw
4edfaf2f38
[tune] Support callable objects in variant generation ( #6849 )
...
* minorcallable
* format
2020-01-21 10:24:25 -08:00
Frank Röder
dac6268c5b
[tune] Fix broken link in Tune User Guide ( #6866 )
2020-01-21 10:21:14 -08:00
chaokunyang
289e5e8aff
enable maven checkstyle ( #6829 )
2020-01-20 23:41:54 -08:00
Sven Mika
c957ed58ed
[RLlib] Implement PPO torch version. ( #6826 )
2020-01-20 23:06:50 -08:00
Ce Gao
574abe844a
[ray-operator] Remove useless RBAC rules ( #6853 )
...
Signed-off-by: Ce Gao <gaoce@caicloud.io>
2020-01-21 00:31:07 -06:00
Lingxuan Zuo
7e484687d3
Use GET-SET macro to reduce duplicated code. ( #6863 )
2020-01-21 10:57:57 +08:00
mehrdadn
139bf8908e
Replace UNIX sockets with TCP sockets in Ray on Windows ( #6823 )
...
* Replace UNIX sockets with TCP sockets in Ray
2020-01-20 17:28:11 -08:00
Stephanie Wang
815cd0e39a
Task and actor fate sharing with the owner process ( #6818 )
...
* Add test
* Kill workers leased by failed workers
* merge
* shorten test
* Add node failure test case
* Fix FromBinary for nil IDs, add assertions
* Test
* Fate sharing on node removal, fix owner address bug
* lint
* Update src/ray/raylet/node_manager.cc
Co-Authored-By: Zhijun Fu <37800433+zhijunfu@users.noreply.github.com>
* fix
* Remove unneeded test
* fix IDs
Co-authored-by: Zhijun Fu <37800433+zhijunfu@users.noreply.github.com>
2020-01-20 16:44:04 -08:00
Eric Liang
14016535a5
[rllib] Add TF and Torch icons to show which are available for each algo ( #6869 )
2020-01-20 15:22:21 -08:00
Ce Gao
125e26dde5
[ray-operator] Watch the pod resource and remove useless code ( #6852 )
...
Signed-off-by: Ce Gao <gaoce@caicloud.io>
2020-01-20 12:13:30 -06:00
Ce Gao
23f32c5ec8
[ray-operator]: Add ignore file ( #6851 )
...
Signed-off-by: Ce Gao <gaoce@caicloud.io>
2020-01-20 12:13:01 -06:00
Philipp Moritz
96e2c1ae74
[Projects] Add small tutorial for projects ( #6641 )
2020-01-20 09:33:41 -08:00
mehrdadn
10609c3a19
Use standard EditorConfig file for editor settings ( #6861 )
...
https://editorconfig.org/
Co-authored-by: GitHub Web Flow <noreply@github.com>
2020-01-20 08:03:06 -08:00
Robert Nishihara
c2cbb85a43
Fix flaky test test_feature_flag ( #6850 )
2020-01-19 20:59:03 -08:00
Richard Liaw
341ddd0a09
[tune] Default to TensorboardX and include in requirements. ( #6836 )
2020-01-19 01:49:33 -08:00
Eric Liang
a229bdf272
[rllib] Deprecate custom preprocessors ( #6833 )
...
* deprecation warnings
* add log warn
* fix test
2020-01-18 23:30:09 -08:00
Richard Liaw
8a9bd18606
[tune] Remove keras dependency ( #6827 )
2020-01-18 23:24:42 -08:00
Richard Liaw
c9a1810392
[doc] Add meetup link (temporary) ( #6835 )
2020-01-18 17:53:47 -08:00
Sven Mika
7659cae3ba
[RLlib] Add PG torch regression test ( #6828 )
...
* Add PG torch regression test to tuned_examples/regression_tests dir.
* Rename cartpole-pg.yaml into cartpole-pg-tf.yaml
* cartpole-pg-tf.yaml: Change cartpole-pg name of tuned_example to cartpole-pg-tf.
2020-01-18 15:57:12 -08:00
Justin Terry
97bf79917c
[RLlib] Update MADDPG example repo to maintained fork ( #6831 )
2020-01-18 13:08:27 -08:00
Yuhao Yang
9b1d2953de
[tune] set correct path when deleting checkpoint folder ( #6758 )
2020-01-17 23:11:03 -08:00
Sven Mika
303547f119
[RLlib] Policy-classes cleanup and torch/tf unification. ( #6770 )
2020-01-17 22:26:28 -08:00
Mitchell Stern
763818b476
[Dashboard] Add static assets for speedscope v1.5.3 ( #6822 )
2020-01-17 20:53:53 -08:00
Sven Mika
e6227082bd
[RLlib] Add torch
flag to train.py ( #6807 )
2020-01-17 18:48:44 -08:00
Yunzhi Zhang
3acf3c7675
[Dashboard] Add actor task counter ( #6820 )
2020-01-17 15:43:56 -08:00
Simon Mo
8f246c17b5
Initialize async plasma for async actors ( #6813 )
...
* Initialize async plasma for async actors
* Address comment
2020-01-17 14:58:06 -08:00