Eric Liang
3f4d37cd0e
[rllib] Fix Multidiscrete support ( #4869 )
2019-05-29 20:41:02 -07:00
Eric Liang
2dd0beb5bd
[rllib] Allow access to batches prior to postprocessing ( #4871 )
2019-05-29 18:17:14 -07:00
Robert Nishihara
a218a14c92
Bump version from 0.8.0.dev0 to 0.7.1. ( #4890 )
2019-05-29 16:57:28 -07:00
Richard Liaw
acee89b1f6
[tune] Auto-init Ray + default SearchAlg ( #4815 )
2019-05-29 12:09:34 -07:00
Philipp Moritz
64eb7b322c
Upgrade arrow to latest master ( #4858 )
2019-05-28 16:04:16 -07:00
Eric Liang
d7be5a5d36
[rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO
2019-05-27 17:24:45 -07:00
Eric Liang
a45c61e19b
[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section ( #4821 )
...
* wip
* fix index
* fix bugs
* todo
* add imports
* note on get ph
* note on get ph
* rename to building custom algs
* add rnn state info
2019-05-27 14:17:32 -07:00
Richard Liaw
574e1c7695
[tune] Fix up Ax Search and Examples ( #4851 )
...
* update Ax for cleaner API
* docs update
2019-05-27 13:23:17 -07:00
Robert Nishihara
7a78e1e320
Install bazel in autoscaler development configs. ( #4874 )
2019-05-26 16:13:50 -07:00
Robert Nishihara
6703519144
Move global state API out of global_state object. ( #4857 )
2019-05-26 11:27:53 -07:00
Eric Liang
7237ea70c4
[rllib] [RFC] Deprecate Python 2 / RLlib ( #4832 )
2019-05-25 10:45:26 -07:00
Richard Liaw
0ce0ecbe9c
[tune] Later expansion of local_dir ( #4806 )
2019-05-25 02:19:28 -07:00
Devin Petersohn
a7d01aba9b
Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. ( #4847 )
2019-05-24 16:49:13 -07:00
Robert Nishihara
49fe894e22
Export remote functions when first used and also fix bug in which rem… ( #4844 )
...
* Export remote functions when first used and also fix bug in which remote functions and actor classes are not exported from workers during subsequent ray sessions.
* Documentation update
* Fix tests.
* Fix grammar
2019-05-24 13:44:39 -07:00
Devin Petersohn
ba6c595094
Bump Ray master version to 0.8.0.dev0 ( #4845 )
2019-05-23 17:02:20 -07:00
Robert Nishihara
2015085192
Fix bug in which actor classes are not exported multiple times. ( #4838 )
2019-05-23 09:22:46 -07:00
Yuhong Guo
1a39fee9c6
Refactor ID Serial 1: Separate ObjectID and TaskID from UniqueID ( #4776 )
...
* Enable BaseId.
* Change TaskID and make python test pass
* Remove unnecessary functions and fix test failure and change TaskID to
16 bytes.
* Java code change draft
* Refine
* Lint
* Update java/api/src/main/java/org/ray/api/id/TaskId.java
Co-Authored-By: Hao Chen <chenh1024@gmail.com>
* Update java/api/src/main/java/org/ray/api/id/BaseId.java
Co-Authored-By: Hao Chen <chenh1024@gmail.com>
* Update java/api/src/main/java/org/ray/api/id/BaseId.java
Co-Authored-By: Hao Chen <chenh1024@gmail.com>
* Update java/api/src/main/java/org/ray/api/id/ObjectId.java
Co-Authored-By: Hao Chen <chenh1024@gmail.com>
* Address comment
* Lint
* Fix SINGLE_PROCESS
* Fix comments
* Refine code
* Refine test
* Resolve conflict
2019-05-22 14:46:30 +08:00
Qing Wang
259cdfa0de
Fix issue when starting raylet_monitor
( #4829 )
2019-05-22 11:08:24 +08:00
Eric Liang
02583a8598
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ ( #4819 )
...
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Eric Liang
6cb5b90bd6
[rllib] [RFC] Dynamic definition of loss functions and modularization support ( #4795 )
...
* dynamic graph
* wip
* clean up
* fix
* document trainer
* wip
* initialize the graph using a fake batch
* clean up dynamic init
* wip
* spelling
* use builder for ppo pol graph
* add ppo graph
* fix naming
* order
* docs
* set class name correctly
* add torch builder
* add custom model support in builder
* cleanup
* remove underscores
* fix py2 compat
* Update dynamic_tf_policy_graph.py
* Update tracking_dict.py
* wip
* rename
* debug level
* rename policy_graph -> policy in new classes
* fix test
* rename ppo tf policy
* port appo too
* forgot grads
* default policy optimizer
* make default config optional
* add config to optimizer
* use lr by default in optimizer
* update
* comments
* remove optimizer
* fix tuple actions support in dynamic tf graph
2019-05-18 00:23:11 -07:00
Noah Golmant
1ef9c0729d
[tune] Initial track integration ( #4362 )
...
Introduces a minimally invasive utility for logging experiment results. A broad requirement for this tool is that it should integrate seamlessly with Tune execution.
2019-05-17 11:34:05 -07:00
Qing Wang
dcd6d4949c
Fix Java worker log dir ( #4781 )
2019-05-17 16:13:28 +08:00
Richard Liaw
e20855ccae
[tune] Remove extra parsing functionality ( #4804 )
2019-05-16 23:11:35 -07:00
Richard Liaw
88b45a53d6
[autoscaler] rsync cluster ( #4785 )
2019-05-16 23:11:06 -07:00
Richard Liaw
ffe61fcc70
[tune] Support non-arg submit ( #4803 )
2019-05-16 23:10:07 -07:00
Eric Liang
3807fb505b
[rllib] TensorFlow 2 compatibility ( #4802 )
2019-05-16 22:12:07 -07:00
Eric Liang
7d5ef6d99c
[rllib] Support continuous action distributions in IMPALA/APPO ( #4771 )
2019-05-16 22:05:07 -07:00
Richard Liaw
9f2645d6ea
[tune] Fix CLI test ( #4801 )
2019-05-16 13:50:03 -07:00
Devin Petersohn
1490a98a71
Bump version to 0.7.0 ( #4791 )
2019-05-15 22:55:21 -07:00
Richard Liaw
3bbafc7105
[autoscaler] Fix submit ( #4782 )
2019-05-14 19:52:28 -07:00
Jones Wong
c5161a2c4d
[rllib] fix clip by value issue as TF upgraded ( #4697 )
...
* fix clip_by_value issue
* fix typo
2019-05-13 15:39:25 -07:00
Qing Wang
62c949bbd5
Fix ray stop
by killing raylet before plasma ( #4778 )
2019-05-13 14:53:10 +08:00
Eric Liang
69352e3302
[rllib] Implement learn_on_batch() in torch policy graph
2019-05-12 21:29:58 -07:00
Romil Bhardwaj
004440f526
Dynamic Custom Resources - create and delete resources ( #3742 )
2019-05-11 20:06:04 +08:00
Eric Liang
351753aae5
[rllib] Remove dependency on TensorFlow ( #4764 )
...
* remove hard tf dep
* add test
* comment fix
* fix test
2019-05-10 20:36:18 -07:00
cgraywang
584adb45b8
[tune] Add MXNet Gluon example on CIFAR-10 ( #4683 )
2019-05-08 21:39:07 -07:00
Adi Zimmerman
28d381373d
[tune] Add Ax to Tune ( #4731 )
2019-05-08 15:54:29 -07:00
Romil Bhardwaj
0421cba4e8
Autoscaler hotfix for #4555 . ( #4653 )
2019-05-08 14:50:52 -07:00
Jacob Beck
28496c8b50
[rllib] Qmix padding patch ( #4735 )
...
* Qmix padding patch
* Update qmix_policy_graph.py
* lint errors
* more linting
* Update qmix_policy_graph.py
2019-05-08 14:07:29 -07:00
Devin Petersohn
edb8465910
[ray-core] Initial addition of performance integration testing files ( #4325 )
2019-05-08 13:40:54 -07:00
Richard Liaw
7f50c96adb
[tune] Reduce sampling API clutter ( #4739 )
...
Adds some sugar for tune sampling API (for commonplace sampling idioms).
2019-05-06 17:42:39 -07:00
Eric Liang
71b2dec3b4
[rllib] Fix bounds of space returned by preprocessor.observation_space ( #4736 )
2019-05-05 18:25:38 -07:00
Si-Yuan
bd00735fe8
Fix tempfile issues ( #4605 )
2019-05-05 16:06:15 -07:00
Daniel Ho
dca1c25d88
[tune] Fix setup-dev relative path ( #4747 )
2019-05-05 00:39:07 -07:00
Richard Liaw
f2faf5ce75
[tune] Contributor Guide and Design Page ( #4716 )
...
* Move setup script out
* some changes
* Finished Contributor guide
* some comments to the design
* move
* Apply suggestions from code review
Co-Authored-By: richardliaw <rliaw@berkeley.edu>
* sourcecode
* comments
2019-05-05 00:04:13 -07:00
Robert Nishihara
d81e71e297
Enable actor methods to be decorated on the caller side also and get postprocessors. ( #4732 )
...
* Allow decorating ray actor methods.
* Add test.
* Add get postprocessors.
* Improve documentation.
* Make it work for remote functions.
* Temporary fix.
2019-05-04 11:53:47 -07:00
Peng Zhenghao
897b35ce36
[tune] fix restore error at tune.run() ( #4733 )
2019-05-04 02:56:15 -04:00
Adi Zimmerman
36b71d1446
[Tune] Post-Experiment Tools ( #4351 )
2019-05-04 02:51:26 -04:00
Federico Fontana
78bb26286e
Replaced discontinued rnn_cell.BasicLSTMCell with rnn_cell.LSTMCell ( #4703 )
...
* Fixed bug in Dirichlet (#4440 )
* Replaced deprecated rnn_cell.BasicLSTMCell with rnn_cell.LSTMCell
2019-05-02 13:19:27 -07:00
Andrew Tan
f87235f232
[tune] Example for Tune blog post ( #4673 )
2019-05-02 13:16:48 -04:00