Eric Liang
02583a8598
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ ( #4819 )
...
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Eric Liang
6cb5b90bd6
[rllib] [RFC] Dynamic definition of loss functions and modularization support ( #4795 )
...
* dynamic graph
* wip
* clean up
* fix
* document trainer
* wip
* initialize the graph using a fake batch
* clean up dynamic init
* wip
* spelling
* use builder for ppo pol graph
* add ppo graph
* fix naming
* order
* docs
* set class name correctly
* add torch builder
* add custom model support in builder
* cleanup
* remove underscores
* fix py2 compat
* Update dynamic_tf_policy_graph.py
* Update tracking_dict.py
* wip
* rename
* debug level
* rename policy_graph -> policy in new classes
* fix test
* rename ppo tf policy
* port appo too
* forgot grads
* default policy optimizer
* make default config optional
* add config to optimizer
* use lr by default in optimizer
* update
* comments
* remove optimizer
* fix tuple actions support in dynamic tf graph
2019-05-18 00:23:11 -07:00
Noah Golmant
1ef9c0729d
[tune] Initial track integration ( #4362 )
...
Introduces a minimally invasive utility for logging experiment results. A broad requirement for this tool is that it should integrate seamlessly with Tune execution.
2019-05-17 11:34:05 -07:00
Qing Wang
dcd6d4949c
Fix Java worker log dir ( #4781 )
2019-05-17 16:13:28 +08:00
Richard Liaw
e20855ccae
[tune] Remove extra parsing functionality ( #4804 )
2019-05-16 23:11:35 -07:00
Richard Liaw
88b45a53d6
[autoscaler] rsync cluster ( #4785 )
2019-05-16 23:11:06 -07:00
Richard Liaw
ffe61fcc70
[tune] Support non-arg submit ( #4803 )
2019-05-16 23:10:07 -07:00
Eric Liang
3807fb505b
[rllib] TensorFlow 2 compatibility ( #4802 )
2019-05-16 22:12:07 -07:00
Eric Liang
7d5ef6d99c
[rllib] Support continuous action distributions in IMPALA/APPO ( #4771 )
2019-05-16 22:05:07 -07:00
Richard Liaw
9f2645d6ea
[tune] Fix CLI test ( #4801 )
2019-05-16 13:50:03 -07:00
Devin Petersohn
1490a98a71
Bump version to 0.7.0 ( #4791 )
2019-05-15 22:55:21 -07:00
Richard Liaw
3bbafc7105
[autoscaler] Fix submit ( #4782 )
2019-05-14 19:52:28 -07:00
Jones Wong
c5161a2c4d
[rllib] fix clip by value issue as TF upgraded ( #4697 )
...
* fix clip_by_value issue
* fix typo
2019-05-13 15:39:25 -07:00
Qing Wang
62c949bbd5
Fix ray stop
by killing raylet before plasma ( #4778 )
2019-05-13 14:53:10 +08:00
Eric Liang
69352e3302
[rllib] Implement learn_on_batch() in torch policy graph
2019-05-12 21:29:58 -07:00
Romil Bhardwaj
004440f526
Dynamic Custom Resources - create and delete resources ( #3742 )
2019-05-11 20:06:04 +08:00
Eric Liang
351753aae5
[rllib] Remove dependency on TensorFlow ( #4764 )
...
* remove hard tf dep
* add test
* comment fix
* fix test
2019-05-10 20:36:18 -07:00
cgraywang
584adb45b8
[tune] Add MXNet Gluon example on CIFAR-10 ( #4683 )
2019-05-08 21:39:07 -07:00
Adi Zimmerman
28d381373d
[tune] Add Ax to Tune ( #4731 )
2019-05-08 15:54:29 -07:00
Romil Bhardwaj
0421cba4e8
Autoscaler hotfix for #4555 . ( #4653 )
2019-05-08 14:50:52 -07:00
Jacob Beck
28496c8b50
[rllib] Qmix padding patch ( #4735 )
...
* Qmix padding patch
* Update qmix_policy_graph.py
* lint errors
* more linting
* Update qmix_policy_graph.py
2019-05-08 14:07:29 -07:00
Devin Petersohn
edb8465910
[ray-core] Initial addition of performance integration testing files ( #4325 )
2019-05-08 13:40:54 -07:00
Richard Liaw
7f50c96adb
[tune] Reduce sampling API clutter ( #4739 )
...
Adds some sugar for tune sampling API (for commonplace sampling idioms).
2019-05-06 17:42:39 -07:00
Eric Liang
71b2dec3b4
[rllib] Fix bounds of space returned by preprocessor.observation_space ( #4736 )
2019-05-05 18:25:38 -07:00
Si-Yuan
bd00735fe8
Fix tempfile issues ( #4605 )
2019-05-05 16:06:15 -07:00
Daniel Ho
dca1c25d88
[tune] Fix setup-dev relative path ( #4747 )
2019-05-05 00:39:07 -07:00
Richard Liaw
f2faf5ce75
[tune] Contributor Guide and Design Page ( #4716 )
...
* Move setup script out
* some changes
* Finished Contributor guide
* some comments to the design
* move
* Apply suggestions from code review
Co-Authored-By: richardliaw <rliaw@berkeley.edu>
* sourcecode
* comments
2019-05-05 00:04:13 -07:00
Robert Nishihara
d81e71e297
Enable actor methods to be decorated on the caller side also and get postprocessors. ( #4732 )
...
* Allow decorating ray actor methods.
* Add test.
* Add get postprocessors.
* Improve documentation.
* Make it work for remote functions.
* Temporary fix.
2019-05-04 11:53:47 -07:00
Peng Zhenghao
897b35ce36
[tune] fix restore error at tune.run() ( #4733 )
2019-05-04 02:56:15 -04:00
Adi Zimmerman
36b71d1446
[Tune] Post-Experiment Tools ( #4351 )
2019-05-04 02:51:26 -04:00
Federico Fontana
78bb26286e
Replaced discontinued rnn_cell.BasicLSTMCell with rnn_cell.LSTMCell ( #4703 )
...
* Fixed bug in Dirichlet (#4440 )
* Replaced deprecated rnn_cell.BasicLSTMCell with rnn_cell.LSTMCell
2019-05-02 13:19:27 -07:00
Andrew Tan
f87235f232
[tune] Example for Tune blog post ( #4673 )
2019-05-02 13:16:48 -04:00
Andrew Tan
23ae73135e
[tune] Tune CLI Fixes ( #4659 )
...
What do these changes do?
Add --limit flag for ls
Add ordering functionality to --sort flag
Remove last_result from the names of columns for ls
Fix weird double quote error messages (\")
2019-04-30 18:21:33 -07:00
Yuhong Guo
448a7bd08d
Add lock in fetch_and_execute_function_to_run of import_thread.py ( #4718 )
2019-04-30 10:47:16 -07:00
Yuhong Guo
4eade036a0
Separate thread locks for worker and function manager. ( #4499 )
...
* Separate lock for function manager and worker
* Lint
* Add test case
* Remove print in remote function.
* Remove test and add ray.exit_actor
* Update python/ray/worker.py
Co-Authored-By: guoyuhong <guoyuhong1985@outlook.com>
* Move exit_actor from worker.py to actor.py
* Update actor.py
* Update actor.py
2019-04-29 14:55:37 +08:00
Kristian Hartikainen
69da6d0fc8
[autoscaler] Remove unnecessary apt installations in docker commands ( #4577 )
...
* Remove unnecessary apt installations in docker commands
* Add example for different head/worker image in gcp gpu example
* Update gcp gpu example docker image to tf 1.13
* Change the VM sourceImage for gcp/example-full.yaml
* Change the gcp gpu docker VM images for consistency
* Change gcp default project id to be consistent with other examples
2019-04-28 14:58:51 -07:00
Robert Nishihara
e9b351e749
Reduce memory usage of test_simple in test_stress.py. ( #4709 )
2019-04-28 07:50:23 -07:00
Eric Liang
b1c9ea7ffc
Update test_trial_scheduler.py ( #4710 )
2019-04-27 23:11:05 -07:00
Daniel Ho
d7d2694b57
[tune] Add config logging functionality to PBT scheduler ( #4680 )
2019-04-27 19:32:19 -07:00
Romil Bhardwaj
686d4caefe
Updates to scheduling objects to support dynamic custom resources ( #4465 )
2019-04-27 18:45:23 -07:00
Si-Yuan
9ce3039390
Fix webui api ( #4686 )
...
* fix webui
* Apply suggestions from code review
lint
Co-Authored-By: suquark <suquark@gmail.com>
* add dependencies for this unittest
* move dependencies to the script file
2019-04-27 15:23:56 +08:00
Sam Toyer
663e92ab3f
[rllib] TD3/DDPG improvements and MuJoCo benchmarks ( #4694 )
...
* [rllib] Separate optimisers for DDPG actor & crit.
* [rllib] Better names for DDPG variables & options
Config changes:
- noise_scale -> exploration_ou_noise_scale
- exploration_theta -> exploration_ou_theta
- exploration_sigma -> exploration_ou_sigma
- act_noise -> exploration_gaussian_sigma
- noise_clip -> target_noise_clip
* [rllib] Make DDPG less class-y
Used functions to replace three classes with only an __init__ method & a
handful of unrelated attributes.
* [rllib] Refactor DDPG noise
* [rllib] Unify DDPG exploration annealing
Added option "exploration_should_anneal" to enable linear annealing of
exploration noise. By default this is off, for consistency with DDPG &
TD3 papers. Also renamed "exploration_final_eps" to
"exploration_final_scale" (that name seems to have been carried over
from DQN, and doesn't really make sense here). Finally, tried to rename
"eps" to "noise_scale" wherever possible.
2019-04-26 17:49:53 -07:00
Eric Liang
47cca971b5
Don't delete files in rsync up, and also shorten timeout ( #4688 )
2019-04-25 12:18:42 -07:00
Qing Wang
f39b6747e5
Refactor command line argument parsing with gflags ( #4676 )
2019-04-24 14:53:07 +08:00
William Ma
c99e3caaca
Change resource bookkeeping to account for machine precision. ( #4533 )
2019-04-23 11:59:53 -07:00
justinwyang
8dfc833a8b
Change all instances of JobID to DriverID. ( #4431 )
2019-04-22 16:28:09 -07:00
Andrew
06c768823c
[rllib] train-eval loop implementation for rllib.Trainer class ( #4647 )
2019-04-21 12:08:04 -07:00
Devin Petersohn
d5df91b031
Bump version to 0.7.0dev3 ( #4671 )
2019-04-19 17:06:14 -07:00
Vlad Firoiu
39a09fa457
Turn replay into a circular queue. ( #4667 )
2019-04-19 11:42:00 -07:00
Wang Qing
9d481cc2e6
[hotfix] Missing import breaks Travis builds
2019-04-18 23:12:44 -07:00