Philipp Moritz
0225581078
[rllib] Improve performance for small rollouts ( #812 )
...
* batch small rollouts together
* implement minimum number of samples for each task
* add total time
* fix linting
* style
* fix
* factor out parameters and document stuff
* add rollout batchsize
* address comments
* linting
* small fix
2017-08-05 22:13:30 -07:00
alanamarzoev
64eaaaebf0
Show timeline button. ( #809 )
2017-08-03 20:11:50 -07:00
Richard Liaw
c30fdb4ab0
[rllib] Code for Supporting Shared Models ( #775 )
...
* Code for Supporting Shared Models
* Running (with vnet modification) - needs to be tested for performance
* Small fix for jenkins
* Linting
* linting
* Summaries
* Small refactoring + generalized to more domains
* Addressing changes
* Addressing changes
* Update envs.py
* Addressing changes
* convnet
* final touches
* Merge - new model
* final linting
* Changing iterations back
* Policy option removed, fixed small things
* Nits
* nit
* Linting
* Linting
2017-08-03 19:29:01 -07:00
Philipp Moritz
df65e87fc7
[rllib] Tune ppo more on control tasks ( #777 )
...
* tune ppo on control tasks
* introduce free log_std
* fix
* flag for writing logs
* fixes
* fixes
2017-08-03 16:34:06 -07:00
alanamarzoev
99badc7ae4
UI functions in separate file. ( #801 )
...
* UI file.
* Fixed linting.
* Change UI instructions slightly.
2017-08-02 19:32:18 -07:00
Philipp Moritz
054ae4180e
Fix installation instruction for ubuntu 14.04 ( #805 )
...
* fix installation instruction for ubuntu 14.04
* upgrade cmake requirements
* fix
2017-08-02 18:14:14 -07:00
Robert Nishihara
cb84972f6b
Recreate actors when local schedulers die. ( #804 )
...
* Reconstruct actor state when local schedulers fail.
* Simplify construction of arguments to pass into default_worker.py from local scheduler.
* Remove deprecated ray.actor.
* Simplify actor reconstruction method.
* Fix linting.
* Small fixes.
2017-08-02 18:02:52 -07:00
Robert Nishihara
37282330c0
Allow plasma manager to gracefully handle EPROTOTYPE. ( #802 )
...
* Allow plasma manager to gracefully handle EPROTOTYPE.
* Fix linting.
2017-08-01 23:33:25 -07:00
Robert Nishihara
fcd07b10b5
Don't call colorama.init(). ( #799 )
2017-08-01 18:58:48 -07:00
Robert Nishihara
8c8258de20
Move worker methods into Worker class and expose more TaskSpec fields to Python. ( #796 )
...
* Move worker methods inside worker class. Move some helper methods from actor.py into utils.py and state.py.
* Add more methods exposing task spec fields to Python.
* Fix linting.
* Fix error.
* Remove unused code in default worker.
2017-08-01 17:16:57 -07:00
Robert Nishihara
52a27be364
Better logging in tests. ( #790 )
2017-07-31 22:30:46 -07:00
Philipp Moritz
c3b39b4d86
Pull Plasma from Apache Arrow and remove Plasma store from Ray. ( #692 )
...
* Rebase Ray on top of Plasma in Apache Arrow
* add thirdparty building scripts
* use rebased arrow
* fix
* fix build
* fix python visibility
* comment out C tests for now
* fix multithreading
* fix
* reduce logging
* fix plasma manager multithreading
* make sure old and new object IDs can coexist peacefully
* more rebasing
* update
* fixes
* fix
* install pyarrow
* install cython
* fix
* install newer cmake
* fix
* rebase on top of latest arrow
* getting runtest.py run locally (needed to comment out a test for that to work)
* work on plasma tests
* more fixes
* fix local scheduler tests
* fix global scheduler test
* more fixes
* fix python 3 bytes vs string
* fix manager tests valgrind
* fix documentation building
* fix linting
* fix c++ linting
* fix linting
* add tests back in
* Install without sudo.
* Set PKG_CONFIG_PATH in build.sh so that Ray can find plasma.
* Install pkg-config
* Link -lpthread, note that find_package(Threads) doesn't seem to work reliably.
* Comment in testGPUIDs in runtest.py.
* Set PKG_CONFIG_PATH when building pyarrow.
* Pull apache/arrow and not pcmoritz/arrow.
* Fix installation in docker image.
* adapt to changes of the plasma api
* Fix installation of pyarrow module.
* Fix linting.
* Use correct python executable to build pyarrow.
2017-07-31 21:04:15 -07:00
alanamarzoev
dfcd399dbb
Cluster heat map. ( #792 )
2017-07-31 20:49:31 -07:00
alanamarzoev
a2852f2329
Allow multiple web UIs to be open at the same time. ( #784 )
...
* Allow multiple web UIs to be open at the same time.
* Changed formatting.
2017-07-31 18:13:24 -07:00
Robert Nishihara
37dafa4d14
Simplify put test and move it to failure tests. ( #788 )
2017-07-31 17:57:48 -07:00
Robert Nishihara
c394a65ffc
Wait longer when getting redis shards to initialize global state API. ( #786 )
2017-07-31 17:56:11 -07:00
Robert Nishihara
1fe49d7676
Simplify testMultipleLocalSchedulers by having it start only one worker. ( #789 )
2017-07-31 17:44:45 -07:00
Eric Liang
b6a18cb39b
[rllib] Also refactor DQN to use shared RLlib models ( #730 )
...
* wip
* works with cartpole
* lint
* fix pg
* comment
* action dist rename
* preprocessor
* fix test
* typo
* fix the action[0] nonsense
* revert
* satisfy the lint
* wip
* works with cartpole
* lint
* fix pg
* comment
* action dist rename
* preprocessor
* fix test
* typo
* fix the action[0] nonsense
* revert
* satisfy the lint
* Minor indentation changes.
* fix merge
* add humanoid
* initial dqn refactor
* remove tfutil
* fix calls
* fix tf errors 1
* closer
* runs now
* lint
* tensorboard graph
* fix linting
* more 4 space
* fix
* fix linT
* more lint
* oops
* es parity
* remove example.py
* fix training bug
* add cartpole demo
* try fixing cartpole
* allow model options, configure cartpole
* debug
* simplify
* no dueling
* avoid out of file handles
* Test dqn in jenkins.
* Minor formatting.
* fix issue
* fix another
* Fix problem in which we log to a directory that hasn't been created.
2017-07-26 12:29:00 -07:00
Robert Nishihara
8ad9ced99b
Fix task ID hash computation. ( #774 )
2017-07-26 10:08:38 -07:00
alanamarzoev
0f0acb8ac1
CPU Time Series. ( #765 )
...
Add time series of CPU utilization to web UI.
2017-07-26 00:15:50 -07:00
Yeolar
31329d43dd
fixtypo: plasma_protocol ( #764 )
...
Fix typo in plasma_protocol.
2017-07-22 17:52:27 -07:00
Robert Nishihara
ff996330e8
If a worker dies unexpectedly, then let it exit. ( #762 )
2017-07-21 06:36:25 +00:00
Robert Nishihara
13000b7503
Start processes using the same version of Python that was used to start Ray. ( #760 )
...
* Make local scheduler start workers using the same version of Python that was used to start the local scheduler.
* Use current version of python to start new processes instead of hardcoded python executable.
* Fix linting.
2017-07-21 00:05:10 +00:00
alanamarzoev
c31c20ca9c
Code toggling instructions. ( #757 )
2017-07-20 10:51:33 -07:00
alanamarzoev
853b2913b7
Task duration distribution plot. ( #743 )
...
* Task duration distribution plot.
* Fixed bug.
* Changed axis labels.
* Modify task start point.
* Modified task_profiles func to decode in ascii.
* Nvm
* Changed to double quotes and added comments.
* fixed linting
* Fixed linting.
* Fixed bug.
2017-07-19 23:15:17 -07:00
Philipp Moritz
d356dd3ec4
[rllib] Expose algorithm parameters and tune policy gradient parameters for humanoid ( #753 )
...
* parameters for humanoid
* fix
2017-07-19 16:45:05 -07:00
Philipp Moritz
ade6d80820
[rllib] use ray.wait to speed up parallel simulations for policy gradients ( #754 )
...
* use ray.wait to speed up parallel simulations for policy gradients
* linting
2017-07-19 16:09:15 -07:00
alanamarzoev
2b3190ad13
Chrome trace timeline with sliders. ( #731 )
...
* Trace timeline with sliders.
* Trace.
* Switched ujson to json.
* Fixed tests.
* linting fixes
* Fixed bug.
* Cleaned up code.
* Fixes according to comments.
* removed checkpoints.
* Undid accidental delete.
* Fixed linting error.
* Added documentation to notebook.
* Undid accidental deletes.
* Add comments and small formatting fixes.
* Small fix.
2017-07-17 19:59:49 -07:00
Eric Liang
420013774c
[rllib] Pull out shared models for evolution strategies and policy gradient. ( #719 )
...
* wip
* works with cartpole
* lint
* fix pg
* comment
* action dist rename
* preprocessor
* fix test
* typo
* fix the action[0] nonsense
* revert
* satisfy the lint
* wip
* works with cartpole
* lint
* fix pg
* comment
* action dist rename
* preprocessor
* fix test
* typo
* fix the action[0] nonsense
* revert
* satisfy the lint
* Minor indentation changes.
* fix merge
* add humanoid
* fix linting
* more 4 space
* fix
* fix linT
* oops
* es parity
2017-07-17 08:58:54 +00:00
Crystal
8fc7dc3ed4
Change Python examples in documentation to use 4 space indentation. ( #736 )
...
* Ray doc - changed python indentation to 4 spaces in documentation files actors.rst, api.rst, and example-*.rst
* Ray documentation - changed Python to 4 space indentation for files install-*.rst, installation-troubleshooting.rst, internals-overview.rst, serialization.rst, troubleshootin.rst, tutorial.rst, using-ray-*.rst
2017-07-16 22:19:33 -07:00
Eric Liang
86a7909149
make es worker count independent ( #740 )
2017-07-16 16:23:56 -07:00
Robert Nishihara
80e8426b5e
Test example applications and rllib in jenkins tests. ( #707 )
...
* Test example applications in Jenkins.
* Fix default upload_dir argument for Algorithm class.
* Fix evolution strategies.
* Comment out policy gradient example which doesn't seem to work.
* Set --env-name for evolution strategies.
2017-07-16 18:51:33 +00:00
Robert Nishihara
4349f1f966
Fix broken links in example documentation. ( #732 )
2017-07-14 20:31:53 +00:00
Robert Nishihara
e0867c8845
Switch Python indentation from 2 spaces to 4 spaces. ( #726 )
...
* 4 space indentation for actor.py.
* 4 space indentation for worker.py.
* 4 space indentation for more files.
* 4 space indentation for some test files.
* Check indentation in Travis.
* 4 space indentation for some rl files.
* Fix failure test.
* Fix multi_node_test.
* 4 space indentation for more files.
* 4 space indentation for remaining files.
* Fixes.
2017-07-13 21:53:57 +00:00
Robert Nishihara
310ba82131
Use miniconda for all travis tests. ( #728 )
...
* Use miniconda for all travis tests.
* Fix.
* Fix.
2017-07-13 16:23:04 +00:00
Philipp Moritz
c24c07613c
[rllib] unify writing performance metrics and make it queryable ( #708 )
...
* write config to s3
* add train file
* write performance to S3
* writing needs to be fixed, replacing result.json at the moment
* update
* add experiment_id
* more logging and example queries
* update
* add info
* fill in other algorithms
* fix linting
* convert readme to rst
* fixes
* simplejson -> json
* make files executable
* edit README.rst
* unify storing logs in S3 and on local filesystem
* use 'info' entry in TrainingResult for algorithm specific info
* don't install smart_open with ray
* fixes
* linting fixes
2017-07-11 01:36:14 +02:00
alanamarzoev
8464d77c76
Change event logs to store one Redis ZSET per worker. ( #705 )
...
* Changing to zset
* Fixed bug.
* Fixed another bug.
* Modified task_profiles.
* Removed extra file.
* Modified task_profiles test.
* WIP
* WIP
* Undid changes
* Updated
* WIP
* Made changes according to comments.
* Removed unneeded print.
* Removed ujson usage.
* failing test
* tests passing
* Fixed linting errors and modified style.
* Fixed bug.
* Fixed linting
* Fixed according to comments.
* Redis crashing?
* Fixed linting
* Fixed linting
2017-07-09 01:42:29 +02:00
Eric Liang
cd12ea7e09
[rllib] Pull out the GPU-parallel optimizer from policy gradients to common ( #711 )
...
* refactor
* docs
* cleanup
* clean up more
* Update parallel.py
* add imports from future
2017-07-07 22:20:02 +00:00
Robert Nishihara
5b3d0c00f2
Create /tmp/ray directory in services.py. ( #715 )
2017-07-07 18:41:56 +00:00
Eric Liang
f012e597c2
[rllib] Basic port of baselines/deepq to rllib ( #709 )
...
* rllib v0
* fix imports
* lint
* comments
* update docs
* a3c wip
* a3c wip
* report stats
* update doc
* add common logdir attr
* name is too long
* fix small bug
* propagate exception on error
* fetch metrics
* initial port
* fix lint
* add right license
* port to common alg format
* fix lint
* rename dqn
* add imports from future
* fix lint
2017-07-07 18:37:00 +00:00
Robert Nishihara
6c45657280
Reset the SIGCHLD handler after forking a worker to avoid influencing the worker. ( #713 )
2017-07-07 14:50:37 +00:00
Eric Liang
66734847bb
[rllib] Standardize writing output logs and other files to /tmp/ray ( #706 )
...
* rllib v0
* fix imports
* lint
* comments
* update docs
* a3c wip
* a3c wip
* report stats
* update doc
* add common logdir attr
* name is too long
* fix small bug
* propagate exception on error
* fetch metrics
* fix small nits
2017-07-03 16:01:47 +00:00
alanamarzoev
2b11a7bca2
Add task ID and object ID search boxes to web UI. ( #704 )
...
* Task search box.
* Cleaned up.
* Small reformatting.
* Add object table search box.
2017-07-01 17:48:23 -04:00
alanamarzoev
716469160e
Enable dumping profiling information to timeline format viewable by chrome tracing. ( #703 )
...
* Chrome tracing timeline.
* Modified decode statement.
* Some cleanups and add test.
* Remove example.
* Fix test.
2017-06-30 12:14:11 -04:00
Eric Liang
2d81edfcdc
[rllib] Move a3c implementation from examples/ to python/ray/rllib/ ( #698 )
...
* rllib v0
* fix imports
* lint
* comments
* update docs
* a3c wip
* a3c wip
* report stats
* update doc
* name is too long
* fix small bug
* propagate exception on error
* fetch metrics
* fix lint
2017-06-29 15:49:56 +00:00
Robert Nishihara
efce49cfbc
Bump version to 0.1.2 in preparation for uploading wheels to PyPI. ( #700 )
2017-06-27 04:35:42 +00:00
Robert Nishihara
1941e0f7b1
Fix compilation on CentOS. ( #699 )
2017-06-26 05:54:21 +00:00
Robert Nishihara
0926550661
Remove -mtune and -march compiler flags. ( #697 )
2017-06-26 05:52:45 +00:00
Eric Liang
a674ec958c
[rllib] Move policy gradient and evolution strategies algorithms from examples/ to ray/rllib/ ( #694 )
...
* rllib v0
* fix imports
* lint
* comments
* update docs
2017-06-25 22:13:03 +00:00
Robert Nishihara
8bc9c275fa
Increase the number of log file names and handle errors better in log monitor. ( #693 )
2017-06-25 05:20:50 +00:00