Commit graph

1321 commits

Author SHA1 Message Date
Eric Liang
63d8d11714 [webui] Checkboxes should go to the left of their labels (#932) 2017-09-04 17:05:13 -07:00
Robert Nishihara
d8010723d7 Attempt to wget boost up to 20 times during installation. (#927) 2017-09-04 14:42:29 -07:00
Robert Nishihara
d5eec0c2cd Pin opencv-python version to 3.2.0.8 in dockerfile. (#926) 2017-09-03 23:51:59 -07:00
Robert Nishihara
8ed03b1cf0 Make task timeline work with ipywidgets==7.0.0, change slider default values. (#925)
* Make task timeline work with ipywidgets==7.0.0.

* Change initial UI slider values from 70-100 to 0-100.
2017-09-03 23:15:46 -07:00
Stephanie Wang
ae0212b399 Fix failing task table test (#924) 2017-09-03 22:41:38 -07:00
Peter Schafhalter
2c19ae97a3 Implemented db_client_cache as unordered_map (#921)
* Implemented db_client_cache as unordered_map

* Fix for memory leak

* Fixed linting
2017-09-03 17:26:05 -07:00
Eric Liang
246be812f0 upd (#917) 2017-09-02 23:55:10 -07:00
Eric Liang
1ebfe9608f [rllib] Add downscale and frameskip options for Montezumas (#908)
* up

* update

* fix

* update

* update

* update

* api break

* Update run_multi_node_tests.sh

* fix
2017-09-02 17:20:56 -07:00
Zongheng Yang
7a36430399 doc: mention cython in installation note. (#896)
* doc: mention cython in installation note.

* Add to ubuntu note as well
2017-08-31 00:57:41 -07:00
Philipp Moritz
4e4a4e4e06 Add plasma in-memory object store blog post (#895)
* add plasma in-memory object store blog post

* modifications for Ray blog

* add arrow blog reference

* update

* rename

* Improve formatting.
2017-08-30 23:40:46 -07:00
Stephanie Wang
7496c98010 Fault tolerance race (#894)
* Remove race between local scheduler disconnecting and global scheduler
assigning a task

* Fix number of workers started in component failures test

* Fix race between global scheduler retrying a task assignment and monitor
cleaning up task table. The global scheduler should only retry the task
assignment if the local scheduler is still alive.

* Clean up task_table_update callback if failure

* Look up current local scheduler mapping when retrying actor task submission

* Log warning if no subscribers received a task table update

* Clean up database handle memory in local scheduler
2017-08-30 22:20:50 -07:00
Robert Nishihara
deca29a7eb Bump version to 0.2.0. (#877) 2017-08-29 21:38:35 -07:00
Robert Nishihara
4b76335157 Allow ResNet example to run on multiple machines. (#891)
* Allow a redis address to be passed into the ResNet example.

* Update documentation.
2017-08-29 21:37:53 -07:00
Philipp Moritz
164a8f368e [rllib] Rename algorithms (#890)
* rename algorithms

* fix

* fix jenkins test

* fix documentation

* fix
2017-08-29 16:56:42 -07:00
Robert Nishihara
e1831792f8 For PPO, rename num_agents -> num_workers. (#882) 2017-08-28 23:11:06 -07:00
Robert Nishihara
1afc487baf In setup.py, move cython to setup_requires. (#878)
* In setup.py, move cython to setup_requires and move setuptools_scm to setup_requires.

* Add back pip install of cython when building mac wheels.

* Revert changes to setuptools_scm.

* Check that the correct number of Linux wheels are produced.

* Add back pip install cython when building linux wheels.
2017-08-28 23:07:33 -07:00
Robert Nishihara
60d4d01d06 Use observation filter in compute_action for PPO. (#884) 2017-08-28 23:01:29 -07:00
Richard Liaw
5d72818ddc Generic shared_model class (#880)
Changing `shared_model` class back to `get_model` rather than `ConvolutionalNetwork`
2017-08-28 22:48:07 -07:00
Si-Yuan
8099cdeb9d Fix: 'hyperopt_adaptive' example keeps fake 'best_hyperparameters' (#883) 2017-08-28 22:47:16 -07:00
Wapaul1
4db45c9c54 Improved layout of controls for Web UI (#876)
* Improved layout of controls

* Added explicit labels and some comments

* Fix linting errors
2017-08-28 14:43:34 -07:00
Richard Liaw
bc082e9a9e [rllib] Additional support for Shared Models in A3C (#866)
* Code for Supporting Shared Models

Running (with vnet modification) - needs to be tested for performance

Summaries

Small refactoring + generalized to more domains

Small fix for jenkins

Linting

linting

Addressing changes

Addressing changes

Update envs.py

Addressing changes

convnet

Merge - new model

final touches

final linting

Changing iterations back

removed extra change

changes for fast experimentation

changes to enable a2c

TEMP FOR DEBUGGING

ContinuousActions - Still doesn't work

InvertedPendulum trains with 8 workers - k=200

huber loss

Maxes for InvertedPendulum-v1 - 16w,200steps

temp: working with a2c

Back to shared model

more fixes

small

nit

LSTM to shared models

need to fix last_features

tuning pong

Best record for hitting 0 - with k=16,n=20

nit

a2cremoval

remove A2c reference and nits

nit

removed a2c vestiges

removing a2c

removing example.py

Linting

nit

* Linting + Removing vestigal code

* Final Touches

* nits

* rerun travis
2017-08-28 12:23:14 -07:00
Robert Nishihara
b251f0b6b9 Add pip install instructions to README. (#869) 2017-08-27 19:55:39 -07:00
Eric Liang
c977fe8895 [rllib] Full checkpoint/restore for all algorithms (#875)
* wip

* working for all but dqn

* update

* add train

* rename

* update

* Update test
2017-08-27 18:56:52 -07:00
Robert Nishihara
d43a435c68 Don't redirect worker output to log files if redirect_output=False. (#873)
* Don't redirect worker output to log files if redirect_output=False.

* Fix, handle case where RedirectOutput key is not in Redis.
2017-08-27 14:27:44 -07:00
Eric Liang
617bc4d239 [rllib] Make the free_logstd param generic (#863)
* make free log std param generic

* fixes

* fixes
2017-08-24 12:43:51 -07:00
Eric Liang
46641a642f [rllib] (take 2) Add top-level checkpoint/restore/compute_action APIs to rllib (#868)
* add top-level checkpoint/restore api to rllib

* todos
2017-08-24 00:09:33 -07:00
Robert Nishihara
e6de744ef4 Fix potential bug in redis.cc. (#851) 2017-08-23 20:38:25 -07:00
Philipp Moritz
791bee343f [rllib] Implement GAE for PPO (#849)
* make information available for GAE

* buggy version of GAE estimator

* fix

* add more logging and reweight losses

* fix logging

* fix loss

* adapt advantage calculation

* update gae

* standardize returns

* don't normalize td lambda ret

* fix

* don't standardize advantages

* do standardization earlier

* different standardization

* initializer

* drop into the debugger

* fix tensorflow broadcasting bug

* vf clipping

* don't standardize tdlambdaret

* different standardization

* use huber loss for value function

* refactor -- first half

* it runs

* fix

* update

* documentation

* linting and tests

* fix linting

* naming

* fix

* linting

* fix

* remove prefix madness

* fixes

* fix

* add value function example

* fix linting

* remove newline
2017-08-23 20:35:47 -07:00
Eric Liang
c943ecaa42 Better error message when actor creation is attempted before ray.init() (#858) 2017-08-22 21:20:15 -07:00
Eric Liang
e2f2a7e57a [rllib] Pick preprocessor based on obs shape (#855)
* update

* auto choose
2017-08-22 16:46:55 -07:00
Robert Nishihara
58d06e3f4d Test building wheels in Travis. (#852)
* Test building wheels in Travis.

* Fix bug, bash variable wasn't evaluated because it was in single quotes not double quotes.
2017-08-21 23:48:20 -07:00
Eric Liang
c81821b856 [rllib] Make Pong-v0 + EvolutionStrategies work by sharing preprocessors with PPO (#848)
* fix by sharing preprocessors

* revert param changeg

* Update evolution_strategies.py

* Update catalog.py
2017-08-21 18:51:49 -07:00
Robert Nishihara
be4beb19c1 Changes to build to fix creation of wheels. (#840)
* Pass DPYTHON_EXECUTABLE into cmake for arrow and for ray.

* Add cython to setup.py install_requires.

* Revert custom code for finding python in cmake.

* Correctly find arrow on CentOS.

* In cmake, don't find PythonLibs, just find PYTHON_INCLUDE_DIRS.

* Fix typo.

* Do not use boost shared libraries when building arrow.

* Add six to the setup.py install_requires because it is needed by pyarrow.

* Don't link numbuf against boost_system and boost_filesystem.

* Compile boost when we are on Linux.

* Make numbuf find the correct boost libraries.

* Only use find_package Boost on Linux, suppress output when building boost.

* Changes to wheel building scripts, install cython in mac script.

* Compile flatbuffers ourselves on Linux and pass it in when compiling Arrow.

* Clean up build_flatbuffers.sh and build_boost.sh scripts a little.

* Install cython when building linux wheel.
2017-08-21 17:49:35 -07:00
Robert Nishihara
af71f9616e Initial fault tolerance documentation. (#845)
* Initial fault tolerance documentation.

* Update documentation.
2017-08-19 17:38:14 -07:00
Robert Nishihara
ea8da13938 Remove UT data structures from global scheduler. (#838)
* Replace pending_tasks utarray with vector.

* Replace local_schedulers vector with unordered_map.

* Replace object info table with unordered_map.

* Replace local_scheduler_plasma_map and plasma_local_scheduler_map with unordered maps.

* Remove unnecessary includes.

* Fix linting.

* Bug fixes.

* Add function for computing the amount of data for a task that wouldn't have to be shipped because it is already accessible to a local scheduler.

* Small cleanups.
2017-08-16 22:28:21 -07:00
Robert Nishihara
cf41964816 Fix some bugs causing Travis test failures. (#839)
* Fix bug in which worker has no actor_class attribute.

* Remove case where we check if processes are defunct.
2017-08-16 01:18:32 -07:00
Robert Nishihara
ca53e9ae7b Fix bugs in task timeline visualization. (#836)
* Fix bugs in task timeline visualization.

* Some cleanups.

* Remove print statements.
2017-08-13 23:39:37 -07:00
Robert Nishihara
a75ccc8032 Fix socket error exception in Python 2. (#833) 2017-08-13 23:14:26 -07:00
Robert Nishihara
508eec58f5 Pin cloudpickle version to 0.3 in travis tests. (#834) 2017-08-13 01:16:42 -07:00
alanamarzoev
bfe473fa8c Embedded task trace with object dependencies. (#818)
* Embedded timeline

* Yeah

* Fixed arrows not showing up.

* Fixed arrows not showing up, and added check boxes for the kinds of dependencies that should be included in the trace.

* first

* Fixes

* Fixed typo in comments, added more comments. fixed linting.

* Added more comments.

* Formatting.

* fixes

* Fixed state.py linting.

* Fixed ui.py linting errors.

* Fixed linting errors.

* Renamed task dependencies and included instructions for viewing arrows.

* Fixed according to PR comments.

* Fixed bug.

* Undid changes to metadata blocks.

* Fixes according to comments.

* Fixed linting.

* Fixed linting.

* NOQA keyword added to link line.
2017-08-09 23:00:14 -07:00
Alexey Tumanov
fc885bd918 Adding basic support for a user-interpretable resource label (#761)
* adding support for the user-interpretable label(UIR)

* more plumbing for num_uirs further upstream; set to infty when specified on cmd line

* pass default num_uirs for actors; update GlobalStateAPI

* support num_uirs in ray.init()

* local scheduler resource accounting: support num_uirs; prep for vectorized resource accounting

* global scheduler test updated

* Fix bug introduced by rebase.

* Rename UIR -> CustomResource and add test.

* Small changes and use constexpr instead of macros.

* Linting and some renaming.

* Reorder some code.

* Remove cpus_in_use and fix bug.

* Add another test and make a small change.

* Rephrase documentation about feature stability.
2017-08-08 02:53:59 -07:00
Robert Nishihara
03f2325780 Package pyarrow along with ray. (#822)
* Rough pass at installing pyarrow along with Ray.

* Remove hardcoded path and try to find correct path automatically.

* Add print.

* Fix linting.

* Copy pyarrow files to a location that we manually add to python path in order to avoid interfering with pre-existing pyarrow installations.

* Move call to build.sh back into build_ext in setup.py.

* Ignore some linting errors.

* Fix problem in which pyarrow files to copy were listed before they were built.

* Fix tests by importing ray before pyarrow.
2017-08-07 21:17:28 -07:00
Mitar
0e6e38115f Be more precise which boost dependencies are needed. (#823) 2017-08-07 20:02:48 -07:00
Philipp Moritz
862e56000b [rllib] Unify RLLib examples and add jenkins test for policy gradients (#815)
* add jenkins test

* correct handling of the number of iterations

* convert policy gradient and evolution strategies script

* convert DQN

* fix A3C

* fix

* fix

* fixes

* remove redundant A3C example
2017-08-07 19:05:48 -07:00
Robert Nishihara
dbe3d9351c Prototype actor checkpointing. (#814)
* Initial testing of checkpointing functions.

* Save checkpoints in Redis.

* Pipe checkpoint_interval through remote decorator.

* Add a test.

* Small cleanups.

* Submit dummy tasks when reconstructing tasks before the most recent tasks so that we don't end up reconstructing the arguments for those tasks.

* Remove old checkpoints to save space.

* Fix linting.
2017-08-07 17:52:39 -07:00
Robert Nishihara
d7b10a84b6 Fallback to custom serializer for very long python ints. (#821)
* Fallback to custom serializer for very long python ints.

* Fix linting.

* Fix naming convention and add RETURN_NOT_OK.
2017-08-07 17:21:06 -07:00
Robert Nishihara
3071ba0070 Add correct Python executable to Path when building arrow. (#820)
* Tell cmake which python to use when building arrow.

* Pass different path into cmake when building arrow so that cmake finds the right python.

* Add correct python executable to PATH when running cmake for ray.
2017-08-07 14:47:34 -07:00
Philipp Moritz
0225581078 [rllib] Improve performance for small rollouts (#812)
* batch small rollouts together

* implement minimum number of samples for each task

* add total time

* fix linting

* style

* fix

* factor out parameters and document stuff

* add rollout batchsize

* address comments

* linting

* small fix
2017-08-05 22:13:30 -07:00
alanamarzoev
64eaaaebf0 Show timeline button. (#809) 2017-08-03 20:11:50 -07:00
Richard Liaw
c30fdb4ab0 [rllib] Code for Supporting Shared Models (#775)
* Code for Supporting Shared Models

* Running (with vnet modification) - needs to be tested for performance

* Small fix for jenkins

* Linting

* linting

* Summaries

* Small refactoring + generalized to more domains

* Addressing changes

* Addressing changes

* Update envs.py

* Addressing changes

* convnet

* final touches

* Merge - new model

* final linting

* Changing iterations back

* Policy option removed, fixed small things

* Nits

* nit

* Linting

* Linting
2017-08-03 19:29:01 -07:00