hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 18:11:42 -05:00

No description

Find a file

mehrdadn 84321d3f75 Ignore protobuf generated files (#200 )		2016-07-02 21:07:28 -07:00
cmake/Modules	add FindNumPy.cmake	2016-03-10 14:46:26 -08:00
data	load imagenet	2016-06-10 17:25:55 -07:00
doc	Get Sphinx infrastructure in place	2016-07-01 18:21:02 -07:00
examples	update api for start_ray_local (#174 )	2016-06-27 11:57:22 -07:00
include/ray	preparation to deallocate objects properly	2016-06-21 13:46:38 -07:00
lib/python	Get Sphinx infrastructure in place	2016-07-01 18:21:02 -07:00
protos	Visualize computation graph	2016-07-01 12:13:23 -07:00
scripts	enable restarting workers in singlenode case, plus cleanups to cluster.py (#190 )	2016-07-01 14:10:51 -07:00
src	Simplify SynchronizedPtr	2016-07-02 17:45:38 -07:00
test	update api for start_ray_local (#174 )	2016-06-27 11:57:22 -07:00
thirdparty	build ray in parallel	2016-06-25 15:35:52 -07:00
vsprojects	Update Visual Studio projects (#199 )	2016-07-02 21:07:12 -07:00
.editorconfig	Initial editor config (#175 )	2016-06-27 13:10:55 -07:00
.gitignore	Ignore protobuf generated files (#200 )	2016-07-02 21:07:28 -07:00
.gitmodules	integrate numbuf into tree and remove ftruncate to prepare windows port	2016-06-18 12:00:17 -07:00
.travis.yml	arrays -> array (#172 )	2016-06-27 11:35:31 -07:00
build.sh	Write computation graph to file	2016-06-27 12:20:30 -07:00
CMakeLists.txt	Write computation graph to file	2016-06-27 12:20:30 -07:00
LICENSE	switching to BSD (#90 )	2016-06-06 12:07:36 -07:00
Ray.sln	Visual Studio project file changes (#186 )	2016-06-29 22:52:53 -07:00
README.md	enable restarting workers in singlenode case, plus cleanups to cluster.py (#190 )	2016-07-01 14:10:51 -07:00
requirements.txt	Visualize computation graph	2016-07-01 12:13:23 -07:00
setup-env.sh	Split up and polish build scripts	2016-06-22 16:20:56 -07:00
setup.sh	Visualize computation graph	2016-07-01 12:13:23 -07:00

README.md

Ray

Ray is an experimental distributed execution framework with a Python-like programming model. It is under development and not ready for general use.

Example Code

Loading ImageNet

TODO: fill this out.

Design Decisions

For a description of our design decisions, see

Setup

Linux, Mac, and other Unix-based systems

After running these instruction, add the line source "$RAY_ROOT/setup-env.sh" in your ~/.bashrc file manually, where "$RAY_ROOT" is the path of the directory containing setup-env.sh.

sudo apt-get update
sudo apt-get install git
git clone https://github.com/amplab/ray.git
cd ray
./setup.sh
./build.sh
source setup-env.sh

Windows

Note: A batch file is provided that clones any missing third-party libraries and applies patches to them. Do not attempt to open the solution before the batch file applies the patches; otherwise, if the projects have been modified, the patches may be rejected, and you may be forced to revert your changes before re-running the batch file.

Install Microsoft Visual Studio 2015
Install Git
git clone https://github.com/amplab/ray.git
ray\thirdparty\download_thirdparty.bat

Installing Ray on a cluster

These instructions work on EC2, but they may require some modifications to run on your own cluster. In particular, on EC2, running sudo does not require a password, and we currently don't handle the case where a password is needed.

Create a file nodes.txt of the IP addresses of the nodes in the cluster. For example
```
 52.50.28.103
 52.51.210.207
```
Make sure that the nodes can all communicate with one another. On EC2, this can be done by creating a new security group and adding the inbound rule "all traffic" and adding the outbound rule "all traffic". Then add all of the nodes in your cluster to that security group.

Run something like

python scripts/cluster.py --nodes nodes.txt \
                          --key-file key.pem \
                          --username ubuntu \
                          --installation-directory /home/ubuntu/

where you replace nodes.txt, key.pem, ubuntu, and /home/ubuntu/ by the appropriate values. This assumes that you can connect to each IP address in nodes.txt with the command ssh -i key.pem ubuntu@<ip-address> 4. The previous command should open a Python interpreter. To install Ray on the cluster, run install_ray() in the interpreter. The interpreter should block until the installation has completed. 5. To check that the installation succeeded, you can ssh to each node, cd into the directory ray/test/, and run the tests (e.g., python runtest.py). 6. Now that Ray has been installed, you can start the cluster (the scheduler, object stores, and workers) with the command start_ray("/home/ubuntu/ray/scripts/default_worker.py"), where the argument is the path on each node in the cluster to the worker code that you would like to use. The workers can be restarted with restart_workers("/home/ubuntu/ray/scripts/default_worker.py"), for example if you wish to update the application code running on the workers. The cluster processes (the scheduler, the object stores, and the workers) can be stopped with stop_ray().