ray/README.md

# Ray

[![Build Status](https://travis-ci.org/amplab/ray.svg?branch=master)](https://travis-ci.org/amplab/ray)

Ray is an experimental distributed extension of Python. It is under development
and not ready to be used.

The goal of Ray is to make it easy to write machine learning applications that
run on a cluster while providing the development and debugging experience of
working on a single machine.

Before jumping into the details, here's a simple Python example for doing a
Monte Carlo estimation of pi (using multiple cores or potentially multiple
machines).

```python
import ray
import numpy as np

# Start a scheduler, an object store, and some workers.
ray.init(start_ray_local=True, num_workers=10)

# Define a remote function for estimating pi.
@ray.remote
def estimate_pi(n):
  x = np.random.uniform(size=n)
  y = np.random.uniform(size=n)
  return 4 * np.mean(x ** 2 + y ** 2 < 1)

# Launch 10 tasks, each of which estimates pi.
result_ids = []
for _ in range(10):
  result_ids.append(estimate_pi.remote(100))

# Fetch the results of the tasks and print their average.
estimate = np.mean(ray.get(result_ids))
print "Pi is approximately {}.".format(estimate)
```

Within the for loop, each call to `estimate_pi.remote(100)` sends a message to
the scheduler asking it to schedule the task of running `estimate_pi` with the
argument `100`. This call returns right away without waiting for the actual
estimation of pi to take place. Instead of returning a float, it returns an
**object ID**, which represents the eventual output of the computation (this is
a similar to a Future).

The call to `ray.get(result_id)` takes an object ID and returns the actual
estimate of pi (waiting until the computation has finished if necessary).

## Next Steps

- Installation on [Ubuntu](doc/install-on-ubuntu.md), [Mac OS X](doc/install-on-macosx.md), [Windows](doc/install-on-windows.md), [Docker](doc/install-on-docker.md)
- [Tutorial](doc/tutorial.md)
- Documentation
  - [Using Ray with TensorFlow](doc/using-ray-wih-tensorflow.md)
  - [Using Ray on a Cluster](doc/using-ray-on-a-cluster.md)

## Example Applications

- [Hyperparameter Optimization](examples/hyperopt/README.md)
- [Batch L-BFGS](examples/lbfgs/README.md)
- [Learning to Play Pong](examples/rl_pong/README.md)
- [Training AlexNet](examples/alexnet/README.md)
renaming project, halo -> ray (#95) 2016-06-10 14:12:15 -07:00			`# Ray`
add setup instructions 2016-02-22 17:32:07 -08:00
travis for os x (#125) 2016-06-22 11:28:01 -07:00			`[![Build Status](https://travis-ci.org/amplab/ray.svg?branch=master)](https://travis-ci.org/amplab/ray)`

update tutorial (#318) 2016-07-28 20:47:37 -07:00			`Ray is an experimental distributed extension of Python. It is under development`
Cleanup setting and getting of tensorflow weights. (#385) * Cleanup setting and getting of tensorflow weights. * Add documentation for using TensorFlow. * Group get_weights and set_weights in a function. * Update readme. 2016-09-16 23:05:14 -07:00			`and not ready to be used.`
add setup instructions 2016-02-22 17:32:07 -08:00
remove installation of dependencies from setup script (#239) 2016-07-08 20:03:21 -07:00			`The goal of Ray is to make it easy to write machine learning applications that`
			`run on a cluster while providing the development and debugging experience of`
			`working on a single machine.`

			`Before jumping into the details, here's a simple Python example for doing a`
			`Monte Carlo estimation of pi (using multiple cores or potentially multiple`
			`machines).`

			```python
			`import ray`
			`import numpy as np`

update tutorial (#318) 2016-07-28 20:47:37 -07:00			`# Start a scheduler, an object store, and some workers.`
unify starting local cluster with attaching to existing cluster (#327) 2016-07-31 19:26:35 -07:00			`ray.init(start_ray_local=True, num_workers=10)`
update tutorial (#318) 2016-07-28 20:47:37 -07:00
			`# Define a remote function for estimating pi.`
Allow remote decorator to be used with no parentheses. 2016-08-30 15:14:02 -07:00			`@ray.remote`
remove installation of dependencies from setup script (#239) 2016-07-08 20:03:21 -07:00			`def estimate_pi(n):`
			`x = np.random.uniform(size=n)`
			`y = np.random.uniform(size=n)`
			`return 4 * np.mean(x 2 + y 2 < 1)`
update tutorial (#318) 2016-07-28 20:47:37 -07:00
			`# Launch 10 tasks, each of which estimates pi.`
Terminology change Object Reference -> Object ID (#330) 2016-07-31 19:58:03 -07:00			`result_ids = []`
update tutorial (#318) 2016-07-28 20:47:37 -07:00			`for _ in range(10):`
Terminology change Object Reference -> Object ID (#330) 2016-07-31 19:58:03 -07:00			`result_ids.append(estimate_pi.remote(100))`
update tutorial (#318) 2016-07-28 20:47:37 -07:00
			`# Fetch the results of the tasks and print their average.`
[API] Implement get for multiple objects (#398) * [API] Implement get for multiple objects * Small fixes. 2016-09-02 18:02:44 -07:00			`estimate = np.mean(ray.get(result_ids))`
update tutorial (#318) 2016-07-28 20:47:37 -07:00			`print "Pi is approximately {}.".format(estimate)`
remove installation of dependencies from setup script (#239) 2016-07-08 20:03:21 -07:00			```

change remote function invocation from func() to func.remote() (#328) 2016-07-31 15:25:19 -07:00			Within the for loop, each call to `estimate_pi.remote(100)` sends a message to
			the scheduler asking it to schedule the task of running `estimate_pi` with the
update tutorial (#318) 2016-07-28 20:47:37 -07:00			argument `100`. This call returns right away without waiting for the actual
			`estimation of pi to take place. Instead of returning a float, it returns an`
Terminology change Object Reference -> Object ID (#330) 2016-07-31 19:58:03 -07:00			`object ID, which represents the eventual output of the computation (this is`
			`a similar to a Future).`
remove installation of dependencies from setup script (#239) 2016-07-08 20:03:21 -07:00
Terminology change Object Reference -> Object ID (#330) 2016-07-31 19:58:03 -07:00			The call to `ray.get(result_id)` takes an object ID and returns the actual
remove installation of dependencies from setup script (#239) 2016-07-08 20:03:21 -07:00			`estimate of pi (waiting until the computation has finished if necessary).`

			`## Next Steps`

Ray with Docker (#324) * Ray with Docker * cleanup based on comments * rename docker user to ray-user * add examples docker image * working toward reliable Docker devel image * adjust ray-user uid for Linux builds on AWS * update documentation * reduced dependencies for examples * updated Docker documentation * experimental notice on developing with Docker 2016-08-01 16:44:11 -07:00			`- Installation on [Ubuntu](doc/install-on-ubuntu.md), [Mac OS X](doc/install-on-macosx.md), [Windows](doc/install-on-windows.md), [Docker](doc/install-on-docker.md)`
remove installation of dependencies from setup script (#239) 2016-07-08 20:03:21 -07:00			`- [Tutorial](doc/tutorial.md)`
Cleanup setting and getting of tensorflow weights. (#385) * Cleanup setting and getting of tensorflow weights. * Add documentation for using TensorFlow. * Group get_weights and set_weights in a function. * Update readme. 2016-09-16 23:05:14 -07:00			`- Documentation`
			`- [Using Ray with TensorFlow](doc/using-ray-wih-tensorflow.md)`
			`- [Using Ray on a Cluster](doc/using-ray-on-a-cluster.md)`
remove installation of dependencies from setup script (#239) 2016-07-08 20:03:21 -07:00
			`## Example Applications`

			`- [Hyperparameter Optimization](examples/hyperopt/README.md)`
			`- [Batch L-BFGS](examples/lbfgs/README.md)`
add Pong example to README (#337) 2016-08-01 18:29:04 -07:00			`- [Learning to Play Pong](examples/rl_pong/README.md)`
			`- [Training AlexNet](examples/alexnet/README.md)`