2018-01-19 16:16:45 -08:00
Development Tips
================
2019-01-26 00:15:48 -08:00
Compilation
-----------
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
To speed up compilation, be sure to install Ray with
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
.. code-block :: shell
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
cd ray/python
pip install -e . --verbose
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
The `` -e `` means "editable", so changes you make to files in the Ray
directory will take effect without reinstalling the package. In contrast, if
you do `` python setup.py install `` , files will be copied from the Ray
directory to a directory of Python packages (often something like
`` /home/ubuntu/anaconda3/lib/python3.6/site-packages/ray `` ). This means that
changes you make to files in the Ray directory will not have any effect.
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
If you run into **Permission Denied** errors when running `` pip install `` ,
you can try adding `` --user `` . You may also need to run something like `` sudo
chown -R $USER /home/ubuntu/anaconda3`` (substituting in the appropriate
path).
2018-01-19 16:16:45 -08:00
2019-12-02 22:19:01 -08:00
If you make changes to the C++ or Python files, you will need to run the build so C++ code is recompiled and/or Python files are redeployed in `ray/python` .
2019-01-26 00:15:48 -08:00
However, you do not need to rerun `` pip install -e . `` . Instead, you can
recompile much more quickly by doing
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
.. code-block :: shell
2018-01-19 16:16:45 -08:00
2019-06-06 18:18:24 -07:00
cd ray
2019-12-02 22:19:01 -08:00
bash build.sh
2019-06-06 18:18:24 -07:00
This command is not enough to recompile all C++ unit tests. To do so, see
`Testing locally`_ .
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
Debugging
---------
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
Starting processes in a debugger
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
2019-02-14 13:50:21 -08:00
When processes are crashing, it is often useful to start them in a debugger.
Ray currently allows processes to be started in the following:
- valgrind
- the valgrind profiler
- the perftools profiler
- gdb
- tmux
To use any of these tools, please make sure that you have them installed on
your machine first (`` gdb `` and `` valgrind `` on MacOS are known to have issues).
Then, you can launch a subset of ray processes by adding the environment
variable `` RAY_{PROCESS_NAME}_{DEBUGGER}=1 `` . For instance, if you wanted to
start the raylet in `` valgrind `` , then you simply need to set the environment
variable `` RAY_RAYLET_VALGRIND=1 `` .
To start a process inside of `` gdb `` , the process must also be started inside of
`` tmux `` . So if you want to start the raylet in `` gdb `` , you would start your
Python script with the following:
.. code-block :: bash
RAY_RAYLET_GDB=1 RAY_RAYLET_TMUX=1 python
2019-05-26 11:27:53 -07:00
2019-02-14 13:50:21 -08:00
You can then list the `` tmux `` sessions with `` tmux ls `` and attach to the
appropriate one.
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
You can also get a core dump of the `` raylet `` process, which is especially
useful when filing `issues`_ . The process to obtain a core dump is OS-specific,
but usually involves running `` ulimit -c unlimited `` before starting Ray to
allow core dump files to be written.
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
Inspecting Redis shards
~~~~~~~~~~~~~~~~~~~~~~~
2019-05-26 11:27:53 -07:00
To inspect Redis, you can use the global state API. The easiest way to do this
is to start or connect to a Ray cluster with `` ray.init() `` , then query the API
like so:
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
.. code-block :: python
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
ray.init()
2019-05-26 11:27:53 -07:00
ray.nodes()
2019-01-26 00:15:48 -08:00
# Returns current information about the nodes in the cluster, such as:
# [{'ClientID': '2a9d2b34ad24a37ed54e4fcd32bf19f915742f5b',
2019-07-11 13:17:19 +08:00
# 'IsInsertion': True,
2019-01-26 00:15:48 -08:00
# 'NodeManagerAddress': '1.2.3.4',
# 'NodeManagerPort': 43280,
# 'ObjectManagerPort': 38062,
# 'ObjectStoreSocketName': '/tmp/ray/session_2019-01-21_16-28-05_4216/sockets/plasma_store',
# 'RayletSocketName': '/tmp/ray/session_2019-01-21_16-28-05_4216/sockets/raylet',
# 'Resources': {'CPU': 8.0, 'GPU': 1.0}}]
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
To inspect the primary Redis shard manually, you can also query with commands
like the following.
2018-06-05 20:22:11 -07:00
2019-01-26 00:15:48 -08:00
.. code-block :: python
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
r_primary = ray.worker.global_worker.redis_client
r_primary.keys("*")
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
To inspect other Redis shards, you will need to create a new Redis client.
For example (assuming the relevant IP address is `` 127.0.0.1 `` and the
relevant port is `` 1234 `` ), you can do this as follows.
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
.. code-block :: python
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
import redis
r = redis.StrictRedis(host='127.0.0.1', port=1234)
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
You can find a list of the relevant IP addresses and ports by running
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
.. code-block :: python
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
r_primary.lrange('RedisShards', 0, -1)
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
.. _backend-logging:
2018-01-19 16:16:45 -08:00
2019-01-26 00:15:48 -08:00
Backend logging
~~~~~~~~~~~~~~~
The `` raylet `` process logs detailed information about events like task
execution and object transfers between nodes. To set the logging level at
runtime, you can set the `` RAY_BACKEND_LOG_LEVEL `` environment variable before
starting Ray. For example, you can do:
.. code-block :: shell
export RAY_BACKEND_LOG_LEVEL=debug
ray start
This will print any `` RAY_LOG(DEBUG) `` lines in the source code to the
`` raylet.err `` file, which you can find in the `Temporary Files`_ .
Testing locally
---------------
2019-12-18 16:20:05 -08:00
Testing for Python development
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
2019-02-26 16:14:55 -08:00
Suppose that one of the tests (e.g., `` test_basic.py `` ) is failing. You can run
2019-12-18 16:20:05 -08:00
that test locally by running `` python -m pytest -v python/ray/tests/test_basic.py `` .
However, doing so will run all of the tests which can take a while. To run a
specific test that is failing, you can do
2019-01-26 00:15:48 -08:00
.. code-block :: shell
cd ray
2019-02-26 16:14:55 -08:00
python -m pytest -v python/ray/tests/test_basic.py::test_keyword_args
2019-01-26 00:15:48 -08:00
When running tests, usually only the first test failure matters. A single
test failure often triggers the failure of subsequent tests in the same
script.
2019-12-18 16:20:05 -08:00
Testing for C++ development
~~~~~~~~~~~~~~~~~~~~~~~~~~~
2019-06-06 18:18:24 -07:00
To compile and run all C++ tests, you can run:
.. code-block :: shell
cd ray
bazel test $(bazel query 'kind(cc_test, ...)')
2019-12-18 16:20:05 -08:00
Alternatively, you can also run one specific C++ test. You can use:
.. code-block :: shell
cd ray
bazel test $(bazel query 'kind(cc_test, ...)') --test_filter=ClientConnectionTest --test_output=streamed
Creating a pull request
-----------------------
To create a pull request (PR) for your change. First please go through the
`PR template`_ and run through the checklist.
Ray automatically runs continuous integration (CI) tests once PR is opened, it
runs on `Travis-CI <https://travis-ci.com/ray-project/ray/> `_ with multiple CI
test jobs.
Understand CI test jobs
-----------------------
The `Travis CI`_ test folder contains all integration test scripts and they
invoke other test scripts via `` pytest `` , `` bazel `` -based test or other bash
scripts. Some of the examples include:
* Raylet integration tests commands:
* `` src/ray/test/run_gcs_tests.sh ``
* `` src/ray/test/run_core_worker_tests.sh ``
* `` src/ray/test/run_object_manager_tests.sh ``
* Bazel test command:
* `` bazel test --build_tests_only //:all ``
* Ray serving test commands:
* `` python -m pytest python/ray/experimental/serve/tests ``
* `` python python/ray/experimental/serve/examples/echo_full.py ``
* Ray test commands:
* `` python/ray/experimental/test/async_test.py ``
* `` python/ray/tests/py3_test.py ``
If the Travis-CI exception doesn't seems to be related to your change, please
use `this link <https://ray-travis-tracker.herokuapp.com/> `_ to check recent
flake tests.
Format and Linting
------------------
2019-06-06 18:18:24 -07:00
2019-01-26 00:15:48 -08:00
**Running linter locally:** To run the Python linter on a specific file, run
2019-08-05 23:33:14 -07:00
something like `` flake8 ray/python/ray/worker.py `` . You may need to first run
`` pip install flake8 `` .
2019-01-26 00:15:48 -08:00
2019-08-05 23:33:14 -07:00
**Autoformatting code** . We use `yapf <https://github.com/google/yapf> `_ for
linting, and the config file is located at `` .style.yapf `` . We recommend
running `` scripts/yapf.sh `` prior to pushing to format changed files.
Note that some projects such as dataframes and rllib are currently excluded.
2019-01-26 00:15:48 -08:00
2019-12-18 16:20:05 -08:00
**Running CI linter:** The Travis CI linter script has multiple components to
run. We recommend running `` ci/travis/format.sh `` , which contains both linter
for python and C++ codes. In addition, there are other formatting checkers for
components like:
* Python REAME format:
.. code-block :: shell
cd ray/python
python setup.py check --restructuredtext --strict --metadata
* Bazel format:
.. code-block :: shell
./ci/travis/bazel-format.sh
2019-01-26 00:15:48 -08:00
.. _`issues`: https://github.com/ray-project/ray/issues
.. _`Temporary Files`: http://ray.readthedocs.io/en/latest/tempfile.html
2019-12-18 16:20:05 -08:00
.. _`PR template`: https://github.com/ray-project/ray/blob/master/.github/PULL_REQUEST_TEMPLATE.md>
.. _`Travis CI`: https://github.com/ray-project/ray/tree/master/ci/travis>