* first step of pipelining
* pipelining tests & default configs
- added pipelining unit tests in direct_task_transport_test.cc
- added an entry in ray_config_def.h, ray_config.pxi, and ray_config.pxd to configure the parameter controlling the maximum number of tasks that can be in fligh to each worker
- consolidated worker_to_lease_client_ and worker_to_lease_client_ hash maps in direct_task_transport.h into a single one called worker_to_lease_entry_
* post-review revisions
* linting, following naming/style convention
* linting
* Drop stale actor table notifications
* build
* Add num_restarts to disconnect handler
* Unit test and increment num_restarts on ALIVE, not RESTARTING
* Wait for pid to exit
* Convert NodeInfo.tsx to a functional component
* Update NodeRowGroup to be a functional component
* lint
* Convert TotalRow to functional component.
* lint
* move node info over to using the sortable table head component. spacing is still a little wonky.
* Factor a NoewWorkerRow class out of NodeRowGroup that will be usable when grouping / ungrouping
* Compilation checkpoint, I factored the worker filtering logic out of node info into the reducer
* Add sort accessors for CPU
* Add sort accessors for Disk
* Add sort accessors for RAM
* add a table sort util for function based accessors (rather than flat attribute-based accessor)
* wip refactor node info features
* wip
* Rendering Checkpoint. I've refactored the features and how they are called to add sorting support. Also reworks the way error counts and log counts are passed to the front-end to remove some ugly logic
* wip
* wip
* wip
* Finish adding sorting and grouping of machine view
* lint
* fix bug in filtration of logs and errors by worker from recent refactor.
* Add export of Cluster Disk feature
* fix some merge issues
Co-authored-by: Max Fitton <max@semprehealth.com>
* remove the put in memory store
* put small objects directly in memory store
* cast data type
* fix another place that uses Put to spill to plasma store
* fix multiple tests related to memory limits
* partially fix test_metrics
* remove not functioning codes
* fix core_worker_test
* refactor put to plasma codes
* add a flag for the new feature
* add flag to more places
* do a warmup round for the plasma store
* lint
* lint again
* fix warmup store
* Update _raylet.pyx
Co-authored-by: Eric Liang <ekhliang@gmail.com>
* GCS Actor management on by default.
* Fix travis config.
* Change condition.
* Finish the initial race condition fix.
* Lint.
* Refine the codebase.
* Finish the initial version
* Improve logic.
* Remove unnecessary log messages.
* Address code review.
* Add tests
* Revert the second race condition that doesn't happen anymore, handle some edge cases. add tests.
* Address the second race condition found.
* Addressed code review.
* Addressed code review.
* Run a new unit test only when gcs actor management is on.