mirror of
https://github.com/vale981/ray
synced 2025-03-06 02:21:39 -05:00

* wip * add * timeout fix * const ref * comments * fix * fix * Move actor state into actor handle * comments 2 * enable by default * temp reorder * some fixes * add debug code * tmp * fix * wip * remove dbg * fix compile * fix * fix check * remove non direct tests * Increment ref count before resolving value * rename * fix another bug * tmp * tmp * Fix object pinning * build change * lint * ActorManager * tmp * ActorManager * fix test component failures * Remove old code * Remove unused * fix * fix * fix resources * fix advanced * eric's diff * blacklist * blacklist * cleanup * annotate * disable tests for now * remove * fix * fix * clean up verbosity * fix test * fix concurrency test * Update .travis.yml * Update .travis.yml * Update .travis.yml * split up analysis suite * split up trial runner suite * fix detached direct actors * fix * split up advanced tesT * lint * fix core worker test hang * fix bad check fail which breaks test_cluster.py in tune * fix some minor diffs in test_cluster * less workers * make less stressful * split up test * retry flaky tests * remove old test flags * fixes * lint * Update worker_pool.cc * fix race * fix * fix bugs in node failure handling * fix race condition * fix bugs in node failure handling * fix race condition * nits * fix test * disable heartbeatS * disable heartbeatS * fix * fix * use worker id * fix max fail * debug exit * fix merge, and apply [PATCH] fix concurrency test * [patch] fix core worker test hang * remove NotifyActorCreation, and return worker on completion of actor creation task * remove actor diied callback * Update core_worker.cc * lint * use task manager * fix merge * fix deadlock * wip * merge conflits * fix * better sysexit handling * better sysexit handling * better sysexit handling * check id * better debug * task failed msg * task failed msg * retry failed tasks with delay * retry failed tasks with delay * clip deps * fix * fix core worker tests * fix task manager test * fix all tests * cleanup * set to 0 for direct tests * dont check worker id for ownership rpc * dont check worker id for ownership rpc * debug messages * add comment * remove debug statements * nit * check worker id * fix test * owner * fix tests
33 lines
623 B
Bash
Executable file
33 lines
623 B
Bash
Executable file
#!/bin/bash
|
|
# Run a command, suppressing output unless it hangs or crashes.
|
|
|
|
TMPFILE=`mktemp`
|
|
PID=$$
|
|
|
|
# Print output to avoid travis killing us
|
|
watchdog() {
|
|
for i in `seq 5 5 150`; do
|
|
sleep 300
|
|
echo "This command has been running for more than $i minutes..."
|
|
done
|
|
echo "Command timed out after 2.5h, dumping logs:"
|
|
cat $TMPFILE
|
|
echo "TIMED OUT"
|
|
kill -SIGKILL $PID
|
|
}
|
|
|
|
watchdog & 2>/dev/null
|
|
WATCHDOG_PID=$!
|
|
|
|
time "$@" >$TMPFILE 2>&1
|
|
|
|
CODE=$?
|
|
if [ $CODE != 0 ]; then
|
|
tail -n 2000 $TMPFILE
|
|
echo "FAILED $CODE"
|
|
kill $WATCHDOG_PID
|
|
exit $CODE
|
|
fi
|
|
|
|
kill $WATCHDOG_PID
|
|
exit 0
|