hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Sven Mika	0db2046b0a	[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107 ) (#7124 ) * Exploration API (+EpsilonGreedy sub-class). * Exploration API (+EpsilonGreedy sub-class). * Cleanup/LINT. * Add `deterministic` to generic Trainer config (NOTE: this is still ignored by most Agents). * Add `error` option to deprecation_warning(). * WIP. * Bug fix: Get exploration-info for tf framework. Bug fix: Properly deprecate some DQN config keys. * WIP. * LINT. * WIP. * Split PerWorkerEpsilonGreedy out of EpsilonGreedy. Docstrings. * Fix bug in sampler.py in case Policy has self.exploration = None * Update rllib/agents/dqn/dqn.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * WIP. * Update rllib/agents/trainer.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * WIP. * Change requests. * LINT * In tune/utils/util.py::deep_update() Only keep deep_updat'ing if both original and value are dicts. If value is not a dict, set * Completely obsolete syn_replay_optimizer.py's parameters schedule_max_timesteps AND beta_annealing_fraction (replaced with prioritized_replay_beta_annealing_timesteps). * Update rllib/evaluation/worker_set.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Review fixes. * Fix default value for DQN's exploration spec. * LINT * Fix recursion bug (wrong parent c'tor). * Do not pass timestep to get_exploration_info. * Update tf_policy.py * Fix some remaining issues with test cases and remove more deprecated DQN/APEX exploration configs. * Bug fix tf-action-dist * DDPG incompatibility bug fix with new DQN exploration handling (which is imported by DDPG). * Switch off exploration when getting action probs from off-policy-estimator's policy. * LINT * Fix test_checkpoint_restore.py. * Deprecate all SAC exploration (unused) configs. * Properly use `model.last_output()` everywhere. Instead of `model._last_output`. * WIP. * Take out set_epsilon from multi-agent-env test (not needed, decays anyway). * WIP. * Trigger re-test (flaky checkpoint-restore test). * WIP. * WIP. * Add test case for deterministic action sampling in PPO. * bug fix. * Added deterministic test cases for different Agents. * Fix problem with TupleActions in dynamic-tf-policy. * Separate supported_spaces tests so they can be run separately for easier debugging. * LINT. * Fix autoregressive_action_dist.py test case. * Re-test. * Fix. * Remove duplicate py_test rule from bazel. * LINT. * WIP. * WIP. * SAC fix. * SAC fix. * WIP. * WIP. * WIP. * FIX 2 examples tests. * WIP. * WIP. * WIP. * WIP. * WIP. * Fix. * LINT. * Renamed test file. * WIP. * Add unittest.main. * Make action_dist_class mandatory. * fix * FIX. * WIP. * WIP. * Fix. * Fix. * Fix explorations test case (contextlib cannot find its own nullcontext??). * Force torch to be installed for QMIX. * LINT. * Fix determine_tests_to_run.py. * Fix determine_tests_to_run.py. * WIP * Add Random exploration component to tests (fixed issue with "static-graph randomness" via py_function). * Add Random exploration component to tests (fixed issue with "static-graph randomness" via py_function). * Rename some stuff. * Rename some stuff. * WIP. * WIP. * Fix SAC. * Fix SAC. * Fix strange tf-error in ray core tests. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix test_io.py. * LINT. * Update SAC yaml files' config. Co-authored-by: Eric Liang <ekhliang@gmail.com>	2020-02-22 14:19:49 -08:00
Stephanie Wang	4c2de7be54	[core] Ref counting for returning object IDs created by a different process (#7221 ) * Add regression tests * Refactor, split RemoveSubmittedTaskReferences into submitted and finished paths * Add nested return IDs to UpdateFinishedTaskRefs, rename WrapObjectIds * Basic unit tests pass * Fix unit test and add an out-of-order regression test * Add stored_in_objects to ObjectReferenceCount, regression test now passes * Add an Address to the ReferenceCounter so we can determine ownership * Set the nested return IDs from the TaskManager * Add another test * Simplify * Update src/ray/core_worker/reference_count_test.cc Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Apply suggestions from code review Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * comments * Add python test Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>	2020-02-22 13:29:48 -08:00
Sven Mika	e2edca45d4	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 ) * Take out stats to analyze memory leak in torch PPO. * WIP * WIP * WIP * WIP * WIP * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * LINT. * Fix determine_tests_to_run.py. * minor change to re-test after determine_tests_to_run.py. * LINT. * update comments. * WIP * WIP * WIP * FIX. * Fix sequence_mask being dependent on torch being installed. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case.	2020-02-22 11:02:31 -08:00
Eric Liang	01dd520797	Remove misleading error message (#7265 )	2020-02-21 21:20:40 -08:00
ijrsvt	325fc24afa	Removing unused Pyarrow Info (#7207 )	2020-02-21 17:07:26 -08:00
Amog Kamsetty	1737a113be	[Parallel Iterators] Repartition functionality (#7163 ) * repartition and tests * blacklist lib/ files from import checks * addressing comments and splitting up tests * code readability * adding explicit ref for parent iterator * formatting	2020-02-21 13:20:18 -08:00
mehrdadn	c6f50ecc51	setpgrp fix (#7250 )	2020-02-21 13:15:11 -08:00
Edward Oakes	d190e73727	Use our own implementation of parallel_memcopy (#7254 )	2020-02-21 11:03:50 -08:00
Sven Mika	cbc808bc6b	[Tests] determine_tests_to_run.sh has a bug affecting RLlib testing to be skipped sometimes. (#7243 )	2020-02-20 19:02:17 -08:00
Kai Yang	007333b960	[Java] Support direct call for normal tasks (#7193 )	2020-02-21 10:03:34 +08:00
Stephanie Wang	f27bb6eb47	Only hold the RefCount lock if needed (#7249 )	2020-02-20 17:10:06 -08:00
Edward Oakes	6c80071a7d	Remove gc.collect() calls from reference counting tests (#7218 )	2020-02-20 10:51:02 -08:00
Edward Oakes	16e37416cd	Fix raylet pinning race condition (#7235 )	2020-02-20 10:41:36 -08:00
Sven Mika	6043ce710d	Fix old exploration configs. (#7240 )	2020-02-20 08:39:16 -08:00
chaokunyang	1ae7c03e86	fix concurrently extract file (#7225 )	2020-02-20 20:38:51 +08:00
chaokunyang	d7f8d18a86	fix symbols conflict and add symbols check (#7227 )	2020-02-20 19:31:16 +08:00
Siyuan (Ryans) Zhuang	0d210a99c3	Ensure deserialized numpy arrays are immutable (#7181 ) * ensure numpy arrays are immutable when deserialized from the memory buffer	2020-02-19 23:30:10 -08:00
Stephanie Wang	7e3819a27a	[core] Eagerly evict objects that are no longer in scope (#7220 ) * Batch free requests, and free when object is unpinned * rename * note	2020-02-19 20:51:38 -08:00
Simon Mo	b804d40c04	Stop vendoring pyarrow (#7233 )	2020-02-19 19:01:26 -08:00
Siyuan (Ryans) Zhuang	48c06f5042	Enhance the serialization refcount test for dynamic classes (#7222 ) * enhance the test for dynamic classes	2020-02-19 18:35:35 -08:00
Eric Liang	46af992efd	[rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213 )	2020-02-19 16:07:37 -08:00
Simon Mo	7bef7031c2	Revert "Revert "Revert "Removing Pyarrow dependency (#7146 )" (#7209 ) (#7214 )" (#7232 )	2020-02-19 13:35:29 -08:00
Sven Mika	d537e9f0d8	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
Eric Liang	399424c418	[rllib] Fix broken check in eval mode for IMPALA #7217	2020-02-19 11:54:30 -08:00
Simon Mo	e8941b1b79	Revert "Revert "Removing Pyarrow dependency (#7146 )" (#7209 ) (#7214 )	2020-02-19 10:08:52 -08:00
Stephanie Wang	f76ce836b2	Distributed ref counting for serialized ObjectIDs (#6945 ) * Skeleton plus a unit test for simple borrower case * First unit test passes - forward an ID and task returns with 1 submitted task pending on the inner ID * Invariant for contained_in * Unit test passes for testing task return without creating a borrower * Wrap ref count functionality in test case * Fix bad delete * Unit test and fix for borrowers creating more borrowers * Unit test and fix for simple borrowing, but owner sends call after borrower's ref count goes to 0 * Refactor: - keep a sentinel ref count for task argument IDs - keep contained_in_borrowed in addition to contained_in_owned * Unit test for nested IDs passes * Refactor so that an object ID can only be contained in 1 borrowed ID at a time * Add check * Fix * Unit test (passes) to test nesting object IDs but no borrowers created * Unit test for nested objects from different owners passes, refactor to unset contained_in when popping refs * Unit tests for borrowers receiving an ObjectID from multiple sources, skip adding ownership info if we already have it to handle duplicate refs * Unit test for returning object ID passes * More unit tests for returning object IDs pass * Add serialized ID tests * fix serialization issue * remove swap * It builds! * debugging and some fixes: - register handler for WaitForRefRemoved - don't create a python reference for arg IDs - pass in client factory into ReferenceCounter - fix bad decrement in PopBorrowerRefs * Fix accounting for serialized IDs: - don't decrement for IDs on dependency resolution, wait until task finished - add object IDs that were inlined when building the arguments to the task spec, pin these on the task executor until task finishes * mu_ -> mutex_ * lint * fix build * clear outer_object_id * add direct call type check * Fix test for direct call IDs and return IDs for actor calls * Fix CoreWorkerClient.Addr() * Remove unneeded lock * Remove unnecessary ObjectID refs * Fix worker holding serialized refs test * Fix hex IDs * fix * fix tests * fix tests * refactor and cleanups * lint * Put inlined Ids in task args and some cleanup * Add back gc.collect() line for test case * Refactor and fixes: - store inlined IDs in RayObject - allow storing objects with inlined IDs in memory store - pin objects that were promoted to plasma * oops * make sure worker ID is set in address, pass in rpc::Address to CoreWorkerClient * todos * cleanups and test builds * Fix tests * Add feature flag * cleanups * address comments and some cleanups * cleanup * fix recursive test * Comments for tests * Turn off ref counting by default * Skip tests * Fix some bugs for test_array.py, java build * Don't include nested objects in the ref count when the feature flag is off * C++ feature flag does not work... * Remove * Turn on python tests and add a warning when plasma objects are evicted before being pinned * Fix build and remove irrelevant test * Fix for java * Revert "Fix build and remove irrelevant test" This reverts commit 056cca9b263ed05b0f9ab2250907338edcbca2d5. * Fix ray.internal.free * Fixes and skip some flaky tests * fix java build * fix windows build * Add IDs contained in owned objects * Update src/ray/protobuf/core_worker.proto Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Update src/ray/core_worker/reference_count.cc Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Update src/ray/protobuf/core_worker.proto Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Update src/ray/protobuf/core_worker.proto Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Update src/ray/core_worker/reference_count.h Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Update src/ray/core_worker/reference_count.h Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Update src/ray/core_worker/reference_count.cc Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * Apply suggestions from code review Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * update * Try to fix ::test_direct_call_serialized_id_eviction Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>	2020-02-18 18:21:34 -08:00
mehrdadn	4a12243336	Use Process instead of pid_t (round 2) (#6882 ) * Revert "Revert "Use Boost.Process instead of pid_t (#6510)" (#6909)" This reverts commit `bde575b8dd`. * Process wrapper, using Boost.Process on Windows - Reverts `bde575b8dd`. - Re-applies `fb8e3615d5` after some refactoring. * Remove Boost.Process dependency * Don't open /proc file on Linux * Change FATAL to ERROR and modify error message when process doesn't exist	2020-02-18 17:44:46 -08:00
Eric Liang	0aa9373d62	Revert "Removing Pyarrow dependency (#7146 )" (#7209 ) This reverts commit `2116fd3bca`.	2020-02-18 14:12:06 -08:00
Eric Liang	5df801605e	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
Eric Liang	fae99ecb8e	[core] Make sure to unsubscribe get dependencies for direct task calls. (#7201 ) * fix * remove assert	2020-02-17 18:35:25 -08:00
ijrsvt	2116fd3bca	Removing Pyarrow dependency (#7146 )	2020-02-17 18:00:13 -08:00
mehrdadn	3bd82d0bcd	Fix various issues/warnings that come up on Jenkins (#7147 ) * Avoid warning about swap being unlimited Currently we get the following message on Jenkins: "Your kernel does not support swap limit capabilities or the cgroup is not mounted. Memory limited without swap." Since we're not limiting swap anyway, we might as well avoid trying to. https://docs.docker.com/config/containers/resource_constraints/#--memory-swap-details * Fix escaping in re.search() * Fix escaping in _noisy_layer() * Raise a more descriptive error when dashboard data isn't found * Don't error on dashboard files not being found when webui isn't required * Change dashboard error to a warning instead	2020-02-17 16:08:55 -08:00
Alex Wu	734629b4ea	Ssh command format (#7176 )	2020-02-17 14:15:42 -08:00
Alind Khare	c6d768be14	[Serve] Added support for no http route services (#7010 )	2020-02-17 11:31:30 -08:00
Eric Liang	42aea966ff	[rllib] Convert torch state arrays to tensors during compute actions (#7162 ) * convert to tensor * normalize fix	2020-02-17 10:26:58 -08:00
fyrestone	a6b8bd47b0	[xlang] Cross language serialize ActorHandle (#7134 )	2020-02-17 20:44:56 +08:00
Edward Oakes	b079787c59	Fix flaky test_get_with_timeout (#7175 )	2020-02-16 21:10:16 -08:00
Richard Liaw	94e2fcea2e	[sgd] fp16 (apex) and scheduler support + move examples page (#7061 ) * Init fp16 * fp16 and schedulers * scheduler linking and fp16 * to fp16 * loss scaling and documentation * more documentation * add tests, refactor config * moredocs * more docs * fix logo, add test mode, add fp16 flag * fix tests * fix scheduler * fix apex * improve safety * fix tests * fix tests * remove pin memory default * rm * fix * Update doc/examples/doc_code/raysgd_torch_signatures.py * fix * migrate changes from other PR * ok thanks * pass * signatures * lint' * Update python/ray/experimental/sgd/pytorch/utils.py * Apply suggestions from code review Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * should address most comments * comments * fix this ci * fix tests' * testmode Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>	2020-02-16 19:04:08 -08:00
Sven Mika	f0e62d733f	Bazel exclude rllib-option fix. (#7185 )	2020-02-16 11:26:03 -08:00
Eric Liang	b7016504e8	[rllib] Only run one set of tests unless rllib or tune dirs are changed. (#7179 ) * full filter * lint	2020-02-16 08:52:49 -08:00
Siyuan (Ryans) Zhuang	6745459f96	Apply cpython patch bpo-39492 for the reference counting issue in pickle5 (#7177 ) * apply cpython patch bpo-39492 for the reference count issue	2020-02-15 21:16:13 -08:00
Eric Liang	b6233dff3c	[rllib] Fix bad sample count assert	2020-02-15 17:22:23 -08:00
Sven Mika	2e60f0d4d8	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 ) * commit * comment	2020-02-15 14:50:44 -08:00
Edward Oakes	dc5a27dac0	Move ray.experimental.multiprocessing to ray.util.multiprocessing (#7149 )	2020-02-14 16:17:05 -08:00
Richard Liaw	52d9189d5d	[autoscaler] port-forward for attach + redis_port (#7145 ) * port-forward * fixport * force redis port in init mode * test * Update python/ray/tests/test_ray_init.py	2020-02-14 15:17:00 -08:00
Simon Mo	30de1286bd	Use pip install setup.py (#7158 )	2020-02-14 13:53:36 -08:00
Adrian O'Grady	fe6ce714a0	[rllib] - TaskPool.completed_prefetch() no longer returns stale object ids after an error (#7139 )	2020-02-13 22:30:44 -08:00
Qing Wang	f3703bafa3	[Java] Support concurrent actor calls API. (#7022 ) * WIP Temp change Attach native thread to jvm * Fix run mode * Address comments.	2020-02-14 13:02:39 +08:00
Alex Wu	0d3687a10d	No warning for docker memory > system memory (#7151 )	2020-02-13 15:21:44 -08:00
Edward Oakes	b81b93a9c0	Convert stress tests to projects (#6495 )	2020-02-13 09:19:24 -08:00

1 2 3 4 5 ...

4056 commits