hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Sven Mika	90b21ce27e	[RLlib] De-flake 3 test cases; Fix `config.simple_optimizer` and `SampleBatch.is_training` warnings. (#17321 )	2021-07-27 14:39:06 -04:00
Vince Jankovics	05c9dfbbda	[RLlib] CV2 to Skimage dependency change (#16841 )	2021-07-21 22:24:18 -04:00
Sven Mika	5a313ba3d6	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
Sven Mika	18d173b172	[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031 )	2021-07-19 13:16:03 -04:00
Sven Mika	e0640ad0dc	[RLlib] Fix seeding for ES and ARS. (#16744 )	2021-07-19 13:13:05 -04:00
Sven Mika	649580d735	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
Sven Mika	1fd0eb805e	[RLlib] Redo fix bug normalize vs unsquash actions (original PR made log-likelihood test flakey). (#17014 )	2021-07-13 14:01:30 -04:00
Amog Kamsetty	38b5b6d24c	Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )" (#17036 ) This reverts commit `e4123fff27`.	2021-07-13 09:57:15 -07:00
Sven Mika	e4123fff27	[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )	2021-07-13 06:38:14 -04:00
Amog Kamsetty	bc33dc7e96	Revert "[RLlib] Fix bug in policy.py: normalize_actions=True has to call `unsquash_action`, not `normalize_action`." (#17002 ) This reverts commit `7862dd64ea`.	2021-07-12 11:09:14 -07:00
Sven Mika	7862dd64ea	[RLlib] Fix bug in policy.py: normalize_actions=True has to call `unsquash_action`, not `normalize_action`. (#16774 )	2021-07-08 17:31:34 +02:00
Sven Mika	9f6a92163b	[RLlib] Remove old UsageTrackingDict code. (#16867 )	2021-07-08 17:27:52 +02:00
Kai Fricke	10fd7111b3	[rllib] Improve test learning check, fix flaky two step qmix (#16843 )	2021-07-06 19:39:12 +01:00
Sven Mika	7eb1a29426	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
Sven Mika	53206dd440	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
mvindiola1	82a3ff795c	[RLlib] ensure curiosity exploration actions are passed in as tf tens… (#15704 )	2021-06-21 10:03:17 -07:00
Sven Mika	d0014cd351	[RLlib] Policies get/set_state fixes and enhancements. (#16354 )	2021-06-15 13:08:43 +02:00
Sven Mika	2d34216660	[RLlib] APEX-DQN: Bug fix for torch and add learning test. (#15762 )	2021-05-20 09:27:03 +02:00
Sven Mika	2303851c3c	[RLlib] Torch multi-GPU + LSTM/RNN bug fix. (#15492 )	2021-05-18 11:51:05 +02:00
Sven Mika	a36b9305d4	[RLlib] Better error message when deep-learning framework not installed. (#15735 )	2021-05-18 11:06:05 +02:00
Sven Mika	bc09e75b78	[RLlib] Fix 3 flakey test cases. (#15785 )	2021-05-16 12:20:33 +02:00
Michael Luo	4cbe13cdfd	[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603 ) Co-authored-by: Sven Mika <sven@anyscale.io> Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-05-04 19:06:19 +02:00
Amog Kamsetty	ebc44c3d76	[CI] Upgrade flake8 to 3.9.1 (#15527 ) * formatting * format util * format release * format rllib/agents * format rllib/env * format rllib/execution * format rllib/evaluation * format rllib/examples * format rllib/policy * format rllib utils and tests * format streaming * more formatting * update requirements files * fix rllib type checking * updates * update * fix circular import * Update python/ray/tests/test_runtime_env.py * noqa	2021-05-03 14:23:28 -07:00
Sven Mika	e973b726c2	[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273 )	2021-04-30 19:26:30 +02:00
Sven Mika	bb8a286cbc	[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684 )	2021-04-27 10:44:54 +02:00
Sven Mika	7e1a191f17	[RLlib] Remove all remaining tf- and MuJoCo warnings from RLlib. (#15454 )	2021-04-22 19:20:19 +02:00
Sven Mika	cecfc3b43b	[RLlib] Multi-GPU support for Torch algorithms. (#14709 )	2021-04-16 09:16:24 +02:00
Sven Mika	8b3554e37e	[RLlib] Remove all (already soft-deprecated) `SampleBatch.data` from code. (#15335 )	2021-04-15 19:19:51 +02:00
Sven Mika	b267f1f1ba	[RLlib] Add support for Int-Box action spaces. (#15012 )	2021-04-11 13:16:01 +02:00
Sven Mika	8698cf9bc8	[RLlib] Fix param noise test case on CI. (#14926 )	2021-03-25 12:33:23 +01:00
Sven Mika	69202c6a7d	[RLlib] Obsolete usage tracking dict via sample batch. (#13065 )	2021-03-17 08:18:15 +01:00
Sven Mika	732197e23a	[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393 )	2021-03-08 15:41:27 +01:00
Sven Mika	8000258333	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
Sven Mika	eb0038612f	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
Sven Mika	52c94b7ee9	[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522 )	2021-02-02 13:05:58 +01:00
Sven Mika	2e3655e8a9	[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238 )	2021-01-19 14:22:36 +01:00
Sven Mika	1f00f834ac	[RLlib] Solve PyTorch/TF-eager A3C async race condition between calling model and its value function. (#13467 )	2021-01-18 10:29:03 -08:00
Sven Mika	56878221ed	[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363 )	2021-01-14 14:44:33 +01:00
Sven Mika	d49c3fae0b	[RLlib] Trajectory View API: Atari framestacking. (#13315 )	2021-01-13 08:53:34 +01:00
Kai Fricke	25f10a947a	Revert "[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339 )" (#13361 ) This reverts commit `e2b2abb88b`.	2021-01-12 12:33:57 +01:00
Sven Mika	e2b2abb88b	[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339 )	2021-01-11 22:42:30 +01:00
Sven Mika	6f342a2221	[RLlib] Preparatory PR for: Documentation on Model Building. (#13260 )	2021-01-08 10:56:09 +01:00
Sven Mika	a5b39ef8e2	[RLlib] Fix missing "info_batch" arg (None) in `compute_actions` calls. (#13237 )	2021-01-07 21:25:02 +01:00
Sven Mika	8726521604	[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091 )	2020-12-30 22:30:52 -05:00
Sven Mika	99ae7bae05	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
Sven Mika	1e74187179	[RLlib] TorchPolicies: Accessing "infos" dict in train_batch causes `TypeError`. (#13039 )	2020-12-23 11:30:50 -05:00
Sven Mika	01faeabc17	[RLlib] Issue 12789: RLlib throws the warning "The given NumPy array is not writeable" (#12793 )	2020-12-22 09:28:07 -05:00
Sven Mika	b2bcab711d	[RLlib] Attention Nets: tf (#12753 )	2020-12-20 20:22:32 -05:00
Sven Mika	74c98ac38e	[RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). (#12786 )	2020-12-11 16:13:38 +01:00
Sven Mika	340b1e99fc	[RLlib] Fix JAX import bug. (#12621 )	2020-12-07 11:05:08 -08:00

1 2 3 4 5

209 commits