kourosh hakhamaneshi
3b3c20209b
[RLlib] Fix dqn reproducibility ( #27459 )
2022-08-09 15:56:44 -07:00
Charles Sun
c358305ca6
[RLlib] DatasetReader action normalization. ( #27356 )
2022-08-09 16:54:03 +02:00
Sven Mika
537f7c65c1
[RLlib] CRR framework torch by default. ( #27161 )
2022-08-09 16:53:00 +02:00
kourosh hakhamaneshi
b84dd38f01
[RLlib] Add __getitem__
to MultiAgentBatch
to access policy_batches
. ( #27619 )
2022-08-09 16:51:26 +02:00
kourosh hakhamaneshi
98b9fa6944
[RLlib] Hotfix for connector tests ( #27654 )
...
hot fix for rllib connector tests
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
2022-08-08 15:12:47 -07:00
Artur Niederfahrenhorst
4fe47d069f
[RLlib] Require ApeX LR schedule test to produce learner info. ( #27557 )
2022-08-08 18:19:02 +02:00
kourosh hakhamaneshi
3b2a8427af
[RLlib] Fix SampleBatch to_device(). ( #27572 )
2022-08-08 18:18:33 +02:00
Jun Gong
a61095a480
[RLlib] fix bandit pre-merge tests ( #27554 )
2022-08-07 17:48:29 -07:00
Jun Gong
5f07987ab1
[RLlib] Fix connector examples ( #27583 )
2022-08-07 17:48:09 -07:00
Jun Gong
f8b2128f16
[RLlib] async_request_test needs to run exclusively. ( #27603 )
2022-08-07 17:47:29 -07:00
Avnish Narayan
55209692ee
[RLlib] Deflake MARWIL and BC and remove memory leak from torch MARWIL policy ( #27406 )
2022-08-03 16:53:12 -07:00
Rohan Potdar
5b6a58ed28
[RLlib] Add OPE Learning Tests ( #27154 )
2022-08-02 17:51:38 -07:00
Avnish Narayan
00f9438101
[RLlib] Training step docs. ( #27344 )
2022-08-02 23:41:45 +02:00
Jun Gong
61add8ede6
[RLlib] Fix the last cartpole-crashing premerge test. ( #27315 )
2022-08-02 20:08:33 +02:00
kourosh hakhamaneshi
bda5026428
[RLlib] Fix A2C release tests ( #27314 )
2022-08-02 10:44:52 -07:00
kourosh hakhamaneshi
8d848890f1
[RLlib] Fix default view_requirement in policy.py ( #27255 )
2022-08-02 10:44:07 -07:00
Artur Niederfahrenhorst
a598458c46
[RLlib] Fix complex torch one-hot and flattened layers not being added to module list. ( #27304 )
2022-08-01 15:52:28 +02:00
Steven Morad
d0a8e3c36f
[RLlib] User-friendly RNN sequencing. ( #27087 )
2022-08-01 15:32:22 +02:00
Steven Morad
77318abfaf
[RLlib] Warn on PPO infinite KL loss term. ( #26629 )
2022-08-01 12:55:26 +02:00
Jun Gong
e6e10ce4cf
[RLlib] Revert 41c9ef70
. ( #27243 )
...
Why are these changes needed?
Also:
Add validation to make sure multi-gpu and micro-batch is not used together.
Update A2C learning test to hit the microbatching branch.
Minor comment updates.
2022-07-29 11:05:15 -07:00
Kai Fricke
1d3c167bfe
[rllib/release] Fix rllib connect test with Tuner() API ( #27155 )
...
Currently failing because the Tune framework example does not return fitting results.
Signed-off-by: Kai Fricke <kai@anyscale.com>
2022-07-28 11:08:02 +01:00
Eric Liang
a4434fac7f
[docs] Fix the remaining style violations in docstrings and add lint rule ( #27033 )
2022-07-27 22:24:20 -07:00
xwjiang2010
eb69c1ca28
[air] Add annotation for Tune module. ( #27060 )
...
Co-authored-by: Kai Fricke <kai@anyscale.com>
2022-07-27 13:53:46 -07:00
Malinda
1d789aee63
[RLlib/Serve/Release tests] Few code refactoring for better use of efficient NumPy functions. ( #26284 )
2022-07-27 22:38:35 +02:00
Jun Gong
e1cf0cc982
[RLlib] Deflake cartpole crashing tests. ( #27097 )
...
Make sure cartpole crashing tests are not flaky.
2022-07-27 12:50:34 -07:00
Jun Gong
acf2bf9b2f
[RLlib] Get rid of all these deprecation warnings. ( #27085 )
2022-07-27 10:48:54 -07:00
Kai Fricke
8fda425eca
[tune/rllib] Hotfix ml_utils deprecation import error ( #27095 )
...
The changes conflicted with a recently merged PR that refactored the package structure (#27005 ).
Signed-off-by: Kai Fricke <kai@anyscale.com>
2022-07-27 16:11:58 +01:00
Kai Fricke
a5ea99cf95
[rfc] [tune/rllib] Fetch _progress_metrics from trainable for verbose=2 display ( #26967 )
...
RLLibs trainables produce a large number of metrics which makethe log output with verbose=2 illegible. This PR introduces a private `_progress_metrics` property for trainables. If set, the trial progress callback will only print these metrics per default, unless overridden e.g. with a custom `TrialProgressCallback`.
2022-07-27 16:04:23 +01:00
Amog Kamsetty
862d10c162
[AIR] Remove ML code from ray.util
( #27005 )
...
Removes all ML related code from `ray.util`
Removes:
- `ray.util.xgboost`
- `ray.util.lightgbm`
- `ray.util.horovod`
- `ray.util.ray_lightning`
Moves `ray.util.ml_utils` to other locations
Closes #23900
Signed-off-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Signed-off-by: Kai Fricke <kai@anyscale.com>
Co-authored-by: Kai Fricke <kai@anyscale.com>
2022-07-27 14:24:19 +01:00
xwjiang2010
fcf897ee72
[air] update rllib example to use Tuner API. ( #26987 )
...
update rllib example to use Tuner API.
Signed-off-by: xwjiang2010 <xwjiang2010@gmail.com>
2022-07-27 12:12:59 +01:00
Jun Gong
c7ae787cc8
[RLlib] Beef up worker failure test. ( #26953 )
2022-07-27 00:10:45 -07:00
Jun Gong
a22457b548
[RLlib] Small bug fix ( #27003 )
2022-07-27 00:02:18 -07:00
Jun Gong
54df8bfe42
[RLlib] Try to checkpoint a durable policy name ( #27016 )
2022-07-27 00:01:14 -07:00
kourosh hakhamaneshi
8ddcf89096
[RLlib] Implemented ViewRequirementConnector ( #26998 )
2022-07-26 21:52:14 -07:00
Rohan Potdar
deccf33912
[RLlib]: Add Off-Policy Estimation docs ( #26809 )
...
Co-authored-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
2022-07-26 13:57:56 -07:00
Jun Gong
ca5e0dcaf4
[RLLib] Record framework and algorithm used by an RLlib run. ( #26956 )
...
Automatically record framework and algorithm used by RLlib jobs.
For better planning.
2022-07-25 16:16:36 -07:00
kourosh hakhamaneshi
5030a4c1d3
[RLlib] Simplify agent collector ( #26803 )
2022-07-25 13:17:17 -07:00
Artur Niederfahrenhorst
b1594260ba
[RLlib] Small SlateQ example fix. ( #26948 )
2022-07-25 15:12:42 +02:00
Avnish Narayan
41c9ef709a
[RLlib] Using PG when not doing microbatching kills A2C performance. ( #26844 )
2022-07-25 15:11:26 +02:00
Artur Niederfahrenhorst
e9a8f7d9ae
[RLlib] Unify gnorm mixin for tf and torch policies. ( #26102 )
2022-07-24 15:31:09 +02:00
Ishant Mrinal
b32c784c7f
[RLLib] RE3 exploration algorithm TF2 framework support ( #25221 )
2022-07-23 18:05:01 -07:00
Rohan Potdar
a53bbe49bf
[RLlib]: Raise deprecation warning in MARWIL OPE methods. ( #26893 )
2022-07-23 13:55:40 +02:00
Rohan Potdar
97bcf38ec0
[RLlib] Fix torch None conversion in torch_utils.py::convert_to_torch_tensor
. ( #26863 )
2022-07-23 13:54:57 +02:00
Rohan Potdar
69f6b843da
[RLlib] Test output length in DatasetReader with default IOContext. ( #26852 )
2022-07-23 13:53:59 +02:00
Rohan Potdar
2f22262d39
[RLlib]: Fix SampleBatch.split_by_episode to use dones if episode id is not available ( #26492 )
2022-07-22 16:46:05 -07:00
Steven Morad
259429bdc3
Bump gym dep to 0.24 ( #26190 )
...
Co-authored-by: Steven Morad <smorad@anyscale.com>
Co-authored-by: Avnish <avnishnarayan@gmail.com>
Co-authored-by: Avnish Narayan <38871737+avnishn@users.noreply.github.com>
2022-07-22 12:37:16 -07:00
Avnish Narayan
2a0ef663c9
[rllib] Use compress observations where replay buffers and image obs are used in tuned examples ( #26735 )
2022-07-22 10:10:51 -07:00
Fabian Witter
dc2ad6c8b4
[RLlib] Fix ModelCatalog for nested complex inputs ( #25620 )
2022-07-22 00:45:25 -07:00
Jun Gong
6c1acd1a2f
[RLlib] Quick state buffer connector fix ( #26836 )
2022-07-21 20:43:59 -07:00
Jun Gong
0bc560bd54
[RLlib] Make sure we step() after adding init_obs. ( #26827 )
2022-07-21 20:43:46 -07:00