akern40
0cb2c602db
[rllib] Fixes typo in RolloutWorker.__init__ ( #17583 )
...
Fixes the typo in RolloutWorker.__init__, closes #17582
2021-08-13 13:17:36 -07:00
mguarin0
3e010c5760
[rllib] bug fix for rllib pettingzoo pistonball_v4 example ( #17701 )
...
* bug fix for rllib pettingzoo pistonball_v4 example
* adding test for PR 17701
* ran scripts/format.sh
* ok
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-12 00:25:00 -07:00
Sven Mika
7f2b3c0824
[RLlib] Issue 17667: CQL-torch + GPU not working (due to simple_optimizer=False; must use simple optimizer!). ( #17742 )
2021-08-11 18:30:21 +02:00
Julius Frost
6891dee6ea
[RLlib] Better exceptions with traceback in TorchPolicy ( #17690 )
2021-08-11 15:01:07 +02:00
Sven Mika
811d71b368
[RLlib] Issue 17653: Torch multi-GPU (>1) broken for LSTMs. ( #17657 )
2021-08-11 12:44:35 +02:00
Sven Mika
29f20cccb6
[RLlib] Issue 17706: AttributeError: 'numpy.ndarray' object has no attribute 'items'" on certain turn-based MultiAgentEnvs with Dict obs space. ( #17735 )
2021-08-11 12:33:35 +02:00
J K Terry
48e32555c8
[rllib] Update PettingZoo dependency versions ( #17702 )
...
* update pettingzoo dependency versions
* pettingzoo verison
* fix tests
2021-08-11 01:19:19 -07:00
Julius Frost
9322f6aab5
[rllib] Fix classes decorated with @Deprecated to be classes instead of methods ( #17666 )
...
* fix deprecated classes from being methods
* format
2021-08-10 18:25:31 -07:00
Amog Kamsetty
0b8489dcc6
Revert "[RLlib] Add support for multi-GPU to DDPG. ( #17586 )" ( #17707 )
...
This reverts commit 0eb0e0ff58
.
2021-08-10 10:50:21 -07:00
Amog Kamsetty
77f28f1c30
Revert "[RLlib] Fix Trainer.add_policy
for num_workers>0 (self play example scripts). ( #17566 )" ( #17709 )
...
This reverts commit 3b447265d8
.
2021-08-10 10:50:01 -07:00
Sven Mika
3b447265d8
[RLlib] Fix Trainer.add_policy
for num_workers>0 (self play example scripts). ( #17566 )
2021-08-05 11:41:18 -04:00
Sven Mika
0eb0e0ff58
[RLlib] Add support for multi-GPU to DDPG. ( #17586 )
2021-08-05 11:39:51 -04:00
Sven Mika
3013d9b341
[RLlib] Fix "Cannot convert a symbolic Tensor (default_policy/strided_slice_3:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported" ( #17587 )
2021-08-05 11:39:15 -04:00
Kai Fricke
5d56a8aac5
[RLlib] Fix ignoring "sample_collector" config key ( #17460 )
2021-08-04 10:27:35 -04:00
Sven Mika
b539560429
[RLlib] Remove SampleBatch._get_slice_indices() deprecated logging spam (moderate). ( #17567 )
2021-08-04 04:21:32 -04:00
Sven Mika
5107d16ae5
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. ( #17530 )
2021-08-03 18:30:02 -04:00
Sven Mika
924f11cd45
[RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). ( #17371 )
2021-08-03 11:35:49 -04:00
Sven Mika
8a844ff840
[RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch|TFPolicy should not use ray.get_gpu_ids()
(b/c no GPUs assigned by ray). ( #17444 )
2021-08-02 17:29:59 -04:00
Kai Fricke
44d209dd5f
[tune] re-enable tensorboardx without torch installed ( #17403 )
2021-07-29 10:39:38 +01:00
kk-55
a7f8dc9d77
[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py ( #15664 )
2021-07-28 15:25:09 -04:00
Julius Frost
d7a5ec1830
[RLlib] SAC tuple observation space fix ( #17356 )
2021-07-28 12:39:28 -04:00
Sven Mika
0d8fce8fd8
[RLlib] Discussion 2294: Custom vector env example and fix. ( #16083 )
2021-07-28 10:40:04 -04:00
Rohan138
f30b444bac
[Rllib] set self._allow_unknown_config ( #17335 )
...
Co-authored-by: Sven Mika <sven@anyscale.io>
2021-07-28 11:48:41 +01:00
Sven Mika
58da5c1c9b
[RLlib] Discussion 3001: Fix comment on internal state shape (must be [B x S=state dim]). ( #17341 )
2021-07-27 21:41:53 -04:00
Sven Mika
90b21ce27e
[RLlib] De-flake 3 test cases; Fix config.simple_optimizer
and SampleBatch.is_training
warnings. ( #17321 )
2021-07-27 14:39:06 -04:00
Stefan Schneider
489febc6b2
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options ( #17038 )
2021-07-26 22:25:48 -04:00
Julius Frost
16be091702
[RLlib] Refactor if __name__ == "__main__"
into main()
method in rollout/train.py for better reusability ( #17315 )
2021-07-26 11:12:59 -04:00
Sven Mika
5231fdd996
[Testing] Split RLlib example scripts CI tests into 4 jobs (from 2). ( #17331 )
2021-07-26 10:52:55 -04:00
Sven Mika
0c5c70b584
[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. ( #17118 )
2021-07-25 16:55:51 -04:00
Chris Bamford
29768a7c01
[RLLib] (P1 regression) Fixing view requirements in compute actions ( #15856 )
2021-07-25 14:25:07 -04:00
ddworak94
fba8461663
[RLlib] Add RNN-SAC agent ( #16577 )
...
Shoutout to @ddworak94 :)
2021-07-25 10:04:52 -04:00
Sven Mika
7bc4376466
[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). ( #17077 )
2021-07-22 10:59:13 -04:00
Richard Liaw
a78a2263e5
[RLlib] Fix reverted RockPaperScissors Pettingzoo example ( #16896 )
2021-07-22 10:55:07 -04:00
Vince Jankovics
05c9dfbbda
[RLlib] CV2 to Skimage dependency change ( #16841 )
2021-07-21 22:24:18 -04:00
Julius Frost
0b1b6222bc
[rllib] Add merge_trainer_config arguments to trainer template ( #17160 )
2021-07-21 15:43:06 -07:00
Sven Mika
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. ( #17169 )
2021-07-20 14:58:13 -04:00
Amog Kamsetty
cb74053ee5
Retry remove gpustat
dependency ( #17115 )
...
* remove gpustat
* move psutil imports
2021-07-19 11:14:10 -07:00
Sven Mika
18d173b172
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. ( #17031 )
2021-07-19 13:16:03 -04:00
Sven Mika
e0640ad0dc
[RLlib] Fix seeding for ES and ARS. ( #16744 )
2021-07-19 13:13:05 -04:00
Sven Mika
649580d735
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). ( #17046 )
2021-07-15 05:51:24 -04:00
kk-55
13094a3f1c
AttributeError: 'numpy.ndarray' object has no attribute 'get_shape' when running with framework config tf2 or tfe ( #16868 )
2021-07-15 10:47:16 +01:00
Sven Mika
ce6dfc9b2d
[RLlib] Update tf1.x vs tf2.x documentation and eager example script. ( #17030 )
2021-07-13 20:02:17 -04:00
Grzegorz Bartyzel
d553d4da6c
[RLlib] DQN (Rainbow): Fix torch noisy layer support and loss ( #16716 )
2021-07-13 16:48:06 -04:00
Sven Mika
1fd0eb805e
[RLlib] Redo fix bug normalize vs unsquash actions (original PR made log-likelihood test flakey). ( #17014 )
2021-07-13 14:01:30 -04:00
Antoine Galataud
16f1011c07
[RLlib] Issue 15910: APEX current learning rate not updated on local worker ( #15911 )
2021-07-13 14:01:00 -04:00
Amog Kamsetty
38b5b6d24c
Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). ( #16565 )" ( #17036 )
...
This reverts commit e4123fff27
.
2021-07-13 09:57:15 -07:00
Kai Fricke
27d80c4c88
[RLlib] ONNX export for tensorflow (1.x) and torch ( #16805 )
2021-07-13 12:38:11 -04:00
Kai Fricke
3380b68b54
[RLlib] Issue 16683: Fix last infos dict ( #16999 ).
2021-07-13 11:33:48 -04:00
Sven Mika
e4123fff27
[RLlib] Simplify multiagent config (automatically infer class/spaces/config). ( #16565 )
2021-07-13 06:38:14 -04:00
Amog Kamsetty
df3dd81348
[rllib] skip highly flaky tests ( #17010 )
2021-07-12 11:18:28 -07:00