Sven Mika
3f89f35e52
[RLlib] Better error messages and hints; + failure-mode tests; ( #18466 )
2021-09-10 16:52:47 +02:00
Sven Mika
cabaa3b3c6
[RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. ( #18381 )
2021-09-07 11:48:41 +02:00
Sven Mika
e3e6ed7aaa
[RLlib] Issues 17844, 18034: Fix n-step > 1 bug. ( #18358 )
2021-09-06 12:14:20 +02:00
Sven Mika
599e589481
[RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. ( #18065 )
2021-08-31 14:56:53 +02:00
Sven Mika
4888d7c9af
[RLlib] Replay buffers: Add config option to store contents in checkpoints. ( #17999 )
2021-08-31 12:21:49 +02:00
Sven Mika
8acb469b04
[RLlib; Testing] Green all RLlib nightly tests. ( #18073 )
2021-08-26 14:09:20 +02:00
Sven Mika
9883505e84
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). ( #18017 )
2021-08-24 21:55:27 +02:00
Sven Mika
ff7740e508
[RLlib] Fix the git-bisect helper/debugging script. ( #18040 )
2021-08-24 18:08:29 +02:00
Sven Mika
494ddd98c1
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. ( #17928 )
2021-08-21 17:05:48 +02:00
Clark Zinzow
d958457d07
[Core] Second pass at privatizing APIs. ( #17885 )
...
* gcs_utils
* resource_spec
* profiling
* ray_perf and ray_cluster_perf
* test_utils
2021-08-18 20:56:33 -07:00
Sven Mika
f18213712f
[RLlib] Redo: "fix self play example scripts" PR (17566) ( #17895 )
...
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
2021-08-17 09:13:35 -07:00
Ian Rodney
2f200e5c2b
[Client] Pass ray.init()
args to the remote server ( #17776 )
2021-08-16 12:34:01 -07:00
dependabot[bot]
91d01f7211
[RLlib](deps): Bump tensorflow from 2.4.1 to 2.5.0 in /python/requirements/rllib ( #15849 )
2021-08-16 10:55:48 +02:00
Sven Mika
2bd2ee7a73
[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. ( #17485 )
2021-08-16 06:08:14 +02:00
mguarin0
3e010c5760
[rllib] bug fix for rllib pettingzoo pistonball_v4 example ( #17701 )
...
* bug fix for rllib pettingzoo pistonball_v4 example
* adding test for PR 17701
* ran scripts/format.sh
* ok
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-12 00:25:00 -07:00
Sven Mika
7f2b3c0824
[RLlib] Issue 17667: CQL-torch + GPU not working (due to simple_optimizer=False; must use simple optimizer!). ( #17742 )
2021-08-11 18:30:21 +02:00
Amog Kamsetty
77f28f1c30
Revert "[RLlib] Fix Trainer.add_policy
for num_workers>0 (self play example scripts). ( #17566 )" ( #17709 )
...
This reverts commit 3b447265d8
.
2021-08-10 10:50:01 -07:00
Sven Mika
3b447265d8
[RLlib] Fix Trainer.add_policy
for num_workers>0 (self play example scripts). ( #17566 )
2021-08-05 11:41:18 -04:00
Sven Mika
8a844ff840
[RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch|TFPolicy should not use ray.get_gpu_ids()
(b/c no GPUs assigned by ray). ( #17444 )
2021-08-02 17:29:59 -04:00
Kai Fricke
44d209dd5f
[tune] re-enable tensorboardx without torch installed ( #17403 )
2021-07-29 10:39:38 +01:00
ddworak94
fba8461663
[RLlib] Add RNN-SAC agent ( #16577 )
...
Shoutout to @ddworak94 :)
2021-07-25 10:04:52 -04:00
Sven Mika
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. ( #17169 )
2021-07-20 14:58:13 -04:00
Sven Mika
649580d735
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). ( #17046 )
2021-07-15 05:51:24 -04:00
Amog Kamsetty
38b5b6d24c
Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). ( #16565 )" ( #17036 )
...
This reverts commit e4123fff27
.
2021-07-13 09:57:15 -07:00
Sven Mika
e4123fff27
[RLlib] Simplify multiagent config (automatically infer class/spaces/config). ( #16565 )
2021-07-13 06:38:14 -04:00
Julius Frost
a88b217d3f
[rllib] Enhancements to Input API for customizing offline datasets ( #16957 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-10 15:05:25 -07:00
Sven Mika
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes ( #16531 )
2021-06-30 12:32:11 +02:00
Travis Addair
e5dfa4cfb9
[tune] Only use TBXLoggerCallback when torch is installed ( #16695 )
...
* [tune] Only use TBXLoggerCallback when torch is installed
* Fix lint
* fix
* Update python/ray/tune/utils/callback.py
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-06-28 16:34:20 -07:00
Sven Mika
be6db06485
[RLlib] Re-do: Trainer: Support add and delete Policies. ( #16569 )
2021-06-21 13:46:01 +02:00
Amog Kamsetty
bd3cbfc56a
Revert "[RLlib] Allow policies to be added/deleted on the fly. ( #16359 )" ( #16543 )
...
This reverts commit e78ec370a9
.
2021-06-18 12:21:49 -07:00
Sven Mika
e78ec370a9
[RLlib] Allow policies to be added/deleted on the fly. ( #16359 )
2021-06-18 10:31:30 +02:00
matthewdeng
138b273136
[rllib] Add tests for examples using ray client ( #16271 )
...
* [rllib] add tests for examples using ray client
* rename test_client to test_ray_client
2021-06-09 10:39:14 -07:00
Sven Mika
e2be41b407
[RLlib] MARWIL + BC: Various fixes and enhancements. ( #16218 )
2021-06-03 22:29:00 +02:00
Sven Mika
c9d220bcda
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. ( #16080 )
2021-06-01 17:39:18 +02:00
Sven Mika
f6302d81be
[RLlib] Discussion 2210: BC algo broken, if "advantages" missing in offline data. ( #16019 )
2021-05-25 08:47:17 +02:00
Sven Mika
e80095591c
[RLlib] Entropy coeff schedule bug fix and git bisect script. ( #15937 )
2021-05-20 18:15:10 +02:00
Steven Morad
d8eed68af2
[RLlib] Add differentiable neural computer example ( #14844 )
2021-05-19 09:15:39 +02:00
Sven Mika
c4a3e1589b
[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. ( #15761 )
2021-05-13 09:17:23 +02:00
Sven Mika
16ddab49f5
[RLlib] Trainer._evaluate -> Trainer.evaluate; Also make evaluation possible w/o evaluation worker set. ( #15591 )
2021-05-12 12:16:00 +02:00
Michael Luo
4cbe13cdfd
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. ( #15603 )
...
Co-authored-by: Sven Mika <sven@anyscale.io>
Co-authored-by: sven1977 <svenmika1977@gmail.com>
2021-05-04 19:06:19 +02:00
Amog Kamsetty
ebc44c3d76
[CI] Upgrade flake8 to 3.9.1 ( #15527 )
...
* formatting
* format util
* format release
* format rllib/agents
* format rllib/env
* format rllib/execution
* format rllib/evaluation
* format rllib/examples
* format rllib/policy
* format rllib utils and tests
* format streaming
* more formatting
* update requirements files
* fix rllib type checking
* updates
* update
* fix circular import
* Update python/ray/tests/test_runtime_env.py
* noqa
2021-05-03 14:23:28 -07:00
Sven Mika
e973b726c2
[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. ( #15273 )
2021-04-30 19:26:30 +02:00
Sven Mika
fc3a65f9d4
[RLlib] Split test_checkpoint_restore tests into 3 and make each "large" (from "enormous"). ( #15499 )
2021-04-30 12:33:12 +02:00
Sven Mika
354c960fff
[RLlib] Fix test_dependency_torch and fix custom logger support for RLlib. ( #15120 )
2021-04-24 08:13:41 +02:00
Sven Mika
bbfa8ffec9
[RLlib] Minor release 1.3 warnings cleanups. ( #15272 )
2021-04-14 14:03:15 +02:00
Sven Mika
b267f1f1ba
[RLlib] Add support for Int-Box action spaces. ( #15012 )
2021-04-11 13:16:01 +02:00
Yi Cheng
e552e3f19c
Skip test_dependency_torch ( #15123 )
2021-04-05 18:02:10 -07:00
Kai Fricke
d33b0e4bc3
[tune] Reconcile placement groups every N seconds to avoid bottlenecks when running many short trials ( #15011 )
...
Closes a release blocking issue
2021-04-01 17:04:44 +02:00
Michael Luo
b84575c092
[RLlib] 2 RLlib Flaky Tests ( #14930 )
2021-03-30 19:21:13 +02:00
Eric Liang
b90cc51c27
[RLlib] Attempt splitting rollout test to avoid initial timeout ( #14999 )
2021-03-30 19:20:02 +02:00