Sven Mika
4b10bdf8fc
[RLlib] rollout.py - Add multi-agent test case. ( #9981 )
2020-08-10 19:44:23 +02:00
Barak Michener
8e76796fd0
ci: Redo format.sh --all
script & backfill lint fixes ( #9956 )
2020-08-07 16:49:49 -07:00
Sven Mika
5d5643e633
[RLlib] Add informative error message when bad Conv2D stack is used with fixed num_outputs
(no flattening at end). ( #9966 )
2020-08-07 12:04:17 +02:00
Eric Liang
668f555755
[rllib] Clean up outdated docs #9915
2020-08-06 18:29:04 -07:00
Sven Mika
57690a3a9f
[RLlib] Trajectory view API - 02 actual API scaffold ( #9753 )
2020-08-06 10:54:20 +02:00
Sven Mika
19d785b947
[LINT] Except RLlib from checking for flake8 error F821 ( #9946 )
2020-08-06 10:44:37 +02:00
Sven Mika
9b90f7db67
[RLlib] Missing type annotations policy templates. ( #9846 )
2020-08-06 05:33:24 +02:00
Michael Luo
4d7bd8c892
[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) ( #9409 )
2020-08-02 18:12:09 +02:00
Sven Mika
e540e425e4
[RLlib] rllib rollout
test and bug fixes. ( #9779 )
2020-07-30 16:17:03 +02:00
Sven Mika
f6bd12eb18
[RLlib] Add tensor-based tests for Schedules and fix some bugs related to using Schedules with tensor time input. ( #9782 )
2020-07-30 12:49:32 +02:00
Miguel Morales
372114b4ed
Update sampler.py ( #9805 )
...
Minor fix for warning string
2020-07-29 22:58:35 -07:00
Sven Mika
b0b0463161
[RLlib] Trajectory View API (preparatory cleanup and enhancements). ( #9678 )
2020-07-29 21:15:09 +02:00
Sven Mika
ff9c1dac88
[RLlib] Issue 9667 DDPG Torch bugs and enhancements. ( #9680 )
2020-07-28 14:15:03 +02:00
Sven Mika
e6ea33a03c
[RLlib] Enhance reward clipping test; add action_clipping tests. ( #9684 )
2020-07-28 10:44:54 +02:00
Michael Luo
b51ab2af66
[RLlib] Offline Type Annotations ( #9676 )
...
* Offline Annotations
* Modifications
* Fixed circular dependencies
* Linter fix
2020-07-27 14:01:17 -07:00
Sven Mika
5dc4b6686e
[RLlib] Implement DQN PyTorch distributional head. ( #9589 )
2020-07-25 09:29:24 +02:00
Petros Christodoulou
46c64c90d0
fixed simplex initialisation seeding bug ( #9660 )
...
Co-authored-by: Petros Christodoulou <petrochr@amazon.com>
2020-07-24 14:22:41 -07:00
Sven Mika
e4c5d3526f
Issue 9631: Tf1.14 does not have tf.config.list_physical_devices. ( #9681 )
2020-07-24 21:48:58 +02:00
Eric Liang
590943a499
[rllib] Type annotations for model classes ( #9646 )
2020-07-24 12:01:46 -07:00
Eric Liang
5acd3e66dd
[rllib] Fix torch TD error, IMPALA LR updates ( #9477 )
...
* update
* add test
* lint
* fix super call
* speed es test up
2020-07-23 12:50:25 -07:00
Raphael Avalos
5303c3abe3
Fix TorchDeterministic ( #9241 )
2020-07-23 10:43:20 -07:00
Sven Mika
75592e664f
Issue 9568: rllib train
framework in config gets overridden with tf. ( #9572 )
2020-07-21 22:02:24 +02:00
Raphael Avalos
440c9c42be
[RLlib] Fix combination of lockstep and multiple agnts controlled by the same policy. ( #9521 )
...
* Change aggregation when lockstep is activated.
Modification of MultiAgentBatch.timeslices to support the combination of lockstep and multiple agents controlled by the same policy.
fix ray-project/ray#9295
* Line too long.
2020-07-19 23:03:12 -07:00
Sven Mika
887cf5eca7
MADDPG learning confirmation test. ( #9538 )
2020-07-17 20:18:02 +02:00
Sven Mika
78dfed2683
[RLlib] Issue 8384: QMIX doesn't learn anything. ( #9527 )
2020-07-17 12:14:34 +02:00
Sven Mika
8204717eed
[RLlib] Issue 9218: PyTorch Policy places Model on GPU even with num_gpus=0 ( #9516 )
2020-07-17 05:53:25 +02:00
Michael Luo
94fcd43593
[rllib] MAML Transform ( #9463 )
...
* MAML Transform
* Moved Inner Adapt to Method in Execution Plan
2020-07-16 11:11:33 -07:00
Sven Mika
935d8308fb
[RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). ( #9497 )
2020-07-16 14:55:50 +02:00
Sven Mika
617eb8f279
[RLlib] Issue 9402 MARWIL producing nan rewards. ( #9429 )
2020-07-14 05:07:16 +02:00
Sven Mika
03ab86567f
[RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). ( #9269 )
2020-07-14 04:27:49 +02:00
Tanay Wakhare
15aa08a3d1
[RLLib] WindowStat bug fix ( #9213 )
...
* WindowStat error catching, which processes NaNs properly instead of erroring. This ought to resolve issue #7910 .
https://github.com/ray-project/ray/issues/7910
2020-07-12 23:01:32 +02:00
Tanay Wakhare
3536d8e4b3
Masking error. With t*valid_mask, we get the error np.inf*0 = np.inf ( #9407 )
2020-07-12 22:59:35 +02:00
Sven Mika
fcdf410ae1
[RLlib] Tf2.x native. ( #8752 )
2020-07-11 22:06:35 +02:00
Sven Mika
14160ca58c
[RLlib] Issue #9366 (DQN w/o dueling produces invalid actions). ( #9386 )
2020-07-10 12:43:03 +02:00
Hao Chen
d49dadf891
Change Python's ObjectID
to ObjectRef
( #9353 )
2020-07-10 17:49:04 +08:00
Sven Mika
01125b8fcf
[RLlib] DQN rainbow eager-mode (keras style NoisyLayer) (preparation for native tf2.x support). ( #9304 )
2020-07-09 10:44:10 +02:00
Sven Mika
4da0e542d5
[RLlib] DDPG and SAC eager support (preparation for tf2.x) ( #9204 )
2020-07-08 16:12:20 +02:00
Benjamin Black
1425cdf834
Pettingzoo environment support ( #9271 )
...
* added pettingzoo wrapper env and example
* added docs, examples for pettingzoo env support
* fixed pettingzoo env flake8, added test
* fixed pettingzoo env import
* fixed pettingzoo env import
* fixed pettingzoo import issue
* fixed pettingzoo test
* fixed linting problem
* fixed bad quotes
* future proofed pettingzoo dependency
* fixed ray init in pettingzoo env
* lint
* manual lint
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-07-06 21:32:26 -07:00
Piotr Januszewski
155cc81e40
Clarify training intensity configuration docstring ( #9244 ) ( #9306 )
2020-07-05 20:07:27 -07:00
Sven Mika
f43d934817
[RLlib] Type annotations for policy. ( #9248 )
2020-07-05 13:09:51 +02:00
Michael Luo
851d02463b
[Doc] RLlib Algorithms Documentation: MAML + PyTorch MAML ( #9189 )
2020-07-03 11:05:15 -07:00
Eric Liang
4b62a888cc
[rllib] Remove deprecated policy optimizer package. ( #9262 )
2020-07-02 14:39:40 -07:00
Sven Mika
5b2a97597b
[RLlib] Retire try_import_tree
(should be installed along with other requirements). ( #9211 )
...
- Retire try_import_tree.
- Stabilize test_supported_multi_agent.py.
2020-07-02 13:06:34 +02:00
Sven Mika
c4ccbfdfa9
[RLlib] tf-eager support for ES and ARS (tf2.x preparation). ( #9207 )
2020-07-02 13:03:10 +02:00
Richard Liaw
d35f0e40d0
[tune] Use public methods for trainable ( #9184 )
2020-07-01 11:00:00 -07:00
Sven Mika
b4c0b942fe
[RLlib] Remove requirement for dataclasses in rllib (not supported in py3.5) ( #9237 )
2020-07-01 17:31:44 +02:00
Sven Mika
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf()
). ( #9136 )
...
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
2020-06-30 10:13:20 +02:00
Tanay Wakhare
efcee9f1de
[RLlib] MADDPG bug fix (issue https://github.com/ray-project/ray/issues/8483 ) ( #9110 )
...
* Bug fix for https://githhub.com/ray-project/ray/issues/8483
We need to pass in a framework explicitly with the new defaults. Further, the actual bug was that policies were being sorted alphabetically in the MADDPG init(), which led to incorrect initialization.
* Linting
2020-06-30 00:27:32 -07:00
Sven Mika
0d37103f84
[RLlib] Prototype: Model Trajectory View API, part 0 ( #9171 )
2020-06-30 05:33:19 +02:00
Michael Luo
eb8eb2c71a
[RLLib] DM Control Suite Wrapper and Examples ( #9031 )
...
* DM Control Suite Added
* Added License
* Changes
* Test fixes
2020-06-29 17:58:29 -07:00