Sven Mika
8a891b3c30
[RLlib] SAC n_step > 1. ( #10567 )
2020-09-05 22:26:42 +02:00
Sven Mika
244aafdcf8
[RLlib] Curiosity enhancements. ( #10373 )
2020-09-05 13:14:24 +02:00
architkulkarni
6ae9e76b81
[RLlib] Fix seeding issue ( #10589 )
2020-09-04 17:17:53 -07:00
Justin Terry
352718610d
Multi-agent Algorithm Documentation Updates ( #9722 )
2020-09-03 22:37:46 -07:00
Sven Mika
715ee8dfc9
[RLlib] Issue 10469: Callbacks should receive env idx ... ( #10477 )
2020-09-03 17:27:05 +02:00
Sven Mika
ef18893fb5
[RLlib] PPO, APPO, and DD-PPO code cleanup. ( #10420 )
2020-09-02 14:03:01 +02:00
Eric Liang
2a204260a8
[api] Second round of 1.0 API changes: exceptions, num_return_vals ( #10377 )
2020-08-28 19:57:02 -07:00
Eric Liang
519354a39a
[api] Initial API deprecations for Ray 1.0 ( #10325 )
2020-08-28 15:03:50 -07:00
Sven Mika
93120e0347
Unity3D API Fixes (recent changes in Unity's MLAgents API caused errors on RLlib side). ( #10285 )
2020-08-26 14:16:08 +02:00
Michael Luo
4e9888ce2f
[RLlib] Dreamer ( #10172 )
2020-08-26 13:24:05 +02:00
Olli Huotari
0dae50b5eb
Fixed num_atoms>1 in pytorch ( #10330 )
2020-08-25 23:10:20 -07:00
Eric Liang
deea1861ab
[rllib] Try fixing torch GPU and masking errors ( #10168 )
2020-08-25 18:34:19 -07:00
Benjamin Black
2689fb439c
Fixed pettingzoo env example ( #9973 )
2020-08-25 13:22:25 +02:00
raoul-khour-ts
c8c4832794
Prevent Local Worker creation from blocking remote worker creation by creating remote workers before local worker ( #10245 )
...
* create remote workers before local worker
* reformatted
2020-08-24 12:29:55 -07:00
Michael Luo
48a39d7cb9
[RLlib] Deepmind Control Suite Examples ( #9751 )
2020-08-23 12:53:08 +02:00
krfricke
c31876002d
[tune/rllib] made wandb compatible with rllib trainables ( #10252 )
2020-08-21 17:25:52 -07:00
Sven Mika
e968b52cb7
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards ( #9950 )
2020-08-21 12:35:16 +02:00
Sven Mika
d14b501692
[RLlib] First attempt at cleaning up algo code in RLlib: PG. ( #10115 )
2020-08-20 17:05:57 +02:00
Raphael Avalos
8b704eb419
Small fix for Cuda Torch DQN. ( #10177 )
2020-08-19 13:28:05 -07:00
Sven Mika
2cbe29a7fa
[RLlib] Curiosity minor fixes, do-overs, and testing. ( #10143 )
2020-08-19 17:49:50 +02:00
Tomasz Wrona
aff7f19360
[tune] Added logger_config field ( #8521 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-08-18 11:10:22 -07:00
Eric Liang
ca133e2699
[rllib] Remove extra model config kwargs passed incorrectly for Torch models ( #10055 )
2020-08-17 11:12:20 -07:00
Julius Frost
dc659ae89a
make action probabilities a numpy array ( #10122 )
2020-08-16 11:25:12 -07:00
Olli Huotari
9ff599cbb8
torch policy now includes model.metrics ( #10121 )
...
* torch policy now includes model.metrics
* Fixed tests to work with custom metrics
* Forgot to run format.sh
2020-08-15 10:43:11 -07:00
Sven Mika
aeb5be7733
[RLlib] Trajectory View API (part 2.5): Actual implementations (not used yet) of a SampleCollector. ( #10112 )
2020-08-15 15:09:00 +02:00
Sven Mika
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. ( #10114 )
2020-08-15 13:24:22 +02:00
Chua Cheow Huan
ea51e94729
[rllib] Learning rate schedule for DDPPO. ( #10006 )
...
* Get shared metrics, increment counter & set global vars for remote workers.
* Add unit test to test lr_schedule for DDPPO.
* Broadcast the local set of global vars to remote workers instead of independently setting the global vars on each rollout worker.
2020-08-15 00:51:45 -07:00
Tanay Wakhare
1826b29757
[RLlib] Curiosity (intrinsic motivation) Exploration module. ( #9912 )
2020-08-13 20:14:16 +02:00
Sven Mika
66d204e078
[RLlib] Model documentation enhancements. ( #10011 )
2020-08-13 13:36:40 +02:00
Sven Mika
0effcda3e4
Add missing int-casts for all shape calculating code (using np.product([some shape])). ( #10092 )
2020-08-13 12:04:22 +02:00
Julius Frost
6d9d2b320a
[RLlib] Support windows drives other than C drive for the offline json API ( #9909 )
2020-08-13 11:57:54 +02:00
yncxcw
32cd94b750
[Core] Do not convert gpu id to int ( #9744 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-08-11 12:09:46 -07:00
Sven Mika
4b10bdf8fc
[RLlib] rollout.py - Add multi-agent test case. ( #9981 )
2020-08-10 19:44:23 +02:00
Barak Michener
8e76796fd0
ci: Redo format.sh --all
script & backfill lint fixes ( #9956 )
2020-08-07 16:49:49 -07:00
Sven Mika
5d5643e633
[RLlib] Add informative error message when bad Conv2D stack is used with fixed num_outputs
(no flattening at end). ( #9966 )
2020-08-07 12:04:17 +02:00
Eric Liang
668f555755
[rllib] Clean up outdated docs #9915
2020-08-06 18:29:04 -07:00
Sven Mika
57690a3a9f
[RLlib] Trajectory view API - 02 actual API scaffold ( #9753 )
2020-08-06 10:54:20 +02:00
Sven Mika
19d785b947
[LINT] Except RLlib from checking for flake8 error F821 ( #9946 )
2020-08-06 10:44:37 +02:00
Sven Mika
9b90f7db67
[RLlib] Missing type annotations policy templates. ( #9846 )
2020-08-06 05:33:24 +02:00
Michael Luo
4d7bd8c892
[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) ( #9409 )
2020-08-02 18:12:09 +02:00
Sven Mika
e540e425e4
[RLlib] rllib rollout
test and bug fixes. ( #9779 )
2020-07-30 16:17:03 +02:00
Sven Mika
f6bd12eb18
[RLlib] Add tensor-based tests for Schedules and fix some bugs related to using Schedules with tensor time input. ( #9782 )
2020-07-30 12:49:32 +02:00
Miguel Morales
372114b4ed
Update sampler.py ( #9805 )
...
Minor fix for warning string
2020-07-29 22:58:35 -07:00
Sven Mika
b0b0463161
[RLlib] Trajectory View API (preparatory cleanup and enhancements). ( #9678 )
2020-07-29 21:15:09 +02:00
Sven Mika
ff9c1dac88
[RLlib] Issue 9667 DDPG Torch bugs and enhancements. ( #9680 )
2020-07-28 14:15:03 +02:00
Sven Mika
e6ea33a03c
[RLlib] Enhance reward clipping test; add action_clipping tests. ( #9684 )
2020-07-28 10:44:54 +02:00
Michael Luo
b51ab2af66
[RLlib] Offline Type Annotations ( #9676 )
...
* Offline Annotations
* Modifications
* Fixed circular dependencies
* Linter fix
2020-07-27 14:01:17 -07:00
Sven Mika
5dc4b6686e
[RLlib] Implement DQN PyTorch distributional head. ( #9589 )
2020-07-25 09:29:24 +02:00
Petros Christodoulou
46c64c90d0
fixed simplex initialisation seeding bug ( #9660 )
...
Co-authored-by: Petros Christodoulou <petrochr@amazon.com>
2020-07-24 14:22:41 -07:00
Sven Mika
e4c5d3526f
Issue 9631: Tf1.14 does not have tf.config.list_physical_devices. ( #9681 )
2020-07-24 21:48:58 +02:00