Sven Mika
|
ea4a22249c
|
[RLlib] Add simple action-masking example script/env/model (tf and torch). (#18494)
|
2021-09-11 23:08:09 +02:00 |
|
Sven Mika
|
8ea1bc5ff9
|
[RLlib] Allow for more than 2^31 policy timesteps. (#11301)
|
2020-10-12 13:49:11 -07:00 |
|
Barak Michener
|
8e76796fd0
|
ci: Redo format.sh --all script & backfill lint fixes (#9956)
|
2020-08-07 16:49:49 -07:00 |
|
Sven Mika
|
e6ea33a03c
|
[RLlib] Enhance reward clipping test; add action_clipping tests. (#9684)
|
2020-07-28 10:44:54 +02:00 |
|
Sven Mika
|
d8a081a185
|
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
2020-05-30 22:48:34 +02:00 |
|
Sven Mika
|
42991d723f
|
[RLlib] rllib/examples folder restructuring (#8250)
Cleans up of the rllib/examples folder by moving all example Envs into rllibexamples/env (so they can be used by other scripts and tests as well).
|
2020-05-01 22:59:34 +02:00 |
|