Julius Frost
a88b217d3f
[rllib] Enhancements to Input API for customizing offline datasets ( #16957 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-10 15:05:25 -07:00
Francesco Stranieri
01c533c171
[rlib] Independent bound for each dimension AssertionError #16845 ( #16860 )
...
* Fix AssertionError for Box space type
Restored support for Box space type with independent bound for each dimension.
* Removed unnecessary assertion for Box space type
2021-07-10 14:48:35 -07:00
Sven Mika
7862dd64ea
[RLlib] Fix bug in policy.py: normalize_actions=True has to call unsquash_action
, not normalize_action
. ( #16774 )
2021-07-08 17:31:34 +02:00
Sven Mika
9f6a92163b
[RLlib] Remove old UsageTrackingDict code. ( #16867 )
2021-07-08 17:27:52 +02:00
Kai Fricke
10fd7111b3
[rllib] Improve test learning check, fix flaky two step qmix ( #16843 )
2021-07-06 19:39:12 +01:00
Amog Kamsetty
ecb632140f
Revert "RockPaperScissors Pettingzoo" ( #16886 )
...
This reverts commit bf3e3225b6
.
2021-07-06 09:43:47 -07:00
Rodrigo de Lazcano
bf3e3225b6
RockPaperScissors Pettingzoo ( #16725 )
2021-07-05 09:52:08 -07:00
Julius Frost
7842bda50a
[rllib] Fix to allow input strings that are not file paths ( #16830 )
2021-07-03 01:12:47 -07:00
Amog Kamsetty
33f31f53c8
[Rllib] Torch Backwards Compatibility ( #16813 )
2021-07-01 19:17:54 -07:00
Rodrigo de Lazcano
5072d86323
[rllib] parallel pettingzoo import ( #16722 )
2021-07-01 18:37:59 -07:00
Julius Frost
ada0552f16
[rllib] d4rl: fix for paths with multiple periods ( #16721 )
2021-07-01 18:35:50 -07:00
Sven Mika
7eb1a29426
[RLlib] Fix ModelV2 custom metrics for torch. ( #16734 )
2021-07-01 13:01:40 +02:00
Sven Mika
ce3e550c43
[RLlib] Enhance comment in example script multi_agent_custom_policy. ( #16740 )
2021-07-01 10:28:38 +02:00
Sven Mika
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes ( #16531 )
2021-06-30 12:32:11 +02:00
Amog Kamsetty
abd16a8438
[RLlib] Skip two_step_game_qmix test ( #16758 )
2021-06-29 14:27:48 -07:00
Travis Addair
e5dfa4cfb9
[tune] Only use TBXLoggerCallback when torch is installed ( #16695 )
...
* [tune] Only use TBXLoggerCallback when torch is installed
* Fix lint
* fix
* Update python/ray/tune/utils/callback.py
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-06-28 16:34:20 -07:00
Amog Kamsetty
be1f6d59fa
[CI] Re-try Tag rllib flaky tests ( #16680 )
2021-06-28 18:42:54 +02:00
AnnaKosiorek
1e709771b2
[rllib][minor] clarification of the softmax axis in dqn_torch_policy ( #16311 )
...
pytorch nn.functional.softmax (unlike tf.nn.softmax) calculates softmax along zeroth dimension by default
2021-06-26 11:19:54 -07:00
architkulkarni
35039869ee
Revert "[RLlib] Add some learning tests to rllib-flaky ( #16604 )" ( #16677 )
...
This reverts commit d1510911e0
.
2021-06-25 09:37:58 -07:00
Eric Liang
1c709cbeb3
Fix typing ( #16668 )
2021-06-24 22:06:33 -07:00
Amog Kamsetty
d1510911e0
[RLlib] Add some learning tests to rllib-flaky ( #16604 )
2021-06-25 00:28:54 +02:00
Sven Mika
c95dea51e9
[RLlib] External env enhancements + more examples. ( #16583 )
2021-06-23 09:09:01 +02:00
mvindiola1
82a3ff795c
[RLlib] ensure curiosity exploration actions are passed in as tf tens… ( #15704 )
2021-06-21 10:03:17 -07:00
Benjamin D. Killeen
50049f86d0
[rllib] check if self.env is not None
explicitly ( #15634 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-21 10:02:13 -07:00
Sven Mika
be6db06485
[RLlib] Re-do: Trainer: Support add and delete Policies. ( #16569 )
2021-06-21 13:46:01 +02:00
Sven Mika
169ddabae7
[RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. ( #16429 )
2021-06-19 22:42:00 +02:00
Sven Mika
79a9d6d517
[RLlib] Issues 16287 and 16200: RLlib not rendering custom multi-agent Envs. ( #16428 )
2021-06-19 08:57:53 +02:00
Amog Kamsetty
bd3cbfc56a
Revert "[RLlib] Allow policies to be added/deleted on the fly. ( #16359 )" ( #16543 )
...
This reverts commit e78ec370a9
.
2021-06-18 12:21:49 -07:00
Sven Mika
2900a06dd7
[RLlib] Issue 14503: SAC not allowing custom action distributions. ( #16427 )
2021-06-18 17:27:29 +02:00
Sven Mika
e78ec370a9
[RLlib] Allow policies to be added/deleted on the fly. ( #16359 )
2021-06-18 10:31:30 +02:00
Sven Mika
a5831f9429
[RLlib] Fix bandit example scripts and add all scripts to CI testing suite.
2021-06-15 13:30:31 +02:00
Sven Mika
d0014cd351
[RLlib] Policies get/set_state fixes and enhancements. ( #16354 )
2021-06-15 13:08:43 +02:00
Chris Bamford
fd1a97e39f
[RLlib] Memory leak docs ( #15908 )
2021-06-10 18:10:21 +02:00
Sven Mika
3d4dc60e2e
[RLlib] CQL iteration count fixes: Remove dummy buffer and unnecessary store op from exec_plan. ( #16332 )
2021-06-10 07:49:17 +02:00
matthewdeng
138b273136
[rllib] Add tests for examples using ray client ( #16271 )
...
* [rllib] add tests for examples using ray client
* rename test_client to test_ray_client
2021-06-09 10:39:14 -07:00
Sven Mika
4b8dadccbd
[RLlib] Fix PR 16162: Having added sleep to _NextValueNotReady
causes TD3 tests to become flakey. ( #16309 )
2021-06-08 07:27:02 -07:00
Gerges Dib
f8cf4a1985
[RLlib] Fixed import tensorflow when module not available ( #16171 )
2021-06-04 10:07:59 +02:00
Sven Mika
e2be41b407
[RLlib] MARWIL + BC: Various fixes and enhancements. ( #16218 )
2021-06-03 22:29:00 +02:00
Sven Mika
c9d220bcda
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. ( #16080 )
2021-06-01 17:39:18 +02:00
Chris Bamford
1e3721ef4a
[RLlib] Remove bad spinlocks to allow pytorch GPU scheduler to interrupt. ( #16162 )
2021-06-01 16:40:28 +02:00
Sven Mika
5fe34862ce
[RLlib] DDPG torch GPU bug. ( #16133 )
2021-05-28 22:09:25 +02:00
Sven Mika
33a69135cb
[RLlib] Issue 16117: DQN/APEX torch not working on GPU. ( #16118 )
2021-05-28 09:12:53 +02:00
Sven Mika
f6302d81be
[RLlib] Discussion 2210: BC algo broken, if "advantages" missing in offline data. ( #16019 )
2021-05-25 08:47:17 +02:00
Eric Liang
810f5c803a
Disable flaky object spilling test on OSX & adjust test timeouts ( #15986 )
...
* blacklist
* move it
* adjust according to bazel timeouts
* fix build
* move to large
* Update BUILD
2021-05-24 09:49:59 -07:00
Steven Morad
581d63e607
[RLlib] Fix dnc input shape ( #15939 )
...
Co-authored-by: Steven Morad <sm2558@cam.ac.uk>
2021-05-20 19:06:02 -07:00
Sven Mika
e80095591c
[RLlib] Entropy coeff schedule bug fix and git bisect script. ( #15937 )
2021-05-20 18:15:10 +02:00
Sven Mika
03c7c530a9
[RLlib] Issue 15483: Wrong init states (should be non-zero if ModelV2.get_initial_state
returns non-zero values). ( #15733 )
2021-05-20 09:28:09 +02:00
Sven Mika
2d34216660
[RLlib] APEX-DQN: Bug fix for torch and add learning test. ( #15762 )
2021-05-20 09:27:03 +02:00
Sven Mika
eaa7f6696d
[RLlib] Issue 15887: MARWIL adv norm update mismatch for tf (static-graph) vs torch versions. ( #15898 )
2021-05-19 15:44:11 -07:00
Stefan Schneider
55709bac7a
[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib ( #15897 )
2021-05-19 16:36:59 +02:00