Artur Niederfahrenhorst
|
d76ef9add5
|
[RLLib] Fix RNNSAC example failing on CI + fixes for recurrent models for other Q Learning Algos. (#24923)
|
2022-05-24 14:39:43 +02:00 |
|
Artur Niederfahrenhorst
|
cd16dc4dae
|
[RLlib] Fix estimated buffer size in replay buffers. (#24848)
|
2022-05-22 21:03:23 +02:00 |
|
Artur Niederfahrenhorst
|
fb2915d26a
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
|
Avnish Narayan
|
f2bb6f6806
|
[RLlib] Impala training iteration fn (#23454)
|
2022-05-05 16:11:08 +02:00 |
|
Sven Mika
|
627b9f2e88
|
[RLlib] QMIX training iteration function and new replay buffer API. (#24164)
|
2022-04-27 14:24:20 +02:00 |
|
Artur Niederfahrenhorst
|
e57ce7efd6
|
[RLlib] Replay Buffer API and Training Iteration Fn for DQN. (#23420)
|
2022-04-18 12:20:12 +02:00 |
|
Artur Niederfahrenhorst
|
02a50f02b7
|
[RLlib] RepayBuffer: _hit_counts working again. (#23586)
|
2022-04-07 10:56:25 +02:00 |
|
Artur Niederfahrenhorst
|
9a64bd4e9b
|
[RLlib] Simple-Q uses training iteration fn (instead of execution_plan); ReplayBuffer API for Simple-Q (#22842)
|
2022-03-29 14:44:40 +02:00 |
|
Artur Niederfahrenhorst
|
32ad6c6ef1
|
[RLlib] Replay Buffer capacity check (#23523)
|
2022-03-29 12:06:27 +02:00 |
|
Siyuan (Ryans) Zhuang
|
0c74ecad12
|
[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128)
|
2022-03-15 17:34:21 +01:00 |
|
Artur Niederfahrenhorst
|
37d129a965
|
[RLlib] ReplayBuffer API: Test cases. (#22390)
|
2022-03-08 16:54:12 +01:00 |
|
Artur Niederfahrenhorst
|
dea3574050
|
[RLlib] Replay Buffer API (#22114)
|
2022-02-09 15:04:43 +01:00 |
|