1
0
Fork 0
mirror of https://github.com/vale981/ray synced 2025-03-13 22:56:38 -04:00
Commit graph

20 commits

Author SHA1 Message Date
Artur Niederfahrenhorst
d76ef9add5
[RLLib] Fix RNNSAC example failing on CI + fixes for recurrent models for other Q Learning Algos. () 2022-05-24 14:39:43 +02:00
Artur Niederfahrenhorst
cd16dc4dae
[RLlib] Fix estimated buffer size in replay buffers. () 2022-05-22 21:03:23 +02:00
Steven Morad
501d932449
[RLlib] SAC, RNNSAC, and CQL TrainerConfig objects () 2022-05-22 19:58:47 +02:00
Artur Niederfahrenhorst
fb2915d26a
[RLlib] Replay Buffer API and Ape-X. () 2022-05-17 13:43:49 +02:00
Max Pumperla
6a6c58b5b4
[RLlib] Config objects for DDPG and SimpleQ. () 2022-05-12 16:12:42 +02:00
Artur Niederfahrenhorst
95d4a83a87
[RLlib] R2D2 Replay Buffer API integration. () 2022-05-10 20:36:14 +02:00
Sven Mika
44a51610c2
[RLlib] SlateQ config objects. () 2022-05-10 20:07:18 +02:00
Artur Niederfahrenhorst
8d906f9bf8
[RLlib] SAC with new Replay Buffer API. () 2022-05-09 14:33:02 +02:00
Artur Niederfahrenhorst
bd2fdf4752
[RLlib] Automate sequences in timeslice_along_seq_lens_with_overlap(). () 2022-05-09 11:55:06 +02:00
Avnish Narayan
f2bb6f6806
[RLlib] Impala training iteration fn () 2022-05-05 16:11:08 +02:00
Artur Niederfahrenhorst
86bc9ecce2
[RLlib] DDPG Training iteration fn & Replay Buffer API () 2022-05-05 09:41:38 +02:00
Sven Mika
627b9f2e88
[RLlib] QMIX training iteration function and new replay buffer API. () 2022-04-27 14:24:20 +02:00
Sven Mika
bb4e5cb70a
[RLlib] CQL: training iteration function. () 2022-04-26 14:28:39 +02:00
Artur Niederfahrenhorst
e57ce7efd6
[RLlib] Replay Buffer API and Training Iteration Fn for DQN. () 2022-04-18 12:20:12 +02:00
Artur Niederfahrenhorst
02a50f02b7
[RLlib] RepayBuffer: _hit_counts working again. () 2022-04-07 10:56:25 +02:00
Artur Niederfahrenhorst
9a64bd4e9b
[RLlib] Simple-Q uses training iteration fn (instead of execution_plan); ReplayBuffer API for Simple-Q () 2022-03-29 14:44:40 +02:00
Artur Niederfahrenhorst
32ad6c6ef1
[RLlib] Replay Buffer capacity check () 2022-03-29 12:06:27 +02:00
Siyuan (Ryans) Zhuang
0c74ecad12
[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). () 2022-03-15 17:34:21 +01:00
Artur Niederfahrenhorst
37d129a965
[RLlib] ReplayBuffer API: Test cases. () 2022-03-08 16:54:12 +01:00
Artur Niederfahrenhorst
dea3574050
[RLlib] Replay Buffer API () 2022-02-09 15:04:43 +01:00