kourosh hakhamaneshi
|
aec79afda1
|
[RLlib] Fixes CRR flakeyness (#26770)
|
2022-07-20 12:08:57 -07:00 |
|
Avnish Narayan
|
1243ed62bf
|
[RLlib] Make Dataset reader default reader and enable CRR to use dataset (#26304)
Co-authored-by: avnish <avnish@avnishs-MBP.local.meter>
|
2022-07-08 12:43:35 -07:00 |
|
kourosh hakhamaneshi
|
f421730b47
|
[RLlib] Added expectation advantage_type option to CRR. (#26142)
|
2022-06-28 15:40:09 +02:00 |
|
Avnish Narayan
|
d859b84058
|
[RLlib] Add compute log likelihoods test for CRR. (#25905)
|
2022-06-21 16:06:10 +02:00 |
|
kourosh hakhamaneshi
|
25940cb95b
|
[RLlib] CRR documentation. (#25667)
|
2022-06-14 12:45:36 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Sven Mika
|
388fb98c79
|
[RLlib] CRR Tests fixes. (#25586)
|
2022-06-08 19:18:55 +02:00 |
|
kourosh hakhamaneshi
|
4cdd508f70
|
[RLlib] Added CRR implementation. (#25499)
|
2022-06-08 11:42:02 +02:00 |
|