ray/rllib/offline at eb12033612f4a8833f5244aaa14376dcd0c60c3b - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Michael Luo 587f207c2f [RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550 )		2021-01-21 16:43:55 +01:00
..
__init__.py	[RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550 )	2021-01-21 16:43:55 +01:00
d4rl_reader.py	[RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550 )	2021-01-21 16:43:55 +01:00
input_reader.py	[RLlib] Trajectory view API docs. (#12718 )	2020-12-30 17:32:21 -08:00
io_context.py	[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064 )	2020-12-27 09:46:03 -05:00
is_estimator.py	[RLlib] In OffPolicyEstimators (Offline RL): Include last step of trajectory (#12619 )	2020-12-08 12:39:40 +01:00
json_reader.py	[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064 )	2020-12-27 09:46:03 -05:00
json_writer.py	[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064 )	2020-12-27 09:46:03 -05:00
mixed_input.py	[rllib] Forgot to pass ioctx to child json readers (#11839 )	2020-11-05 22:07:57 -08:00
off_policy_estimator.py	[RLlib] Fix offline logp vs prob bug in OffPolicyEstimator class. (#12158 )	2020-11-20 08:59:43 +01:00
output_writer.py	[RLlib] SAC algo cleanup. (#10825 )	2020-09-20 11:27:02 +02:00
shuffled_input.py	[RLlib] SAC algo cleanup. (#10825 )	2020-09-20 11:27:02 +02:00
wis_estimator.py	[RLlib] In OffPolicyEstimators (Offline RL): Include last step of trajectory (#12619 )	2020-12-08 12:39:40 +01:00