ray/rllib/offline
Julius Frost a88b217d3f
[rllib] Enhancements to Input API for customizing offline datasets (#16957)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-10 15:05:25 -07:00
..
__init__.py [RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550) 2021-01-21 16:43:55 +01:00
d4rl_reader.py [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 2021-05-04 19:06:19 +02:00
input_reader.py [RLlib] Trajectory view API docs. (#12718) 2020-12-30 17:32:21 -08:00
io_context.py [rllib] Enhancements to Input API for customizing offline datasets (#16957) 2021-07-10 15:05:25 -07:00
is_estimator.py [RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761) 2021-05-13 09:17:23 +02:00
json_reader.py [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 2021-06-30 12:32:11 +02:00
json_writer.py [RLlib] Remove all (already soft-deprecated) SampleBatch.data from code. (#15335) 2021-04-15 19:19:51 +02:00
mixed_input.py [rllib] Enhancements to Input API for customizing offline datasets (#16957) 2021-07-10 15:05:25 -07:00
off_policy_estimator.py [RLlib] Fix bug in policy.py: normalize_actions=True has to call unsquash_action, not normalize_action. (#16774) 2021-07-08 17:31:34 +02:00
output_writer.py [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
shuffled_input.py [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
wis_estimator.py [RLlib] In OffPolicyEstimators (Offline RL): Include last step of trajectory (#12619) 2020-12-08 12:39:40 +01:00