ray/rllib/offline
Kai Fricke 8c2e471265
[AIR] Add RLTrainer interface, implementation, and examples (#23465)
This PR adds a RLTrainer to Ray AIR. It works for both offline and online use cases. In offline training, it will leverage the datasets key of the Trainer API to specify a dataset reader input, used e.g. in Behavioral Cloning (BC). In online training, it is a wrapper around the rllib trainables making use of the parameter layering enabled by the Trainer API.
2022-04-08 17:16:42 -07:00
..
__init__.py [RLlib] Request CPU resources in Trainer.default_resource_request() if using dataset input. (#21948) 2022-02-02 10:20:37 +01:00
d4rl_reader.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
dataset_reader.py [AIR] Add RLTrainer interface, implementation, and examples (#23465) 2022-04-08 17:16:42 -07:00
dataset_writer.py [rllib] Fix error messages and example for dataset writer (#23419) 2022-03-28 19:53:12 +01:00
input_reader.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00
io_context.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
is_estimator.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
json_reader.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
json_writer.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
mixed_input.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00
off_policy_estimator.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
output_writer.py [RLlib; Docs overhaul] Docstring cleanup: Offline. (#19808) 2021-11-01 10:59:53 +01:00
resource.py [RLlib] Request CPU resources in Trainer.default_resource_request() if using dataset input. (#21948) 2022-02-02 10:20:37 +01:00
shuffled_input.py [RLlib; Docs overhaul] Docstring cleanup: Offline. (#19808) 2021-11-01 10:59:53 +01:00
wis_estimator.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00