ray/rllib/offline at 3c0a3f4cc1fcb57bd47685487a637d33dce9e406 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Kai Fricke 8c2e471265 [AIR] Add RLTrainer interface, implementation, and examples (#23465 ) This PR adds a RLTrainer to Ray AIR. It works for both offline and online use cases. In offline training, it will leverage the datasets key of the Trainer API to specify a dataset reader input, used e.g. in Behavioral Cloning (BC). In online training, it is a wrapper around the rllib trainables making use of the parameter layering enabled by the Trainer API.		2022-04-08 17:16:42 -07:00
..
__init__.py	[RLlib] Request CPU resources in `Trainer.default_resource_request()` if using dataset input. (#21948 )	2022-02-02 10:20:37 +01:00
d4rl_reader.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
dataset_reader.py	[AIR] Add RLTrainer interface, implementation, and examples (#23465 )	2022-04-08 17:16:42 -07:00
dataset_writer.py	[rllib] Fix error messages and example for dataset writer (#23419 )	2022-03-28 19:53:12 +01:00
input_reader.py	[docs] fix doctests and activate CI (#23418 )	2022-03-24 17:04:02 -07:00
io_context.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
is_estimator.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
json_reader.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
json_writer.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
mixed_input.py	[docs] fix doctests and activate CI (#23418 )	2022-03-24 17:04:02 -07:00
off_policy_estimator.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
output_writer.py	[RLlib; Docs overhaul] Docstring cleanup: Offline. (#19808 )	2021-11-01 10:59:53 +01:00
resource.py	[RLlib] Request CPU resources in `Trainer.default_resource_request()` if using dataset input. (#21948 )	2022-02-02 10:20:37 +01:00
shuffled_input.py	[RLlib; Docs overhaul] Docstring cleanup: Offline. (#19808 )	2021-11-01 10:59:53 +01:00
wis_estimator.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00