ray/rllib/policy at 2b7d90776279e78bf03f7987defc67f0b6eb5b7c - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 04:46:38 -04:00

History

simonsays1980 7b33dc21dc [RLlib] Fix update model view requirements from init state for bare-metal policies with custom view-reqs. (#17867 ) * Changed '_update_model_view_requirements_from_init_state()' to adopt the 'shift' in view_requirements from a user-defined policy that inherits directly from Policy. * Added slightly modifed version of Sven's suggestion. Like this any user-defined attributes of the ViewRequirement of the state get conserved. * I saw that the code in _update_model_view_requirements_from_init_state() had changed and is not identical to my locally installed version. In the new version view_requirements from the model and the policy get united and therefore a loop runs through this unified list. Code should run now in the present version * Apply suggestions from code review		2021-08-17 11:49:24 +02:00
..
tests	[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485 )	2021-08-16 06:08:14 +02:00
__init__.py	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py	[RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371 )	2021-08-03 11:35:49 -04:00
eager_tf_policy.py	[RLlib] Test cases/BUILD cleanup; split "everything else" (longest running one rn) tests in 2. (#17640 )	2021-08-16 22:01:01 +02:00
policy.py	[RLlib] Fix update model view requirements from init state for bare-metal policies with custom view-reqs. (#17867 )	2021-08-17 11:49:24 +02:00
policy_map.py	Revert "[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566 )" (#17709 )	2021-08-10 10:50:01 -07:00
policy_template.py	[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603 )	2021-05-04 19:06:19 +02:00
rnn_sequencing.py	[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485 )	2021-08-16 06:08:14 +02:00
sample_batch.py	[RLlib] Test cases/BUILD cleanup; split "everything else" (longest running one rn) tests in 2. (#17640 )	2021-08-16 22:01:01 +02:00
tf_policy.py	[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530 )	2021-08-03 18:30:02 -04:00
tf_policy_template.py	[RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371 )	2021-08-03 11:35:49 -04:00
torch_policy.py	[RLlib] Test cases/BUILD cleanup; split "everything else" (longest running one rn) tests in 2. (#17640 )	2021-08-16 22:01:01 +02:00
torch_policy_template.py	[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530 )	2021-08-03 18:30:02 -04:00
view_requirement.py	[RLlib] Remove all non-trajectory view API code. (#14860 )	2021-03-23 09:50:18 -07:00