mirror of
https://github.com/vale981/ray
synced 2025-03-06 18:41:40 -05:00
![]() * Fix DDPG, since it is based on GenericOffPolicyTrainer. * Fix QMix, SAC, and MADDPA too. * Undo QMix change. * Fix DQN input batch type. Always use SampleBatch. * apex ddpg should not use replay_buffer_config yet. * Make eager tf policy to use SampleBatch. * lint * LINT. * Re-enable RLlib broken tests to make sure things work ok now. * fixes. Co-authored-by: sven1977 <svenmika1977@gmail.com> |
||
---|---|---|
.. | ||
tests | ||
__init__.py | ||
dynamic_tf_policy.py | ||
eager_tf_policy.py | ||
policy.py | ||
policy_map.py | ||
policy_template.py | ||
rnn_sequencing.py | ||
sample_batch.py | ||
tf_policy.py | ||
tf_policy_template.py | ||
torch_policy.py | ||
torch_policy_template.py | ||
view_requirement.py |