mirror of
https://github.com/vale981/ray
synced 2025-03-09 04:46:38 -04:00
![]() * WIP. * Fix float32 conversion in OneHot preprocessor (would cause float64 in eager, then NN-matmul-failure). Add proper seq-len + state-in construction in eager_tf_policy.py::_compute_gradients(). * LINT. * eager_tf_policy.py: Only set samples["seq_lens"] if RNN. Otherwise, eager-tracing will throw flattened-dict key-mismatch error. * Move issue code to examples folder. Co-authored-by: Eric Liang <ekhliang@gmail.com> |
||
---|---|---|
.. | ||
__init__.py | ||
episode.py | ||
interface.py | ||
metrics.py | ||
policy_evaluator.py | ||
policy_graph.py | ||
postprocessing.py | ||
rollout_metrics.py | ||
rollout_worker.py | ||
sample_batch.py | ||
sample_batch_builder.py | ||
sampler.py | ||
tf_policy_graph.py | ||
torch_policy_graph.py | ||
worker_set.py |