mirror of
https://github.com/vale981/ray
synced 2025-03-08 19:41:38 -05:00
![]() * Cleaner and more correct treatment of agent states in rollout.py * support lstm_use_prev_action_reward in rollout.py * Linter. * appease flake8 * Use _DUMMY_AGENT_ID instead of 0. * All agents have a policy_agent_mapping. Reset the mapping cache at the start of each episode. * Update rollout.py * Fix rollout.py for single-agent envs. * Use agent_id, not policy_id. |
||
---|---|---|
.. | ||
benchmarks | ||
ray | ||
asv.conf.json | ||
build-wheel-macos.sh | ||
build-wheel-manylinux1.sh | ||
README-benchmarks.rst | ||
README-building-wheels.md | ||
setup.py |