mirror of
https://github.com/vale981/ray
synced 2025-03-06 10:31:39 -05:00
![]() * Validate that entropy coeff is not an integer Passing an integer value for entropy coeff such as 0 raises an error somewhere inside the TF policy graph, so this checks to make sure the entropy coeff is a float. * Cast to float instead Also move this check after the negative value check |
||
---|---|---|
.. | ||
test | ||
__init__.py | ||
appo.py | ||
appo_policy.py | ||
ppo.py | ||
ppo_policy.py | ||
utils.py |