ray/rllib/policy at fca9dc73e1abcfb2762723a0a7052fd2094eba61 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

History

Sven Mika 20ef4a8603 [RLlib] Cleanup/unify all test cases. (#7533 )		2020-03-11 20:39:47 -07:00
..
tests	[RLlib] Cleanup/unify all test cases. (#7533 )	2020-03-11 20:39:47 -07:00
__init__.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
dynamic_tf_policy.py	[rllib] Make timestep a required arg for exploration classes (#7380 )	2020-03-04 13:00:37 -08:00
eager_tf_policy.py	[RLlib] SAC add discrete action support. (#7320 )	2020-03-06 10:37:12 -08:00
policy.py	[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504 )	2020-03-10 11:14:14 -07:00
rnn_sequencing.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
sample_batch.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
tf_policy.py	[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504 )	2020-03-10 11:14:14 -07:00
tf_policy_template.py	[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107 ) (#7124 )	2020-02-22 14:19:49 -08:00
torch_policy.py	[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504 )	2020-03-10 11:14:14 -07:00
torch_policy_template.py	[RLlib] Issue 7421: can't convert cuda tensor to numpy in torch ppo. (#7445 )	2020-03-06 12:45:30 -08:00