ray/rllib/policy at ee0eb44a326dac95fe9a485e9b6055706efd849b - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

History

Eric Liang f48da50e1c [rllib] observation function api for multi-agent (#8236 )		2020-05-04 22:13:49 -07:00
..
tests	[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143 )	2020-04-28 14:59:16 +02:00
__init__.py	[rllib] observation function api for multi-agent (#8236 )	2020-05-04 22:13:49 -07:00
dynamic_tf_policy.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
eager_tf_policy.py	[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143 )	2020-04-28 14:59:16 +02:00
policy.py	[rllib] observation function api for multi-agent (#8236 )	2020-05-04 22:13:49 -07:00
rnn_sequencing.py	[Testing] Do not run any non-RLlib/core tests if only RLLib affected (except wheels). (#7892 )	2020-04-09 14:36:06 -07:00
sample_batch.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
tf_policy.py	[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304 )	2020-05-04 23:53:38 +02:00
tf_policy_template.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
torch_policy.py	[RLlib] Fix issue 8135 (DDPG inf actions when using [-inf,inf] action space). (#8302 )	2020-05-04 22:27:30 +02:00
torch_policy_template.py	[RLlib] PyTorch version of APPO. (#8120 )	2020-04-23 09:11:12 +02:00