ray/rllib/policy
2020-05-04 22:13:49 -07:00
..
tests [RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143) 2020-04-28 14:59:16 +02:00
__init__.py [rllib] observation function api for multi-agent (#8236) 2020-05-04 22:13:49 -07:00
dynamic_tf_policy.py [RLlib] SAC Torch (incl. Atari learning) (#7984) 2020-04-15 13:25:16 +02:00
eager_tf_policy.py [RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143) 2020-04-28 14:59:16 +02:00
policy.py [rllib] observation function api for multi-agent (#8236) 2020-05-04 22:13:49 -07:00
rnn_sequencing.py [Testing] Do not run any non-RLlib/core tests if only RLLib affected (except wheels). (#7892) 2020-04-09 14:36:06 -07:00
sample_batch.py [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 2020-04-01 00:43:21 -07:00
tf_policy.py [RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304) 2020-05-04 23:53:38 +02:00
tf_policy_template.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
torch_policy.py [RLlib] Fix issue 8135 (DDPG inf actions when using [-inf,inf] action space). (#8302) 2020-05-04 22:27:30 +02:00
torch_policy_template.py [RLlib] PyTorch version of APPO. (#8120) 2020-04-23 09:11:12 +02:00