ray/rllib/examples at 17e3c545d91b979586cdc5a99e1c53da77b00a8a - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Eric Liang 17e3c545d9 [rllib] Fix truncate episodes mode in central critic example (#8073 )		2020-04-20 12:58:01 -07:00
..
export	Change /tmp to platform-specific temporary directory (#7529 )	2020-03-16 18:10:14 -07:00
serving	[rllib] Add high-performance external application connector (#7641 )	2020-03-20 12:43:57 -07:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
autoregressive_action_dist.py	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
batch_norm_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
cartpole_lstm.py	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 )	2020-02-15 14:50:44 -08:00
centralized_critic.py	[rllib] Fix truncate episodes mode in central critic example (#8073 )	2020-04-20 12:58:01 -07:00
centralized_critic_2.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_env.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_eval.py	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 )	2020-02-15 14:50:44 -08:00
custom_fast_model.py	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
custom_keras_cnn_plus_rnn_model.py	[RLlib] Fix broken example: tf-eager with custom-RNN (#6732 ). (#7021 )	2020-02-06 09:44:08 -08:00
custom_keras_model.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
custom_keras_rnn_model.py	[RLlib] Working/learning example: PPO + torch + LSTM. (#7797 )	2020-03-31 22:00:28 -07:00
custom_loss.py	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 )	2020-02-15 14:50:44 -08:00
custom_metrics_and_callbacks.py	[RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972 )	2020-04-16 16:06:42 -07:00
custom_metrics_and_callbacks_legacy.py	[RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972 )	2020-04-16 16:06:42 -07:00
custom_tf_policy.py	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 )	2020-02-15 14:50:44 -08:00
custom_torch_policy.py	[RLlib] Assert correct policy class being used in Worker. (#7769 )	2020-03-30 14:03:29 -07:00
custom_torch_rnn_model.py	[RLlib] Working/learning example: PPO + torch + LSTM. (#7797 )	2020-03-31 22:00:28 -07:00
custom_train_fn.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
dmlab_watermaze.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
eager_execution.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
hierarchical_training.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
multi_agent_cartpole.py	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
multi_agent_custom_policy.py	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
multi_agent_two_trainers.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
parametric_action_cartpole.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
random_env.py	[RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions (#6890 )	2020-01-24 10:29:35 -08:00
rock_paper_scissors_multiagent.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
rollout_worker_custom_workflow.py	[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107 ) (#7124 )	2020-02-22 14:19:49 -08:00
saving_experiences.py	Change /tmp to platform-specific temporary directory (#7529 )	2020-03-16 18:10:14 -07:00
twostep_game.py	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00