ray/doc/source at 510c8506512b8e9873ffbab0abdc1a668b022022 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

History

Sven Mika 510c850651 [RLlib] SAC add discrete action support. (#7320 ) * Exploration API (+EpsilonGreedy sub-class). * Exploration API (+EpsilonGreedy sub-class). * Cleanup/LINT. * Add `deterministic` to generic Trainer config (NOTE: this is still ignored by most Agents). * Add `error` option to deprecation_warning(). * WIP. * Bug fix: Get exploration-info for tf framework. Bug fix: Properly deprecate some DQN config keys. * WIP. * LINT. * WIP. * Split PerWorkerEpsilonGreedy out of EpsilonGreedy. Docstrings. * Fix bug in sampler.py in case Policy has self.exploration = None * Update rllib/agents/dqn/dqn.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * WIP. * Update rllib/agents/trainer.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * WIP. * Change requests. * LINT * In tune/utils/util.py::deep_update() Only keep deep_updat'ing if both original and value are dicts. If value is not a dict, set * Completely obsolete syn_replay_optimizer.py's parameters schedule_max_timesteps AND beta_annealing_fraction (replaced with prioritized_replay_beta_annealing_timesteps). * Update rllib/evaluation/worker_set.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Review fixes. * Fix default value for DQN's exploration spec. * LINT * Fix recursion bug (wrong parent c'tor). * Do not pass timestep to get_exploration_info. * Update tf_policy.py * Fix some remaining issues with test cases and remove more deprecated DQN/APEX exploration configs. * Bug fix tf-action-dist * DDPG incompatibility bug fix with new DQN exploration handling (which is imported by DDPG). * Switch off exploration when getting action probs from off-policy-estimator's policy. * LINT * Fix test_checkpoint_restore.py. * Deprecate all SAC exploration (unused) configs. * Properly use `model.last_output()` everywhere. Instead of `model._last_output`. * WIP. * Take out set_epsilon from multi-agent-env test (not needed, decays anyway). * WIP. * Trigger re-test (flaky checkpoint-restore test). * WIP. * WIP. * Add test case for deterministic action sampling in PPO. * bug fix. * Added deterministic test cases for different Agents. * Fix problem with TupleActions in dynamic-tf-policy. * Separate supported_spaces tests so they can be run separately for easier debugging. * LINT. * Fix autoregressive_action_dist.py test case. * Re-test. * Fix. * Remove duplicate py_test rule from bazel. * LINT. * WIP. * WIP. * SAC fix. * SAC fix. * WIP. * WIP. * WIP. * FIX 2 examples tests. * WIP. * WIP. * WIP. * WIP. * WIP. * Fix. * LINT. * Renamed test file. * WIP. * Add unittest.main. * Make action_dist_class mandatory. * fix * FIX. * WIP. * WIP. * Fix. * Fix. * Fix explorations test case (contextlib cannot find its own nullcontext??). * Force torch to be installed for QMIX. * LINT. * Fix determine_tests_to_run.py. * Fix determine_tests_to_run.py. * WIP * Add Random exploration component to tests (fixed issue with "static-graph randomness" via py_function). * Add Random exploration component to tests (fixed issue with "static-graph randomness" via py_function). * Rename some stuff. * Rename some stuff. * WIP. * update. * WIP. * Gumbel Softmax Dist. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP * WIP. * WIP. * Hypertune. * Hypertune. * Hypertune. * Lock-in. * Cleanup. * LINT. * Fix. * Update rllib/policy/eager_tf_policy.py Co-Authored-By: Kristian Hartikainen <kristian.hartikainen@gmail.com> * Update rllib/agents/sac/sac_policy.py Co-Authored-By: Kristian Hartikainen <kristian.hartikainen@gmail.com> * Update rllib/agents/sac/sac_policy.py Co-Authored-By: Kristian Hartikainen <kristian.hartikainen@gmail.com> * Update rllib/models/tf/tf_action_dist.py Co-Authored-By: Kristian Hartikainen <kristian.hartikainen@gmail.com> * Update rllib/models/tf/tf_action_dist.py Co-Authored-By: Kristian Hartikainen <kristian.hartikainen@gmail.com> * Fix items from review comments. * Add dm_tree to RLlib dependencies. * Add dm_tree to RLlib dependencies. * Fix DQN test cases ((Torch)Categorical). * Fix wrong pip install. Co-authored-by: Eric Liang <ekhliang@gmail.com> Co-authored-by: Kristian Hartikainen <kristian.hartikainen@gmail.com>		2020-03-06 10:37:12 -08:00
..
_static	[docs] Convert Examples to Gallery (#5414 )	2019-09-24 15:46:56 -07:00
_templates	[docs] Add a feedback form (#5610 )	2019-09-02 01:28:28 -07:00
images	[RLlib] Add Exploration API documentation. (#7373 )	2020-03-01 16:55:41 -08:00
raysgd	[RaySGD] Rename PyTorch API endpoints to start with Torch (#7425 )	2020-03-03 16:44:42 -08:00
a2c-arch.svg	[rllib] [docs] Add some architecture diagrams (#5390 )	2019-08-06 20:14:57 -07:00
actors.rst	Change actor.__ray_kill__() to ray.kill(actor) (#7360 )	2020-02-28 11:55:13 -06:00
advanced.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
apex-arch.svg	[rllib] [docs] Add some architecture diagrams (#5390 )	2019-08-06 20:14:57 -07:00
apex.png	[rllib] Document "v2" APIs (#2316 )	2018-07-01 00:05:08 -07:00
async_api.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
autoscaler-status.png	Update the pip wheel in example.yaml and add docs (#1381 )	2018-01-01 13:02:05 -08:00
autoscaling.rst	[docs] Make walkthrough and starting Ray materials clear (#7099 )	2020-02-11 23:17:30 -08:00
cluster-index.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
conf.py	Fix the problem that ray.remote reference is not visible at a document. (#7311 )	2020-02-28 14:03:08 -08:00
configure.rst	[docs] Make walkthrough and starting Ray materials clear (#7099 )	2020-02-11 23:17:30 -08:00
custom_directives.py	[docs] Pictures for all the Examples (#5859 )	2019-10-14 14:18:52 -07:00
custom_metric.png	[rllib] Implement custom metrics (#3144 )	2018-11-03 18:48:32 -07:00
ddppo-arch.svg	[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )	2020-02-10 15:28:27 -08:00
deploy-on-kubernetes.rst	Ray on YARN + Skein Documentation (#6119 )	2019-11-14 15:06:05 -08:00
deploy-on-yarn.rst	[docs] yarn update (#6173 )	2019-11-19 16:15:08 -08:00
deploying-on-slurm.rst	Set redis password in slurm deployment documentation. (#5747 )	2019-09-21 15:33:15 -07:00
development.rst	Bazel improvements (#7427 )	2020-03-04 13:13:21 -08:00
dqn-arch.svg	[rllib] [docs] Add some architecture diagrams (#5390 )	2019-08-06 20:14:57 -07:00
es.png	[rllib] Document "v2" APIs (#2316 )	2018-07-01 00:05:08 -07:00
fault-tolerance.rst	Document fault tolerance behavior. (#6698 )	2020-01-06 22:34:06 -08:00
getting-involved.rst	[doc] Update the test command in getting-involved. (#6347 )	2019-12-07 11:03:52 -08:00
impala-arch.svg	[rllib] [docs] Add some architecture diagrams (#5390 )	2019-08-06 20:14:57 -07:00
impala.png	[rllib] Update multi-gpu impala numbers (#3327 )	2018-11-19 20:55:27 -08:00
index.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
installation.rst	Fix install documentation on readthedocs (#7423 )	2020-03-03 11:03:18 -08:00
iter.rst	[Parallel Iterators] Allow for operator chaining after repartition (#7268 )	2020-03-04 14:42:52 -08:00
joblib.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
memory-management.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
multi-agent.svg	[rllib] Document "v2" APIs (#2316 )	2018-07-01 00:05:08 -07:00
multi-flat.svg	[rllib] RLlib in 60 seconds documentation (#5430 )	2019-08-12 17:39:02 -07:00
multiprocessing.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
offline-q.png	[rllib] Basic infrastructure for off-policy estimation (IS, WIS) (#3941 )	2019-02-13 16:25:05 -08:00
package-ref.rst	Change actor.__ray_kill__() to ray.kill(actor) (#7360 )	2020-02-28 11:55:13 -06:00
pandas_on_ray.rst	Dataframe deprecation (#2353 )	2018-07-06 00:16:22 -07:00
pbt.png	[tune] Improve PBT example (#4575 )	2019-04-09 20:59:17 -07:00
ppo-arch.svg	[rllib] [docs] Add some architecture diagrams (#5390 )	2019-08-06 20:14:57 -07:00
ppo.png	[rllib] Document "v2" APIs (#2316 )	2018-07-01 00:05:08 -07:00
profiling.rst	Release 0.8.0 test logs (#6512 )	2019-12-17 15:56:50 -08:00
projects.rst	[Projects] Add small tutorial for projects (#6641 )	2020-01-20 09:33:41 -08:00
pytorch.png	[rllib] Add TF and Torch icons to show which are available for each algo (#6869 )	2020-01-20 15:22:21 -08:00
ray-tune-parcoords.png	[tune] Fix Docs (#1469 )	2018-01-25 16:39:00 -08:00
ray-tune-tensorboard.png	[tune] Documentation for Ray.tune (#1243 )	2017-11-23 11:31:59 -08:00
ray-tune-viskit.png	[tune] Documentation for Ray.tune (#1243 )	2017-11-23 11:31:59 -08:00
rllib-algorithms.rst	[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )	2020-02-10 15:28:27 -08:00
rllib-api.svg	[rllib] Add rock paper scissors multi-agent example (#5336 )	2019-08-01 13:03:59 -07:00
rllib-components.svg	[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436 )	2019-08-23 14:21:11 +08:00
rllib-concepts.rst	[rllib] updated policy definition link (#6989 )	2020-01-31 16:22:11 -08:00
rllib-config.svg	[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )	2019-06-03 06:49:24 +08:00
rllib-dev.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
rllib-env.rst	[RLlib] SAC add discrete action support. (#7320 )	2020-03-06 10:37:12 -08:00
rllib-envs.svg	[rllib] annotate public vs developer vs private APIs (#3808 )	2019-01-23 21:27:26 -08:00
rllib-examples.rst	Update rllib-examples.rst (#6396 )	2019-12-08 16:21:50 -08:00
rllib-models.rst	[rllib] Support parallel, parameterized evaluation (#6981 )	2020-02-01 22:12:12 -08:00
rllib-offline.rst	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
rllib-package-ref.rst	[rllib] Port remainder of algorithms to build_trainer() pattern (#4920 )	2019-06-07 16:45:36 -07:00
rllib-stack.svg	[rllib] Sync filters at end of iteration not start; hierarchical docs (#3769 )	2019-01-15 16:25:25 -08:00
rllib-toc.rst	[RLlib] Add Exploration API documentation. (#7373 )	2020-03-01 16:55:41 -08:00
rllib-training.rst	[RLlib] Add Exploration API documentation. (#7373 )	2020-03-01 16:55:41 -08:00
rllib.rst	[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )	2020-02-10 15:28:27 -08:00
rock-paper-scissors.png	[rllib] Add rock paper scissors multi-agent example (#5336 )	2019-08-01 13:03:59 -07:00
serialization.rst	Removing unused Pyarrow Info (#7207 )	2020-02-21 17:07:26 -08:00
serve.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
sgd.png	[sgd] Document and add simple MNIST example (#3236 )	2018-11-10 21:52:20 -08:00
starting-ray.rst	[docs] Make walkthrough and starting Ray materials clear (#7099 )	2020-02-11 23:17:30 -08:00
tensorflow.png	[rllib] Add TF and Torch icons to show which are available for each algo (#6869 )	2020-01-20 15:22:21 -08:00
throughput.png	[rllib] Document "v2" APIs (#2316 )	2018-07-01 00:05:08 -07:00
timeline.png	[minor] Use a better timeline pic in the documentation	2017-12-20 12:54:25 -08:00
troubleshooting.rst	Consolidate and clean up documentation (#5645 )	2019-09-07 11:50:18 -07:00
tune-advanced-tutorial.rst	[tune] demo exporting trained models in pbt examples (#6533 )	2019-12-26 02:14:49 +01:00
tune-contrib.rst	[tune] Add requirements-dev.txt and update docs for contributing (#4925 )	2019-06-05 09:04:36 -07:00
tune-design.rst	[tune] Contributor Guide and Design Page (#4716 )	2019-05-05 00:04:13 -07:00
tune-distributed.rst	[docs] Make walkthrough and starting Ray materials clear (#7099 )	2020-02-11 23:17:30 -08:00
tune-examples.rst	[tune] tf2.0 mnist example (#5898 )	2019-10-15 22:25:01 -07:00
tune-package-ref.rst	[tune] Async restores and S3/GCP-capable trial FT (#6376 )	2020-01-02 20:40:53 -08:00
tune-schedulers.rst	Fixed few broken links in docs (#5477 )	2019-08-19 14:22:25 -07:00
tune-searchalg.rst	[tune] Implement BOHB (#5382 )	2019-08-13 12:32:07 -07:00
tune-tutorial.rst	[docs] Edit survey links (#6777 )	2020-01-17 11:52:04 -08:00
tune-usage.rst	Small correction in documentation (#7453 )	2020-03-04 13:28:28 -08:00
tune.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
using-ray-on-a-cluster.rst	[docs] Make walkthrough and starting Ray materials clear (#7099 )	2020-02-11 23:17:30 -08:00
using-ray-with-gpus.rst	Consolidate and clean up documentation (#5645 )	2019-09-07 11:50:18 -07:00
using-ray-with-pytorch.rst	[docs] Second push of changes (#5391 )	2019-08-28 17:54:15 -07:00
using-ray-with-tensorflow.rst	[docs] fix code block display (#5967 )	2019-10-22 00:45:38 -07:00
using-ray.rst	Add ray.util package and move libraries from experimental (#7100 )	2020-02-18 13:43:19 -08:00
walkthrough.rst	Mention that calling some_function.remote() is non-blocking (#7417 )	2020-03-04 13:35:46 -08:00