hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Eric Liang	905258dbc1	Clean up docstyle in python modules and add LINT rule (#25272 )	2022-06-01 11:27:54 -07:00
kourosh hakhamaneshi	3815e52a61	[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896 )	2022-05-19 18:30:42 +02:00
Max Pumperla	6a6c58b5b4	[RLlib] Config objects for DDPG and SimpleQ. (#24339 )	2022-05-12 16:12:42 +02:00
Balaji Veeramani	7f1bacc7dc	[CI] Format Python code with Black (#21975 ) See #21316 and #21311 for the motivation behind these changes.	2022-01-29 18:41:57 -08:00
Jun Gong	2317c693cf	[RLlib] Use SampleBrach instead of input dict whenever possible (#20746 )	2021-12-02 13:11:26 +01:00
Stefan Schneider	2b3d0c691f	[RLlib] Document and extend action mask example. (#20390 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Sven Mika <sven@anyscale.io> Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-11-16 13:20:41 +01:00
Sven Mika	a931076f59	[RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981 )	2021-11-05 16:10:00 +01:00
Sven Mika	cf21c634a3	[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982 )	2021-11-03 10:00:46 +01:00
Sven Mika	0b308719f8	[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829 )	2021-11-01 21:46:02 +01:00
gjoliver	39b0faa3ec	[RLlib]: bug fix, should be input_dict['is_training'] (#19805 )	2021-10-27 23:30:43 +02:00
gjoliver	c3c42278e4	[RLlib] clean up all the SampleBatch['is_training'] deprecation warnings (#19652 ) * [RLlib] clean up all the SampleBatch['is_training'] deprecation warnings. * wip	2021-10-25 09:38:56 +02:00
Sven Mika	ea4a22249c	[RLlib] Add simple action-masking example script/env/model (tf and torch). (#18494 )	2021-09-11 23:08:09 +02:00
Sven Mika	494ddd98c1	[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928 )	2021-08-21 17:05:48 +02:00
kk-55	a7f8dc9d77	[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664 )	2021-07-28 15:25:09 -04:00
ddworak94	fba8461663	[RLlib] Add RNN-SAC agent (#16577 ) Shoutout to @ddworak94 :)	2021-07-25 10:04:52 -04:00
Sven Mika	7eb1a29426	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
Steven Morad	581d63e607	[RLlib] Fix dnc input shape (#15939 ) Co-authored-by: Steven Morad <sm2558@cam.ac.uk>	2021-05-20 19:06:02 -07:00
Steven Morad	d8eed68af2	[RLlib] Add differentiable neural computer example (#14844 )	2021-05-19 09:15:39 +02:00
Michael Luo	4cbe13cdfd	[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603 ) Co-authored-by: Sven Mika <sven@anyscale.io> Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-05-04 19:06:19 +02:00
Amog Kamsetty	ebc44c3d76	[CI] Upgrade flake8 to 3.9.1 (#15527 ) * formatting * format util * format release * format rllib/agents * format rllib/env * format rllib/execution * format rllib/evaluation * format rllib/examples * format rllib/policy * format rllib utils and tests * format streaming * more formatting * update requirements files * fix rllib type checking * updates * update * fix circular import * Update python/ray/tests/test_runtime_env.py * noqa	2021-05-03 14:23:28 -07:00
Sven Mika	e973b726c2	[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273 )	2021-04-30 19:26:30 +02:00
Sven Mika	bb8a286cbc	[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684 )	2021-04-27 10:44:54 +02:00
Sven Mika	4f66309e19	[RLlib] Redo issue 14533 tf enable eager exec (#14984 )	2021-03-29 20:07:44 +02:00
SangBin Cho	fa5f961d5e	Revert "[RLlib] Issue 14533: `tf.enable_eager_execution()` must be called at beginning. (#14737 )" (#14918 ) This reverts commit `3e389d5812`.	2021-03-25 00:42:01 -07:00
Sven Mika	3e389d5812	[RLlib] Issue 14533: `tf.enable_eager_execution()` must be called at beginning. (#14737 )	2021-03-24 12:54:27 +01:00
Sven Mika	0a0d9183fe	[RLlib] Trajectory view API example script (enhancements and tf2 support). (#13786 )	2021-02-02 18:42:18 +01:00
Sven Mika	52c94b7ee9	[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522 )	2021-02-02 13:05:58 +01:00
Sven Mika	56878221ed	[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363 )	2021-01-14 14:44:33 +01:00
Kai Fricke	25f10a947a	Revert "[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339 )" (#13361 ) This reverts commit `e2b2abb88b`.	2021-01-12 12:33:57 +01:00
Sven Mika	e2b2abb88b	[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339 )	2021-01-11 22:42:30 +01:00
Sven Mika	9dd9f72111	[RLlib] Add more detailed Documentation on Model building API (#13261 )	2021-01-09 12:38:29 +01:00
Sven Mika	6f342a2221	[RLlib] Preparatory PR for: Documentation on Model Building. (#13260 )	2021-01-08 10:56:09 +01:00
Sven Mika	d5604eaba3	[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029 )	2020-12-21 18:38:34 -08:00
Sven Mika	841d93d366	[RLlib] Issue 12233 shared tf layers example not really shared (only works for tf1.x, not tf2.x). (#12399 )	2020-11-25 11:27:19 -08:00
Sven Mika	62c7ab5182	[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747 )	2020-11-12 16:27:34 +01:00
Sven Mika	c17169dc11	[RLlib] Fix all example scripts to run on GPUs. (#11105 )	2020-10-02 23:07:44 +02:00
Sven Mika	36bda8432b	[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056 )	2020-10-01 16:57:10 +02:00
Sven Mika	28ab797cf5	[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544 )	2020-09-06 10:58:00 +02:00
Eric Liang	deea1861ab	[rllib] Try fixing torch GPU and masking errors (#10168 )	2020-08-25 18:34:19 -07:00
Olli Huotari	9ff599cbb8	torch policy now includes model.metrics (#10121 ) * torch policy now includes model.metrics * Fixed tests to work with custom metrics * Forgot to run format.sh	2020-08-15 10:43:11 -07:00
Sven Mika	78dfed2683	[RLlib] Issue 8384: QMIX doesn't learn anything. (#9527 )	2020-07-17 12:14:34 +02:00
Sven Mika	f43d934817	[RLlib] Type annotations for policy. (#9248 )	2020-07-05 13:09:51 +02:00
Sven Mika	43043ee4d5	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 ) * WIP. * Fixes. * LINT. * WIP. * WIP. * Fixes. * Fixes. * Fixes. * Fixes. * WIP. * Fixes. * Test * Fix. * Fixes and LINT. * Fixes and LINT. * LINT.	2020-06-30 10:13:20 +02:00
Sven Mika	af1203b9df	[RLlib] Issue 8507 (PyTorch does not support custom loss). (#9142 )	2020-06-26 09:52:22 +02:00
Sven Mika	4fd8977eaf	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 ) * WIP. * Fixes. * LINT. * Fixes. * Fixes and LINT. * WIP.	2020-06-25 19:01:32 +02:00
Sven Mika	7008902cff	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
Sven Mika	0ba7472da9	[Testing] Fix LINT/sphinx errors. (#8874 )	2020-06-10 15:41:59 +02:00
Eric Liang	be26a7b1b0	[rllib] Support for complex / variable-length observation spaces (#8393 )	2020-06-06 12:22:19 +02:00
Sven Mika	d8a081a185	[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590 )	2020-05-30 22:48:34 +02:00
Sven Mika	0422e9c5a8	[RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624 )	2020-05-27 10:19:47 +02:00

1 2

54 commits