hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Sven Mika	494ddd98c1	[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928 )	2021-08-21 17:05:48 +02:00
simonsays1980	60aee4a330	[RLlib] Add example script for bare metal Policy with custom `view_requirements`. (#17896 )	2021-08-20 12:17:13 +02:00
Sven Mika	8248ba531b	[RLlib] Redo #17410 : Example script: Remote worker envs with inference done on main node. (#17960 )	2021-08-20 08:02:18 +02:00
Alex Wu	318ba6fae0	Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410 )" (#17951 ) This reverts commit `8fc16b9a18`.	2021-08-19 07:55:10 -07:00
Sven Mika	8fc16b9a18	[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410 )	2021-08-19 12:14:50 +02:00
Sven Mika	a428f10ebe	[RLlib] Add multi-GPU learning tests to nightly. (#17778 )	2021-08-18 17:21:01 +02:00
Sven Mika	f18213712f	[RLlib] Redo: "fix self play example scripts" PR (17566) (#17895 ) * wip. * wip. * wip. * wip. * wip. * wip. * wip. * wip. * wip.	2021-08-17 09:13:35 -07:00
Stefan Schneider	eab9c25856	[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (autoregressive_action_dist.py) (#17705 )	2021-08-16 22:08:13 +02:00
mguarin0	3e010c5760	[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701 ) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-08-12 00:25:00 -07:00
J K Terry	48e32555c8	[rllib] Update PettingZoo dependency versions (#17702 ) * update pettingzoo dependency versions * pettingzoo verison * fix tests	2021-08-11 01:19:19 -07:00
Amog Kamsetty	77f28f1c30	Revert "[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566 )" (#17709 ) This reverts commit `3b447265d8`.	2021-08-10 10:50:01 -07:00
Sven Mika	3b447265d8	[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566 )	2021-08-05 11:41:18 -04:00
Sven Mika	5107d16ae5	[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530 )	2021-08-03 18:30:02 -04:00
kk-55	a7f8dc9d77	[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664 )	2021-07-28 15:25:09 -04:00
Sven Mika	0d8fce8fd8	[RLlib] Discussion 2294: Custom vector env example and fix. (#16083 )	2021-07-28 10:40:04 -04:00
Stefan Schneider	489febc6b2	[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038 )	2021-07-26 22:25:48 -04:00
Sven Mika	0c5c70b584	[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. (#17118 )	2021-07-25 16:55:51 -04:00
ddworak94	fba8461663	[RLlib] Add RNN-SAC agent (#16577 ) Shoutout to @ddworak94 :)	2021-07-25 10:04:52 -04:00
Sven Mika	7bc4376466	[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077 )	2021-07-22 10:59:13 -04:00
Richard Liaw	a78a2263e5	[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896 )	2021-07-22 10:55:07 -04:00
Sven Mika	5a313ba3d6	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
Sven Mika	649580d735	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
Sven Mika	ce6dfc9b2d	[RLlib] Update tf1.x vs tf2.x documentation and eager example script. (#17030 )	2021-07-13 20:02:17 -04:00
Amog Kamsetty	38b5b6d24c	Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )" (#17036 ) This reverts commit `e4123fff27`.	2021-07-13 09:57:15 -07:00
Kai Fricke	27d80c4c88	[RLlib] ONNX export for tensorflow (1.x) and torch (#16805 )	2021-07-13 12:38:11 -04:00
Sven Mika	e4123fff27	[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )	2021-07-13 06:38:14 -04:00
Julius Frost	a88b217d3f	[rllib] Enhancements to Input API for customizing offline datasets (#16957 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-07-10 15:05:25 -07:00
Kai Fricke	10fd7111b3	[rllib] Improve test learning check, fix flaky two step qmix (#16843 )	2021-07-06 19:39:12 +01:00
Amog Kamsetty	ecb632140f	Revert "RockPaperScissors Pettingzoo" (#16886 ) This reverts commit `bf3e3225b6`.	2021-07-06 09:43:47 -07:00
Rodrigo de Lazcano	bf3e3225b6	RockPaperScissors Pettingzoo (#16725 )	2021-07-05 09:52:08 -07:00
Sven Mika	7eb1a29426	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
Sven Mika	ce3e550c43	[RLlib] Enhance comment in example script multi_agent_custom_policy. (#16740 )	2021-07-01 10:28:38 +02:00
Sven Mika	53206dd440	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
Sven Mika	c95dea51e9	[RLlib] External env enhancements + more examples. (#16583 )	2021-06-23 09:09:01 +02:00
Sven Mika	be6db06485	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
Sven Mika	169ddabae7	[RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. (#16429 )	2021-06-19 22:42:00 +02:00
Sven Mika	79a9d6d517	[RLlib] Issues 16287 and 16200: RLlib not rendering custom multi-agent Envs. (#16428 )	2021-06-19 08:57:53 +02:00
Amog Kamsetty	bd3cbfc56a	Revert "[RLlib] Allow policies to be added/deleted on the fly. (#16359 )" (#16543 ) This reverts commit `e78ec370a9`.	2021-06-18 12:21:49 -07:00
Sven Mika	e78ec370a9	[RLlib] Allow policies to be added/deleted on the fly. (#16359 )	2021-06-18 10:31:30 +02:00
Steven Morad	581d63e607	[RLlib] Fix dnc input shape (#15939 ) Co-authored-by: Steven Morad <sm2558@cam.ac.uk>	2021-05-20 19:06:02 -07:00
Stefan Schneider	55709bac7a	[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897 )	2021-05-19 16:36:59 +02:00
Steven Morad	d8eed68af2	[RLlib] Add differentiable neural computer example (#14844 )	2021-05-19 09:15:39 +02:00
Rick Lan	3b1b1d74fe	[rllib] Read "logger_config" first before "prefix". (#15871 )	2021-05-18 10:50:46 -07:00
Sven Mika	d2c755ccef	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
Sven Mika	839fc59224	[RLlib] CQL TensorFlow support (#15841 )	2021-05-18 11:10:46 +02:00
Sven Mika	308ea62430	[RLlib] Fix "seed" setting to work in all frameworks and w/ all CUDA versions. (#15682 )	2021-05-18 11:00:24 +02:00
Sven Mika	f25d58492d	[Testing] Dependabot for RLlib. (#15812 )	2021-05-17 18:24:13 +02:00
Sven Mika	d89fb82bfb	[RLlib] Add simple curriculum learning API and example script. (#15740 )	2021-05-16 17:35:10 +02:00
Sven Mika	ebc6d8692a	[RLlib] Docs: Example scripts and blogs documentation update. (#15763 )	2021-05-16 15:24:38 +02:00
Sven Mika	c4a3e1589b	[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761 )	2021-05-13 09:17:23 +02:00

1 2 3 4 5

226 commits