hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Augusto Yao	0d90a17426	Pass cleanup argument to start_monitor. (#1040 )	2017-09-30 15:35:25 -07:00
Wapaul1	97b3355adc	Register Class Only Creates Entry in Redis Once (#1038 ) Don't export the same custom class definition multiple times.	2017-09-30 15:30:27 -07:00
Richard Liaw	16e82b43d1	[rllib] Changes for preprocessors (#1033 ) * Changes for preprocessors * removed comments * Changes + push for lint * linted * adding dependency for travis * linting won't pass * reordering * needed for testing * added comments * pip it * pip dependencies	2017-09-30 13:11:20 -07:00
Alexey Tumanov	2d0f439b7b	hugepage + plasma directory support plumbing + documentation (#1030 ) * hugepage + plasma directory support plumbing + documentation * Indentation fix. * huge_pages_enabled --> huge_pages * One more change	2017-09-30 09:56:52 -07:00
Robert Nishihara	b991dc8900	Add flag for ignoring the UI, don't start UI in jenkins tests. (#1021 )	2017-09-29 15:22:51 -07:00
Eric Liang	9f3a4fce50	[rllib] Parallelize sample collection and gradient computation in DQN (#746 ) * wip * works with cartpole * lint * fix pg * comment * action dist rename * preprocessor * fix test * typo * fix the action[0] nonsense * revert * satisfy the lint * wip * wip * works with cartpole * lint * fix pg * comment * action dist rename * preprocessor * fix test * typo * fix the action[0] nonsense * revert * satisfy the lint * Minor indentation changes. * fix merge * add humanoid * initial dqn refactor * remove tfutil * fix calls * fix tf errors 1 * closer * runs now * lint * tensorboard graph * fix linting * more 4 space * fix * fix linT * more lint * oops * es parity * remove example.py * fix training bug * add cartpole demo * try fixing cartpole * allow model options, configure cartpole * debug * simplify * no dueling * avoid out of file handles * Test dqn in jenkins. * Minor formatting. * lint * fix py3 * fix issue * remove chekcpoint * revert * Fixit * sanity check configs * update cuda * fix * parallel gradient computation * update * upd * bug * upd * always record training stats * fix * comments * revert assert * add gpu mask * fofset * a tie * Merge * fix * fix * fix examples * A3C -> DQN * fix dqn test * remove submodule * fix linting	2017-09-29 00:06:51 -07:00
Eric Liang	19562f6ce5	[rllib] Fix issues with PPO model restoration (#1018 ) * fix filter * add test * lint * fix * commit * Update a3c.py	2017-09-28 13:12:06 -07:00
Zongheng Yang	5a50e80b63	Make Monitor remove dead Redis entries from exiting drivers. (#994 ) * WIP: removing OL, OI, TT on client exit; no saving yet. * ray_redis_module.cc: update header comment. * Cleanup: just the removal. * Reformat via yapf: use pep8 style instead of google. * Checkpoint addressing comments (partially) * Add 'b' marker before strings (py3 compat) * Add MonitorTest. * Use `isort` to sort imports. * Remove some loggings * Fix flake8 noqa marker runtest.py * Try to separate tests out to monitor_test.py * Rework cleanup algorithm: correct logic * Extend tests to cover multi-shard cases * Add some small comments and formatting changes.	2017-09-26 00:11:38 -07:00
Eric Liang	5c70faf76b	Update common.py (#996 )	2017-09-19 10:10:56 -07:00
gycn	a432285e77	Disable parallelization for Actors and ray.wait for debugging (#961 ) Support actors and ray.wait in PYTHON_MODE.	2017-09-17 00:12:50 -07:00
Philipp Moritz	73f40bd844	[rllib] user defined preprocessor (#985 ) * add register_preprocessor to ModelCatalog * add pytest * make staticmethod a classmethod * update * install gym on travis * fix linting * fix	2017-09-16 15:53:19 -07:00
Eric Liang	98142ef51f	fix checkpoint (#988 )	2017-09-16 15:29:36 -07:00
Philipp Moritz	6601bb5f9e	[rllib] Make observation filter optional (#940 ) * make observation filter optional * fix linting	2017-09-14 17:37:19 -07:00
Richard Liaw	d516d9440e	Fixing local directory (#977 ) * Fixing local directory Enables ability to set custom local directory; code may be messy. * Create all intermediate parent directories	2017-09-14 10:33:52 -07:00
Philipp Moritz	1eb8c83314	[rllib] Initial RLLib documentation (#969 ) * initial documentation for RLLib * more RL documentation * fix linting * fix comments * update * fix	2017-09-12 23:38:21 -07:00
Eric Liang	9f42ef6a4f	[rllib] Make sure to always record stats like time elapsed, timesteps (#965 ) * always record training stats * fix * comments * revert assert * nan * fix	2017-09-12 14:28:16 -07:00
Eric Liang	e17412a72b	fix free log std param (#964 )	2017-09-11 18:52:48 -07:00
Stephanie Wang	99c8b1f38c	Actor fault tolerance using object lineage reconstruction (#902 ) * Revert Python actor reconstruction * Actor reconstruction using object lineage * Add dummy arguments and return values for actor tasks * Pin dummy outputs for actor tasks * Skip checkpointing test for now * TODOs * minor edits * Generate dummy object dependencies in Python, not C * Fix linting. * Move actor counter and dummy objects inside of the actor handle * Refactor Worker._process_task, suppress exception propagation for sequential actor tasks	2017-09-10 19:29:28 -07:00
Eric Liang	d8aa826e63	[webui] Scalability fixes for the task timeline and visualizations (#935 ) * fixes * comments * fix test * Update ui.py * upd * Fix linting.	2017-09-10 15:47:44 -07:00
Robert Nishihara	f3c1248d98	Clone catapult and generate html files during installation. (#956 ) * Clone catapult and generate static html during setup. * Include UI files in installation. * Fix directory to clone catapult to and fix linting. * Use absolute path. * Make sure we find a sufficiently new version of python2 when building wheels. * Copy the trace_viewer_full.html file to the local directory if it is not present. * Make sure wheels fail to build if UI is not included.	2017-09-10 13:41:16 -07:00
Philipp Moritz	546ba23ceb	Upgrade to latest arrow to include set serialization speedups (#957 ) * update arrow to pull in the set serialization speedups * remove _register_class for set	2017-09-10 00:12:17 -07:00
Eric Liang	953878364e	[webui] Print out timeline link for full-screen trace viewing (#936 ) * up * update	2017-09-06 01:41:21 -07:00
Wapaul1	e19e2c6284	Print jupyter notebook token when starting web UI. (#887 ) * User now only needs to copy url to get to notebook * Fixed duplicate code * Added function to print url * Added exception for calling function on worker * Stored webui url in Redis * Fix linting and simplify code. * Now uses 24 bytes hex token * Fixed python 3 compatibility * Fix linting and python 3 compat * Added comment explaining generating the token. * Removed newline * Small fixes. * Fixed jenkins failure * Rebased and changed formatting * Revert "changed formatting" This reverts commit 226510cf0cdcaab9cf42ad30bd9588a963683592.	2017-09-05 23:31:44 -07:00
Robert Nishihara	853969225b	Sleep longer when starting plasma manager in valgrind case to catch errors where port bind fails. (#934 )	2017-09-05 20:58:12 -07:00
Philipp Moritz	7030ef366f	Rebase Ray on latest arrow (remove numbuf from Ray). (#910 ) * remove some stuff * put get roundtrip working * fixes * more fixes * cleanup * fix tests * latest arrow * fixes * fix tests * fix linting * rebase * fixes * fix bug * bring back libgcc error * fix linting * use official arrow repo * fixes	2017-09-04 22:58:49 -07:00
Eric Liang	a2814567e1	[webui] Quick fix to timeline on task failure (#930 ) * foo * update * Move _add_missing_timestamps to task_profiles function.	2017-09-04 22:58:19 -07:00
Eric Liang	63d8d11714	[webui] Checkboxes should go to the left of their labels (#932 )	2017-09-04 17:05:13 -07:00
Robert Nishihara	8ed03b1cf0	Make task timeline work with ipywidgets==7.0.0, change slider default values. (#925 ) * Make task timeline work with ipywidgets==7.0.0. * Change initial UI slider values from 70-100 to 0-100.	2017-09-03 23:15:46 -07:00
Eric Liang	246be812f0	upd (#917 )	2017-09-02 23:55:10 -07:00
Eric Liang	1ebfe9608f	[rllib] Add downscale and frameskip options for Montezumas (#908 ) * up * update * fix * update * update * update * api break * Update run_multi_node_tests.sh * fix	2017-09-02 17:20:56 -07:00
Robert Nishihara	deca29a7eb	Bump version to 0.2.0. (#877 )	2017-08-29 21:38:35 -07:00
Philipp Moritz	164a8f368e	[rllib] Rename algorithms (#890 ) * rename algorithms * fix * fix jenkins test * fix documentation * fix	2017-08-29 16:56:42 -07:00
Robert Nishihara	e1831792f8	For PPO, rename num_agents -> num_workers. (#882 )	2017-08-28 23:11:06 -07:00
Robert Nishihara	1afc487baf	In setup.py, move cython to setup_requires. (#878 ) * In setup.py, move cython to setup_requires and move setuptools_scm to setup_requires. * Add back pip install of cython when building mac wheels. * Revert changes to setuptools_scm. * Check that the correct number of Linux wheels are produced. * Add back pip install cython when building linux wheels.	2017-08-28 23:07:33 -07:00
Robert Nishihara	60d4d01d06	Use observation filter in compute_action for PPO. (#884 )	2017-08-28 23:01:29 -07:00
Richard Liaw	5d72818ddc	Generic `shared_model` class (#880 ) Changing `shared_model` class back to `get_model` rather than `ConvolutionalNetwork`	2017-08-28 22:48:07 -07:00
Wapaul1	4db45c9c54	Improved layout of controls for Web UI (#876 ) * Improved layout of controls * Added explicit labels and some comments * Fix linting errors	2017-08-28 14:43:34 -07:00
Richard Liaw	bc082e9a9e	[rllib] Additional support for Shared Models in A3C (#866 ) * Code for Supporting Shared Models Running (with vnet modification) - needs to be tested for performance Summaries Small refactoring + generalized to more domains Small fix for jenkins Linting linting Addressing changes Addressing changes Update envs.py Addressing changes convnet Merge - new model final touches final linting Changing iterations back removed extra change changes for fast experimentation changes to enable a2c TEMP FOR DEBUGGING ContinuousActions - Still doesn't work InvertedPendulum trains with 8 workers - k=200 huber loss Maxes for InvertedPendulum-v1 - 16w,200steps temp: working with a2c Back to shared model more fixes small nit LSTM to shared models need to fix last_features tuning pong Best record for hitting 0 - with k=16,n=20 nit a2cremoval remove A2c reference and nits nit removed a2c vestiges removing a2c removing example.py Linting nit * Linting + Removing vestigal code * Final Touches * nits * rerun travis	2017-08-28 12:23:14 -07:00
Eric Liang	c977fe8895	[rllib] Full checkpoint/restore for all algorithms (#875 ) * wip * working for all but dqn * update * add train * rename * update * Update test	2017-08-27 18:56:52 -07:00
Robert Nishihara	d43a435c68	Don't redirect worker output to log files if redirect_output=False. (#873 ) * Don't redirect worker output to log files if redirect_output=False. * Fix, handle case where RedirectOutput key is not in Redis.	2017-08-27 14:27:44 -07:00
Eric Liang	617bc4d239	[rllib] Make the free_logstd param generic (#863 ) * make free log std param generic * fixes * fixes	2017-08-24 12:43:51 -07:00
Eric Liang	46641a642f	[rllib] (take 2) Add top-level checkpoint/restore/compute_action APIs to rllib (#868 ) * add top-level checkpoint/restore api to rllib * todos	2017-08-24 00:09:33 -07:00
Philipp Moritz	791bee343f	[rllib] Implement GAE for PPO (#849 ) * make information available for GAE * buggy version of GAE estimator * fix * add more logging and reweight losses * fix logging * fix loss * adapt advantage calculation * update gae * standardize returns * don't normalize td lambda ret * fix * don't standardize advantages * do standardization earlier * different standardization * initializer * drop into the debugger * fix tensorflow broadcasting bug * vf clipping * don't standardize tdlambdaret * different standardization * use huber loss for value function * refactor -- first half * it runs * fix * update * documentation * linting and tests * fix linting * naming * fix * linting * fix * remove prefix madness * fixes * fix * add value function example * fix linting * remove newline	2017-08-23 20:35:47 -07:00
Eric Liang	c943ecaa42	Better error message when actor creation is attempted before ray.init() (#858 )	2017-08-22 21:20:15 -07:00
Eric Liang	e2f2a7e57a	[rllib] Pick preprocessor based on obs shape (#855 ) * update * auto choose	2017-08-22 16:46:55 -07:00
Eric Liang	c81821b856	[rllib] Make Pong-v0 + EvolutionStrategies work by sharing preprocessors with PPO (#848 ) * fix by sharing preprocessors * revert param changeg * Update evolution_strategies.py * Update catalog.py	2017-08-21 18:51:49 -07:00
Robert Nishihara	be4beb19c1	Changes to build to fix creation of wheels. (#840 ) * Pass DPYTHON_EXECUTABLE into cmake for arrow and for ray. * Add cython to setup.py install_requires. * Revert custom code for finding python in cmake. * Correctly find arrow on CentOS. * In cmake, don't find PythonLibs, just find PYTHON_INCLUDE_DIRS. * Fix typo. * Do not use boost shared libraries when building arrow. * Add six to the setup.py install_requires because it is needed by pyarrow. * Don't link numbuf against boost_system and boost_filesystem. * Compile boost when we are on Linux. * Make numbuf find the correct boost libraries. * Only use find_package Boost on Linux, suppress output when building boost. * Changes to wheel building scripts, install cython in mac script. * Compile flatbuffers ourselves on Linux and pass it in when compiling Arrow. * Clean up build_flatbuffers.sh and build_boost.sh scripts a little. * Install cython when building linux wheel.	2017-08-21 17:49:35 -07:00
Robert Nishihara	cf41964816	Fix some bugs causing Travis test failures. (#839 ) * Fix bug in which worker has no actor_class attribute. * Remove case where we check if processes are defunct.	2017-08-16 01:18:32 -07:00
Robert Nishihara	ca53e9ae7b	Fix bugs in task timeline visualization. (#836 ) * Fix bugs in task timeline visualization. * Some cleanups. * Remove print statements.	2017-08-13 23:39:37 -07:00
Robert Nishihara	a75ccc8032	Fix socket error exception in Python 2. (#833 )	2017-08-13 23:14:26 -07:00

... 116 117 118 119 120 ...

6051 commits