Revisions of python-dask
Dominique Leuenberger (dimstar_suse)
accepted
request 1171090
from
Dirk Mueller (dirkmueller)
(revision 77)
Ana Guerrero (anag+factory)
accepted
request 1146835
from
Matej Cepl (mcepl)
(revision 75)
Forwarded request #1146758 from bnavigator - Update to 2024.2.0 * Deprecate Dask DataFrame implementation * Improved tokenization * https://docs.dask.org/en/stable/changelog.html#v2024-2-0 - Really drop python39 from testing instead of testing it with every other test flavor
Ana Guerrero (anag+factory)
accepted
request 1142781
from
Dirk Mueller (dirkmueller)
(revision 74)
- update to 2024.1.1: * This release contains compatibility updates for the latest pandas and scipy releases. See :pr:`10834`, :pr:`10849`, :pr:`10845`, and :pr-distributed:`8474` from `crusaderky`_ for details.
Ana Guerrero (anag+factory)
accepted
request 1140136
from
Dirk Mueller (dirkmueller)
(revision 73)
- update to 2024.1.0: * Released on January 12, 2024 * P2P rechunking now utilizes the relationships between input and output chunks. For situations that do not require all-to- all data transfer, this may significantly reduce the runtime and memory/disk footprint. It also enables task culling. * The fastparquet Parquet engine has been deprecated. Users should migrate to the pyarrow engine by installing PyArrow and removing engine="fastparquet" in read_parquet or to_parquet calls. * This release improves serialization robustness for arbitrary data. Previously there were some cases where serialization could fail for non-msgpack serializable data. In those cases we now fallback to using pickle. * Deprecate shuffle keyword in favour of shuffle_method for DataFrame methods (:pr:`10738`) `Hendrik Makait`_ * Deprecate automatic argument inference in repartition * Deprecate compute parameter in set_index * Deprecate inplace in eval * Deprecate Series.view * Deprecate npartitions="auto" for set_index & sort_values
Ana Guerrero (anag+factory)
accepted
request 1135096
from
Factory Maintainer (factory-maintainer)
(revision 72)
Automatic submission by obs-autosubmit
Dominique Leuenberger (dimstar_suse)
accepted
request 1132242
from
Factory Maintainer (factory-maintainer)
(revision 71)
Automatic submission by obs-autosubmit
Ana Guerrero (anag+factory)
accepted
request 1127184
from
Ondřej Súkup (mimi_vx)
(revision 70)
- Update to 2023.11.0 * Zero-copy P2P Array Rechunking * Deprecating PyArrow <14.0.1 * Improved PyArrow filesystem for Parquet * Improve Type Reconciliation in P2P Shuffling * official support for Python 3.12 * Reduced memory pressure for multi array reductions * improved P2P shuffling robustness * Reduced scheduler CPU load for large graphs (forwarded request 1127183 from mimi_vx)
Dominique Leuenberger (dimstar_suse)
accepted
request 1092262
from
Dirk Mueller (dirkmueller)
(revision 68)
Dominique Leuenberger (dimstar_suse)
accepted
request 1090990
from
Steve Kowalik (StevenK)
(revision 67)
- Tighten bokeh requirement to match distributed. - Update to 2023.5.1 * This release drops support for Python 3.8. As of this release Dask supports Python 3.9, 3.10, and 3.11. ## Enhancements * Drop Python 3.8 support (GH#10295) Thomas Grainger * Change Dask Bag partitioning scheme to improve cluster saturation (GH#10294) Jacob Tomlinson * Generalize dd.to_datetime for GPU-backed collections, introduce get_meta_library utility (GH#9881) Charles Blackmon-Luca * Add na_action to DataFrame.map (GH#10305) Patrick Hoefler * Raise TypeError in DataFrame.nsmallest and DataFrame.nlargest when columns is not given (GH#10301) Patrick Hoefler * Improve sizeof for pd.MultiIndex (GH#10230) Patrick Hoefler * Support duplicated columns in a bunch of DataFrame methods (GH#10261) Patrick Hoefler * Add numeric_only support to DataFrame.idxmin and DataFrame.idxmax (GH#10253) Patrick Hoefler * Implement numeric_only support for DataFrame.quantile (GH#10259) Patrick Hoefler * Add support for numeric_only=False in DataFrame.std (GH#10251) Patrick Hoefler * Implement numeric_only=False for GroupBy.cumprod and GroupBy.cumsum (GH#10262) Patrick Hoefler * Implement numeric_only for skew and kurtosis (GH#10258) Patrick Hoefler * mask and where should accept a callable (GH#10289) Irina Truong * Fix conversion from Categorical to pa.dictionary in read_parquet (GH#10285) Patrick Hoefler
Dominique Leuenberger (dimstar_suse)
accepted
request 1070217
from
Dirk Mueller (dirkmueller)
(revision 62)
Dominique Leuenberger (dimstar_suse)
accepted
request 1063413
from
Daniel Garcia (dgarcia)
(revision 60)
- Update to 2023.1.1 ## Enhancements * Add to_backend method to Array and _Frame (GH#9758) Richard (Rick) Zamora * Small fix for timestamp index divisions in pandas 2.0 (GH#9872) Irina Truong * Add numeric_only to DataFrame.cov and DataFrame.corr (GH#9787) James Bourbeau * Fixes related to group_keys default change in pandas 2.0 (GH#9855) Irina Truong * infer_datetime_format compatibility for pandas 2.0 (GH#9783) James Bourbeau ## Bug Fixes * Fix serialization bug in BroadcastJoinLayer (GH#9871) Richard (Rick) Zamora * Satisfy broadcast argument in DataFrame.merge (GH#9852) Richard (Rick) Zamora * Fix pyarrow parquet columns statistics computation (GH#9772) aywandji ## Documentation * Fix “duplicate explicit target name” docs warning (GH#9863) Chiara Marmo * Fix code formatting issue in “Defining a new collection backend” docs (GH#9864) Chiara Marmo * Update dashboard documentation for memory plot (GH#9768) Jayesh Manani * Add docs section about no-worker tasks (GH#9839) Florian Jetter ## Maintenance * Additional updates for detecting a distributed scheduler (GH#9890) James Bourbeau * Update gpuCI RAPIDS_VER to 23.04 (GH#9876) * Reverse precedence between collection and distributed default (GH#9869) Florian Jetter * Update xarray-contrib/issue-from-pytest-log to version 1.2.6 (GH#9865) James Bourbeau * Dont require dask config shuffle default (GH#9826) Florian Jetter * Un-xfail datetime64 Parquet roundtripping tests for new fastparquet (GH#9811) James Bourbeau * Add option to manually run upstream CI build (GH#9853) James Bourbeau * Use custom timeout in CI builds (GH#9844) James Bourbeau * Remove kwargs from make_blockwise_graph (GH#9838) Florian Jetter * Ignore warnings on persist call in test_setitem_extended_API_2d_mask (GH#9843) Charles Blackmon-Luca * Fix running S3 tests locally (GH#9833) James Bourbeau - Release 2023.1.0 ## Enhancements * Use distributed default clients even if no config is set (GH#9808) Florian Jetter * Implement ma.where and ma.nonzero (GH#9760) Erik Holmgren * Update zarr store creation functions (GH#9790) Ryan Abernathey * iteritems compatibility for pandas 2.0 (GH#9785) James Bourbeau * Accurate sizeof for pandas string[python] dtype (GH#9781) crusaderky * Deflate sizeof() of duplicate references to pandas object types (GH#9776) crusaderky * GroupBy.__getitem__ compatibility for pandas 2.0 (GH#9779) James Bourbeau * append compatibility for pandas 2.0 (GH#9750) James Bourbeau * get_dummies compatibility for pandas 2.0 (GH#9752) James Bourbeau * is_monotonic compatibility for pandas 2.0 (GH#9751) James Bourbeau * numpy=1.24 compatability (GH#9777) James Bourbeau ## Documentation * Remove duplicated encoding kwarg in docstring for to_json (GH#9796) Sultan Orazbayev * Mention SubprocessCluster in LocalCluster documentation (GH#9784) Hendrik Makait * Move Prometheus docs to dask/distributed (GH#9761) crusaderky ## Maintenance * Temporarily ignore RuntimeWarning in test_setitem_extended_API_2d_mask (GH#9828) James Bourbeau * Fix flaky test_threaded.py::test_interrupt (GH#9827) Hendrik Makait * Update xarray-contrib/issue-from-pytest-log in upstream report (GH#9822) James Bourbeau * pip install dask on gpuCI builds (GH#9816) Charles Blackmon-Luca * Bump actions/checkout from 3.2.0 to 3.3.0 (GH#9815) * Resolve sqlalchemy import failures in mindeps testing (GH#9809) Charles Blackmon-Luca * Ignore sqlalchemy.exc.RemovedIn20Warning (GH#9801) Thomas Grainger * xfail datetime64 Parquet roundtripping tests for pandas 2.0 (GH#9786) James Bourbeau * Remove sqlachemy 1.3 compatibility (GH#9695) McToel * Reduce size of expected DoK sparse matrix (GH#9775) Elliott Sales de Andrade * Remove executable flag from dask/dataframe/io/orc/utils.py (GH#9774) Elliott Sales de Andrade - Drop dask-pr9777-np1.24.patch
Dominique Leuenberger (dimstar_suse)
accepted
request 1037296
from
Markéta Machová (mcalabkova)
(revision 58)
Displaying revisions 1 - 20 of 77