fugue
An abstraction layer for distributed computation
An abstraction layer for distributed computation
To install this package, run one of the following:
Fugue is a unified interface for distributed computing that lets users execute Python, pandas, and SQL code on Spark and Dask without rewrites. It is meant for data scientists/analysts who want to focus on defining logic rather than worrying about execution. It is also suitable for SQL users wanting to use SQL to define end-to-end workflows in pandas, Spark, and Dask. Data scientists using pandas wanting to take advantage of Spark or Dask with minimal effort, as well as big data practitioners finding testing code to be costly and slow would also find Fugue useful.
Summary
An abstraction layer for distributed computation
Last Updated
Feb 20, 2026 at 22:27
License
Apache-2.0
Total Downloads
179.1K
Version Downloads
7
Supported Platforms
Documentation
https://fugue-tutorials.readthedocs.io/