Benchmark Results for Spark, Dask, DuckDB, and Polars

.. tab-set::

   .. tab-item:: Local

       .. tab-set::

          .. tab-item:: 10 GB

            .. altair-plot::
               :remove-code:

               import altair as alt
               alt.Chart.from_json(open("charts/local-10.json").read())

          .. tab-item:: 100 GB

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/local-100.json").read())

   .. tab-item:: Cloud
      :selected:

       .. tab-set::

          .. tab-item:: 10 GB

            .. altair-plot::
               :remove-code:

               import altair as alt
               alt.Chart.from_json(open("charts/cloud-10.json").read())

          .. tab-item:: 100 GB

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/cloud-100.json").read())

          .. tab-item:: 1 TB
            :selected:

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/cloud-1000.json").read())

          .. tab-item:: 10 TB

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/cloud-10000.json").read())

Source code for all queries is available in the coiled/benchmarks repo. Watch our YouTube video for more details on why TPC-H, how we attempt to address our Dask bias, where Dask can be more efficient, and how you can run these benchmarks yourself.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

index.rst

index.rst

Benchmark Results for Spark, Dask, DuckDB, and Polars

Files

index.rst

Latest commit

History

index.rst

File metadata and controls

Benchmark Results for Spark, Dask, DuckDB, and Polars