Skip to content

Latest commit

 

History

History
60 lines (36 loc) · 1.63 KB

index.rst

File metadata and controls

60 lines (36 loc) · 1.63 KB

Benchmark Results for Spark, Dask, DuckDB, and Polars

.. tab-set::

   .. tab-item:: Local

       .. tab-set::

          .. tab-item:: 10 GB

            .. altair-plot::
               :remove-code:

               import altair as alt
               alt.Chart.from_json(open("charts/local-10.json").read())

          .. tab-item:: 100 GB

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/local-100.json").read())

   .. tab-item:: Cloud
      :selected:

       .. tab-set::

          .. tab-item:: 10 GB

            .. altair-plot::
               :remove-code:

               import altair as alt
               alt.Chart.from_json(open("charts/cloud-10.json").read())

          .. tab-item:: 100 GB

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/cloud-100.json").read())

          .. tab-item:: 1 TB
            :selected:

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/cloud-1000.json").read())

          .. tab-item:: 10 TB

            .. altair-plot::
               :remove-code:

               alt.Chart.from_json(open("charts/cloud-10000.json").read())

Source code for all queries is available in the coiled/benchmarks repo. Watch our YouTube video for more details on why TPC-H, how we attempt to address our Dask bias, where Dask can be more efficient, and how you can run these benchmarks yourself.