[WIP] experimental memref_stream unroll-and-jam pass using interpreter-based cost model #3724

superlopuh · 2025-01-08T00:37:01Z

Most of the juicy stuff here is in the autotune.py marimo notebook, best experienced by checking out this branch and running uv run marimo edit docs/marimo.

The main idea here is to have a system that proposes a bunch of rewrites, and then evaluates each of these rewrites based on an additional lowering + interpreter tracing.

codecov · 2025-01-08T00:42:10Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.28%. Comparing base (9c53363) to head (45d57ed).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3724      +/-   ##
==========================================
- Coverage   91.30%   91.28%   -0.02%     
==========================================
  Files         468      471       +3     
  Lines       58636    58697      +61     
  Branches     5656     5661       +5     
==========================================
+ Hits        53535    53581      +46     
- Misses       3650     3662      +12     
- Partials     1451     1454       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

qaco · 2025-01-08T15:06:18Z

In the marimo notebook, when I click on autotune.py, I have the following error : "f-string expression part cannot include a backslash (autotune.py, line 516)"

superlopuh · 2025-01-08T15:54:00Z

uh oh

superlopuh · 2025-01-08T16:04:00Z

I have a feeling it was a python 3.12 thing, should work now

superlopuh · 2025-01-08T16:16:35Z

OK despite the CI still failing it should work locally for you

qaco · 2025-01-09T04:34:01Z

It's nice. I like the "lazy pass" pattern. And I like the fact that the unroll and jam is confined at linalg level (rather than materializing loops).

In the case of this cost model, it's minimizing the number of jumps/blt (thanks to unrolling) that reduces the cost, right?

superlopuh · 2025-01-09T12:12:23Z

Yep, basically. It should be relatively easy to make the cost model a bit smarter and to add latency of instructions.

superlopuh requested review from tobiasgrosser, compor and qaco January 8, 2025 00:37

superlopuh self-assigned this Jan 8, 2025

superlopuh force-pushed the sasha/autotuner/riscv-cycle-estimator-notebook branch from d0066cc to d770896 Compare January 8, 2025 11:35

superlopuh added 12 commits January 10, 2025 23:00

primitive estimator

888b659

split memref_stream_interleave into helpers

a993a16

make linalg notebook full width

e08390c

create autotuning notebook

0deaeef

add autotune notebook

aca4753

move unroll and jam helper to new file

822c775

add unroll and jam pass and use in notebook

5d00d8b

evaluate scores in notebok

b1a3f49

add automated pass and cost

826a2d2

fix fstring

4318bb6

add numpy to the list of requirements

3ed911d

remove use of numpy

45d57ed

superlopuh force-pushed the sasha/autotuner/riscv-cycle-estimator-notebook branch from 59871dc to 45d57ed Compare January 10, 2025 23:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] experimental memref_stream unroll-and-jam pass using interpreter-based cost model #3724

[WIP] experimental memref_stream unroll-and-jam pass using interpreter-based cost model #3724

superlopuh commented Jan 8, 2025

codecov bot commented Jan 8, 2025 •

edited

Loading

qaco commented Jan 8, 2025

superlopuh commented Jan 8, 2025

superlopuh commented Jan 8, 2025

superlopuh commented Jan 8, 2025

qaco commented Jan 9, 2025

superlopuh commented Jan 9, 2025

[WIP] experimental memref_stream unroll-and-jam pass using interpreter-based cost model #3724

Are you sure you want to change the base?

[WIP] experimental memref_stream unroll-and-jam pass using interpreter-based cost model #3724

Conversation

superlopuh commented Jan 8, 2025

codecov bot commented Jan 8, 2025 • edited Loading

Codecov Report

qaco commented Jan 8, 2025

superlopuh commented Jan 8, 2025

superlopuh commented Jan 8, 2025

superlopuh commented Jan 8, 2025

qaco commented Jan 9, 2025

superlopuh commented Jan 9, 2025

codecov bot commented Jan 8, 2025 •

edited

Loading