diff --git a/.nojekyll b/.nojekyll
new file mode 100644
index 00000000..e69de29b
diff --git a/2017/03/12/min-of-three.html b/2017/03/12/min-of-three.html
new file mode 100644
index 00000000..79b7560a
--- /dev/null
+++ b/2017/03/12/min-of-three.html
@@ -0,0 +1,381 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Min of Three</title>
+  <meta name="description" content="How to find a minimum of three double numbers? It may be surprising to you (it
+certainly was to me), but there is more than one way to do it, and with big
+difference in performance as well. It is possible to make this simple
+calculation significantly faster by utilizing
+CPU level parallelism.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2017/03/12/min-of-three.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Min-of-Three"><span>Min of Three</span> <time datetime="2017-03-12">Mar 12, 2017</time></a>
+    </h1>
+<p><span>How to find a minimum of three </span><code>double</code><span> numbers? It may be surprising to you (it</span>
+<span>certainly was to me), but there is more than one way to do it, and with big</span>
+<span>difference in performance as well. It is possible to make this simple</span>
+<span>calculation significantly faster by utilizing</span>
+<a href="https://en.wikipedia.org/wiki/Superscalar_processor"><span>CPU level parallelism</span></a><span>.</span></p>
+<p><span>The phenomenon described in this blog post was observed in </span><a href="https://users.rust-lang.org/t/performance-issue-with-c-array-like-computation-2-times-worst-than-naive-java/9807"><span>this</span>
+<span>thread</span></a><span> of the Rust forum. I am not the one who found out what is</span>
+<span>going on, I am just writing it down :)</span></p>
+<p><span>We will be using Rust, but the language is not important, the original program</span>
+<span>was in Java. What will turn out to be important is CPU architecture. The laptop</span>
+<span>on which the measurements are done has </span><code>i7-3612QM</code><span>.</span></p>
+<section id="Test-subject">
+
+    <h2>
+    <a href="#Test-subject"><span>Test subject</span> </a>
+    </h2>
+<p><span>We will be measuring </span><a href="https://en.wikipedia.org/wiki/Dynamic_time_warping"><span>dynamic time warping</span></a><span> algorithm. This algorithm</span>
+<span>calculates a distance between two real number sequences, </span><code>xs</code><span> and </span><code>ys</code><span>. It is</span>
+<span>very similar to </span><a href="https://en.wikipedia.org/wiki/Wagner%E2%80%93Fischer_algorithm"><span>edit distance</span></a><span> or </span><a href="https://en.wikipedia.org/wiki/Needleman%E2%80%93Wunsch_algorithm"><span>Needleman</span>&ndash;<span>Wunsch</span></a><span>,</span>
+<span>because it uses the same dynamic programming structure.</span></p>
+<p><span>The main equation is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">dtw[i, j] =</span>
+<span class="line">    min(dtw[i-1, j-1], dtw[i, j-1], [i-1, j]) + (xs[i] - ys[i])^2</span></code></pre>
+
+</figure>
+<p><span>That is, we calculate the distance between each pair of prefixes of </span><code>xs</code><span> and</span>
+<code>ys</code><span> using the distances from three smaller pairs. This calculation can be</span>
+<span>represented as a table where each cell depends on three others:</span></p>
+
+<figure>
+
+<img alt="Dynamic programming 2D table" src="/assets/min3_table.png">
+</figure>
+<p><span>It is possible to avoid storing the whole table explicitly. Each row depends</span>
+<span>only on the previous one, so we need to store only two rows at a time.</span></p>
+
+<figure>
+
+<img alt="Dynamic programming 2 rows" src="/assets/min3_rows.png">
+</figure>
+<p><span>Here is the Rust code for this version:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">dtw</span>(xs: &amp;[<span class="hl-type">f64</span>], ys: &amp;[<span class="hl-type">f64</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-comment">// assume equal lengths for simplicity</span></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(xs.<span class="hl-title function_ invoke__">len</span>(), ys.<span class="hl-title function_ invoke__">len</span>());</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">prev</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">0f64</span>; n + <span class="hl-number">1</span>];</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">curr</span> = <span class="hl-built_in">vec!</span>[std::<span class="hl-type">f64</span>::MAX; n + <span class="hl-number">1</span>];</span>
+<span class="line">    curr[<span class="hl-number">0</span>] = <span class="hl-number">0.0</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">ix</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..(n + <span class="hl-number">1</span>) {</span>
+<span class="line">        std::mem::<span class="hl-title function_ invoke__">swap</span>(&amp;<span class="hl-keyword">mut</span> curr, &amp;<span class="hl-keyword">mut</span> prev);</span>
+<span class="line">        curr[<span class="hl-number">0</span>] = std::<span class="hl-type">f64</span>::MAX;</span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-variable">iy</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..(n + <span class="hl-number">1</span>) {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d11</span> = prev[iy - <span class="hl-number">1</span>];</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d01</span> = curr[iy - <span class="hl-number">1</span>];</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d10</span> = prev[iy];</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-comment">// Find the minimum of d11, d01, d10</span></span>
+<span class="line">            <span class="hl-comment">// by enumerating all the cases.</span></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-keyword">if</span> d11 &lt; d01 {</span>
+<span class="line">                <span class="hl-keyword">if</span> d11 &lt; d10 { d11 } <span class="hl-keyword">else</span> { d10 }</span>
+<span class="line">            } <span class="hl-keyword">else</span> {</span>
+<span class="line">                <span class="hl-keyword">if</span> d01 &lt; d10 { d01 } <span class="hl-keyword">else</span> { d10 }</span>
+<span class="line">            };</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">cost</span> = {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = xs[ix - <span class="hl-number">1</span>] - ys[iy - <span class="hl-number">1</span>];</span>
+<span class="line">                t * t</span>
+<span class="line">            };</span>
+<span class="line"></span>
+<span class="line">            curr[iy] = d + cost;</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">    curr[n]</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=3d42c67904441279c4cbb1708fb35a06&amp;version=stable"><span>Code on Rust playground</span></a></p>
+</section>
+<section id="Profile-first">
+
+    <h2>
+    <a href="#Profile-first"><span>Profile first</span> </a>
+    </h2>
+<p><span>Is it fast? If we compile it in </span><code>--release</code><span> mode with</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-section">[build]</span></span>
+<span class="line"><span class="hl-attr">rustflags</span> = <span class="hl-string">&quot;-C target-cpu=native&quot;</span></span></code></pre>
+
+</figure>
+<p><span>in </span><code>~/.cargo/config</code><span>, it takes 435 milliseconds for two</span>
+<span>random sequences of length 10000.</span></p>
+<p><span>What is the bottleneck? Let</span>&rsquo;<span>s look at the instruction level profile of the main</span>
+<span>loop using </span><a href="https://perf.wiki.kernel.org/index.php/Main_Page"><code>perf annotate</code></a><span> command:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">   // Find the minimum of three numbers.</span>
+<span class="line">    0.00 :       vmovsd -0x8(%rax,%rsi,8),%xmm1</span>
+<span class="line">    0.00 :       vmovsd (%rax,%rsi,8),%xmm2</span>
+<span class="line">    0.06 :       vminsd %xmm2,%xmm1,%xmm3</span>
+<span class="line">    9.04 :       vminsd %xmm2,%xmm0,%xmm2</span>
+<span class="line">    0.00 :       vcmpltsd %xmm0,%xmm1,%xmm0</span>
+<span class="line">   22.70 :       vblendvpd %xmm0,%xmm3,%xmm2,%xmm0</span>
+<span class="line"></span>
+<span class="line">   // Calculate the squared error penalty.</span>
+<span class="line">    0.00 :       vmovsd -0x8(%r12,%r10,8),%xmm1</span>
+<span class="line">    0.00 :       vsubsd -0x8(%r13,%rsi,8),%xmm1,%xmm1</span>
+<span class="line">   11.01 :       vmulsd %xmm1,%xmm1,%xmm1</span>
+<span class="line"></span>
+<span class="line">   // Store the result in the `curr` array.</span>
+<span class="line">   // Note how xmm0 is used on the next iteration.</span>
+<span class="line">   22.81 :       vaddsd %xmm1,%xmm0,%xmm0</span>
+<span class="line">   10.67 :       vmovsd %xmm0,(%rdi,%rsi,8)</span></code></pre>
+
+</figure>
+<p><code>perf annotate</code><span> uses AT&amp;T assembly syntax, this means that the destination</span>
+<span>register is on the right.</span></p>
+<p><span>The </span><code>xmm0</code><span> register holds the value of </span><code>curr[iy]</code><span>, which was calculated on the</span>
+<span>previous iteration. Values of </span><code>prev[iy - 1]</code><span> and </span><code>prev[iy]</code><span> are fetched into</span>
+<code>xmm1</code><span> and </span><code>xmm2</code><span>. Note that although the original code contained three </span><code>if</code>
+<span>expressions, the assembly does not have any jumps and instead uses two </span><code>min</code><span> and</span>
+<span>one </span><code>blend</code><span> instruction to select the minimum. Nevertheless, a significant</span>
+<span>amount of time, according to </span><code>perf</code><span>, is spent calculating the minimum.</span></p>
+</section>
+<section id="Optimization">
+
+    <h2>
+    <a href="#Optimization"><span>Optimization</span> </a>
+    </h2>
+<p><span>Can we do better? Let</span>&rsquo;<span>s use </span><code>min2</code><span> function to calculate minimum of three</span>
+<span>elements recursively:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">min2</span>(x: <span class="hl-type">f64</span>, y: <span class="hl-type">f64</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-keyword">if</span> x &lt; y { x } <span class="hl-keyword">else</span> { y }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">dtw</span>(xs: &amp;[<span class="hl-type">f64</span>], ys: &amp;[<span class="hl-type">f64</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-comment">// ...</span></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-title function_ invoke__">min2</span>(<span class="hl-title function_ invoke__">min2</span>(d11, d01), d10);</span>
+<span class="line">    <span class="hl-comment">// ...</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="http://play.integer32.com/?gist=c69968bb572f2973b1c314f92e4fb332&amp;version=stable"><span>Code on Rust playground</span></a></p>
+<p><span>This version completes in 430 milliseconds, which is a nice win of 5</span>
+<span>milliseconds over the first version, but is not that impressive. The assembly</span>
+<span>looks cleaner though:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">    0.00 :       vmovsd -0x8(%rax,%rsi,8),%xmm1</span>
+<span class="line">    0.28 :       vminsd %xmm0,%xmm1,%xmm0</span>
+<span class="line">   31.14 :       vminsd (%rax,%rsi,8),%xmm0,%xmm0</span>
+<span class="line"></span>
+<span class="line">    0.06 :       vmovsd -0x8(%r12,%r10,8),%xmm1</span>
+<span class="line">    0.28 :       vsubsd -0x8(%r13,%rsi,8),%xmm1,%xmm1</span>
+<span class="line">   10.61 :       vmulsd %xmm1,%xmm1,%xmm1</span>
+<span class="line"></span>
+<span class="line">   23.29 :       vaddsd %xmm1,%xmm0,%xmm0</span>
+<span class="line">   11.11 :       vmovsd %xmm0,(%rdi,%rsi,8)</span></code></pre>
+
+</figure>
+<p><span>Up to this point it was a rather boring blog post about Rust with some assembly</span>
+<span>thrown in. But let</span>&rsquo;<span>s tweak the last variant just a little bit </span>&hellip;</p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">dtw</span>(xs: &amp;[<span class="hl-type">f64</span>], ys: &amp;[<span class="hl-type">f64</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-comment">// ...</span></span>
+<span class="line">            <span class="hl-comment">// Swap d10 and d01.</span></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-title function_ invoke__">min2</span>(<span class="hl-title function_ invoke__">min2</span>(d11, d10), d01);</span>
+<span class="line">    <span class="hl-comment">// ...</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="http://play.integer32.com/?gist=caf7609db82341fb7ccf13033738232e&amp;version=stable"><span>Code on Rust playground</span></a></p>
+<p><span>This version takes only 287 milliseconds to run, which is roughly 1.5 times</span>
+<span>faster than the previous one! However, the assembly looks almost the same </span>&hellip;</p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">    0.08 :       vmovsd -0x8(%rax,%rsi,8),%xmm1</span>
+<span class="line">    0.17 :       vminsd (%rax,%rsi,8),%xmm1,%xmm1</span>
+<span class="line">   16.40 :       vminsd %xmm0,%xmm1,%xmm0</span>
+<span class="line"></span>
+<span class="line">    0.00 :       vmovsd -0x8(%r12,%r10,8),%xmm1</span>
+<span class="line">    0.17 :       vsubsd -0x8(%r13,%rsi,8),%xmm1,%xmm1</span>
+<span class="line">   18.24 :       vmulsd %xmm1,%xmm1,%xmm1</span>
+<span class="line"></span>
+<span class="line">   17.15 :       vaddsd %xmm1,%xmm0,%xmm0</span>
+<span class="line">   15.82 :       vmovsd %xmm0,(%rdi,%rsi,8)</span></code></pre>
+
+</figure>
+<p><span>The only difference is that two </span><code>vminsd</code><span> instructions are swapped.</span>
+<span>But it is definitely much faster.</span></p>
+</section>
+<section id="A-possible-explanation">
+
+    <h2>
+    <a href="#A-possible-explanation"><span>A possible explanation</span> </a>
+    </h2>
+<p><span>A possible explanation is a synergy of CPU level parallelism and speculative</span>
+<span>execution. It was proposed by </span><a href="https://users.rust-lang.org/t/performance-issue-with-c-array-like-computation-2-times-worst-than-naive-java/9807/30?u=matklad"><span>@krdln and @vitalyd</span></a><span>. I don</span>&rsquo;<span>t know how to</span>
+<a href="https://en.wikipedia.org/wiki/Falsifiability"><span>falsify</span></a><span> it, but it at least looks plausible to me!</span></p>
+<p><span>Imagine for a second that instead of </span><code>vminsd %xmm0,%xmm1,%xmm0</code><span> instruction</span>
+<span>in the preceding assembly there is just </span><code>vmovsd %xmm1,%xmm0</code><span>. That is, we don</span>&rsquo;<span>t</span>
+<span>use </span><code>xmm0</code><span> from the previous iteration at all! This corresponds to the following</span>
+<span>update rule:</span></p>
+
+<figure>
+
+<img alt="Parallel update" src="/assets/min3_par.png">
+</figure>
+<p><span>The important property of this update rule is that CPU can calculate two cells</span>
+<span>simultaneously in parallel, because there is no data dependency between</span>
+<code>curr[i]</code><span> and </span><code>curr[i + 1]</code><span>.</span></p>
+<p><span>We do have </span><code>vminsd %xmm0,%xmm1,%xmm0</code><span>, but it is equivalent to </span><code>vmovsd
+%xmm1,%xmm0</code><span> if </span><code>xmm1</code><span> is smaller than </span><code>xmm0</code><span>. And this is often the case:</span>
+<code>xmm1</code><span> holds the minimum of upper and diagonal cell, so it is likely to be less</span>
+<span>then a single cell to the left. Also, the diagonal path is taken slightly more</span>
+<span>often then the two alternatives, which adds to the bias.</span></p>
+<p><span>So it looks like the CPU is able to speculatively execute </span><code>vminsd</code><span> and</span>
+<span>parallelise the following computation based on this speculation! Isn</span>&rsquo;<span>t that</span>
+<span>awesome?</span></p>
+</section>
+<section id="Further-directions">
+
+    <h2>
+    <a href="#Further-directions"><span>Further directions</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s interesting that we can make the computation truly parallel if we update</span>
+<span>the cells diagonally:</span></p>
+
+<figure>
+
+<img alt="Diagonal update" src="/assets/min3_diag.png">
+</figure>
+<p><span>This is explored in </span><a href="https://matklad.github.io/2017/03/18/min-of-three-part-2.html"><span>the second part</span></a><span> of this post.</span></p>
+</section>
+<section id="Conclusion">
+
+    <h2>
+    <a href="#Conclusion"><span>Conclusion</span> </a>
+    </h2>
+<p><span>Despite the fact that Rust is a high level language, there is a strong</span>
+<span>correlation between the source code and the generated assembly. Small tweaks to</span>
+<span>the source result in the small changes to the assembly with potentially big</span>
+<span>implications for performance. Also, </span><code>perf</code><span> is great!</span></p>
+<p><span>That</span>&rsquo;<span>s all :)</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2017-03-12-min-of-three.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2017/03/18/min-of-three-part-2.html b/2017/03/18/min-of-three-part-2.html
new file mode 100644
index 00000000..d32de34d
--- /dev/null
+++ b/2017/03/18/min-of-three-part-2.html
@@ -0,0 +1,542 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Min of Three Part 2</title>
+  <meta name="description" content="This is the continuation of the previous post about optimizing 2D grid
+based dynamic programming algorithm for CPU level parallelism.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2017/03/18/min-of-three-part-2.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Min-of-Three-Part-2"><span>Min of Three Part 2</span> <time datetime="2017-03-18">Mar 18, 2017</time></a>
+    </h1>
+<p><span>This is the continuation of the </span><a href="https://matklad.github.io/2017/03/12/min-of-three.html"><span>previous post</span></a><span> about optimizing 2D grid</span>
+<span>based dynamic programming algorithm for </span><a href="https://en.wikipedia.org/wiki/Superscalar_processor"><span>CPU level parallelism</span></a><span>.</span></p>
+<section id="In-The-Previous-Episode">
+
+    <h2>
+    <a href="#In-The-Previous-Episode"><span>In The Previous Episode</span> </a>
+    </h2>
+<p><span>This is the code we are trying to make faster:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">dtw</span>(xs: &amp;[<span class="hl-type">f64</span>], ys: &amp;[<span class="hl-type">f64</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-comment">// assume equal lengths for simplicity</span></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(xs.<span class="hl-title function_ invoke__">len</span>(), ys.<span class="hl-title function_ invoke__">len</span>());</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">prev</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">0f64</span>; n + <span class="hl-number">1</span>];</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">curr</span> = <span class="hl-built_in">vec!</span>[std::<span class="hl-type">f64</span>::MAX; n + <span class="hl-number">1</span>];</span>
+<span class="line">    curr[<span class="hl-number">0</span>] = <span class="hl-number">0.0</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">ix</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..(n + <span class="hl-number">1</span>) {</span>
+<span class="line">        ::std::mem::<span class="hl-title function_ invoke__">swap</span>(&amp;<span class="hl-keyword">mut</span> curr, &amp;<span class="hl-keyword">mut</span> prev);</span>
+<span class="line">        curr[<span class="hl-number">0</span>] = std::<span class="hl-type">f64</span>::MAX;</span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-variable">iy</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..(n + <span class="hl-number">1</span>) {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d11</span> = prev[iy - <span class="hl-number">1</span>];</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d01</span> = curr[iy - <span class="hl-number">1</span>];</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d10</span> = prev[iy];</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-comment">// Find the minimum of d11, d01, d10</span></span>
+<span class="line">            <span class="hl-comment">// by enumerating all the cases.</span></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-keyword">if</span> d11 &lt; d01 {</span>
+<span class="line">                <span class="hl-keyword">if</span> d11 &lt; d10 { d11 } <span class="hl-keyword">else</span> { d10 }</span>
+<span class="line">            } <span class="hl-keyword">else</span> {</span>
+<span class="line">                <span class="hl-keyword">if</span> d01 &lt; d10 { d01 } <span class="hl-keyword">else</span> { d10 }</span>
+<span class="line">            };</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">cost</span> = {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = xs[ix - <span class="hl-number">1</span>] - ys[iy - <span class="hl-number">1</span>];</span>
+<span class="line">                t * t</span>
+<span class="line">            };</span>
+<span class="line"></span>
+<span class="line">            curr[iy] = d + cost;</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">    curr[n]</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=3d42c67904441279c4cbb1708fb35a06&amp;version=stable"><span>Code on Rust playground</span></a><span> (293 ms)</span></p>
+<p><span>It calculates </span><a href="https://en.wikipedia.org/wiki/Dynamic_time_warping"><span>dynamic time warping</span></a><span> distance between two </span><code>double</code>
+<span>vectors using an update rule which is structured like this:</span></p>
+
+<figure>
+
+<img alt="Dynamic programming 2D table" src="/assets/min3_table.png">
+</figure>
+<p><span>This code takes 293 milliseconds to run on a particular input</span>
+<span>data. The speedup from 435 milliseconds stated in the previous post is</span>
+<span>due to Moore</span>&rsquo;<span>s law: I</span>&rsquo;<span>ve upgraded the CPU :)</span></p>
+<p><span>We can bring run time down by tweaking how we calculate the minimum of</span>
+<span>three elements.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">min2</span>(x: <span class="hl-type">f64</span>, y: <span class="hl-type">f64</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-keyword">if</span> x &lt; y { x } <span class="hl-keyword">else</span> { y }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">dtw</span>(xs: &amp;[<span class="hl-type">f64</span>], ys: &amp;[<span class="hl-type">f64</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-comment">// ...</span></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-title function_ invoke__">min2</span>(<span class="hl-title function_ invoke__">min2</span>(d11, d10), d01);</span>
+<span class="line">    <span class="hl-comment">// ...</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=caf7609db82341fb7ccf13033738232e&amp;version=stable"><span>Code on Rust playground</span></a><span> (210 ms)</span></p>
+<p><span>This version takes only 210 milliseconds, presumably because the</span>
+<span>minimum of two elements in the previous row can be calculated without</span>
+<span>waiting for the preceding element in the current row to be computed.</span></p>
+<p><span>The assembly for the main loop looks like this (AT&amp;T syntax,</span>
+<span>destination register on the right)</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">   18.32    vmovsd -0x8(%rax,%rsi,8),%xmm1</span>
+<span class="line">    0.00    vminsd (%rax,%rsi,8),%xmm1,%xmm1</span>
+<span class="line">    6.72    vminsd %xmm0,%xmm1,%xmm0</span>
+<span class="line">    4.64    vmovsd -0x8(%r12,%r10,8),%xmm1</span>
+<span class="line">    0.00    vsubsd -0x8(%r13,%rsi,8),%xmm1,%xmm1</span>
+<span class="line">    7.69    vmulsd %xmm1,%xmm1,%xmm1</span>
+<span class="line">   36.14    vaddsd %xmm1,%xmm0,%xmm0</span>
+<span class="line">   14.16    vmovsd %xmm0,(%rdi,%rsi,8)</span></code></pre>
+
+</figure>
+<p><span>Check the </span><a href="https://matklad.github.io/2017/03/12/min-of-three.html"><span>previous post</span></a><span> for more details!</span></p>
+</section>
+<section id="The-parallel-plan">
+
+    <h2>
+    <a href="#The-parallel-plan"><span>The parallel plan</span> </a>
+    </h2>
+<p><span>Can we loosen dependencies between cells even more to benefit from instruction</span>
+<span>level parallelism? What if instead of filling the table row by row, we do it</span>
+<span>diagonals?</span></p>
+
+<figure>
+
+<img alt="Diagonal update" src="/assets/min3_diag_color.png">
+</figure>
+<p><span>We</span>&rsquo;<span>d need to remember </span><strong><span>two</span></strong><span> previous diagonals instead of one previous</span>
+<span>row, but all the cells on the next diagonal would be independent! In</span>
+<span>theory, compiler should be able to use </span><a href="https://en.wikipedia.org/wiki/SIMD#Hardware"><span>SIMD instructions</span></a><span> to make the</span>
+<span>computation truly parallel.</span></p>
+</section>
+<section id="Implementation-Plan">
+
+    <h2>
+    <a href="#Implementation-Plan"><span>Implementation Plan</span> </a>
+    </h2>
+<p><span>Coding up this diagonal traversal is a bit tricky, because you need to</span>
+<span>map linear vector indices to diagonal indices.</span></p>
+<p><span>The original indexing worked like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">        iy</span>
+<span class="line">       ----&gt;</span>
+<span class="line">    | . . . .</span>
+<span class="line"> ix | . . . .</span>
+<span class="line">    | . . . .</span>
+<span class="line">    V . . . .</span></code></pre>
+
+</figure>
+<ul>
+<li>
+<code>ix</code><span> and </span><code>iy</code><span> are indices in the input vectors.</span>
+</li>
+<li>
+<span>The outer loop is over </span><code>ix</code><span>.</span>
+</li>
+<li>
+<span>On each iteration, we remember two rows (</span><code>curr</code><span> and </span><code>prev</code><span> in the</span>
+<span>code).</span>
+</li>
+</ul>
+<p><span>For our grand plan, we need to fit a rhombus peg in a square hole:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">   id</span>
+<span class="line">  ----&gt;</span>
+<span class="line"> . . . .        |</span>
+<span class="line">   . . . .      | ix</span>
+<span class="line">     . . . .    |</span>
+<span class="line">       . . . .  V</span></code></pre>
+
+</figure>
+<ul>
+<li>
+<code>id</code><span> is the index of the diagonal. There are twice as much diagonals</span>
+<span>as rows.</span>
+</li>
+<li>
+<span>The outer loop is over </span><code>id</code><span>.</span>
+</li>
+<li>
+<span>On each iteration we remember three columns (</span><code>d1</code><span>, </span><code>d2</code><span> </span><code>d3</code><span> in the</span>
+<span>code).</span>
+</li>
+<li>
+<span>There is a phase transition once we</span>&rsquo;<span>ve crossed the main diagonal.</span>
+</li>
+<li>
+<span>We can derive </span><code>iy</code><span> from the fact that </span><code>ix + iy = id</code><span>.</span>
+</li>
+</ul>
+</section>
+<section id="Code">
+
+    <h2>
+    <a href="#Code"><span>Code</span> </a>
+    </h2>
+<p><span>The actual code looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">dtw</span>(xs: &amp;[<span class="hl-type">f64</span>], ys: &amp;[<span class="hl-type">f64</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(xs.<span class="hl-title function_ invoke__">len</span>(), ys.<span class="hl-title function_ invoke__">len</span>());</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">d1</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">0f64</span>; n + <span class="hl-number">1</span>];</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">d2</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">0f64</span>; n + <span class="hl-number">1</span>];</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">d3</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">0f64</span>; n + <span class="hl-number">1</span>];</span>
+<span class="line">    d2[<span class="hl-number">0</span>] = ::std::<span class="hl-type">f64</span>::MAX;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">id</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..(<span class="hl-number">2</span> * n + <span class="hl-number">1</span>) {</span>
+<span class="line">        ::std::mem::<span class="hl-title function_ invoke__">swap</span>(&amp;<span class="hl-keyword">mut</span> d1, &amp;<span class="hl-keyword">mut</span> d2);</span>
+<span class="line">        ::std::mem::<span class="hl-title function_ invoke__">swap</span>(&amp;<span class="hl-keyword">mut</span> d2, &amp;<span class="hl-keyword">mut</span> d3);</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">ix_range</span> = <span class="hl-keyword">if</span> id &lt;= n {</span>
+<span class="line">            d3[<span class="hl-number">0</span>] = ::std::<span class="hl-type">f64</span>::MAX;</span>
+<span class="line">            d3[id] = ::std::<span class="hl-type">f64</span>::MAX;</span>
+<span class="line">            <span class="hl-number">1</span>..id</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            (id - n..n + <span class="hl-number">1</span>)</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-variable">ix</span> <span class="hl-keyword">in</span> ix_range {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">iy</span> = id - ix;</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-title function_ invoke__">min2</span>(<span class="hl-title function_ invoke__">min2</span>(d2[ix - <span class="hl-number">1</span>], d2[ix]), d1[ix - <span class="hl-number">1</span>]);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">cost</span> = {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = xs[ix - <span class="hl-number">1</span>] - ys[iy - <span class="hl-number">1</span>];</span>
+<span class="line">                t * t</span>
+<span class="line">            };</span>
+<span class="line">            d3[ix] = d + cost;</span>
+<span class="line">        };</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    d3[n]</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=4522cb9d4d0e95e9daa4b1f1d6a563b0&amp;version=stable"><span>Code on Rust playground</span></a><span> (185 ms)</span></p>
+<p><span>It take 185 milliseconds to run. The assembly for the main loop is</span>
+<span>quite interesting:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">    1.67    cmp    %rax,%rdx</span>
+<span class="line">    0.00    jbe    6d95</span>
+<span class="line">    1.95    lea    0x1(%rax),%rbx</span>
+<span class="line">    8.09    cmp    %rbx,%rdx</span>
+<span class="line">    0.98    jbe    6da4</span>
+<span class="line">    1.12    cmp    %rax,%r8</span>
+<span class="line">    0.00    jbe    6db3</span>
+<span class="line">    3.49    cmp    %r12,%rax</span>
+<span class="line">    0.00    jae    6de9</span>
+<span class="line">    9.07    cmp    %r12,%rcx</span>
+<span class="line">    0.00    jae    6dc5</span>
+<span class="line">    0.56    cmp    %rbx,%r9</span>
+<span class="line">    0.00    jbe    6dd7</span>
+<span class="line">    2.23    vmovsd (%r15,%rax,8),%xmm0</span>
+<span class="line">   11.72    vminsd 0x8(%r15,%rax,8),%xmm0,%xmm0</span>
+<span class="line">    2.09    vminsd (%r11,%rax,8),%xmm0,%xmm0</span>
+<span class="line">    2.51    vmovsd (%r14,%rax,8),%xmm1</span>
+<span class="line">    7.95    mov    -0x88(%rbp),%rdi</span>
+<span class="line">    3.07    vsubsd (%rdi,%rcx,8),%xmm1,%xmm1</span>
+<span class="line">    3.91    vmulsd %xmm1,%xmm1,%xmm1</span>
+<span class="line">   15.90    vaddsd %xmm1,%xmm0,%xmm0</span>
+<span class="line">    8.37    vmovsd %xmm0,0x8(%r13,%rax,8)</span></code></pre>
+
+</figure>
+<p><span>First of all, we don</span>&rsquo;<span>t see any vectorized instructions, the code does</span>
+<span>roughly the same operations as the in previous version. Also, there is</span>
+<span>a whole bunch of extra branching instructions on the top. These are</span>
+<span>bounds checks which were not eliminated this time. And this is great:</span>
+<span>if I add all off-by one errors I</span>&rsquo;<span>ve made implementing diagonal</span>
+<span>indexing, I would get an integer overflow! Nevertheless, we</span>&rsquo;<span>ve got</span>
+<span>some speedup.</span></p>
+<p><span>Can we go further and add get SIMD instructions here? At the moment,</span>
+<span>Rust does not have a stable way to explicitly emit SIMD</span>
+<span>(</span><a href="https://internals.rust-lang.org/t/getting-explicit-simd-on-stable-rust/4380"><span>it</span>&rsquo;<span>s going to change some day</span></a><span>) (UPDATE: we have </span><a href="https://doc.rust-lang.org/core/arch/index.html"><span>SIMD on stable</span></a><span> now!), so the only choice we</span>
+<span>have is to tweak the source code until LLVM sees an opportunity for</span>
+<span>vectorization.</span></p>
+</section>
+<section id="SIMD">
+
+    <h2>
+    <a href="#SIMD"><span>SIMD</span> </a>
+    </h2>
+<p><span>Although bounds checks themselves don</span>&rsquo;<span>t slow down the code that much,</span>
+<span>they can prevent LLVM from vectorizing. So let</span>&rsquo;<span>s dip our toes into</span>
+<code>unsafe</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">unsafe</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-title function_ invoke__">min2</span>(</span>
+<span class="line">        <span class="hl-title function_ invoke__">min2</span>(*d2.<span class="hl-title function_ invoke__">get_unchecked</span>(ix - <span class="hl-number">1</span>), *d2.<span class="hl-title function_ invoke__">get_unchecked</span>(ix)),</span>
+<span class="line">        *d1.<span class="hl-title function_ invoke__">get_unchecked</span>(ix - <span class="hl-number">1</span>),</span>
+<span class="line">    );</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">cost</span> = {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">t</span> =</span>
+<span class="line">            xs.<span class="hl-title function_ invoke__">get_unchecked</span>(ix - <span class="hl-number">1</span>) - ys.<span class="hl-title function_ invoke__">get_unchecked</span>(iy - <span class="hl-number">1</span>);</span>
+<span class="line">        t * t</span>
+<span class="line">    };</span>
+<span class="line">    *d3.<span class="hl-title function_ invoke__">get_unchecked_mut</span>(ix) = d + cost;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=a7d7e447794eda6fef7b08a28c2c79da&amp;version=stable"><span>Code on Rust playground</span></a><span> (52 ms)</span></p>
+<p><span>The  code is  as  fast as  it  is  ugly: it  finishes  in whooping  52</span>
+<span>milliseconds! And of course we see SIMD in the assembly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">    5.74    vmovupd -0x8(%r8,%rcx,8),%ymm0</span>
+<span class="line">    1.44    vminpd (%r8,%rcx,8),%ymm0,%ymm0</span>
+<span class="line">    7.66    vminpd -0x8(%r11,%rcx,8),%ymm0,%ymm0</span>
+<span class="line">    5.26    vmovupd -0x8(%rbx,%rcx,8),%ymm1</span>
+<span class="line">    7.66    vpermpd $0x1b,0x20(%r12),%ymm2</span>
+<span class="line">    5.26    vsubpd %ymm2,%ymm1,%ymm1</span>
+<span class="line">    7.66    vmulpd %ymm1,%ymm1,%ymm1</span>
+<span class="line">    8.61    vaddpd %ymm1,%ymm0,%ymm0</span>
+<span class="line">    2.39    vmovupd %ymm0,(%rdx,%rcx,8)</span>
+<span class="line">    2.39    vmovupd 0x18(%r8,%rcx,8),%ymm0</span>
+<span class="line">    5.74    vminpd 0x20(%r8,%rcx,8),%ymm0,%ymm0</span>
+<span class="line">    9.09    vminpd 0x18(%r11,%rcx,8),%ymm0,%ymm0</span>
+<span class="line">    0.96    vmovupd 0x18(%rbx,%rcx,8),%ymm1</span>
+<span class="line">    4.78    vpermpd $0x1b,(%r12),%ymm2</span>
+<span class="line">    3.83    vsubpd %ymm2,%ymm1,%ymm1</span>
+<span class="line">    3.83    vmulpd %ymm1,%ymm1,%ymm1</span>
+<span class="line">   10.53    vaddpd %ymm1,%ymm0,%ymm0</span>
+<span class="line">    4.78    vmovupd %ymm0,0x20(%rdx,%rcx,8)</span></code></pre>
+
+</figure>
+</section>
+<section id="Safe-SIMD">
+
+    <h2>
+    <a href="#Safe-SIMD"><span>Safe SIMD</span> </a>
+    </h2>
+<p><span>How can we get the same results with safe Rust? One possible way is to</span>
+<span>use iterators, but in this case the resulting code would be rather</span>
+<span>ugly, because you</span>&rsquo;<span>ll need a lot of nested </span><code>.zip</code>&rsquo;<span>s. So let</span>&rsquo;<span>s try a</span>
+<span>simple trick of hoisting the bounds checks of the loop. The idea is to</span>
+<span>transform this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..n {</span>
+<span class="line">    assert i &lt; xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">    xs.<span class="hl-title function_ invoke__">get_unchecked</span>(i);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>into this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">assert xs.<span class="hl-title function_ invoke__">len</span>() &lt; n;</span>
+<span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..n {</span>
+<span class="line">    xs.<span class="hl-title function_ invoke__">get_unchecked</span>(i);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In Rust, this is possible by explicitly slicing the buffer before the loop:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">ix_range</span> = <span class="hl-keyword">if</span> id &lt;= n {</span>
+<span class="line">    d3[<span class="hl-number">0</span>] = ::std::<span class="hl-type">f64</span>::MAX;</span>
+<span class="line">    d3[id] = ::std::<span class="hl-type">f64</span>::MAX;</span>
+<span class="line">    <span class="hl-number">1</span>..id</span>
+<span class="line">} <span class="hl-keyword">else</span> {</span>
+<span class="line">    (id - n..n + <span class="hl-number">1</span>)</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">ix_range_1</span> = ix_range.start - <span class="hl-number">1</span>..ix_range.end - <span class="hl-number">1</span>;</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">dn</span> = ix_range.end - ix_range.start;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">d1</span> = &amp;d1[ix_range_1.<span class="hl-title function_ invoke__">clone</span>()];</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">d2_0</span> = &amp;d2[ix_range.<span class="hl-title function_ invoke__">clone</span>()];</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">d2_1</span> = &amp;d2[ix_range_1.<span class="hl-title function_ invoke__">clone</span>()];</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">d3</span> = &amp;<span class="hl-keyword">mut</span> d3[ix_range.<span class="hl-title function_ invoke__">clone</span>()];</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">xs</span> = &amp;xs[ix_range_1.<span class="hl-title function_ invoke__">clone</span>()];</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">ys</span> = &amp;ys[id - ix_range.end..id - ix_range.start];</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// All the buffers we access inside the loop</span></span>
+<span class="line"><span class="hl-comment">// will have the same length</span></span>
+<span class="line"><span class="hl-built_in">assert!</span>(</span>
+<span class="line">    d1.<span class="hl-title function_ invoke__">len</span>() == dn &amp;&amp; d2_0.<span class="hl-title function_ invoke__">len</span>() == dn &amp;&amp; d2_1.<span class="hl-title function_ invoke__">len</span>() == dn</span>
+<span class="line">    &amp;&amp; d3.<span class="hl-title function_ invoke__">len</span>() == dn &amp;&amp; xs.<span class="hl-title function_ invoke__">len</span>() == dn &amp;&amp; ys.<span class="hl-title function_ invoke__">len</span>() == dn</span>
+<span class="line">);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..dn { <span class="hl-comment">// so hopefully LLVM can eliminate bounds checks.</span></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">d</span> = <span class="hl-title function_ invoke__">min2</span>(<span class="hl-title function_ invoke__">min2</span>(d2_0[i], d2_1[i]), d1[i]);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">cost</span> = {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = xs[i] - ys[ys.<span class="hl-title function_ invoke__">len</span>() - i - <span class="hl-number">1</span>];</span>
+<span class="line">        t * t</span>
+<span class="line">    };</span>
+<span class="line">    d3[i] = d + cost;</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=65a707923aa0d49aa06e84c509c83bed&amp;version=stable"><span>Code on Rust playground</span></a><span> (107 ms)</span></p>
+<p><span>This is definitely an improvement over the best safe version, but is</span>
+<span>still twice as slow as the unsafe variant. Looks like some bounds</span>
+<span>checks are still there! It is possible to find them by selectively</span>
+<span>using </span><code>unsafe</code><span> to replace some indexing operations.</span></p>
+<p><span>And it turns out that only </span><code>ys</code><span> is still checked!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">t</span> = xs[i] - <span class="hl-keyword">unsafe</span> { ys.<span class="hl-title function_ invoke__">get_unchecked</span>(ys.<span class="hl-title function_ invoke__">len</span>() - i - <span class="hl-number">1</span>) };</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=d735daf2993acd1286d399c813546c71&amp;version=stable"><span>Code on Rust playground</span></a><span> (52 ms)</span></p>
+<p><span>If we use </span><code>unsafe</code><span> only for </span><code>ys</code><span>, we regain all the performance.</span></p>
+<p><span>LLVM is having trouble iterating </span><code>ys</code><span> in reverse, but the fix is easy:</span>
+<span>just reverse it once at the beginning of the function:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">ys_rev</span>: <span class="hl-type">Vec</span>&lt;<span class="hl-type">f64</span>&gt; = ys.<span class="hl-title function_ invoke__">iter</span>().<span class="hl-title function_ invoke__">cloned</span>().<span class="hl-title function_ invoke__">rev</span>().<span class="hl-title function_ invoke__">collect</span>();</span></code></pre>
+
+</figure>
+<p><a href="http://play.rust-lang.org/?gist=6f6bcf941df819d10f8fa688f86765ad&amp;version=stable"><span>Code on Rust playground</span></a><span> (50 ms)</span></p>
+</section>
+<section id="Conclusions">
+
+    <h2>
+    <a href="#Conclusions"><span>Conclusions</span> </a>
+    </h2>
+<p><span>We</span>&rsquo;<span>ve gone from almost 300 milliseconds to only 50 in safe Rust. That</span>
+<span>is quite impressive! However, the resulting code is rather brittle and</span>
+<span>even small changes can prevent vectorization from triggering.</span></p>
+<p><span>It</span>&rsquo;<span>s also important to understand that to allow for SIMD, we had to</span>
+<span>change the underlying algorithm. This is not something even a very</span>
+<span>smart compiler could do!</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2017-03-18-min-of-three-part-2.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2017/03/25/nixos-notes.html b/2017/03/25/nixos-notes.html
new file mode 100644
index 00000000..7050da99
--- /dev/null
+++ b/2017/03/25/nixos-notes.html
@@ -0,0 +1,203 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>NixOS Notes</title>
+  <meta name="description" content="I had bought a new laptop recently, which was a perfect opportunity to take a
+fresh look at my NixOS setup.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2017/03/25/nixos-notes.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#NixOS-Notes"><span>NixOS Notes</span> <time datetime="2017-03-25">Mar 25, 2017</time></a>
+    </h1>
+<p><span>I had bought a new laptop recently, which was a perfect opportunity to take a</span>
+<span>fresh look at my </span><a href="http://nixos.org/"><span>NixOS</span></a><span> setup.</span></p>
+<p><span>As usual, there are some hacks and not obvious things which I would like to</span>
+<span>document just in case :)</span></p>
+<section id="If-it-does-not-work-update">
+
+    <h2>
+    <a href="#If-it-does-not-work-update"><span>If it does not work, update</span> </a>
+    </h2>
+<p><span>I</span>&rsquo;<span>ve tried installed a stable 16.09 version first, but live CD didn</span>&rsquo;<span>t manage to</span>
+<span>start the X server properly. This was easy to fix by switching to the then beta</span>
+<span>17.03.</span></p>
+</section>
+<section id="UEFI">
+
+    <h2>
+    <a href="#UEFI"><span>UEFI</span> </a>
+    </h2>
+<p><span>It is my first system which uses </span><a href="https://en.wikipedia.org/wiki/Unified_Extensible_Firmware_Interface"><span>UEFI</span></a><span> instead of BIOS, and I was</span>
+<span>pleasantly surprised by how everything just worked. Documentation contains only</span>
+<span>a short paragraph about UEFI, but it</span>&rsquo;<span>s everything you need. The only hiccup on</span>
+<span>my side happened when I enabled GRUB together with </span><code>systemd-boot</code><span>: you don</span>&rsquo;<span>t</span>
+<span>need GRUB at all, </span><code>system-boot</code><span> is a bootloader which handles everything.</span></p>
+</section>
+<section id="If-it-does-not-work-fix-the-obvious-problem">
+
+    <h2>
+    <a href="#If-it-does-not-work-fix-the-obvious-problem"><span>If it does not work, fix the obvious problem</span> </a>
+    </h2>
+<p><span>After I</span>&rsquo;<span>ve installed everything, I was presented with a blank screen</span>
+<span>instead of my desktop environment (with the live CD everything</span>
+<span>worked). It took me ages to debug the issue, while the fix was super</span>
+<span>trivial: add </span><code>videoDrivers = [ "intel" ];</code><span> to </span><code>xserver</code><span> config and</span>
+<code>"noveau"</code><span> to </span><code>blacklistedKernelModules</code><span>.</span></p>
+</section>
+<section id="Rust">
+
+    <h2>
+    <a href="#Rust"><span>Rust</span> </a>
+    </h2>
+<p><span>While nix is the best way to manage Linux desktop I am aware of,</span>
+<a href="https://github.com/rust-lang-nursery/rustup.rs"><span>rustup</span></a><span> is the most convenient way of managing Rust toolchains.</span>
+<span>Unfortunately it</span>&rsquo;<span>s not easy to make rustup play nicely with NixOS (UPDATE:</span>
+<span>rustup is now packaged in nixpkgs and just works). Rustup downloads binaries of</span>
+<span>the compiler and Cargo, but it is impossible to launch unmodified binaries on</span>
+<span>NixOS because it a lacks conventional loader.</span></p>
+<p><span>The fix I came up with is a horrible hack which goes against</span>
+<span>everything in NixOS. Here it is:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">environment.<span class="hl-attr">extraInit</span> = <span class="hl-keyword">let</span> <span class="hl-attr">loader</span> = <span class="hl-string">&quot;ld-linux-x86-64.so.2&quot;</span>; <span class="hl-keyword">in</span> <span class="hl-string">&#x27;&#x27;</span></span>
+<span class="line"><span class="hl-string">  export LD_LIBRARY_PATH=&quot;$LD_LIBRARY_PATH:/run/current-system/sw/lib:<span class="hl-subst">${pkgs.stdenv.cc.cc.lib}</span>/lib&quot;</span></span>
+<span class="line"><span class="hl-string">  ln -fs <span class="hl-subst">${pkgs.stdenv.cc.libc.out}</span>/lib/<span class="hl-subst">${loader}</span> /lib64/<span class="hl-subst">${loader}</span></span></span>
+<span class="line"><span class="hl-string">&#x27;&#x27;</span>;</span></code></pre>
+
+</figure>
+<p><span>It makes the loader and shared libraries (rustup needs </span><code>zlib</code><span>) visible</span>
+<span>to binaries compiled for x64 Linux.</span></p>
+</section>
+<section id="Idea">
+
+    <h2>
+    <a href="#Idea"><span>Idea</span> </a>
+    </h2>
+<p><span>Another software which I wish to update somewhat more frequently than</span>
+<span>other packages is </span><a href="https://www.jetbrains.com/idea/"><span>IntelliJ IDEA</span></a><span> (I write a fair amount of Kotlin and</span>
+<span>Rust). NixOS has a super convenient mechanism to do this:</span>
+<a href="https://nixos.org/wiki/Nix_Modifying_Packages#Overriding_Existing_Packages"><code>packageOverrides</code></a><span>. Here is my </span><code>~/nixpkgs/config.nix</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">{</span>
+<span class="line">  <span class="hl-attr">packageOverrides</span> = pkgs: <span class="hl-keyword">rec</span> {</span>
+<span class="line">    <span class="hl-attr">idea-community</span> = <span class="hl-keyword">let</span></span>
+<span class="line">      <span class="hl-attr">version</span> = <span class="hl-string">&quot;2017.1&quot;</span>;</span>
+<span class="line">      <span class="hl-attr">sha256</span> = <span class="hl-string">&quot;750b517742157475bb690c1cc8f21ac151a754a38fec5c99a4bb473efd71da5d&quot;</span>;</span>
+<span class="line">    <span class="hl-keyword">in</span></span>
+<span class="line">      pkgs.idea.idea-community.overrideDerivation (attrs: <span class="hl-keyword">rec</span> {</span>
+<span class="line">        <span class="hl-keyword">inherit</span> version;</span>
+<span class="line">        <span class="hl-attr">name</span> = <span class="hl-string">&quot;idea-community-<span class="hl-subst">${version}</span>&quot;</span>;</span>
+<span class="line">        <span class="hl-attr">src</span> = pkgs.fetchurl {</span>
+<span class="line">          <span class="hl-keyword">inherit</span> sha256;</span>
+<span class="line">          <span class="hl-attr">url</span> = <span class="hl-string">&quot;https://download.jetbrains.com/idea/ideaIC-<span class="hl-subst">${version}</span>.tar.gz&quot;</span>;</span>
+<span class="line">        };</span>
+<span class="line">      });</span>
+<span class="line">  };</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It allows to use the most recent IDEA with the stable NixOS channel.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2017-03-25-nixos-notes.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2017/10/21/lldb-dynamic-type.html b/2017/10/21/lldb-dynamic-type.html
new file mode 100644
index 00000000..d6a0e41f
--- /dev/null
+++ b/2017/10/21/lldb-dynamic-type.html
@@ -0,0 +1,168 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Dynamic types in LLDB</title>
+  <meta name="description" content="If you are wondering how debuggers work, I suggest reading Eli Bendersky's
+eli-on-debuggers. However after having read these notes myself, I still
+had one question unanswered. Namely, how can debugger show fields of a class, if
+the type of the class is known only at runtime?">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2017/10/21/lldb-dynamic-type.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Dynamic-types-in-LLDB"><span>Dynamic types in LLDB</span> <time datetime="2017-10-21">Oct 21, 2017</time></a>
+    </h1>
+<p><span>If you are wondering how debuggers work, I suggest reading Eli Bendersky</span>&rsquo;<span>s</span>
+<a href="https://eli.thegreenplace.net/tag/debuggers"><span>eli-on-debuggers</span></a><span>. However after having read these notes myself, I still</span>
+<span>had one question unanswered. Namely, how can debugger show fields of a class, if</span>
+<span>the type of the class is known only at runtime?</span></p>
+<section id="Example">
+
+    <h2>
+    <a href="#Example"><span>Example</span> </a>
+    </h2>
+<p><span>Consider this situation: you have a pointer of type </span><code>A*</code><span>, which at runtime holds</span>
+<span>a value of some subtype of </span><code>A</code><span>. Could the debugger display the fields of the</span>
+<span>actual type? Turns out, it can handle cases like the one below just fine!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Base</span> { ... };</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Derived</span>: Base { ... };</span>
+<span class="line"></span>
+<span class="line"><span class="hl-function"><span class="hl-type">void</span> <span class="hl-title">foo</span><span class="hl-params">(Base&amp; x)</span> </span>{</span>
+<span class="line">    <span class="hl-comment">// `x` can be `Derived` or `Base` here.</span></span>
+<span class="line">    <span class="hl-comment">// How can debugger show fields of `Derived` then?</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="DWARF">
+
+    <h2>
+    <a href="#DWARF"><span>DWARF</span> </a>
+    </h2>
+<p><span>Could it be possible that information about dynamic types is present in DWARF?</span>
+<span>If we look at the DWARF, we</span>&rsquo;<span>ll see that there</span>&rsquo;<span>s layout information for both</span>
+<code>Base</code><span> and </span><code>Derive</code><span> types, as well as a entry for </span><code>x</code><span> parameter, which says that</span>
+<span>it has type </span><code>Base</code><span>. And this makes sense: we don</span>&rsquo;<span>t know that </span><code>x</code><span> is  </span><code>Derived</code>
+<span>until runtime! So debugger must somehow figure the type of the variable</span>
+<span>dynamically.</span></p>
+</section>
+<section id="No-Magic">
+
+    <h2>
+    <a href="#No-Magic"><span>No Magic</span> </a>
+    </h2>
+<p><span>As usual, there</span>&rsquo;<span>s no magic. For example, LLDB has a hard-coded knowledge of C++</span>
+<span>programming language, which allows debugger to inspect types at runtime.</span>
+<span>Specifically, this is handled by </span><code>LanguageRuntime</code><span> LLDB </span><strong><span>plugin</span></strong><span>, which has a</span>
+<span>curious function </span><a href="https://github.com/llvm-mirror/lldb/blob/bc19e289f759c26e4840aab450443d4a85071139/include/lldb/Target/LanguageRuntime.h#L82"><code>GetDynamicTypeAndAddress</code></a><span>, whose job is to poke the</span>
+<span>representation of value to get its real type and adjust pointer, if necessary</span>
+<span>(remember, with multiple inheritance, casts may change the value of the</span>
+<span>pointer).</span></p>
+<p><span>The implementation of this function for C++ language lives in</span>
+<a href="https://github.com/llvm-mirror/lldb/blob/bc19e289f759c26e4840aab450443d4a85071139/source/Plugins/LanguageRuntime/CPlusPlus/ItaniumABI/ItaniumABILanguageRuntime.cpp#L185"><span>ItaniumABILanguageRuntime.cpp</span></a><span> although, unlike C, C++ lacks a</span>
+<span>standardized ABI, almost all compilers on all non-windows platforms use a</span>
+<a href="http://refspecs.linuxbase.org/cxxabi-1.83.html"><span>specific ABI</span></a><span>, confusingly called Itanium (after a now effectively dead</span>
+<span>64-bit CPU architecture).</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2017-10-21-lldb-dynamic-type.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/01/03/make-your-own-make.html b/2018/01/03/make-your-own-make.html
new file mode 100644
index 00000000..3fe881e3
--- /dev/null
+++ b/2018/01/03/make-your-own-make.html
@@ -0,0 +1,258 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Make your own make</title>
+  <meta name="description" content="One of my favorite features of Cargo is that it is not a general
+purpose build tool. This allows Cargo to really excel at the task of building
+Rust code, without usual Turing tarpit of build configuration files. I have yet
+to see a complicated Cargo.toml file!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/01/03/make-your-own-make.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Make-your-own-make"><span>Make your own make</span> <time datetime="2018-01-03">Jan 3, 2018</time></a>
+    </h1>
+<section id="Introduction">
+
+    <h2>
+    <a href="#Introduction"><span>Introduction</span> </a>
+    </h2>
+<p><span>One of my favorite features of </span><a href="https://doc.rust-lang.org/cargo/"><span>Cargo</span></a><span> is that it is </span><strong><strong><span>not</span></strong></strong><span> a general</span>
+<span>purpose build tool. This allows Cargo to really excel at the task of building</span>
+<span>Rust code, without usual Turing tarpit of build configuration files. I have yet</span>
+<span>to see a complicated Cargo.toml file!</span></p>
+<p><span>However, once a software project grows, it</span>&rsquo;<span>s almost inevitable that it will</span>
+<span>require some tasks </span><strong><span>besides</span></strong><span> building Rust code. For example, you might need to</span>
+<span>integrate several languages together, or to setup some elaborate testing for</span>
+<span>non-code aspects of your project, like checking the licenses, or to establish an</span>
+<span>elaborate release procedure.</span></p>
+<p><span>For such use-cases, a general purpose task automation solution is needed. In</span>
+<span>this blog post I want to describe one possible approach, which leans heavily on</span>
+<span>Cargo</span>&rsquo;<span>s built-in functionality.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><a href="https://github.com/matklad/cargo-xtask/"><span>xtask</span></a><span> specification is a modern</span>
+<span>version of this idea.</span></p>
+</div>
+</aside></section>
+<section id="Existing-Solutions">
+
+    <h2>
+    <a href="#Existing-Solutions"><span>Existing Solutions</span> </a>
+    </h2>
+<p><span>The simplest way to automate something is to write a shell script. However there</span>
+<span>are few experts in the arcane art of shell scripting, and shell scripts are</span>
+<span>inherently platform dependent.</span></p>
+<p><span>The same goes for make, with its many annoyingly similar flavors.</span></p>
+<p><span>Two tools which significantly improve on the ease of use and ergonomics are</span>
+<a href="https://github.com/casey/just"><span>just</span></a><span> and </span><a href="https://github.com/sagiegurari/cargo-make"><span>cargo make</span></a><span>. Alas, they still mostly rely on the</span>
+<span>shell to actually execute the tasks.</span></p>
+</section>
+<section id="Reinventing-the-Wheel">
+
+    <h2>
+    <a href="#Reinventing-the-Wheel"><span>Reinventing the Wheel</span> </a>
+    </h2>
+<p><span>Obligatory </span><a href="https://xkcd.com/927/"><span>XKCD 927</span></a><span>:</span></p>
+
+<figure>
+
+<img alt="xkcd 927" src="https://imgs.xkcd.com/comics/standards.png">
+</figure>
+<p><span>An obvious idea is to use Rust for task automation. Originally, I have proposed</span>
+<span>creating a special Cargo subcommand to execute build tasks, implemented as Rust</span>
+<span>programs, in </span><a href="https://users.rust-lang.org/t/idea-for-a-crate-tool-cargo-task/15300/"><span>this</span>
+<span>thread</span></a><span>.</span>
+<span>However, since then I realized that there are built-in tools in Cargo which</span>
+<span>allow one to get a pretty ergonomic solution. Namely, the combination of</span>
+<span>workspaces, aliases and ability to define binaries seems to do the trick.</span></p>
+</section>
+<section id="Elements-of-the-Solution">
+
+    <h2>
+    <a href="#Elements-of-the-Solution"><span>Elements of the Solution</span> </a>
+    </h2>
+<p><span>If you just want a working example, see </span><a href="https://github.com/matklad/libsyntax2/commit/bb381a7ff7a21cad98d80005a81f2586684f80a0"><span>this</span>
+<span>commit</span></a><span>.</span></p>
+<p><span>A typical Rust project looks like this</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">frobnicator/</span>
+<span class="line">  Cargo.toml</span>
+<span class="line">  src/</span>
+<span class="line">    lib.rs</span></code></pre>
+
+</figure>
+<p><span>Suppose that we want to add a couple of tasks, like generating some code from</span>
+<span>some specification in the </span><a href="https://github.com/ron-rs/ron"><span>RON</span></a><span> format, or</span>
+<span>grepping the source code for </span><code>TODO</code><span> marks.</span></p>
+<p><span>First, create a special </span><code>tools</code><span> package:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">frobnicator/</span>
+<span class="line">  Cargo.toml</span>
+<span class="line">  src/</span>
+<span class="line">    lib.rs</span>
+<span class="line">  tools/</span>
+<span class="line">    Cargo.toml</span>
+<span class="line">    src/bin/</span>
+<span class="line">      gen.rs</span>
+<span class="line">      todo.rs</span></code></pre>
+
+</figure>
+<p><span>The </span><code>tools/Cargo.toml</code><span> might look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment"># file: frobnicator/tools/Cargo.toml</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-section">[package]</span></span>
+<span class="line"><span class="hl-attr">name</span> = <span class="hl-string">&quot;tools&quot;</span></span>
+<span class="line"><span class="hl-attr">version</span> = <span class="hl-string">&quot;0.1.0&quot;</span></span>
+<span class="line"><span class="hl-attr">authors</span> = []</span>
+<span class="line"><span class="hl-comment"># We never publish our tasks</span></span>
+<span class="line"><span class="hl-attr">publish</span> = <span class="hl-literal">false</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-section">[dependencies]</span></span>
+<span class="line"><span class="hl-comment"># These dependencies are isolated from the main crate.</span></span>
+<span class="line"><span class="hl-attr">serde</span> = <span class="hl-string">&quot;1.0.26&quot;</span></span>
+<span class="line"><span class="hl-attr">serde_derive</span> = <span class="hl-string">&quot;1.0.26&quot;</span></span>
+<span class="line"><span class="hl-attr">file</span> = <span class="hl-string">&quot;1.1.1&quot;</span></span>
+<span class="line"><span class="hl-attr">ron</span> = <span class="hl-string">&quot;0.1.5&quot;</span></span></code></pre>
+
+</figure>
+<p><span>Then, we add a</span>
+<a href="https://doc.rust-lang.org/cargo/reference/manifest.html#the-workspace-section"><code>[workspace]</code></a>
+<span>to the parent package:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment"># file: frobnicator/Cargo.toml</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-section">[workspace]</span></span>
+<span class="line"><span class="hl-attr">members</span> = [<span class="hl-string">&quot;tools&quot;</span>]</span></code></pre>
+
+</figure>
+<p><span>We need this section because </span><code>tools</code><span> is not a dependency of </span><code>frobnicator</code><span>, so it</span>
+<span>won</span>&rsquo;<span>t be picked up automatically.</span></p>
+<p><span>Then, we write code to accomplish the tasks in </span><code>tools/src/bin/gen.rs</code><span> and</span>
+<code>tools/src/bin/todo.rs</code><span>.</span></p>
+<p><span>Finally, we add </span><code>frobnicator/.cargo/config</code><span> with the following contents:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment"># file: frobnicator/.cargo/config</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-section">[alias]</span></span>
+<span class="line"><span class="hl-attr">gen</span>  = <span class="hl-string">&quot;run --package tools --bin gen&quot;</span></span>
+<span class="line"><span class="hl-attr">todo</span> = <span class="hl-string">&quot;run --package tools --bin todo&quot;</span></span></code></pre>
+
+</figure>
+<p><span>Voilà! Now, running </span><code>cargo gen</code><span> or </span><code>cargo todo</code><span> will execute the tasks!</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/7v0q3h/blog_post_i_accidentally_a_build_system_almost/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-01-03-make-your-own-make.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/03/03/stopping-a-rust-worker.html b/2018/03/03/stopping-a-rust-worker.html
new file mode 100644
index 00000000..44db0f94
--- /dev/null
+++ b/2018/03/03/stopping-a-rust-worker.html
@@ -0,0 +1,549 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Stopping a Rust Worker</title>
+  <meta name="description" content="This is a small post about a specific pattern for cancellation in the Rust
+programming language. The pattern is simple and elegant, but it's rather
+difficult to come up with it by yourself.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/03/03/stopping-a-rust-worker.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Stopping-a-Rust-Worker"><span>Stopping a Rust Worker</span> <time datetime="2018-03-03">Mar 3, 2018</time></a>
+    </h1>
+<p><span>This is a small post about a specific pattern for cancellation in the Rust</span>
+<span>programming language. The pattern is simple and elegant, but it</span>&rsquo;<span>s rather</span>
+<span>difficult to come up with it by yourself.</span></p>
+<section id="Introducing-a-worker">
+
+    <h2>
+    <a href="#Introducing-a-worker"><span>Introducing a worker</span> </a>
+    </h2>
+<p><span>To be able to stop a worker, we need to have one in the first place! So, let</span>&rsquo;<span>s</span>
+<span>implement a model program.</span></p>
+<p><span>The task is to read the output line-by-line, sending these lines to another thread</span>
+<span>for processing (echoing the line back, with ❤️).</span>
+<span>My solution looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::io::BufRead;</span>
+<span class="line"><span class="hl-keyword">use</span> std::sync::mpsc::{Sender, channel};</span>
+<span class="line"><span class="hl-keyword">use</span> std::thread;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">worker</span> = <span class="hl-title function_ invoke__">spawn_worker</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">stdin</span> = ::std::io::<span class="hl-title function_ invoke__">stdin</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">line</span> <span class="hl-keyword">in</span> stdin.<span class="hl-title function_ invoke__">lock</span>().<span class="hl-title function_ invoke__">lines</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">line</span> = line.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        worker.<span class="hl-title function_ invoke__">send</span>(Msg::<span class="hl-title function_ invoke__">Echo</span>(line))</span>
+<span class="line">            .<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;Bye!&quot;</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Msg</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Echo</span>(<span class="hl-type">String</span>),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">spawn_worker</span>() <span class="hl-punctuation">-&gt;</span> Sender&lt;Msg&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> (tx, rx) = <span class="hl-title function_ invoke__">channel</span>();</span>
+<span class="line">    thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        <span class="hl-keyword">loop</span> {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">msg</span> = rx.<span class="hl-title function_ invoke__">recv</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            <span class="hl-keyword">match</span> msg {</span>
+<span class="line">                Msg::<span class="hl-title function_ invoke__">Echo</span>(msg) =&gt; <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{} ❤️&quot;</span>, msg),</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">    });</span>
+<span class="line">    tx</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The program seems to work:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo r</span>
+<span class="line"><span class="hl-output">    Finished dev [unoptimized + debuginfo] target(s) in 0.0 secs</span></span>
+<span class="line"><span class="hl-output">     Running `target/debug/worker`</span></span>
+<span class="line"><span class="hl-output">hello</span></span>
+<span class="line"><span class="hl-output">hello ❤️</span></span>
+<span class="line"><span class="hl-output">world</span></span>
+<span class="line"><span class="hl-output">world ❤️</span></span>
+<span class="line"><span class="hl-output">Bye!</span></span></code></pre>
+
+</figure>
+</section>
+<section id="Stopping-the-worker-the-obvious-way">
+
+    <h2>
+    <a href="#Stopping-the-worker-the-obvious-way"><span>Stopping the worker, the obvious way</span> </a>
+    </h2>
+<p><span>Now that we have a worker, let</span>&rsquo;<span>s add a new requirement.</span></p>
+<p><span>When the user types </span><code>stop</code><span>, the worker (but not the program itself) should be halted.</span></p>
+<p><span>How can we do this? The most obvious way is to add a new variant, </span><code>Stop</code><span>, to the </span><code>Msg</code>
+<span>enum, and break out of the worker</span>&rsquo;<span>s loop:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::io::BufRead;</span>
+<span class="line"><span class="hl-keyword">use</span> std::sync::mpsc::{Sender, channel};</span>
+<span class="line"><span class="hl-keyword">use</span> std::thread;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">worker</span> = <span class="hl-title function_ invoke__">spawn_worker</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">stdin</span> = ::std::io::<span class="hl-title function_ invoke__">stdin</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">line</span> <span class="hl-keyword">in</span> stdin.<span class="hl-title function_ invoke__">lock</span>().<span class="hl-title function_ invoke__">lines</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">line</span> = line.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">msg</span> = <span class="hl-keyword">if</span> line == <span class="hl-string">&quot;stop&quot;</span> {</span>
+<span class="line">            Msg::Stop</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            Msg::<span class="hl-title function_ invoke__">Echo</span>(line)</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        worker.<span class="hl-title function_ invoke__">send</span>(msg)</span>
+<span class="line">            .<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;Bye!&quot;</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Msg</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Echo</span>(<span class="hl-type">String</span>),</span>
+<span class="line">    Stop,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">spawn_worker</span>() <span class="hl-punctuation">-&gt;</span> Sender&lt;Msg&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> (tx, rx) = <span class="hl-title function_ invoke__">channel</span>();</span>
+<span class="line">    thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        <span class="hl-keyword">loop</span> {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">msg</span> = rx.<span class="hl-title function_ invoke__">recv</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            <span class="hl-keyword">match</span> msg {</span>
+<span class="line">                Msg::<span class="hl-title function_ invoke__">Echo</span>(msg) =&gt; <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{} ❤️&quot;</span>, msg),</span>
+<span class="line">                Msg::Stop =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">        <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;The worker has stopped!&quot;</span>);</span>
+<span class="line">    });</span>
+<span class="line">    tx</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This works, but only partially:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo r</span>
+<span class="line"><span class="hl-output">    Finished dev [unoptimized + debuginfo] target(s) in 0.0 secs</span></span>
+<span class="line"><span class="hl-output">     Running `target/debug/worker`</span></span>
+<span class="line"><span class="hl-output">hello</span></span>
+<span class="line"><span class="hl-output">hello ❤️</span></span>
+<span class="line"><span class="hl-output">stop</span></span>
+<span class="line"><span class="hl-output">The worker has stopped!</span></span>
+<span class="line"><span class="hl-output">world</span></span>
+<span class="line"><span class="hl-output">thread 'main' panicked at 'called `Result::unwrap()` on an `Err` value: "SendError(..)"', /checkout/src/libcore/result.rs:916:5</span></span>
+<span class="line"><span class="hl-output">note: Run with `RUST_BACKTRACE=1` for a backtrace.</span></span></code></pre>
+
+</figure>
+<p><span>We can add more code to fix the panic, but let</span>&rsquo;<span>s stop for a moment and try</span>
+<span>to invent a more elegant way to stop the worker. The answer will be below this</span>
+<span>beautiful Ukiyo-e print :-)</span></p>
+
+<figure>
+
+<img alt="" src="https://upload.wikimedia.org/wikipedia/commons/d/d0/100_views_edo_008.jpg">
+</figure>
+</section>
+<section id="Dropping-the-microphone">
+
+    <h2>
+    <a href="#Dropping-the-microphone"><span>Dropping the microphone</span> </a>
+    </h2>
+<p><span>The answer is: the cleanest way to cancel something in Rust is to drop it.</span>
+<span>For our task, we can stop the worker by dropping the </span><code>Sender</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::io::BufRead;</span>
+<span class="line"><span class="hl-keyword">use</span> std::sync::mpsc::{Sender, channel};</span>
+<span class="line"><span class="hl-keyword">use</span> std::thread;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">worker</span> = <span class="hl-title function_ invoke__">Some</span>(<span class="hl-title function_ invoke__">spawn_worker</span>());</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">stdin</span> = ::std::io::<span class="hl-title function_ invoke__">stdin</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">line</span> <span class="hl-keyword">in</span> stdin.<span class="hl-title function_ invoke__">lock</span>().<span class="hl-title function_ invoke__">lines</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">line</span> = line.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        <span class="hl-keyword">if</span> line == <span class="hl-string">&quot;stop&quot;</span> {</span>
+<span class="line">            <span class="hl-title function_ invoke__">drop</span>(worker.<span class="hl-title function_ invoke__">take</span>());</span>
+<span class="line">            <span class="hl-keyword">continue</span></span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(<span class="hl-keyword">ref</span> worker) = worker {</span>
+<span class="line">            worker.<span class="hl-title function_ invoke__">send</span>(Msg::<span class="hl-title function_ invoke__">Echo</span>(line)).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;The worker has been stopped!&quot;</span>);</span>
+<span class="line">        };</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;Bye!&quot;</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Msg</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Echo</span>(<span class="hl-type">String</span>),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">spawn_worker</span>() <span class="hl-punctuation">-&gt;</span> Sender&lt;Msg&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> (tx, rx) = <span class="hl-title function_ invoke__">channel</span>();</span>
+<span class="line">    thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        <span class="hl-keyword">while</span> <span class="hl-keyword">let</span> <span class="hl-variable">Ok</span>(msg) = rx.<span class="hl-title function_ invoke__">recv</span>() {</span>
+<span class="line">            <span class="hl-keyword">match</span> msg {</span>
+<span class="line">                Msg::<span class="hl-title function_ invoke__">Echo</span>(msg) =&gt; <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{} ❤️&quot;</span>, msg),</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">        <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;The worker has stopped!&quot;</span>);</span>
+<span class="line">    });</span>
+<span class="line">    tx</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note the interesting parts of the solution:</span></p>
+<ul>
+<li>
+<span>no need to invent an additional message type,</span>
+</li>
+<li>
+<span>the </span><code>Sender</code><span> is stored inside an </span><code>Option</code><span>, so that we can</span>
+<span>drop it with the </span><code>.take</code><span> method,</span>
+</li>
+<li>
+<span>the </span><code>Option</code><span> forces us to check if the worker is alive</span>
+<span>before sending a message.</span>
+</li>
+</ul>
+<p><span>More generally, previously the worker had two paths for termination: a normal</span>
+<span>termination via the </span><code>Stop</code><span> message and an abnormal termination after a panic</span>
+<span>in </span><code>recv</code><span> (which might happen if the parent thread panics and drops the </span><code>Sender</code><span>).</span>
+<span>Now there is a single code path for both cases. That means we can be surer that if</span>
+<span>something somewhere dies with a panic then the shutdown will proceed in an</span>
+<span>orderly fashion, it is not a special case anymore.</span></p>
+<p><span>The only thing left to make this ultimately neat is to replace a hand-written </span><code>while let</code>
+<span>with a </span><code>for</code><span> loop:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">msg</span> <span class="hl-keyword">in</span> rx {</span>
+<span class="line">    <span class="hl-keyword">match</span> msg {</span>
+<span class="line">        Msg::<span class="hl-title function_ invoke__">Echo</span>(msg) =&gt; <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{} ❤️&quot;</span>, msg),</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Am-I-awaited">
+
+    <h2>
+    <a href="#Am-I-awaited"><span>Am I awaited?</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s interesting to see that the same pattern applies to the async version of the</span>
+<span>solution as well.</span></p>
+<p><span>Async baseline:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">extern</span> <span class="hl-keyword">crate</span> futures; <span class="hl-comment">// [dependencies] futures = &quot;0.1&quot;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">use</span> std::io::BufRead;</span>
+<span class="line"><span class="hl-keyword">use</span> std::thread;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">use</span> futures::sync::mpsc::{Sender, channel};</span>
+<span class="line"><span class="hl-keyword">use</span> futures::{Future, Stream, Sink};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">worker</span> = <span class="hl-title function_ invoke__">spawn_worker</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">stdin</span> = ::std::io::<span class="hl-title function_ invoke__">stdin</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">line</span> <span class="hl-keyword">in</span> stdin.<span class="hl-title function_ invoke__">lock</span>().<span class="hl-title function_ invoke__">lines</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">line</span> = line.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        worker = worker.<span class="hl-title function_ invoke__">send</span>(Msg::<span class="hl-title function_ invoke__">Echo</span>(line)).<span class="hl-title function_ invoke__">wait</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;Bye!&quot;</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Msg</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Echo</span>(<span class="hl-type">String</span>),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">spawn_worker</span>() <span class="hl-punctuation">-&gt;</span> Sender&lt;Msg&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> (tx, rx) = <span class="hl-title function_ invoke__">channel</span>(<span class="hl-number">1</span>);</span>
+<span class="line">    thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        rx.<span class="hl-title function_ invoke__">for_each</span>(|msg| {</span>
+<span class="line">            <span class="hl-keyword">match</span> msg {</span>
+<span class="line">                Msg::<span class="hl-title function_ invoke__">Echo</span>(msg) =&gt; <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{} ❤️&quot;</span>, msg),</span>
+<span class="line">            }</span>
+<span class="line">            <span class="hl-title function_ invoke__">Ok</span>(())</span>
+<span class="line">        }).<span class="hl-title function_ invoke__">wait</span>().<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">    });</span>
+<span class="line">    tx</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Async with a termination message:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">extern</span> <span class="hl-keyword">crate</span> futures; <span class="hl-comment">// [dependencies] futures = &quot;0.1&quot;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">use</span> std::io::BufRead;</span>
+<span class="line"><span class="hl-keyword">use</span> std::thread;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">use</span> futures::sync::mpsc::{Sender, channel};</span>
+<span class="line"><span class="hl-keyword">use</span> futures::{Future, Stream, Sink};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">worker</span> = <span class="hl-title function_ invoke__">spawn_worker</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">stdin</span> = ::std::io::<span class="hl-title function_ invoke__">stdin</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">line</span> <span class="hl-keyword">in</span> stdin.<span class="hl-title function_ invoke__">lock</span>().<span class="hl-title function_ invoke__">lines</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">line</span> = line.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">msg</span> = <span class="hl-keyword">if</span> line == <span class="hl-string">&quot;stop&quot;</span> {</span>
+<span class="line">            Msg::Stop</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            Msg::<span class="hl-title function_ invoke__">Echo</span>(line)</span>
+<span class="line">        };</span>
+<span class="line">        worker = worker.<span class="hl-title function_ invoke__">send</span>(msg).<span class="hl-title function_ invoke__">wait</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;Bye!&quot;</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Msg</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Echo</span>(<span class="hl-type">String</span>),</span>
+<span class="line">    Stop,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">spawn_worker</span>() <span class="hl-punctuation">-&gt;</span> Sender&lt;Msg&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> (tx, rx) = <span class="hl-title function_ invoke__">channel</span>(<span class="hl-number">1</span>);</span>
+<span class="line">    thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">_</span> = rx.<span class="hl-title function_ invoke__">for_each</span>(|msg| {</span>
+<span class="line">            <span class="hl-keyword">match</span> msg {</span>
+<span class="line">                Msg::<span class="hl-title function_ invoke__">Echo</span>(msg) =&gt; {</span>
+<span class="line">                    <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{} ❤️&quot;</span>, msg);</span>
+<span class="line">                    <span class="hl-title function_ invoke__">Ok</span>(())</span>
+<span class="line">                },</span>
+<span class="line">                Msg::Stop =&gt; <span class="hl-title function_ invoke__">Err</span>(()),</span>
+<span class="line">            }</span>
+<span class="line">        }).<span class="hl-title function_ invoke__">then</span>(|result| {</span>
+<span class="line">            <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;The worker has stopped!&quot;</span>);</span>
+<span class="line">            result</span>
+<span class="line">        }).<span class="hl-title function_ invoke__">wait</span>();</span>
+<span class="line">    });</span>
+<span class="line">    tx</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo r</span>
+<span class="line"><span class="hl-output">    Finished dev [unoptimized + debuginfo] target(s) in 0.0 secs</span></span>
+<span class="line"><span class="hl-output">     Running `target/debug/worker`</span></span>
+<span class="line"><span class="hl-output">hello</span></span>
+<span class="line"><span class="hl-output">hello ❤️</span></span>
+<span class="line"><span class="hl-output">stop</span></span>
+<span class="line"><span class="hl-output">The worker has stopped!</span></span>
+<span class="line"><span class="hl-output">world</span></span>
+<span class="line"><span class="hl-output">thread 'main' panicked at 'called `Result::unwrap()` on an `Err` value: SendError("...")', /checkout/src/libcore/result.rs:916:5</span></span>
+<span class="line"><span class="hl-output">note: Run with `RUST_BACKTRACE=1` for a backtrace.</span></span></code></pre>
+
+</figure>
+<p><span>Async with drop:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">extern</span> <span class="hl-keyword">crate</span> futures; <span class="hl-comment">// [dependencies] futures = &quot;0.1&quot;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">use</span> std::io::BufRead;</span>
+<span class="line"><span class="hl-keyword">use</span> std::thread;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">use</span> futures::sync::mpsc::{Sender, channel};</span>
+<span class="line"><span class="hl-keyword">use</span> futures::{Future, Stream, Sink};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">worker</span> = <span class="hl-title function_ invoke__">Some</span>(<span class="hl-title function_ invoke__">spawn_worker</span>());</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">stdin</span> = ::std::io::<span class="hl-title function_ invoke__">stdin</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">line</span> <span class="hl-keyword">in</span> stdin.<span class="hl-title function_ invoke__">lock</span>().<span class="hl-title function_ invoke__">lines</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">line</span> = line.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        <span class="hl-keyword">if</span> line == <span class="hl-string">&quot;stop&quot;</span> {</span>
+<span class="line">            <span class="hl-title function_ invoke__">drop</span>(worker.<span class="hl-title function_ invoke__">take</span>());</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(w) = worker {</span>
+<span class="line">            worker = <span class="hl-title function_ invoke__">Some</span>(w.<span class="hl-title function_ invoke__">send</span>(Msg::<span class="hl-title function_ invoke__">Echo</span>(line)).<span class="hl-title function_ invoke__">wait</span>().<span class="hl-title function_ invoke__">unwrap</span>())</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;The worker has been stopped!&quot;</span>);</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;Bye!&quot;</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Msg</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Echo</span>(<span class="hl-type">String</span>),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">spawn_worker</span>() <span class="hl-punctuation">-&gt;</span> Sender&lt;Msg&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> (tx, rx) = <span class="hl-title function_ invoke__">channel</span>(<span class="hl-number">1</span>);</span>
+<span class="line">    thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        rx.<span class="hl-title function_ invoke__">for_each</span>(|msg| {</span>
+<span class="line">            <span class="hl-keyword">match</span> msg {</span>
+<span class="line">                Msg::<span class="hl-title function_ invoke__">Echo</span>(msg) =&gt; <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{} ❤️&quot;</span>, msg),</span>
+<span class="line">            }</span>
+<span class="line">            <span class="hl-title function_ invoke__">Ok</span>(())</span>
+<span class="line">        }).<span class="hl-title function_ invoke__">map</span>(|()| {</span>
+<span class="line">            <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;The worker has stopped!&quot;</span>);</span>
+<span class="line">        }).<span class="hl-title function_ invoke__">wait</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    });</span>
+<span class="line">    tx</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo r</span>
+<span class="line"><span class="hl-output">    Finished dev [unoptimized + debuginfo] target(s) in 0.0 secs</span></span>
+<span class="line"><span class="hl-output">     Running `target/debug/worker`</span></span>
+<span class="line"><span class="hl-output">hello</span></span>
+<span class="line"><span class="hl-output">hello ❤️</span></span>
+<span class="line"><span class="hl-output">stop</span></span>
+<span class="line"><span class="hl-output">The worker has stopped!</span></span>
+<span class="line"><span class="hl-output">world</span></span>
+<span class="line"><span class="hl-output">The worker has been stopped!</span></span>
+<span class="line"><span class="hl-output">Bye!</span></span></code></pre>
+
+</figure>
+</section>
+<section id="Conclusion">
+
+    <h2>
+    <a href="#Conclusion"><span>Conclusion</span> </a>
+    </h2>
+<p><span>So, yeah, this all was written just to say </span>&ldquo;<span>in Rust, cancellation is </span><code>drop</code>&rdquo;<span> :-)</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/81j1gd/blog_stropping_a_rust_worker/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-03-03-stopping-a-rust-worker.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/05/03/effective-pull-requests.html b/2018/05/03/effective-pull-requests.html
new file mode 100644
index 00000000..94414f75
--- /dev/null
+++ b/2018/05/03/effective-pull-requests.html
@@ -0,0 +1,277 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Effective Pull Requests</title>
+  <meta name="description" content="Recently I've been sending a lot of pull requests to various GitHub-hosted
+projects. It had been a lot of trial and error before I settled on the git
+workflow which doesn't involve Nah, I'll just rm -rf this folder and do a
+fresh git clone somewhere. This post documents the workflow. In a nutshell,
+it is">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/05/03/effective-pull-requests.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Effective-Pull-Requests"><span>Effective Pull Requests</span> <time datetime="2018-05-03">May 3, 2018</time></a>
+    </h1>
+<p><span>Recently I</span>&rsquo;<span>ve been sending a lot of pull requests to various GitHub-hosted</span>
+<span>projects. It had been a lot of trial and error before I settled on the git</span>
+<span>workflow which doesn</span>&rsquo;<span>t involve </span>&ldquo;<span>Nah, I</span>&rsquo;<span>ll just </span><code>rm -rf</code><span> this folder and do a</span>
+<span>fresh </span><code>git clone</code>&rdquo;<span> somewhere. This post documents the workflow. In a nutshell,</span>
+<span>it is</span></p>
+<ul>
+<li>
+<span>do not use the master branch for pull requests</span>
+</li>
+<li>
+<span>use the master branch to track upstream repository</span>
+</li>
+<li>
+<span>automate</span>
+</li>
+</ul>
+<p><span>Note that </span><a href="https://hub.github.com/"><span>hub</span></a><span> utility exist to handle these issues</span>
+<span>automatically. I personally haven</span>&rsquo;<span>t used for no real reason, you definitely</span>
+<span>should check it out!</span></p>
+<section id="Avoiding-the-master-branch">
+
+    <h2>
+    <a href="#Avoiding-the-master-branch"><span>Avoiding the master branch</span> </a>
+    </h2>
+<p><span>The natural thing to do, when sending a pull request, is to fork the upstream</span>
+<span>repository, </span><code>git clone</code><span> your fork locally, make a fix, </span><code>git commit -am</code><span> and</span>
+<code>git push</code><span> it to the master branch of your fork and then send a PR.</span></p>
+<p><span>It even seems to work at first, but breaks down in these two cases:</span></p>
+<ul>
+<li>
+<p><span>You want to send a second PR, and now you don</span>&rsquo;<span>t have a clean branch</span>
+<span>to base your work off.</span></p>
+</li>
+<li>
+<p><span>The upstream was updated, your PR does not merge cleanly anymore,</span>
+<span>you need to do a rebase, but you don</span>&rsquo;<span>t have a clean branch to rebase</span>
+<span>onto.</span></p>
+</li>
+</ul>
+<p><strong><span>Tip 1: always start with creating a feature branch for PR:</span></strong></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> git clone git@github.com:matklad/cargo.git &amp;&amp; cd cargo</span>
+<span class="line"><span class="hl-title function_">$</span> git checkout -b long-and-descriptive-name-of-the-pr-branch</span>
+<span class="line"><span class="hl-title function_">$</span> $EDITOR hack-hack-hack</span></code></pre>
+
+</figure>
+<p><span>However it is easy to forget this step, so it is important to be able</span>
+<span>to move to a separate branch after you erroneously committed code to</span>
+<span>master. It is also crucial to reset </span><code>master</code><span> to clean state, otherwise</span>
+<span>you</span>&rsquo;<span>ll face some bewildering merge conflicts, when you try to update</span>
+<span>your fork several days later.</span></p>
+<p><strong><span>Tip 2: don</span>&rsquo;<span>t forget to reset master after a mix-up:</span></strong></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> git clone git@github.com:matklad/cargo.git &amp;&amp; cd cargo</span>
+<span class="line"><span class="hl-title function_">$</span> $EDITOR hack-hack-hack</span>
+<span class="line"><span class="hl-title function_">$</span> git commit -am'A very important fix'</span>
+<span class="line"><span class="hl-title function_">$</span> echo "urgh, should have done this on a separate branch"</span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-title function_">$</span> git branch pr-branch</span>
+<span class="line"><span class="hl-title function_">$</span> git reset --hard origin/master</span>
+<span class="line"><span class="hl-title function_">$</span> git checkout pr-branch</span></code></pre>
+
+</figure>
+<p><strong><strong><span>Update:</span></strong></strong><span> I</span>&rsquo;<span>ve learned that magit has a dedicated utility for this </span>&ldquo;<span>create a</span>
+<span>branch and reset </span><code>master</code><span> to a clean state</span>&rdquo;<span> workflow </span>&mdash;<span> </span><code>git spinoff</code><span>.</span>
+<span>My implementation is </span><a href="https://github.com/matklad/config/blob/514210829551054c982aec71c6d9b11ebae3f22f/xtool/src/git_spinoff.rs"><span>here</span></a><span>.</span></p>
+</section>
+<section id="Syncing-with-upstream">
+
+    <h2>
+    <a href="#Syncing-with-upstream"><span>Syncing with upstream</span> </a>
+    </h2>
+<p><span>If you work regularly on a particular project, you</span>&rsquo;<span>d want to keep your</span>
+<span>fork in sync with upstream repository. One way to do that would be to</span>
+<span>add upstream repository as a git remote, and set the local master</span>
+<span>branch to track the master from upstream:</span></p>
+<p><strong><span>Tip 3: tracking remote repository</span></strong></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> git clone git@github.com:matklad/cargo.git &amp;&amp; cd cargo</span>
+<span class="line"><span class="hl-title function_">$</span> git remote add upstream git@github.com:rust-lang/cargo.git</span>
+<span class="line"><span class="hl-title function_">$</span> git fetch remote</span>
+<span class="line"><span class="hl-title function_">$</span> git branch --set-upstream-to=upstream/master</span></code></pre>
+
+</figure>
+<p><span>With this setup, you can easily update your pull request if they don</span>&rsquo;<span>t</span>
+<span>merge cleanly because of upstream changes:</span></p>
+<p><strong><strong><span>Tip 4: updating a PR</span></strong></strong></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> git checkout master &amp;&amp; git pull --rebase</span>
+<span class="line"><span class="hl-title function_">$</span> git checkout pr-branch</span>
+<span class="line"><span class="hl-title function_">$</span> git rebase master</span></code></pre>
+
+</figure>
+<p><strong><strong><span>Update:</span></strong></strong><span> worth automating as well, here</span>&rsquo;<span>s my </span><a href="https://github.com/matklad/config/blob/514210829551054c982aec71c6d9b11ebae3f22f/xtool/src/git_refresh.rs"><code>git
+refresh</code></a></p>
+</section>
+<section id="Automating">
+
+    <h2>
+    <a href="#Automating"><span>Automating</span> </a>
+    </h2>
+<p><span>There are several steps to get the repo setup just right, and doing it</span>
+<span>manually every time would lead to errors and mysterious merge</span>
+<span>conflicts. It might be useful to define a shell function to do this</span>
+<span>for you! It could look like this</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment"># calling `gcf rust-lang/cargo` would clone github.com/matklad/cargo,</span></span>
+<span class="line"><span class="hl-comment"># and setup upstream properly</span></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-function"><span class="hl-title">gcf</span></span>() {</span>
+<span class="line">    <span class="hl-built_in">local</span> userrepo=<span class="hl-variable">$1</span></span>
+<span class="line">    <span class="hl-built_in">local</span> repo=`<span class="hl-built_in">basename</span> <span class="hl-variable">$userrepo</span>`</span>
+<span class="line">    git <span class="hl-built_in">clone</span> git@github.com:matklad/<span class="hl-variable">$repo</span>.git</span>
+<span class="line">    <span class="hl-built_in">pushd</span> <span class="hl-variable">$repo</span></span>
+<span class="line">    git remote add upstream git@github.com:<span class="hl-variable">$userrepo</span>.git</span>
+<span class="line">    git fetch upstream</span>
+<span class="line">    git checkout master</span>
+<span class="line">    git branch --set-upstream-to=upstream/master</span>
+<span class="line">    git pull --rebase --force</span>
+<span class="line">    <span class="hl-built_in">popd</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Bonus-points">
+
+    <h2>
+    <a href="#Bonus-points"><span>Bonus points</span> </a>
+    </h2>
+<p><strong><span>Bonus 1: another useful function to have is for reviewing PRs:</span></strong></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment"># called like `gpr 9262`, this function would checkout</span></span>
+<span class="line"><span class="hl-comment"># GitHub pull request #9262 to `pr-9262` branch</span></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-function"><span class="hl-title">gpr</span></span>() {</span>
+<span class="line">    <span class="hl-built_in">local</span> <span class="hl-built_in">pr</span>=<span class="hl-variable">$1</span></span>
+<span class="line">    git fetch upstream pull/<span class="hl-variable">$pr</span>/head:<span class="hl-built_in">pr</span>-<span class="hl-variable">$pr</span></span>
+<span class="line">    git checkout <span class="hl-built_in">pr</span>-<span class="hl-variable">$pr</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><strong><span>Bonus 2:</span></strong><br>
+<span>There are a lot of learning materials about Git out there. However, a</span>
+<span>lot of these materials are either comprehensive references, or just present a</span>
+<span>handful of most useful git commands. I</span>&rsquo;<span>ve once accidentally stumbled upon</span>
+<a href="https://jwiegley.github.io/git-from-the-bottom-up/"><span>Git from the bottom up</span></a><span> and I</span>
+<span>highly recommend reading it: it is a moderately long article, which explains the</span>
+<span>inner mechanics of Git.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-05-03-effective-pull-requests.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/05/04/encapsulating-lifetime-of-the-field.html b/2018/05/04/encapsulating-lifetime-of-the-field.html
new file mode 100644
index 00000000..466181b9
--- /dev/null
+++ b/2018/05/04/encapsulating-lifetime-of-the-field.html
@@ -0,0 +1,552 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Encapsulating Lifetime of the Field</title>
+  <meta name="description" content="This is a post about an annoying Rust pattern and an annoying
+workaround, without a good solution :)">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/05/04/encapsulating-lifetime-of-the-field.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Encapsulating-Lifetime-of-the-Field"><span>Encapsulating Lifetime of the Field</span> <time datetime="2018-05-04">May 4, 2018</time></a>
+    </h1>
+<p><span>This is a post about an annoying Rust pattern and an annoying</span>
+<span>workaround, without a good solution :)</span></p>
+<section id="Problem-Statement">
+
+    <h2>
+    <a href="#Problem-Statement"><span>Problem Statement</span> </a>
+    </h2>
+<p><span>Suppose you have some struct which holds some references inside. Now,</span>
+<span>you want to store a reference to this structure inside some larger</span>
+<span>struct. It could look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    buff: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">String</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The code, as written, does not compile:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">error[E0106]: missing lifetime specifier</span>
+<span class="line"> --&gt; src/main.rs:8:14</span>
+<span class="line">  |</span>
+<span class="line">8 |     foo: &amp;'f Foo</span>
+<span class="line">  |              ^^^ expected lifetime parameter</span></code></pre>
+
+</figure>
+<p><span>To fix it, we need to get </span><code>Foo</code><span> an additional lifetime:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    buff: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">String</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And this is the problem which is the subject of this post. Although</span>
+<code>Foo</code><span> is supposed to be an implementation detail, its lifetime, </span><code>'a</code><span>,</span>
+<span>bleeds to </span><code>Context</code>&rsquo;<span>s interface, so most of the clients of </span><code>Context</code>
+<span>would need to name this lifetime together with </span><code>'a: 'f</code><span> bound. Note</span>
+<span>that this effect is transitive: in general, rust struct has to name</span>
+<span>lifetimes of contained types, and their contained types, and their</span>
+<span>contained types, </span>&hellip;<span> But let</span>&rsquo;<span>s concentrate on this two-level example!</span></p>
+<p><span>The question is, can we somehow hide this </span><code>'a</code><span> from users of </span><code>Context</code><span>? It</span>&rsquo;<span>s</span>
+<span>interesting that I</span>&rsquo;<span>ve first distilled this problem about half a year ago in this</span>
+<a href="https://users.rust-lang.org/t/dealing-with-references-to-references/14065"><span>urlo</span>
+<span>post</span></a><span>,</span>
+<span>and today, while refactoring some of Cargo internals in</span>
+<a href="https://github.com/rust-lang/cargo/pull/5476"><span>#5476</span></a><span> with</span>
+<a href="https://github.com/dwijnand"><span>@dwijnand</span></a><span>, I</span>&rsquo;<span>ve stumbled upon something, which</span>
+<span>could be called a solution, if you squint hard enough.</span></p>
+</section>
+<section id="Extended-Example">
+
+    <h2>
+    <a href="#Extended-Example"><span>Extended Example</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s create a somewhat longer example to check that lifetime setup</span>
+<span>actually works out in practice.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    buff: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">String</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">len</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.buff.<span class="hl-title function_ invoke__">len</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Note how we have to repeat ugly `&#x27;a: &#x27;f` bound here!</span></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt; Context&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">        Context { foo }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">len</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.foo.<span class="hl-title function_ invoke__">len</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Check, that we actually can create a `Context`</span></span>
+<span class="line"><span class="hl-comment">// from `Foo` and call a method.</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">ctx</span> = Context::<span class="hl-title function_ invoke__">new</span>(foo);</span>
+<span class="line">    ctx.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="https://play.rust-lang.org/?gist=874046bf74f60644a59f75023518fa0c&amp;version=stable&amp;mode=debug"><span>playground</span></a></p>
+</section>
+<section id="First-fix">
+
+    <h2>
+    <a href="#First-fix"><span>First fix</span> </a>
+    </h2>
+<p><span>The first natural idea is to try to use the same lifetime, </span><code>'f</code><span> for</span>
+<span>both </span><code>&amp;</code><span> and </span><code>Foo</code><span>: it fits syntactically, so why not give it a try?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    buff: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">String</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">len</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.buff.<span class="hl-title function_ invoke__">len</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;f</span>&gt;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; Context&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">        Context { foo }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">len</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.foo.<span class="hl-title function_ invoke__">len</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">ctx</span> = Context::<span class="hl-title function_ invoke__">new</span>(foo);</span>
+<span class="line">    ctx.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="https://play.rust-lang.org/?gist=5be80cbb6d896399953ece71babf4f70&amp;version=stable&amp;mode=debug"><span>playground</span></a></p>
+<p><span>Surprisingly, it works! I</span>&rsquo;<span>ll show a case where this approach breaks down</span>
+<span>in a moment, but let</span>&rsquo;<span>s first understand </span><strong><span>why</span></strong><span> this works. The magic</span>
+<span>happens in the </span><code>new</code><span> method, which could be written more explicitly as</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>&lt;<span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">foo1</span>: &amp;<span class="hl-symbol">&#x27;f</span> Foo&lt;<span class="hl-symbol">&#x27;f</span>&gt; = foo;</span>
+<span class="line">    Context { foo: foo1 }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, we assign a </span><code>&amp;'f Foo&lt;'a&gt;</code><span> to a variable of a different type </span><code>&amp;'f
+Foo&lt;'f&gt;</code><span>. Why is this allowed? We use </span><code>'a</code><span> lifetime in </span><code>Foo</code><span> only for</span>
+<span>a shared reference. That means that </span><code>Foo</code><span> is</span>
+<a href="https://doc.rust-lang.org/nomicon/subtyping.html"><span>covariant</span></a><span> over</span>
+<code>'a</code><span>. And that means that the compiler can use </span><code>Foo&lt;'a&gt;</code><span> instead of</span>
+<code>Foo&lt;'f&gt;</code><span> if </span><code>'a: 'f</code><span>. In other words rustc is allowed to shorten the</span>
+<span>lifetime.</span></p>
+<p><span>It</span>&rsquo;<span>s interesting to note that the original </span><code>new</code><span> function didn</span>&rsquo;<span>t say</span>
+<span>that </span><code>'a: 'f</code><span>, although we had to add this bound to the </span><code>impl</code><span> block</span>
+<span>explicitly. For functions, the compiler infers such bounds from</span>
+<span>parameters.</span></p>
+<p><span>Hopefully, I</span>&rsquo;<span>ve mixed polarity an even number of times in this</span>
+<span>variance discussion :-)</span></p>
+</section>
+<section id="Going-invariant">
+
+    <h2>
+    <a href="#Going-invariant"><span>Going invariant</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s throw a wrench in the works by adding some unique references:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    buff: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> <span class="hl-type">String</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.buff.<span class="hl-title function_ invoke__">push</span>(c)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span>  Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt; Context&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">        Context { foo }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.foo.<span class="hl-title function_ invoke__">push</span>(c)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">ctx</span> = Context::<span class="hl-title function_ invoke__">new</span>(foo);</span>
+<span class="line">    ctx.<span class="hl-title function_ invoke__">push</span>(<span class="hl-string">&#x27;9&#x27;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="https://play.rust-lang.org/?gist=e9353288e05a31ce504bc073fd05ead0&amp;version=stable&amp;mode=debug"><span>playground</span></a></p>
+<p><code>Foo</code><span> is now invariant, so the previous solution does not work:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span>  Foo&lt;<span class="hl-symbol">&#x27;f</span>&gt;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; Context&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>&lt;<span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">foo1</span>: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;f</span>&gt; = foo;</span>
+<span class="line">        Context { foo: foo1 }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.foo.<span class="hl-title function_ invoke__">push</span>(c)</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">error[E0308]: mismatched types</span>
+<span class="line">  --&gt; src/main.rs:17:37</span>
+<span class="line">   |</span>
+<span class="line">17 |         let foo1: &amp;'f mut Foo&lt;'f&gt; = foo;</span>
+<span class="line">   |                                     ^^^ lifetime mismatch</span>
+<span class="line">   |</span>
+<span class="line">   = note: expected type `&amp;'f mut Foo&lt;'f&gt;`</span>
+<span class="line">              found type `&amp;'f mut Foo&lt;'a&gt;`</span></code></pre>
+
+</figure>
+<p><a href="https://play.rust-lang.org/?gist=f2b6ceab4e82d9f02d605befabe59524&amp;version=stable&amp;mode=debug"><span>playground</span></a></p>
+</section>
+<section id="Unsheathing-existentials">
+
+    <h2>
+    <a href="#Unsheathing-existentials"><span>Unsheathing existentials</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s look again at the </span><code>Context</code><span> type:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>: <span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span>  Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>What we want to say is that, inside the </span><code>Context</code><span>, there is </span><strong><span>some</span></strong>
+<span>lifetime </span><code>'a</code><span> which the consumers of </span><code>Context</code><span> need not care about,</span>
+<span>because it outlives </span><code>'f</code><span> anyway. I </span><strong><span>think</span></strong><span> that the syntax for that</span>
+<span>would be something like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> <span class="hl-keyword">for</span>&lt;<span class="hl-symbol">&#x27;a</span>: f&gt; Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Alas, </span><code>for</code><span> is supported only for traits and function pointers, and</span>
+<span>there it has the opposite polarity of </span><code>for all</code><span> instead of </span><code>exists</code><span>,</span>
+<span>so using it for a struct gives</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">error[E0404]: expected trait, found struct `Foo`</span>
+<span class="line">  --&gt; src/main.rs:12:30</span>
+<span class="line">   |</span>
+<span class="line">12 |     foo: &amp;'f mut for&lt;'a: 'f&gt; Foo&lt;'a&gt;</span>
+<span class="line">   |                              ^^^^^^^ not a trait</span></code></pre>
+
+</figure>
+</section>
+<section id="A-hack">
+
+    <h2>
+    <a href="#A-hack"><span>A hack</span> </a>
+    </h2>
+<p><span>However, and this is what I realized reading the Cargo</span>&rsquo;<span>s source code,</span>
+<span>we </span><strong><span>can</span></strong><span> use a trait here!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    buff: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> <span class="hl-type">String</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.buff.<span class="hl-title function_ invoke__">push</span>(c)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">trait</span> <span class="hl-title class_">Push</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; Push <span class="hl-keyword">for</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">push</span>(c)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-title function_ invoke__">mut</span> (Push + <span class="hl-symbol">&#x27;f</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; Context&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">foo</span>: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Push = foo;</span>
+<span class="line">        Context { foo }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.foo.<span class="hl-title function_ invoke__">push</span>(c)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">ctx</span> = Context::<span class="hl-title function_ invoke__">new</span>(foo);</span>
+<span class="line">    ctx.<span class="hl-title function_ invoke__">push</span>(<span class="hl-string">&#x27;9&#x27;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="https://play.rust-lang.org/?gist=7d94842bad6cc92652e3d175e6cf435f&amp;version=stable&amp;mode=debug"><span>playground</span></a></p>
+<p><span>We</span>&rsquo;<span>ve added a </span><code>Push</code><span> trait, which has the same interface as the </span><code>Foo</code>
+<span>struct, but is </span><strong><strong><span>not</span></strong></strong><span> parametrized over the lifetime. This is</span>
+<span>possible because </span><code>Foo</code>&rsquo;<span>s interface doesn</span>&rsquo;<span>t actually depend on the </span><code>'a</code>
+<span>lifetime. And this allows us to magically write </span><code>foo: &amp;'f mut (Push + 'f)</code><span>.</span>
+<span>This </span><code>+ 'f</code><span> is what hides </span><code>'a</code><span> as </span>&ldquo;<span>some unknown lifetime, which outlives </span><code>'f</code>&rdquo;<span>.</span></p>
+</section>
+<section id="A-hack-refined">
+
+    <h2>
+    <a href="#A-hack-refined"><span>A hack, refined</span> </a>
+    </h2>
+<p><span>There are many problems with the previous solution: it is ugly,</span>
+<span>complicated and introduces dynamic dispatch. I don</span>&rsquo;<span>t know how to solve</span>
+<span>those problems, so let</span>&rsquo;<span>s talk about something I know how to deal with</span>
+<span>:-)</span></p>
+<p><span>The </span><code>Push</code><span> trait duplicated the interface of the </span><code>Foo</code><span> struct. It</span>
+<span>wasn</span>&rsquo;<span>t </span><strong><span>that</span></strong><span> bad, because </span><code>Foo</code><span> had only one method. But what if</span>
+<code>Bar</code><span> has a dozen of methods? Could we write a more general trait,</span>
+<span>which gives us access to </span><code>Foo</code><span> directly? Looks like it is possible, at</span>
+<span>least to some extent:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    buff: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> <span class="hl-type">String</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.buff.<span class="hl-title function_ invoke__">push</span>(c)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">trait</span> <span class="hl-title class_">WithFoo</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">with_foo</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt;(&amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, f: &amp;<span class="hl-keyword">mut</span> <span class="hl-title function_ invoke__">FnMut</span>(&amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo));</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; WithFoo <span class="hl-keyword">for</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">with_foo</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt;(&amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, f: &amp;<span class="hl-keyword">mut</span> <span class="hl-title function_ invoke__">FnMut</span>(&amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo)) {</span>
+<span class="line">        <span class="hl-title function_ invoke__">f</span>(<span class="hl-keyword">self</span>)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Context</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-title function_ invoke__">mut</span> (WithFoo + <span class="hl-symbol">&#x27;f</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;f</span>&gt; Context&lt;<span class="hl-symbol">&#x27;f</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">foo</span>: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> WithFoo = foo;</span>
+<span class="line">        Context { foo }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">push</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, c: <span class="hl-type">char</span>) {</span>
+<span class="line">        <span class="hl-keyword">self</span>.foo.<span class="hl-title function_ invoke__">with_foo</span>(&amp;<span class="hl-keyword">mut</span> |foo| foo.<span class="hl-title function_ invoke__">push</span>(c))</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test</span>&lt;<span class="hl-symbol">&#x27;f</span>, <span class="hl-symbol">&#x27;a</span>&gt;(foo: &amp;<span class="hl-symbol">&#x27;f</span> <span class="hl-keyword">mut</span> Foo&lt;<span class="hl-symbol">&#x27;a</span>&gt;) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">ctx</span> = Context::<span class="hl-title function_ invoke__">new</span>(foo);</span>
+<span class="line">    ctx.<span class="hl-title function_ invoke__">push</span>(<span class="hl-string">&#x27;9&#x27;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><a href="https://play.rust-lang.org/?gist=419d72db0b34c6cdc69a507a1fab2689&amp;version=stable&amp;mode=debug"><span>playground</span></a></p>
+<p><span>How does this work? Generally, we want to say that </span>&ldquo;<span>there exists some</span>
+<span>lifetime </span><code>'a</code><span>, which we know nothing about except that </span><code>'a: 'f</code>&rdquo;<span>. Rust</span>
+<span>supports similar constructions only for functions, where </span><code>for&lt;'a&gt; fn
+foo(&amp;'a i32)</code><span> means that a function works for all lifetimes </span><code>'a</code><span>. The</span>
+<span>trick is to turn one into another! The desugared type of callback </span><code>f</code><span>,</span>
+<span>is </span><code>&amp;mut for&lt;'x&gt; FnMut(&amp;'f mut Foo&lt;'x&gt;)</code><span>. That is, it is a function</span>
+<span>which accepts </span><code>Foo</code><span> with any lifetime. Given that callback, we are</span>
+<span>able to feed our </span><code>Foo</code><span> with a particular lifetime to it.</span></p>
+</section>
+<section id="Conclusion">
+
+    <h2>
+    <a href="#Conclusion"><span>Conclusion</span> </a>
+    </h2>
+<p><span>While the code examples in the post juggled </span><code>Foo</code><span>s and </span><code>Bar</code><span>s, the</span>
+<span>core problem is real and greatly affects the design of Rust code. When</span>
+<span>you add a lifetime to a struct, you </span>&ldquo;<span>poison</span>&rdquo;<span> it, and all structs which</span>
+<span>contain it as a member need to declare this lifetime as well. I would</span>
+<span>love to know a proper solution for this problem: the described trait</span>
+<span>object workaround is closer to code golf than to the practical</span>
+<span>approach.</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/8h2kt5/blog_post_encapsulating_lifetime_of_the_field/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-05-04-encapsulating-lifetime-of-the-field.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/05/24/typed-key-pattern.html b/2018/05/24/typed-key-pattern.html
new file mode 100644
index 00000000..ee5888c5
--- /dev/null
+++ b/2018/05/24/typed-key-pattern.html
@@ -0,0 +1,307 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Typed Key Pattern</title>
+  <meta name="description" content="In this post, I'll talk about a pattern for extracting values from a
+weakly typed map. This pattern applies to all statically typed
+languages, and even to dynamically typed ones, but the post is rather
+Rust-specific.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/05/24/typed-key-pattern.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Typed-Key-Pattern"><span>Typed Key Pattern</span> <time datetime="2018-05-24">May 24, 2018</time></a>
+    </h1>
+<p><span>In this post, I</span>&rsquo;<span>ll talk about a pattern for extracting values from a</span>
+<span>weakly typed map. This pattern applies to all statically typed</span>
+<span>languages, and even to dynamically typed ones, but the post is rather</span>
+<span>Rust-specific.</span></p>
+<p><span>I</span>&rsquo;<span>ve put together a small crate which implements the pattern:</span><br>
+<a href="https://github.com/matklad/typed_key" class="url">https://github.com/matklad/typed_key</a></p>
+<p><span>If you want to skip all the</span>
+<span>blah-blah-blah, you can dig right into the code &amp; docs :)</span></p>
+<section id="The-problem">
+
+    <h2>
+    <a href="#The-problem"><span>The problem</span> </a>
+    </h2>
+<p><span>You have an untyped </span><code>Map&lt;String, Object&gt;</code><span> and you need to get a typed</span>
+<code>Foo</code><span> out of it by the </span><code>"foo"</code><span> key. The untyped map is often some kind</span>
+<span>of configuration, like a JSON file, but it can be a real map with</span>
+<span>type-erased </span><code>Any</code><span> objects as well.</span></p>
+<p><span>In the common case of statically known configuration, the awesome</span>
+<span>solution that Rust offers is </span><a href="https://crates.io/crates/serde"><span>serde</span></a><span>. You stick </span><code>derive(Deserialize)</code>
+<span>in front of the </span><code>Config</code><span> struct and read it from JSON, YML, TOML or</span>
+<span>even just </span><a href="https://github.com/softprops/envy"><span>environment variables</span></a><span>!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Deserialize)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Config</span> {</span>
+<span class="line">    foo: Foo</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">parse_config</span>(data: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;Config&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">config</span> = serde_json::<span class="hl-title function_ invoke__">from_str</span>(data)?;</span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(config)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>However, occasionally you can</span>&rsquo;<span>t use serde. Some of the cases where</span>
+<span>this might happen are:</span></p>
+<ul>
+<li>
+<p><span>merging configuration from several sources, which requires writing a</span>
+<span>non-trivial serde deserializer,</span></p>
+</li>
+<li>
+<p><span>lazy deserialization, when you don</span>&rsquo;<span>t want to care about invalid values</span>
+<span>until you actually use them,</span></p>
+</li>
+<li>
+<p><span>extensible plugin architecture, where various independent modules</span>
+<span>contribute options to a shared global config, and so the shape of</span>
+<span>the config is not known upfront.</span></p>
+</li>
+<li>
+<p><span>you are working with </span><code>Any</code><span> objects or otherwise don</span>&rsquo;<span>t do</span>
+<span>serialization per se.</span></p>
+</li>
+</ul>
+</section>
+<section id="Typical-solutions">
+
+    <h2>
+    <a href="#Typical-solutions"><span>Typical solutions</span> </a>
+    </h2>
+<p><span>The simplest approach here is to just grab an untyped object using a</span>
+<span>string literal and specify its type on the call site:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Config</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">get</span>&lt;T: Deserialize&gt;(&amp;<span class="hl-keyword">self</span>, key: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;T&gt; {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">json_value</span> = <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">get</span>(<span class="hl-string">&quot;key&quot;</span>)</span>
+<span class="line">            .<span class="hl-title function_ invoke__">ok_or_else</span>(|| bail!(<span class="hl-string">&quot;key is missing: `{}`&quot;</span>, key))?;</span>
+<span class="line">        <span class="hl-title function_ invoke__">Ok</span>(T::<span class="hl-title function_ invoke__">deserialize</span>(json_value)?)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">...</span>
+<span class="line"></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">foo</span> = config.get::&lt;Foo&gt;(<span class="hl-string">&quot;foo&quot;</span>)?;</span></code></pre>
+
+</figure>
+<p><span>I actually think that this is a fine approach as long as such snippets</span>
+<span>are confined within a single module.</span></p>
+<p><span>One possible way to make it better is to extract </span><code>"foo"</code><span> constant to a</span>
+<span>variable:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> FOO: &amp;<span class="hl-type">str</span> = <span class="hl-string">&quot;foo&quot;</span>;</span>
+<span class="line"></span>
+<span class="line">...</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">foo</span> = config.get::&lt;Foo&gt;(FOO)?;</span></code></pre>
+
+</figure>
+<p><span>This does bring certain benefits:</span></p>
+<ul>
+<li>
+<p><span>fewer places to make a typo in,</span></p>
+</li>
+<li>
+<p><span>behavior is moved from the code (</span><code>.get("foo")</code><span>) into data (</span><code>const FOO</code><span>), which</span>
+<span>makes it easier to reason about the code (at a glance, you can see all available</span>
+<span>config option and get an idea why they might be useful),</span></p>
+</li>
+<li>
+<p><span>there</span>&rsquo;<span>s now an obvious place to document keys: write a doc-comment for a</span>
+<span>constant.</span></p>
+</li>
+</ul>
+<p><span>While great in theory, I personally feel that this usually brings little</span>
+<span>tangible benefit in most cases, especially if some constants are used only once.</span>
+<span>This is the case where the implementation, a literal </span><code>"foo"</code><span>, is more clear than</span>
+<span>the abstraction, a constant </span><code>FOO</code><span>.</span></p>
+</section>
+<section id="Adding-types">
+
+    <h2>
+    <a href="#Adding-types"><span>Adding types</span> </a>
+    </h2>
+<p><span>However, the last pattern can become much more powerful and</span>
+<span>interesting if we associate types with string constants. The idea is</span>
+<span>to encode that the </span><code>"foo"</code><span> key can be used to extract an object of</span>
+<span>type </span><code>Foo</code><span>, and make it impossible to use it for, say,</span>
+<code>Vec&lt;String&gt;</code><span>. To do this, we</span>&rsquo;<span>ll need a pinch of</span>
+<a href="https://doc.rust-lang.org/beta/std/marker/struct.PhantomData.html"><code>PhantomData</code></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Key</span>&lt;T&gt; {</span>
+<span class="line">    name: &amp;<span class="hl-symbol">&#x27;static</span> <span class="hl-type">str</span>,</span>
+<span class="line">    marker: PhantomData&lt;T&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; Key&lt;T&gt; {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">const</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(name: &amp;<span class="hl-symbol">&#x27;static</span> <span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> Key&lt;T&gt; {</span>
+<span class="line">        Key { name, marker: PhantomData }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">name</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-symbol">&#x27;static</span> <span class="hl-type">str</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.name</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, we can add type knowledge to the </span><code>"foo"</code><span> literal:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> FOO: Key&lt;Foo&gt; = Key::<span class="hl-title function_ invoke__">new</span>(<span class="hl-string">&quot;foo&quot;</span>);</span></code></pre>
+
+</figure>
+<p><span>And we can take advantage of this in the </span><code>get</code><span> method:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Config</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">get</span>&lt;T: Deserialize&gt;(&amp;<span class="hl-keyword">self</span>, key: Key&lt;T&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;T&gt; {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">json_value</span> = <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">get</span>(key.<span class="hl-title function_ invoke__">name</span>())</span>
+<span class="line">            .<span class="hl-title function_ invoke__">ok_or_else</span>(|| bail!(<span class="hl-string">&quot;key is missing: `{}`&quot;</span>, key))?;</span>
+<span class="line">        <span class="hl-title function_ invoke__">Ok</span>(T::<span class="hl-title function_ invoke__">deserialize</span>(json_value)?)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">...</span>
+<span class="line"></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">foo</span> = config.<span class="hl-title function_ invoke__">get</span>(FOO)?;</span></code></pre>
+
+</figure>
+<p><span>Note how we were able to get rid of the turbofish at the call-site!</span>
+<span>Moreover, the understandably aspect of the previous pattern is also</span>
+<span>enhanced: if you know both the type and the name of the config option,</span>
+<span>you can pretty reliably predict how it is going to be used.</span></p>
+</section>
+<section id="Pattern-in-the-wild">
+
+    <h2>
+    <a href="#Pattern-in-the-wild"><span>Pattern in the wild</span> </a>
+    </h2>
+<p><span>I</span>&rsquo;<span>ve first encountered this pattern in </span><a href="https://github.com/JetBrains/intellij-community/blob/16bfca92199dca383b66c69c3427b4639ea8e213/platform/util/src/com/intellij/openapi/util/Key.java"><span>IntelliJ</span></a><span> code. It uses</span>
+<a href="https://github.com/JetBrains/intellij-community/blob/16bfca92199dca383b66c69c3427b4639ea8e213/platform/util/src/com/intellij/openapi/util/UserDataHolder.java"><code>UserDataHolder</code></a><span>, which is basically </span><code>Map&lt;String, Object&gt;</code><span>, everywhere.</span>
+<span>It helps plugin authors to extend built-in objects in crazy ways but is rather</span>
+<span>hard to reason about, and type-safety improves the situation a lot. I</span>&rsquo;<span>ve also</span>
+<span>changed Exonum</span>&rsquo;<span>s config to employ this pattern in this </span><a href="https://github.com/exonum/exonum/pull/417"><span>PR</span></a><span>. It also was a</span>
+<span>case of plugin extensible, where an upfront definition of all configuration</span>
+<span>option is impossible.</span></p>
+<p><span>Finally, I</span>&rsquo;<span>ve written a small crate for this </span><a href="https://crates.io/crates/typed_key"><code>typed_key</code></a><span> :)</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/8ls25e/blog_post_typed_key_pattern/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-05-24-typed-key-pattern.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/06/04/newtype-index-pattern.html b/2018/06/04/newtype-index-pattern.html
new file mode 100644
index 00000000..d4b52455
--- /dev/null
+++ b/2018/06/04/newtype-index-pattern.html
@@ -0,0 +1,331 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Newtype Index Pattern</title>
+  <meta name="description" content="Similarly to the previous post, we will once again add types to the Rust
+code which works perfectly fine without them. This time, we'll try to improve
+the pervasive pattern of using indexes to manage cyclic data structures.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/06/04/newtype-index-pattern.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Newtype-Index-Pattern"><span>Newtype Index Pattern</span> <time datetime="2018-06-04">Jun 4, 2018</time></a>
+    </h1>
+<p><span>Similarly to the </span><a href="/2018/05/24/typed-key-pattern.html"><span>previous post</span></a><span>, we will once again add types to the Rust</span>
+<span>code which works perfectly fine without them. This time, we</span>&rsquo;<span>ll try to improve</span>
+<span>the pervasive pattern of using indexes to manage cyclic data structures.</span></p>
+<section id="The-problem">
+
+    <h2>
+    <a href="#The-problem"><span>The problem</span> </a>
+    </h2>
+<p><span>Often one wants to work with a data structure which contains a cycle</span>
+<span>of some form: object </span><code>foo</code><span> references </span><code>bar</code><span>, which references </span><code>baz</code>
+<span>which references </span><code>foo</code><span> again. The textbook example here is a graph of</span>
+<span>vertices and edges. In practice, however, true graphs are a rare</span>
+<span>encounter. Instead, you are more likely to see a tree with parent</span>
+<span>pointers, which contains a lot of trivial cycles. And sometimes cyclic</span>
+<span>graphs are implicit: an </span><code>Employee</code><span> can be the head of a </span><code>Departement</code><span>,</span>
+<span>and </span><code>Departement</code><span> has a </span><code>Vec&lt;Employee&gt;</code><span> personal. This is sort-of a</span>
+<span>graph in disguise: in usual graphs, all vertices are of the same type,</span>
+<span>and here </span><code>Employee</code><span> and </span><code>Departement</code><span> are different types.</span></p>
+<p><span>Working with such data structures is hard in any language. To arrive</span>
+<span>at a situation when </span><code>A</code><span> points to </span><code>B</code><span> which points back to </span><code>A</code><span>, some</span>
+<span>form of mutability is required. Indeed, either </span><code>A</code><span> or </span><code>B</code><span> must be</span>
+<span>created first, and so it can not point to the other immediately after</span>
+<span>construction. You can paper over this mutability with </span><code>let rec</code><span>, as in</span>
+<span>OCaml, or with laziness, as in Haskell, but it is still there.</span></p>
+<p><span>Rust tends to surface subtle problems in the form of compile-time</span>
+<span>errors, so implementing such graphs in Rust is challenging. The three</span>
+<span>usual approaches are:</span></p>
+<ul>
+<li>
+<span>reference counting, explanation by </span><a href="https://github.com/nrc/r4cppp/blob/master/graphs/README.md#rcrefcellnode"><span>nrc</span></a><span>,</span>
+</li>
+<li>
+<span>arena and real cyclic references, explanation by</span>
+<a href="https://exyr.org/2018/rust-arenas-vs-dropck/"><span>simonsapin</span></a><span> (this one is really neat!),</span>
+</li>
+<li>
+<span>arena and integer indices, explanation by </span><a href="http://smallcultfollowing.com/babysteps/blog/2015/04/06/modeling-graphs-in-rust-using-vector-indices/"><span>nikomatsakis</span></a><span>.</span>
+</li>
+</ul>
+<p><span>(apparently, rewriting a Haskell monad tutorial in Rust results in a</span>
+<span>graphs blog post).</span></p>
+<p><span>I personally like the indexing approach the most. However it presents</span>
+<span>an interesting readability challenge. With references, you have a</span>
+<code>foo</code><span> of type </span><code>&amp;Foo</code><span>, and it is immediately clear what that </span><code>foo</code><span> is,</span>
+<span>and what you can do with it. With indexes, however, you have a </span><code>foo:
+usize</code><span>, and it is not obvious that you somehow can get a </span><code>Foo</code><span>. Even</span>
+<span>worse, if indexes are used for two types of objects, like </span><code>Foo</code><span> and</span>
+<code>Bar</code><span>, you may end up with </span><code>thing: usize</code><span>. While writing the code with</span>
+<code>usize</code><span> actually works pretty well (I don</span>&rsquo;<span>t think I</span>&rsquo;<span>ve ever used the</span>
+<span>wrong index type), reading it later is more complicated, because</span>
+<code>usize</code><span> is much less suggestive of what you could do.</span></p>
+</section>
+<section id="Newtype-trick">
+
+    <h2>
+    <a href="#Newtype-trick"><span>Newtype trick</span> </a>
+    </h2>
+<p><span>One way to ameliorate this problem is to introduce a newtype wrapper</span>
+<span>around </span><code>usize</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(Debug, Copy, Clone, Ord, PartialOrd, Eq, PartialEq, Hash)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">FooIdx</span>(<span class="hl-type">usize</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Arena</span> {</span>
+<span class="line">    foos: <span class="hl-type">Vec</span>&lt;Foo&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Arena</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>(&amp;<span class="hl-keyword">self</span>, foo: FooIdx) <span class="hl-punctuation">-&gt;</span> &amp;Foo {</span>
+<span class="line">        &amp;<span class="hl-keyword">self</span>.foos[foo.<span class="hl-number">0</span>]</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, </span>&ldquo;<span>one should use </span><code>FooIdx</code><span> to index into </span><code>Vec&lt;Foo&gt;</code>&rdquo;<span> is still just</span>
+<span>a convention. A cool thing about Rust is that we can turn this</span>
+<span>convention into a property verified during type checking. By adding an</span>
+<span>appropriate impl, we should be able to index into </span><code>Vec&lt;Foo&gt;</code><span> with</span>
+<code>FooIdx</code><span> directly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">direct_indexing</span>(foos: <span class="hl-type">Vec</span>&lt;Foo&gt;, idx: FooIdx) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">_foo</span>: &amp;Foo = &amp;foos[idx];</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The impl would look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::ops;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">ops</span>::Index&lt;FooIdx&gt; <span class="hl-keyword">for</span> <span class="hl-title class_">Vec</span>&lt;Foo&gt; {</span>
+<span class="line">    <span class="hl-keyword">type</span> <span class="hl-title class_">Output</span> = Foo;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">index</span>(&amp;<span class="hl-keyword">self</span>, index: FooIdx) <span class="hl-punctuation">-&gt;</span> &amp;Foo {</span>
+<span class="line">        &amp;<span class="hl-keyword">self</span>[index.<span class="hl-number">0</span>]</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Coherence">
+
+    <h2>
+    <a href="#Coherence"><span>Coherence</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s insightful to study why this impl is allowed. In Rust, types,</span>
+<span>traits and impls are separate. This creates a room for a problem: what</span>
+<span>if there are two impl blocks for a given (trait, type) pair? The</span>
+<span>obvious choice is to forbid to have two impls in the first place, and</span>
+<span>this is what Rust does.</span></p>
+<p><span>Actually enforcing this restriction is tricky! The simplest rule of</span>
+&ldquo;<span>error if a set of crates currently compiled contains duplicate impls</span>&rdquo;
+<span>has severe drawbacks. First of all, this is a global check, which</span>
+<span>requires the knowledge of all compiled crates. This postpones the</span>
+<span>check until the later stages of compilation. It also plays awfully</span>
+<span>with dependencies, because two completely unrelated crates might fail</span>
+<span>the compilation if present simultaneously. What</span>&rsquo;<span>s more, it doesn</span>&rsquo;<span>t</span>
+<span>actually solve the problem, because the compiler does not necessary</span>
+<span>know the set of all crates beforehand. For example, you may load</span>
+<span>additional code at runtime via dynamic libraries, and silent bad</span>
+<span>things might happen if you program and dynamic library have duplicate</span>
+<span>impls.</span></p>
+<p><span>To be able to combine crates freely, we want a much stronger property:</span>
+<span>not only the set of crates currently compiled, but all existing and</span>
+<span>even future crates must not violate the one impl restriction. How on</span>
+<span>earth is it possible to check this? Should </span><code>cargo publish</code><span> look for</span>
+<span>conflicting impls across all of the crates.io?</span></p>
+<p><span>Luckily, and this is stunningly beautiful, it is possible to loosen</span>
+<span>this world-global property to a local one. In the simplest form, we</span>
+<span>can place a restriction that </span><code>impl Foo for Bar</code><span> can appear either in</span>
+<span>the crate that defines </span><code>Foo</code><span>, or in the one that defines</span>
+<code>Bar</code><span>. Crucially, whichever one defines the impl has to use the other,</span>
+<span>which makes it possible to detect the conflict.</span></p>
+<p><span>This is all really nifty, but we</span>&rsquo;<span>ve just defined an </span><code>Index</code><span> impl for</span>
+<code>Vec</code><span>, and both </span><code>Index</code><span> and </span><code>Vec</code><span> are from the standard library! How</span>
+<span>is it possible? The trick is that </span><code>Index</code><span> has a type parameter: </span><code>trait
+Index&lt;Idx: ?Sized&gt;</code><span>. It is a template for a trait of sorts, and we get</span>
+<span>a </span>&ldquo;<span>real</span>&rdquo;<span> trait when we substitute type parameter with a type. Because</span>
+<code>FooIdx</code><span> is a local type, the resulting </span><code>Index&lt;FromIdx&gt;</code><span> trait is also</span>
+<span>considered local. The precise rules here are quite tricky, </span><a href="https://github.com/rust-lang/rfcs/pull/2451"><span>this</span>
+<span>RFC</span></a><span> explains them pretty well.</span></p>
+</section>
+<section id="More-impls">
+
+    <h2>
+    <a href="#More-impls"><span>More impls</span> </a>
+    </h2>
+<p><span>Because </span><code>Index&lt;FooIdx&gt;</code><span> and </span><code>Index&lt;BarIdx&gt;</code><span> are different traits, one</span>
+<span>type can implement both of them. This is convenient for containers</span>
+<span>which hold distinct types:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Arena</span> {</span>
+<span class="line">    foos: <span class="hl-type">Vec</span>&lt;Foo&gt;,</span>
+<span class="line">    bars: <span class="hl-type">Vec</span>&lt;Bar&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">ops</span>::Index&lt;FooIdx&gt; <span class="hl-keyword">for</span> <span class="hl-title class_">Arena</span> { ... }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">ops</span>::Index&lt;BarIdx&gt; <span class="hl-keyword">for</span> <span class="hl-title class_">Arena</span> { ... }</span></code></pre>
+
+</figure>
+<p><span>It</span>&rsquo;<span>s also helpful to define arithmetic operations and conversions for</span>
+<span>the newtyped indexes. I</span>&rsquo;<span>ve put together a</span>
+<a href="https://crates.io/crates/typed_index_derive"><code>typed_index_derive</code></a><span> crate to automate this boilerplate via a</span>
+<span>proc macro, the end result looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[macro_use]</span></span>
+<span class="line"><span class="hl-keyword">extern</span> <span class="hl-keyword">crate</span> typed_index_derive;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Spam</span>(<span class="hl-type">String</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(</span></span>
+<span class="line"><span class="hl-meta">    // Usual derives for plain old data</span></span>
+<span class="line"><span class="hl-meta">    Debug, Copy, Clone, Ord, PartialOrd, Eq, PartialEq, Hash,</span></span>
+<span class="line"><span class="hl-meta"></span></span>
+<span class="line"><span class="hl-meta">    TypedIndex</span></span>
+<span class="line"><span class="hl-meta">)]</span></span>
+<span class="line"><span class="hl-meta">#[typed_index(Spam)]</span> <span class="hl-comment">// index into `&amp;[Spam]`</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">SpamIdx</span>(<span class="hl-type">usize</span>); <span class="hl-comment">// could be `u32` instead of `usize`</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">spams</span> = <span class="hl-built_in">vec!</span>[<span class="hl-title function_ invoke__">Spam</span>(<span class="hl-string">&quot;foo&quot;</span>.<span class="hl-title function_ invoke__">into</span>()), <span class="hl-title function_ invoke__">Spam</span>(<span class="hl-string">&quot;bar&quot;</span>.<span class="hl-title function_ invoke__">into</span>()), <span class="hl-title function_ invoke__">Spam</span>(<span class="hl-string">&quot;baz&quot;</span>.<span class="hl-title function_ invoke__">into</span>())];</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Conversions between `usize` and `SpamIdx`</span></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">idx</span>: SpamIdx = <span class="hl-number">1</span>.<span class="hl-title function_ invoke__">into</span>();</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(<span class="hl-type">usize</span>::<span class="hl-title function_ invoke__">from</span>(idx), <span class="hl-number">1</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Indexing `Vec&lt;Spam&gt;` with `SpamIdx`, `IndexMut` works as well</span></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(&amp;spams[idx].<span class="hl-number">0</span>, <span class="hl-string">&quot;bar&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Indexing `Vec&lt;usize&gt;` is rightfully forbidden</span></span>
+<span class="line">    <span class="hl-comment">// vec![1, 2, 3][idx]</span></span>
+<span class="line">    <span class="hl-comment">// error: slice indices are of type `usize` or ranges of `usize`</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// It is possible to  add/subtract `usize` from an index</span></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(&amp;spams[idx - <span class="hl-number">1</span>].<span class="hl-number">0</span>, <span class="hl-string">&quot;foo&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// The difference between two indices is `usize`</span></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(idx - idx, <span class="hl-number">0usize</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/8ohaj4/blog_post_newtype_index_pattern/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-06-04-newtype-index-pattern.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/06/06/modern-parser-generator.html b/2018/06/06/modern-parser-generator.html
new file mode 100644
index 00000000..b42f5ec5
--- /dev/null
+++ b/2018/06/06/modern-parser-generator.html
@@ -0,0 +1,771 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Modern Parser Generator</title>
+  <meta name="description" content="Hi! During the last couple of years, I've spent a lot of time writing
+parsers and parser generators, and I want to write down my thoughts
+about this topic. Specifically, I want to describe some properties of
+a parser generator that I would enjoy using. Note that this is not an
+introduction to parsing blog post, some prior knowledge is assumed.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/06/06/modern-parser-generator.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Modern-Parser-Generator"><span>Modern Parser Generator</span> <time datetime="2018-06-06">Jun 6, 2018</time></a>
+    </h1>
+<p><span>Hi! During the last couple of years, I</span>&rsquo;<span>ve spent a lot of time writing</span>
+<span>parsers and parser generators, and I want to write down my thoughts</span>
+<span>about this topic. Specifically, I want to describe some properties of</span>
+<span>a parser generator that I would enjoy using. Note that this is not an</span>
+&ldquo;<span>introduction to parsing</span>&rdquo;<span> blog post, some prior knowledge is assumed.</span></p>
+<p><span>Why do I care about this at all? The broad reason is that today a lot</span>
+<span>of tools and even most editors use regular expressions to</span>
+<span>approximately parse programming languages, and I find this outright</span>
+<a href="https://stackoverflow.com/a/1732454/1936422"><span>b҉a͡rb̢ari͞c͘</span></a><span>. I understand</span>
+<span>that in practice parsing is not as easy as it is in theory:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>Law: You can</span>&rsquo;<span>t check code you can</span>&rsquo;<span>t parse. Checking code deeply requires</span>
+<span>understanding the code</span>&rsquo;<span>s semantics. The most basic requirement is that you parse</span>
+<span>it. Parsing is considered a solved problem. Unfortunately, this view is naïve,</span>
+<span>rooted in the widely believed myth that programming languages exist.</span></p>
+</blockquote>
+<figcaption><cite><a href="https://cacm.acm.org/magazines/2010/2/69354-a-few-billion-lines-of-code-later/fulltext"><span>A few billion lines of code later</span></a></cite></figcaption>
+</figure>
+<p><span>However, I do believe we could do better if we use better tools!</span></p>
+<p><span>The specific reason is that I care way too much about the Rust</span>
+<span>programming language and</span></p>
+<ul>
+<li>
+<p><span>I think today it is the best language for writing compiler-like</span>
+<span>stuff (yes, better than OCaml!),</span></p>
+</li>
+<li>
+<p><span>I</span>&rsquo;<span>d love to see an awesome parser generator written in and</span>
+<span>targeting Rust,</span></p>
+</li>
+<li>
+<p><span>I want to write a Rust parser in a</span>
+<a href="https://github.com/rust-lang/rfcs/pull/2256"><span>slightly better way</span></a><span>. I</span>&rsquo;<span>ve</span>
+<a href="https://github.com/intellij-rust/intellij-rust/blob/e39a199992372603ba7b7fe23d77b9138454b972/src/main/grammars/RustParser.bnf"><span>done</span></a>
+<span>it</span>
+<a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/lang/rust/syntax/src/rust.fall"><span>twice</span></a>
+<span>already :) (update: </span><a href="https://github.com/rust-lang/rust-analyzer/blob/599142c34abad1442994947bd1200ce0bc973c54/crates/parser/src/grammar.rs#L90"><span>thrice</span></a><span>)</span></p>
+</li>
+</ul>
+<p><span>I</span>&rsquo;<span>ve used various parser generators, implemented one,</span>
+<a href="https://github.com/matklad/fall/"><span>fall</span></a><span>, and still haven</span>&rsquo;<span>t met a parser generator</span>
+<span>that I love.</span></p>
+<p><span>The post is split into three major chapters:</span></p>
+<ul>
+<li>
+<p><strong><strong><span>UX</span></strong></strong><span> </span>&mdash;<span> how to make using a parser generator easy, enjoyable and</span>
+<span>fun?</span></p>
+</li>
+<li>
+<p><strong><strong><span>API</span></strong></strong><span> </span>&mdash;<span> what API the generated parser should have.</span></p>
+</li>
+<li>
+<p><strong><strong><span>Parsing Techniques</span></strong></strong><span> </span>&mdash;<span> how exactly do we get from text to the</span>
+<span>parsed tree?</span></p>
+</li>
+</ul>
+<p><span>I</span>&rsquo;<span>ll be using a rather direct and assertive language in the following,</span>
+<span>but the fact is I am totally not sure about anything written here, and</span>
+<span>would love to know more about alternatives!</span></p>
+<section id="UX">
+
+    <h2>
+    <a href="#UX"><span>UX</span> </a>
+    </h2>
+<p><span>Although this text is written in Emacs, I strongly believe that a</span>
+<span>semantic-based, reliable, and fast support from tooling is a great</span>
+<span>boon to learnability and productivity. A great IDE support is a must</span>
+<span>for a modern parser generator, and this chapter talks mostly about</span>
+<span>IDE-related features.</span></p>
+<p><span>The most important productivity boost of a parser generator is the</span>
+<span>ability to fiddle with grammar interactively. The UI for this might</span>
+<span>look as a three-pane view, where the grammar is on the first pane,</span>
+<span>example code to parse is in the second pane and the resulting parse</span>
+<span>tree is in the third one. Editing first two panes should reactively</span>
+<span>update the last one. This is difficult to implement with most</span>
+<span>yacc-like parser generators, I</span>&rsquo;<span>ll talk more about it in the next</span>
+<span>section.</span></p>
+<p><span>The second most important feature is inline tests: for complex</span>
+<span>grammars it could be really hard to map from a particular rule</span>
+<span>specification to actual code that is parsed by the rule. Having a test</span>
+<span>written alongside the rule is invaluable! The test should be just a</span>
+<span>snippet of code in the target language. The </span>&ldquo;<span>gold</span>&rdquo;<span> value of the parse</span>
+<span>tree for the snippet should be saved in the file alongside the grammar</span>
+<span>and should be updated automatically when the grammar changes. Having</span>
+<span>inline tests allows to fit the </span>&ldquo;<span>three pane UI</span>&rdquo;<span> from the previous into</span>
+<span>two panes because you can just use the test as your second pane.</span></p>
+<p><span>Here</span>&rsquo;<span>s a video that shows how it works in fall: </span><a href="https://youtu.be/gb1MJnTcvds" class="url">https://youtu.be/gb1MJnTcvds</a><span>.</span></p>
+<p><span>Note that even if you write your parser by hand, you still should use such</span>
+&ldquo;<span>inline tests</span>&rdquo;<span>. To do so, write them as comments with special markers, and write</span>
+<span>a small script which extracts such comments and turns them into tests proper.</span>
+<span>Here</span>&rsquo;<span>s </span><a href="https://github.com/matklad/libsyntax2/blob/9500ad521121f501aea02f549223eb583cb298ee/src/parser/grammar/types.rs#L145-L168"><span>an</span>
+<span>example</span></a>
+<span>from one experimental hand-written parser of mine. Having such examples of </span>&ldquo;<span>what</span>
+<span>does this </span><code>if</code><span> parses?</span>&rdquo;<span> greatly simplifies reading of parser</span>&rsquo;<span>s code!</span></p>
+<p><span>Here</span>&rsquo;<span>s the list of important misc IDE features, from super important to very</span>
+<span>important. They are not specific to parser generators, so, if you are </span><strong><span>using</span></strong><span> a</span>
+<span>parser generator to implement IDE support for your language, look into these</span>
+<span>first!</span></p>
+<ul>
+<li>
+<p><span>Extend selection to the enclosing syntactic structure (and not just</span>
+<span>to a braced block). A super simple feature, but this combined with</span>
+<span>multiple cursors is arguably more powerful than vim</span>&rsquo;<span>s text objects,</span>
+<span>and most definitely easier to use.</span></p>
+</li>
+<li>
+<p><span>Fuzzy search of symbols in the current file/in the project: super</span>
+<span>handy for navigation, both more important and easier to implement</span>
+<span>than goto definition.</span></p>
+</li>
+<li>
+<p><span>Precise syntax highlighting. Highlighting is not a super-important</span>
+<span>feature and actually works ok even with regex approximations, but</span>
+<span>if you already have the syntax tree, then why not use it?</span></p>
+</li>
+<li>
+<p><span>Go to definition/find references.</span></p>
+</li>
+<li>
+<p><span>Errors and warnings inline, with fixes if available.</span></p>
+</li>
+<li>
+<p><span>Extract rule refactoring, pairs well with extend selection.</span></p>
+</li>
+<li>
+<p><span>Code formatting.</span></p>
+</li>
+<li>
+<p><span>Smart typing: indenting code on </span><code>Enter</code><span>, adding/removing trailing</span>
+<span>commas when joining/splitting lines, and in general auto magically</span>
+<span>fixing punctuation.</span></p>
+</li>
+<li>
+<p><span>Code completion: although for parser generators dumb word-based</span>
+<span>completion tends to work OK.</span></p>
+</li>
+</ul>
+<p><span>Here</span>&rsquo;<span>s a short demo of some of these features in fall: </span><a href="https://youtu.be/WRWmwfBLf7o" class="url">https://youtu.be/WRWmwfBLf7o</a><span>.</span></p>
+<p><span>I want to emphasize that most of these features are </span><strong><strong><span>ridiculously</span></strong></strong><span> easy to</span>
+<span>implement, if you have a parse tree for your language. Take, for example, </span>&ldquo;<span>fuzzy</span>
+<span>search of symbols in the project</span>&rdquo;<span>. This is a super awesome feature for</span>
+<span>navigation. Basically, it is CTAGS done right: first, you parse each file (in</span>
+<span>parallel) and build a list of symbols for it. Then, as user types, you</span>
+<span>incrementally update the changed files. Using fall, I</span>&rsquo;<span>ve implemented this</span>
+<span>feature for Rust, and it took me three small files:</span></p>
+<ul>
+<li>
+<p><a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/lang/rust/src/editor/file_symbols.rs"><span>find</span><span>_symbols.rs</span></a>
+<span>to extract symbols from a single file, 21(!) lines.</span></p>
+</li>
+<li>
+<p><a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/indxr/src/lib.rs"><span>indxr.rs</span></a><span>,</span>
+<span>a generic infra to watch files for changes and recompute the index incrementally, 155 lines.</span></p>
+</li>
+<li>
+<p><a href="https://github.com/matklad/fall/blob/master/lang/rust/src/editor/symbol_index.rs"><span>symbol</span><span>_index.rs</span></a>
+<span>glues the previous two together, and adds</span>
+<a href="https://github.com/BurntSushi/fst"><span>fst</span></a><span> by ever-awesome BurntSushi</span>
+<span>on top for fuzzy search, 122 lines.</span></p>
+</li>
+</ul>
+<p><span>This is actually practical: initial indexing of rust-lang/rust repo</span>
+<span>takes about 30 seconds using a single core and fall</span>&rsquo;<span>s ridiculously</span>
+<span>slow parser, and after that everything just works:</span></p>
+<p><a href="https://youtu.be/KyUUDcnOvUw" class="url">https://youtu.be/KyUUDcnOvUw</a></p>
+<p><span>A small note on how to pack all this IDE functionality: make a library. That</span>
+<span>way, anyone could use it anywhere. For example, as a web-assembly module in the</span>
+<span>online version. On top of the library you could implement whatever protocol you</span>
+<span>like, Microsoft</span>&rsquo;<span>s LSP, or some custom one. If you go the protocol-first way,</span>
+<span>using your code outside of certain editors could be harder.</span></p>
+</section>
+<section id="API">
+
+    <h2>
+    <a href="#API"><span>API</span> </a>
+    </h2>
+<section id="Parse-Tree">
+
+    <h3>
+    <a href="#Parse-Tree"><span>Parse Tree</span> </a>
+    </h3>
+<p><span>Traditionally, parser generators work by allowing the user to specify</span>
+<span>custom code for each rule, which is then copy-pasted into the</span>
+<span>generated parser. This is typically used to construct an abstract</span>
+<span>syntax tree, but could be used, for example, to evaluate arithmetic</span>
+<span>expressions during parsing.</span></p>
+<p><span>I don</span>&rsquo;<span>t think this is the right API for the parser generator for three</span>
+<span>reasons though.</span></p>
+<p><span>It feels like a layering violation because it allows to intermix parsing with</span>
+<span>basically everything else. You can literally do code-generation during parsing.</span>
+<span>It makes things like</span>
+<a href="https://eli.thegreenplace.net/2007/11/24/the-context-sensitivity-of-cs-grammar/"><span>the lexer hack</span></a><span> possible.</span></p>
+<p><span>It would be very hard to implement reactive rendering of the parse</span>
+<span>tree if the result of parsing is some user-defined type.</span></p>
+<p><span>Most importantly, I don</span>&rsquo;<span>t think that producing </span><strong><strong><span>abstract</span></strong></strong><span> syntax</span>
+<span>tree as a result of parsing is the right choice. The problem with AST</span>
+<span>is that it, by definition, loses information. The most commonly lost</span>
+<span>things are whitespace and comments. While they are not important for a</span>
+<span>command-line batch compiler, they are crucial for IDEs, which work</span>
+<span>very close to the original source code. Another important IDE-specific</span>
+<span>aspect is support for incomplete code. If a function is missing a body</span>
+<span>and a closing parenthesis on the parameter list, it</span>&rsquo;<span>s still better be</span>
+<span>recognized as a function. It</span>&rsquo;<span>s difficult to support such missing</span>
+<span>pieces in traditional AST.</span></p>
+<p><span>I am pretty confident that a better API for the generated parser is to</span>
+<span>produce a parse tree which losslessly represents both the input text</span>
+<span>and associated tree structure. Losslessness is a very important</span>
+<span>property: it guarantees that we could implement anything in principle.</span></p>
+<p><span>I</span>&rsquo;<span>ve outlined one possible design of such lossless representation in the</span>
+<a href="https://github.com/rust-lang/rfcs/pull/2256"><span>libsyntax2</span></a><span> RFC, the simplified</span>
+<span>version looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Kind</span>(<span class="hl-type">u32</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span> {</span>
+<span class="line">    kind: Kind,</span>
+<span class="line">    span: (<span class="hl-type">usize</span>, <span class="hl-type">usize</span>),</span>
+<span class="line">    children: <span class="hl-type">Vec</span>&lt;Node&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>That is, the result of parsing is a </span><strong><span>homogeneous</span></strong><span> tree, with nodes</span>
+<span>having two bits of information besides the children:</span></p>
+<ul>
+<li>
+<p><span>Type of a node: is it a function definition, a parameter, a</span>
+<span>comment?</span></p>
+</li>
+<li>
+<p><span>Region of the source text covered by the node.</span></p>
+</li>
+</ul>
+<p><span>A cool thing about such representation is that </span><strong><span>every</span></strong><span> language uses</span>
+<span>the same type of the syntax tree. In fall features like extend</span>
+<span>selection are implemented once and work for all languages.</span></p>
+<p><span>If you need it, you can do the conversion to AST in a separate</span>
+<span>pass. Alternatively, it</span>&rsquo;<span>s possible to layer AST on top of the</span>
+<span>homogeneous tree, using newtype wrappers like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// invariant: Node.kind == STRUCT_DEF</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">StructDef</span>(Node);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// invariant: Node.kind == STRUCT_FIELD</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">StructField</span>(Node);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">StructDef</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">fields</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Vec</span>&lt;StructField&gt; {</span>
+<span class="line">        <span class="hl-keyword">self</span>.<span class="hl-number">0</span>.children.<span class="hl-title function_ invoke__">iter</span>().<span class="hl-title function_ invoke__">filer</span>(|c| c.kind == STRUCT_FIELD)</span>
+<span class="line">            .<span class="hl-title function_ invoke__">map</span>(StructField)</span>
+<span class="line">            .<span class="hl-title function_ invoke__">collect</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Parser generator should automatically generate such AST wrappers. However, it</span>
+<span>shouldn</span>&rsquo;<span>t directly infer them from the grammar: not every node kind needs an AST</span>
+<span>wrapper, and method names are important. Better to let the user specify AST</span>
+<span>structure separately, and check that AST and parse tree agree. As an example</span>
+<span>from fall, here is the</span>
+<a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/lang/rust/syntax/src/rust.fall#L380-L402"><span>grammar rule</span></a><span> for Rust paths, the corresponding</span>
+<a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/lang/rust/syntax/src/rust.fall#L1253-L1256"><span>ast definition</span></a><span>, and the</span>
+<a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/lang/rust/syntax/src/rust.rs#L876-L897"><span>generated code</span></a><span>.</span></p>
+</section>
+<section id="Incremental-Reparsing">
+
+    <h3>
+    <a href="#Incremental-Reparsing"><span>Incremental Reparsing</span> </a>
+    </h3>
+<p><span>Another important feature for modern parser generator is support for</span>
+<span>incremental reparsing, which is obviously useful for IDEs.</span></p>
+<p><span>One thing that greatly helps here is the split between parser and</span>
+<span>lexer phases.</span></p>
+<p><span>It is much simpler (and more efficient) to make lexing</span>
+<span>incremental. When lexing, almost any change affects at most a couple</span>
+<span>of tokens, so in theory incremental lexing could be pretty</span>
+<span>efficient. Beware though that worst-case relexing still has to be</span>
+<span>linear, because insertion of unclosed quote changes all the following</span>
+<span>tokens.</span></p>
+<p><span>In contrast, it is much easier to change tree structure significantly</span>
+<span>with a small edit, which places upper-bound on incremental reparsing</span>
+<span>effectiveness. Besides, making parsing incremental is more complicated</span>
+<span>because you have to deal with trees instead of a linear structure.</span></p>
+<p><span>An interesting middle ground here is an incremental lexer combined</span>
+<span>with a fast non-incremental parser.</span></p>
+</section>
+<section id="Lexer">
+
+    <h3>
+    <a href="#Lexer"><span>Lexer</span> </a>
+    </h3>
+<p><span>Traditional lex-style lexers struggle with special cases like ml-style</span>
+<span>properly nested comments or Rust raw literals which are even not</span>
+<a href="https://github.com/rust-lang/rust/blob/cb8ab33ed29544973da866bdc3eff509b3c3e789/src/grammar/raw-string-literal-ambiguity.md"><span>context-free</span></a><span>.</span>
+<span>The problem is typically solved by injecting custom code into lexer,</span>
+<span>which maintains some sort of state, like a nesting level of</span>
+<span>comments. In my experience, making this work properly is very</span>
+<span>frustrating.</span></p>
+<p><span>These two tricks may make writing lexer simpler.</span></p>
+<p><span>Instead of supporting lexer states and injecting custom code, allow to pair</span>
+<span>regex, which defines a token, with a function which takes a string slice and</span>
+<span>outputs </span><code>usize</code><span>. If lexer matches such external token, it then calls supplied</span>
+<span>function to determine the other end of the token. Here</span>&rsquo;<span>s an example from fall:</span>
+<a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/lang/rust/syntax/src/rust.fall#L4"><span>external</span>
+<span>token</span></a><span>,</span>
+<a href="https://github.com/matklad/fall/blob/527ab331f82b8394949041bab668742868c0c282/lang/rust/syntax/src/rust.fall#L1294-L1324"><span>custom</span>
+<span>functions</span></a><span>.</span></p>
+<p><span>Often it is better to use layered languages instead of lexer</span>
+<span>states. Parsing string literals is a great example of this. String</span>
+<span>literals usually have some notion of a well-formed escape</span>
+<span>sequence. The traditional approach to parsing string literals is to</span>
+<span>switch to a separate lexer state after </span><code>"</code><span>, which handles</span>
+<span>escapes. This is bad for error recovery: if there</span>&rsquo;<span>s a typo in an</span>
+<span>escape sequence, it should still be possible to recognize literal</span>
+<span>correctly. So alternative approach is to parse a string literal as,</span>
+<span>basically, </span>&ldquo;<span>anything between two quotes</span>&rdquo;<span>, and then use a separate</span>
+<span>lexer for escapes specifically later in the compiler pipeline.</span></p>
+<p><span>Another interesting lexing problem which arises in practice is</span>
+<span>context-sensitivity: things like contextual keywords or </span><code>&gt;&gt;</code><span> can</span>
+<span>represent different token types, depending on the surrounding code. To</span>
+<span>deal with this case nicely, the parser should support token</span>
+<span>remapping. While most of the tokens appear in the final parse tree as</span>
+<span>is, the parser should be able to, for example, substitute two </span><code>&gt;</code><span> </span><code>&gt;</code>
+<span>tokens with a single </span><code>&gt;&gt;</code><span>, so that later stages of compilation need</span>
+<span>not to handle this special case.</span></p>
+</section>
+<section id="Parser">
+
+    <h3>
+    <a href="#Parser"><span>Parser</span> </a>
+    </h3>
+<p><span>A nice trick to make parser more general and fast is not to construct</span>
+<span>parse tree directly, but emit a stream of events like </span>&ldquo;<span>start internal</span>
+<span>node</span>&rdquo;<span>, </span>&ldquo;<span>eat token</span>&rdquo;<span>, </span>&ldquo;<span>finish internal node</span>&rdquo;<span>. That way, parsing does not</span>
+<span>itself allocate and, for example, you can use the stream of events to</span>
+<span>patch an existing tree, doing minimal allocations. This also divorces</span>
+<span>the parser from a particular tree structure, so it is easier to</span>
+<span>plug-in different tree backends.</span></p>
+<p><span>Events also help with reshuffling the tree structure. For example,</span>
+<span>during event processing we can turn left-leaning trees to</span>
+<span>right-leaning ones or flatten them into lists. Another interesting</span>
+<span>form of tree reshuffling is attachment of comments. If a comment</span>
+<span>immediately precedes some definition, it should be a part of this</span>
+<span>definition. This is not specified by the language, but it is the</span>
+<span>result that human would expect. With events, we can handle only</span>
+<span>significant tokens to the parser and deal with attaching comments and</span>
+<span>whitespace when reconstructing tree from a flat list of events.</span></p>
+</section>
+<section id="Miscellaneous-concerns">
+
+    <h3>
+    <a href="#Miscellaneous-concerns"><span>Miscellaneous concerns</span> </a>
+    </h3>
+<p><span>To properly implement incremental reparsing, we should start with a</span>
+<span>data structure for text which is more efficient to update than</span>
+<code>String</code><span>. While we do have quite a few extremely high-quality</span>
+<span>implementations of ropes, the ecosystem is critically missing a way to</span>
+<span>talks about them generically. That is, there</span>&rsquo;<span>s no something like</span>
+<span>Java</span>&rsquo;<span>s </span><code>CharSequence</code><span> in Rust (which needs a much more involved design</span>
+<span>in Rust to avoid unnecessary overhead).</span></p>
+<p><span>Luckily, the parse tree needs to remember only the offsets, so we can</span>
+<span>avoid hard-coding a particular text representation, and we don</span>&rsquo;<span>t even</span>
+<span>need a generic parameter for that.</span></p>
+<p><span>Homogeneous trees make reactive testing of the grammar possible in</span>
+<span>theory because you can always produce a text representation of a tree</span>
+<span>from them. But in practice reactivity requires that </span>&ldquo;<span>read grammar,</span>
+<span>compile parser, run it on input</span>&rdquo;<span> loop is fast. Literally generating</span>
+<span>source code of the parser and then compiling it would be too slow, so</span>
+<span>some kind of interpreted mode is required. However, this conflicts</span>
+<span>with the need to be able to extend lexer with custom code. I don</span>&rsquo;<span>t</span>
+<span>know of a great solution here, but something like this would work:</span></p>
+<ul>
+<li>
+<p><span>require that all lexer extensions are specified in the verbatim</span>
+<span>block of the grammar file and don</span>&rsquo;<span>t have external dependencies,</span></p>
+</li>
+<li>
+<p><span>for IDE support, compile the lexer, and only the lexer, in a temp</span>
+<span>dir and communicate with it via IPC.</span></p>
+</li>
+</ul>
+<p><span>A possible alternative is to use a different, approximate lexer for</span>
+<span>interactive testing of the grammar. In my experience this makes such</span>
+<span>testing almost useless because you get different results in</span>
+<span>interesting cases and interesting cases are what is important for this</span>
+<span>feature.</span></p>
+<p><span>In IDEs, a surprisingly complicated problem is managing a list of open</span>
+<span>and modified files, synchronizing them with the file system, providing</span>
+<span>consistent file-system snapshots and making sure that things like</span>
+<span>in-memory buffers are also possible. For parser generators, all this</span>
+<span>complexity might be dodged by requiring that all of the grammar needs</span>
+<span>to be specified in a single file.</span></p>
+</section>
+</section>
+<section id="Parsing-Techniques">
+
+    <h2>
+    <a href="#Parsing-Techniques"><span>Parsing Techniques</span> </a>
+    </h2>
+<p><span>So we want to write a parser generator that produces lossless parse</span>
+<span>trees and which has an awesome IDE support. How do we actually </span><strong><span>parse</span></strong>
+<span>a text into a tree? Unfortunately, while there are many ways to parse</span>
+<span>text, there</span>&rsquo;<span>s no accepted best one. I</span>&rsquo;<span>ll try to do a broad survey of</span>
+<span>various options.</span></p>
+<p><span>I</span>&rsquo;<span>d love to discuss the challenges of the textbook approach of just</span>
+<span>using a context-free grammar/BNF notation. However, let</span>&rsquo;<span>s start with a</span>
+<span>simpler, </span>&ldquo;<span>solved</span>&rdquo;<span> case: regular expressions.</span></p>
+<p><span>Languages which could be described by regular expressions are called</span>
+<span>regular. They are exactly the same languages which could be recognized</span>
+<span>by finite state machines. These two definition mechanisms have nice</span>
+<span>properties which explain the usefulness of regular languages in real</span>
+<span>life:</span></p>
+<ul>
+<li>
+<p><span>Regular expressions map closely to our thinking and are easy for</span>
+<span>humans to understand. Note that there are equivalent in power, but</span>
+<span>much less </span>&ldquo;<span>natural</span>&rdquo;<span> meta-languages for describing regular</span>
+<span>languages: raw finite state machines or regular grammars.</span></p>
+</li>
+<li>
+<p><span>Finite state machines are easy for computers to execute. FSM is</span>
+<span>just a program which is guaranteed to use constant amount of</span>
+<span>memory.</span></p>
+</li>
+</ul>
+<p><span>Regular languages are rather inexpressive, but they work great for</span>
+<span>lexers. On the opposite side of expressivity spectrum are Turing</span>
+<span>machines. For them, we also have a number of meta-languages (like</span>
+<span>Rust), which work great for humans. It</span>&rsquo;<span>s interesting that a Turing</span>
+<span>machine is equivalent to a finite state machine with a pair of stacks:</span>
+<span>to get two stacks from a tape, cut the tape in half where the head</span>
+<span>is. Moving the head then corresponds to popping from one stack and</span>
+<span>pushing to another.</span></p>
+<p><span>And the context-free languages, which are described by CFGs, are</span>
+<span>exactly in between languages recognized by finite state machines and</span>
+<span>languages recognized by Turing machines. You need a push-down</span>
+<span>automaton, or a state machine with </span><strong><span>one</span></strong><span> stack, to recognize a</span>
+<span>context-free language.</span></p>
+<p><span>CFGs are powerful enough to describe arbitrary nesting structures and</span>
+<span>seem to be a good fit for describing programming languages. However,</span>
+<span>there are a couple of problems with CFGs. Let</span>&rsquo;<span>s write a grammar for</span>
+<span>arithmetic expressions with additions, multiplications, parenthesis</span>
+<span>and numbers. The obvious answer,</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">E -&gt; E + E | E * E | (E) | number</span></code></pre>
+
+</figure>
+<p><span>has a problem. It is under specified and does not tell if </span><code>1 + 2 * 3</code>
+<span>is </span><code>(1 + 2) * 3</code><span> or </span><code>1 + (2 * 3)</code><span>. We need to tweak the grammar to get</span>
+<span>rid of this ambiguity:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">E -&gt; F | E + F</span>
+<span class="line">F -&gt; T | F * T</span>
+<span class="line">T -&gt; number | (E)</span></code></pre>
+
+</figure>
+<p><span>I think the necessity of such transformations is a problem! Humans don</span>&rsquo;<span>t think</span>
+<span>like this: it took me three or four courses in formal grammars to really</span>
+<span>internalize this transformation. And if we look at language references, we</span>&rsquo;<span>ll</span>
+<span>typically see a</span>
+<a href="https://doc.rust-lang.org/1.22.1/reference/expressions/operator-expr.html#operator-precedence"><span>precedence</span>
+<span>table</span></a><span> instead of BNF.</span></p>
+<p><span>Another problem here is that we even can</span>&rsquo;<span>t workaround ambiguity by</span>
+<span>plainly forbidding it: checking if CFG is unambiguous is undecidable.</span></p>
+<p><span>So CFGs turn out to be much less practical and simple than regular</span>
+<span>expressions. What options do we have then?</span></p>
+<section id="Abandoning-CFG">
+
+    <h3>
+    <a href="#Abandoning-CFG"><span>Abandoning CFG</span> </a>
+    </h3>
+<p><span>The first choice is to parse </span><strong><span>something</span></strong><span>, not necessary a context-free</span>
+<span>language. A good way to do it is to write a parser by hand. A</span>
+<span>hand-written parser is usually called a recursive descent parser, but</span>
+<span>in reality it includes two crucial techniques in addition to just</span>
+<span>recursive descent. The pure recursive descent works by translating</span>
+<span>grammar rules like </span><code>T -&gt; A B</code><span> into a set of recursive functions:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">parse_t</span>() {</span>
+<span class="line">    <span class="hl-title function_ invoke__">parse_a</span>();</span>
+<span class="line">    <span class="hl-title function_ invoke__">parse_b</span>();</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The theoretical problem here is that it can</span>&rsquo;<span>t deal with</span>
+<span>left-recursion. That is, rules like </span><code>Statements -&gt; Statements ';'
+OneStatement</code><span> make recursive descent parser to loop infinitely. In</span>
+<span>theory, this problem is solved by rewriting the grammar and</span>
+<span>eliminating the left recursion. If you had a formal grammars class,</span>
+<span>you probably have done this! In practice, this is a completely</span>
+<span>non-existent problem, because we have loops:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">parse_statements</span>() {</span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-title function_ invoke__">parse_one_statement</span>();</span>
+<span class="line">        <span class="hl-keyword">if</span> !<span class="hl-title function_ invoke__">parse_semicolon</span>() {</span>
+<span class="line">            <span class="hl-keyword">break</span>;</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The next problem with recursive descent is that parsing expressions with</span>
+<span>precedence requires that weird grammar rewriting.  Luckily, there</span>&rsquo;<span>s a simpler</span>
+<span>technique to deal with expressions. Suppose you want to parse </span><code>1 + 2 * 3</code><span>. One</span>
+<span>way to do that would be to parse it with a </span><code>loop</code><span> as a list of atoms separated</span>
+<span>by operators and then reconstruct a tree separately. If you fuse these two</span>
+<span>stages together, you get a loop, which could recursively call itself and nest,</span>
+<a href="http://journal.stuffwithstuff.com/2011/03/19/pratt-parsers-expression-parsing-made-easy/"><span>a</span>
+<span>Pratt parser</span></a><span>. Understanding it for the first time is hard, but you only need to</span>
+<span>do it once :)</span></p>
+<p><span>The most important feature of hand-written parsers is a great support</span>
+<span>for error recovery and partial parses. It boils down to two simple</span>
+<span>tricks.</span></p>
+<p><span>If you are parsing a homogeneous sequence of things (i.e, you are inside the</span>
+<span>loop), and the current token does not look like it can begin a new element, you</span>
+<span>just skip over it and start the next iteration of the loop. Here</span>&rsquo;<span>s an</span>
+<a href="https://github.com/JetBrains/kotlin/blob/9891f562cc0acb505ee5ff2f30626253ace0201a/compiler/psi/src/org/jetbrains/kotlin/parsing/KotlinParsing.java#L1048-L1136"><span>example</span></a>
+<span>from Kotlin. At</span>
+<a href="https://github.com/JetBrains/kotlin/blob/9891f562cc0acb505ee5ff2f30626253ace0201a/compiler/psi/src/org/jetbrains/kotlin/parsing/KotlinParsing.java#L1086"><span>this</span>
+<span>line</span></a><span>, we</span>&rsquo;<span>ll get </span><code>null</code><span> if current token could not begin a class member</span>
+<span>declaration.</span>
+<a href="https://github.com/JetBrains/kotlin/blob/9891f562cc0acb505ee5ff2f30626253ace0201a/compiler/psi/src/org/jetbrains/kotlin/parsing/KotlinParsing.java#L1089"><span>Here</span></a>
+<span>we just skip over it.</span></p>
+<p><span>If you are parsing a particular thing </span><code>T</code><span>, and you expect token </span><code>foo</code><span>,</span>
+<span>but see </span><code>bar</code><span>, then, roughly:</span></p>
+<ul>
+<li>
+<span>if </span><code>bar</code><span> is not in the </span><code>FOLLOW(T)</code><span>, you skip over it and emit error,</span>
+</li>
+<li>
+<span>if </span><code>bar</code><span> is in </span><code>FOLLOW(T)</code><span>, you emit error, but </span><strong><span>don</span>&rsquo;<span>t</span></strong><span> skip the</span>
+<span>token.</span>
+</li>
+</ul>
+<p><span>That way, parsing something like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>(</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">   f: <span class="hl-type">u32</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>would correctly recognize incomplete function </span><code>foo</code><span> (again, its easier to</span>
+<span>represent such incomplete function with homogeneous parse trees than with AST),</span>
+<span>and a complete struct </span><code>S</code><span>. Here</span>&rsquo;<span>s another</span>
+<a href="https://github.com/JetBrains/kotlin/blob/9891f562cc0acb505ee5ff2f30626253ace0201a/compiler/psi/src/org/jetbrains/kotlin/parsing/KotlinParsing.java#L1219"><span>example</span>
+<span>from Kotlin</span></a><span>.</span></p>
+<p><span>Although hand-written parsers are good at producing high-quality error</span>
+<span>messages as well, I don</span>&rsquo;<span>t think that this is important. In the IDE</span>
+<span>context, for syntax errors it is much more important and beneficial to</span>
+<span>get a red squiggly under the error immediately after you</span>&rsquo;<span>ve typed</span>
+<span>invalid code. Instantaneous feedback and precise location are, in my</span>
+<span>personal experience, enough to fix syntax errors. The error message</span>
+<span>can be just </span>&ldquo;<span>Syntax error</span>&rdquo;<span>, and more elaborate messages are often make</span>
+<span>things </span><strong><span>worse</span></strong><span> because mapping from an error message to what is</span>
+<span>actually wrong is harder than just typing and deleting stuff and</span>
+<span>checking if it works.</span></p>
+<p><span>It is possible to simplify authoring of this style of parsers by</span>
+<span>generating all recursive functions, loop and Pratt parsers from</span>
+<span>declarative BNF/PEG style description. This is what Grammar Kit and</span>
+<span>fall do.</span></p>
+</section>
+<section id="Embracing-ambiguity">
+
+    <h3>
+    <a href="#Embracing-ambiguity"><span>Embracing ambiguity</span> </a>
+    </h3>
+<p><span>Another choice is to stay within CFG class but avoid dealing with</span>
+<span>ambiguity by producing </span><strong><span>all</span></strong><span> possible parse trees for a given</span>
+<span>input. This is typically achieved using non-determinism and</span>
+<span>memorization, using GLR and GLL style techniques.</span></p>
+<p><span>Here I</span>&rsquo;<span>d like to call out</span>
+<a href="https://github.com/tree-sitter/tree-sitter"><span>tree-sitter</span></a><span> project, which actually</span>
+<span>ticks quite a few boxes outlined in this blog post. In particular, it uses</span>
+<span>homogeneous trees, is fully incremental and has surprisingly good support for</span>
+<span>error recovery (though not quite as good as hand-written style parsers, at least</span>
+<span>when I</span>&rsquo;<span>ve last checked it).</span></p>
+</section>
+<section id="Abandoning-generality">
+
+    <h3>
+    <a href="#Abandoning-generality"><span>Abandoning generality</span> </a>
+    </h3>
+<p><span>Yet another choice is to give up full generality and restrict the</span>
+<span>parser generator to a subset of unambiguous grammars, for which we</span>
+<span>actually could verify the absence of ambiguity. This is how traditional</span>
+<span>parser generators like yacc, happy, menhir or LALRPOP work.</span></p>
+<p><span>The very important advantage of these parsers is that you get a strong</span>
+<span>guarantee that the grammar works and does not have nasty</span>
+<span>surprises. The price you have to pay, though, is that sometimes it is</span>
+<span>necessary to tweak an already unambiguous grammar to make the stupid</span>
+<span>tool understand that there</span>&rsquo;<span>s no ambiguity.</span></p>
+<p><span>I also haven</span>&rsquo;<span>t seen deterministic LR parsers with great support for</span>
+<span>error recovery, but looks like it should be possible in theory?</span>
+<span>Recursive descent parsers, which are more or less LL(1), recover from</span>
+<span>errors splendidly, and LR(1) has strictly more information than an</span>
+<span>LL(1) one.</span></p>
+<p><span>So, what is the best choice for writing a parser/parser generator?</span></p>
+<p><span>It seems to me that the two extremes are the most promising: hand</span>
+<span>written parser gives you utmost control over everything, which is</span>
+<span>important when you need to parse some language, not designed by you,</span>
+<span>which is hostile to the usual parsing techniques. On the other hand,</span>
+<span>classical LR-style parsers give you a proof that the grammar is</span>
+<span>unambiguous, which is very useful if you are creating your own</span>
+<span>language. Ultimately, I think that being able to produce lossless</span>
+<span>parse trees supporting partial parses is more important than any</span>
+<span>particular parsing technique, so perhaps supporting both approaches</span>
+<span>with a single API is the right choice?</span></p>
+</section>
+</section>
+<section id="Conclusion">
+
+    <h2>
+    <a href="#Conclusion"><span>Conclusion</span> </a>
+    </h2>
+<p><span>This turned out to be a quite lengthy post, hope it was interesting!</span>
+<span>These are the main points:</span></p>
+<ul>
+<li>
+<p><span>IDE support is important, for the parser generator itself as well as</span>
+<span>for the target language.</span></p>
+</li>
+<li>
+<p><span>Lossless parse trees are more general than ASTs and custom action</span>
+<span>code, and are a better fit for IDEs.</span></p>
+</li>
+<li>
+<p><span>Interactivity matters! Reactive grammar repl and inline tests rock!</span></p>
+</li>
+<li>
+<p><span>Parsing is an unsolved problem :)</span></p>
+</li>
+</ul>
+<p><span>Discussion on</span>
+<a href="https://www.reddit.com/r/rust/comments/8pbi54/blog_post_modern_parser_generator/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-06-06-modern-parser-generator.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/06/18/a-trick-for-test-maintenance.html b/2018/06/18/a-trick-for-test-maintenance.html
new file mode 100644
index 00000000..dce271ef
--- /dev/null
+++ b/2018/06/18/a-trick-for-test-maintenance.html
@@ -0,0 +1,243 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>A Trick For Test Maintenance</title>
+  <meta name="description" content="This is a post about an interesting testing technique which feels like it should
+be well known. However, I haven't seen it mentioned anywhere. I don't even have
+a good name for it, I've semi-discovered it in the wild. If you know how this
+thing is called, please leave a comment!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/06/18/a-trick-for-test-maintenance.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#A-Trick-For-Test-Maintenance"><span>A Trick For Test Maintenance</span> <time datetime="2018-06-18">Jun 18, 2018</time></a>
+    </h1>
+<p><span>This is a post about an interesting testing technique which feels like it should</span>
+<span>be well known. However, I haven</span>&rsquo;<span>t seen it mentioned anywhere. I don</span>&rsquo;<span>t even have</span>
+<span>a good name for it, I</span>&rsquo;<span>ve semi-discovered it in the wild. If you know how this</span>
+<span>thing is called, please leave a comment!</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><a href="https://github.com/matklad/cov-mark"><span>cov-mark</span></a><span> crate is the currently maintained and used in production implementation of this idea.</span></p>
+</div>
+</aside><section id="A-long-time-ago">
+
+    <h2>
+    <a href="#A-long-time-ago"><span>A long time ago</span>&hellip; </a>
+    </h2>
+<p><span>I was reading Dart analysis server source code, and came across </span><a href="https://github.com/dart-lang/sdk/blob/f6d2c2378a00160ca1b79f8f7bd45df97b1275e4/pkg/analysis_server/lib/src/services/correction/assist_internal.dart#L1063"><span>this</span>
+<span>line</span></a><span>.</span>
+<span>Immediately I was struck as if by lighting. Well, not exactly in the same way,</span>
+<span>but you get the idea.</span></p>
+<p><span>What does this line do? I actually don</span>&rsquo;<span>t know, but I have a guess. My</span>
+<span>explanation is further down (to give you a chance to discover the</span>
+<span>trick as well!), but the general idea is that this line helps</span>
+<span>tremendously with making tests more maintainable.</span></p>
+</section>
+<section id="The-two-mundane-problems">
+
+    <h2>
+    <a href="#The-two-mundane-problems"><span>The two mundane problems</span> </a>
+    </h2>
+<p><span>Two tasks which programmers typically enjoy less than furiously</span>
+<span>cranking out new features are maintaining existing code and writing</span>
+<span>tests. And, as an old Russian joke says, maintaining tests is the</span>
+<span>worst. Here are some pain points specific to the post:</span></p>
+<p><strong><strong><span>Negative tests</span></strong></strong><span>. You want to check that something does not</span>
+<span>happen. Writing a test in this situation is tricky because the test</span>
+<span>might actually pass for a trivial reason instead of the intended</span>
+<span>one. The rule of thumb is to verify that the test actually fails if</span>
+<span>the specific condition which it covers is commented out. The problem</span>
+<span>with this rule of thumb is that it works in a single point in time. As</span>
+<span>the code evolves, the test might begin to pass for a trivial reason.</span></p>
+<p><strong><strong><span>Duplicated tests</span></strong></strong><span>. Test suites are usually append-only and grow</span>
+<span>indefinitely. Almost inevitably this leads to a situation where</span>
+<span>different tests are testing essentially the same features, or where</span>
+<span>one test is a superset of another.</span></p>
+<p><strong><strong><span>Bifurcated suites</span></strong></strong><span>. Somewhat similar to the previous point, you may</span>
+<span>end up in a situation where a single component has two separate</span>
+<span>test-suites in different parts of the code base. I</span>&rsquo;<span>d want to say that</span>
+<span>this happens when two developers write tests independently, but</span>
+<span>practice says that me and me one month later are enough to create such</span>
+<span>a mess :)</span></p>
+<p><strong><strong><span>Tests discoverability</span></strong></strong><span>. This is a problem a new contributor usually</span>
+<span>faces. Finding a piece of code where the bug fix should be applied is</span>
+<span>usually comparatively easier than locating the corresponding tests.</span></p>
+<p><span>The underlying issue is that it is non-trivial to answer these two</span>
+<span>questions:</span></p>
+<ul>
+<li>
+<p><span>Given a line of code, where is the test for this specific line?</span></p>
+</li>
+<li>
+<p><span>Given a test, where is the code that is being tested?</span></p>
+</li>
+</ul>
+</section>
+<section id="The-solution">
+
+    <h2>
+    <a href="#The-solution"><span>The solution</span> </a>
+    </h2>
+<p><span>The beautiful solution to this problem (which I </span><strong><span>hypothesise</span></strong><span> the</span>
+<code>_coverageMarker()</code><span> line in Dart does) is to track code coverage on the</span>
+<span>test-by-test basis. That is, when running a test, verify that</span>
+<strong><span>specific</span></strong><span> lines of code were covered by this test.</span></p>
+<p><span>I</span>&rsquo;<span>ve put together a small Rust library to do this, called</span>
+<a href="https://crates.io/crates/uncover"><code>uncover</code></a><span>. It provides two macros:</span>
+<code>covered_by</code><span> and </span><code>covers</code><span>.</span></p>
+<p><span>The first macro is used in the code under test, </span><a href="https://github.com/matklad/tom/blob/081b09e90b4ff64246969783fe9fb9261ba188f1/src/factory.rs#L72-L75"><span>like</span>
+<span>this</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">if</span> !<span class="hl-keyword">self</span>.keys.<span class="hl-title function_ invoke__">is_empty</span>() {</span>
+<span class="line">    covered_by!(<span class="hl-string">&quot;table_with_two_names&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;table header is already specified, can&#x27;t reset to {:?}&quot;</span>, key)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The second macro is used in the </span><a href="https://github.com/matklad/tom/blob/081b09e90b4ff64246969783fe9fb9261ba188f1/tests/suite/factory.rs#L55-L64"><span>corresponding test</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">table_with_two_names</span>() {</span>
+<span class="line">    covers!(<span class="hl-string">&quot;table_with_two_names&quot;</span>);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">f</span> = Factory::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-title function_ invoke__">check_panics</span>(|| {</span>
+<span class="line">        f.<span class="hl-title function_ invoke__">table</span>()</span>
+<span class="line">            .<span class="hl-title function_ invoke__">with_name</span>(<span class="hl-string">&quot;foo&quot;</span>)</span>
+<span class="line">            .<span class="hl-title function_ invoke__">with_name</span>(<span class="hl-string">&quot;bar&quot;</span>)</span>
+<span class="line">            .<span class="hl-title function_ invoke__">build</span>();</span>
+<span class="line">    })</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If the block where </span><code>covers</code><span> is used does not cause the execution of</span>
+<span>the corresponding </span><code>covered_by</code><span> line then the error will be raised at</span>
+<span>the end of the block.</span></p>
+<p><span>Under the hood, this is implemented as a global </span><code>HashMap&lt;String, u64&gt;</code><span> which</span>
+<span>counts how many times each line was executed. So </span><code>covered_by!</code>
+<a href="https://github.com/matklad/uncover/blob/1d0770d997e29731b287e9e11e4ffbbea5f456da/src/lib.rs#L146"><span>increments</span></a>
+<span>the corresponding count, and </span><code>covers!</code><span> returns a guard object that</span>
+<a href="https://github.com/matklad/uncover/blob/1d0770d997e29731b287e9e11e4ffbbea5f456da/src/lib.rs#L174-L176"><span>checks</span></a>
+<span>in </span><code>Drop</code><span> that the count was incremented. It is possible to disable these checks</span>
+<span>at compile time. And yes, the library actually</span>
+<a href="https://github.com/matklad/uncover/blob/1d0770d997e29731b287e9e11e4ffbbea5f456da/src/lib.rs#L110-L137"><span>exposes</span>
+<span>a macro which defines macros</span></a><span> :)</span></p>
+<p><span>I haven</span>&rsquo;<span>t had a chance to apply this technique in large projects (and</span>
+<span>it is less useful for smaller ones), but it looks very promising.</span></p>
+<p><span>It</span>&rsquo;<span>s now easy to navigate between code and tests: just ripgrep the</span>
+<span>string literal (or write a plugin for this for your IDE). You will be</span>
+<span>able to find the test for the specific if-branch! This should be</span>
+<span>especially handy for new contributors.</span></p>
+<p><span>If this technique is used pervasively, you also get an idea about the</span>
+<span>overall test coverage.</span></p>
+<p><span>During refactorings, you became aware of tests which might be</span>
+<span>affected. Moreover, because coverage is actually checked by the tests</span>
+<span>themselves, you</span>&rsquo;<span>ll notice if some test stop to exercise the code it</span>
+<span>was intended to check.</span></p>
+<p><span>Once again, if you know how this thing is called, please do enlighten</span>
+<span>me in comments! Discussion on </span><a href="https://www.reddit.com/r/rust/comments/8s1eu1/blog_post_a_trick_for_test_maintenance/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-06-18-a-trick-for-test-maintenance.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2018/07/24/exceptions-in-structured-concurrency.html b/2018/07/24/exceptions-in-structured-concurrency.html
new file mode 100644
index 00000000..541dafbe
--- /dev/null
+++ b/2018/07/24/exceptions-in-structured-concurrency.html
@@ -0,0 +1,334 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Exceptions vs Structured Concurrency</title>
+  <meta name="description" content="This is partially a mild instance of xkcd://386 with
+respect to the great don't
+panic post by
+@vorner (yes, it's 2 am here) and partially a
+discussion of error-handling in the framework of structured concurrency, which
+was recently popularized by @njsmith.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2018/07/24/exceptions-in-structured-concurrency.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Exceptions-vs-Structured-Concurrency"><span>Exceptions vs Structured Concurrency</span> <time datetime="2018-07-24">Jul 24, 2018</time></a>
+    </h1>
+<p><span>This is partially a mild instance of </span><a href="https://xkcd.com/386/"><span>xkcd://386</span></a><span> with</span>
+<span>respect to the great </span><a href="https://vorner.github.io/2018/07/22/dont_panic.html"><span>don</span>&rsquo;<span>t</span>
+<span>panic</span></a><span> post by</span>
+<a href="https://github.com/vorner/"><span>@vorner</span></a><span> (yes, it</span>&rsquo;<span>s 2 am here) and partially a</span>
+<span>discussion of error-handling in the framework of structured concurrency, which</span>
+<span>was recently popularized by </span><a href="https://github.com/njsmith/"><span>@njsmith</span></a><span>.</span></p>
+<section id="Panics">
+
+    <h2>
+    <a href="#Panics"><span>Panics</span> </a>
+    </h2>
+<p><span>In the blog post, @vorner argues that unwinding sometimes may do more</span>
+<span>harm than good, if it manages to break some unsafe invariants,</span>
+<span>cross FFI boundary or put the application into an impossible state. I</span>
+<span>fully agree that these all are indeed significant dangers of panics.</span></p>
+<p><span>However, I don</span>&rsquo;<span>t think that just disabling unwinding and using </span><code>panic
+= "abort"</code><span> is the proper fix to the problem for the majority of use</span>
+<span>cases. A lot of programs work in a series of requests and responses</span>
+<span>(often implicit), and I argue that for this pattern it is desirable to</span>
+<span>be able to handle bugs in requests gracefully.</span></p>
+<p><span>I</span>&rsquo;<span>ve spent quite some time working on an</span>
+<a href="https://github.com/intellij-rust/intellij-rust"><span>IDE</span></a><span>, and, although it might not</span>
+<span>be apparent on the first sight, IDEs are also based on requests/responses:</span></p>
+<ul>
+<li>
+<span>users types a character, IDE updates its internal data structures</span>
+</li>
+<li>
+<span>users requests completion, IDE runs some calculations on the data</span>
+<span>and gives results</span>
+</li>
+</ul>
+<p><span>As IDEs are large and have a huge number of features, it is inevitable</span>
+<span>that some not very important linting inspection will fail due to index</span>
+<span>out of bounds access on this particular macro invocation in this</span>
+<span>particular project. Killing the whole IDE process would definitely be</span>
+<span>a bad user experience. On the other hand, just showing a non-modal</span>
+<span>popup </span>&ldquo;<span>Something went wrong, would you like to submit a bug report</span>&rdquo;<span> is</span>
+<span>usually only a minor irritation: errors are more common in the</span>
+<span>numerous </span>&ldquo;<span>additional</span>&rdquo;<span> features, while the smaller core tends to be</span>
+<span>more correct.</span></p>
+<p><span>I do think that this pattern of </span>&ldquo;<span>show error message and chug along</span>&rdquo;<span> is</span>
+<span>applicable to a significant number of applications. Of course, even in</span>
+<span>this setting a bug in the code can in theory have dire consequences,</span>
+<span>but in practice this is mitigated by the following:</span></p>
+<ul>
+<li>
+<p><span>Majority of requests are readonly and can</span>&rsquo;<span>t corrupt data.</span></p>
+</li>
+<li>
+<p><span>The low-level implementation of write requests usually has a</span>
+<span>relatively bug-free transnational semantics, so bugs in write</span>
+<span>requests which lead to transaction aborts don</span>&rsquo;<span>t corrupt data as</span>
+<span>well.</span></p>
+</li>
+<li>
+<p><span>Most applications have some kind of backup/undo functionality, and</span>
+<span>even if a bug leads to a commit of invalid data, user often can</span>
+<span>restore good state (of course this works only for relatively</span>
+<span>unimportant data).</span></p>
+</li>
+</ul>
+<p><span>However, @vorner identifies a very interesting specific problem with</span>
+<span>unwinding which I feel we should really try to solve better: if you</span>
+<span>have a bunch of threads running, and one of them catches fire, what</span>
+<span>happens? It turns out that often nothing particular happens: some more</span>
+<span>threads might die from the poisoned mutexes and closed channels, but</span>
+<span>other treads might continue, and, as a result the application will</span>
+<span>exist in a half-dead state for indefinite period of time.</span></p>
+</section>
+<section id="Structured-Concurrency">
+
+    <h2>
+    <a href="#Structured-Concurrency"><span>Structured Concurrency</span> </a>
+    </h2>
+<p><span>At this point, some of you might be silently screaming </span>&ldquo;<span>Erlang!</span>&rdquo;<span>:</span></p>
+
+<figure>
+
+<img alt="Destroy one of my processes &amp; I will only grow stronger" src="/assets/PPerlang.png">
+</figure>
+<p><strong><strong><span>Source: </span><a href="http://leftoversalad.com/c/015_programmingpeople/" class="url">http://leftoversalad.com/c/015_programmingpeople/</a></strong></strong></p>
+<p><span>You are right! Erlang and especially OTP behaviors are great for managing errors</span>
+<span>at scale. However a full actor system might be an overkill if all you want is</span>
+<span>just an OS thread.</span></p>
+<p><span>If you haven</span>&rsquo;<span>t done this already, pack some snacks, prepare lots of coffee/tea</span>
+<span>and do read the </span><a href="https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful/"><span>structured</span>
+<span>concurrency</span></a>
+<span>blog post. The crux of the pattern is to avoid fire and forget concurrency:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::thread;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">unstructured</span>() {</span>
+<span class="line">    thread::<span class="hl-title function_ invoke__">spawn</span>(|| {</span>
+<span class="line">        <span class="hl-title function_ invoke__">do_stuff</span>()</span>
+<span class="line">    });</span>
+<span class="line">    <span class="hl-comment">// The thread is &quot;leaked&quot; out of `unstructured` function</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Instead, each thread should be confined to some lexical scope and</span>
+<span>never escape it:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">extern</span> <span class="hl-keyword">crate</span> crossbeam;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">structured</span>() {</span>
+<span class="line">    crossbeam::<span class="hl-title function_ invoke__">scope</span>(|scope| {</span>
+<span class="line">        scope.<span class="hl-title function_ invoke__">spawn</span>(|| {</span>
+<span class="line">            <span class="hl-title function_ invoke__">do_stuff</span>()</span>
+<span class="line">        })</span>
+<span class="line">    });</span>
+<span class="line">    <span class="hl-comment">// The thread is finished and joined at this point.</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The benefit of this organization is that all threads form a tree,</span>
+<span>which gives you greater control, because you know for sure which parts</span>
+<span>are sequential and which are concurrent. Concurrency is explicitly</span>
+<span>scoped.</span></p>
+</section>
+<section id="Panics-and-Structured-Concurrency">
+
+    <h2>
+    <a href="#Panics-and-Structured-Concurrency"><span>Panics and Structured Concurrency</span> </a>
+    </h2>
+<p><span>And we have a really, really interesting API design problem if we</span>
+<span>combine structured concurrency and unwinding. What should be the</span>
+<span>behavior of the following program?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">everything_is_terrible</span>() {</span>
+<span class="line">    crossbeam::<span class="hl-title function_ invoke__">scope</span>(|scope| {</span>
+<span class="line">        scope.<span class="hl-title function_ invoke__">spawn</span>(|| <span class="hl-title function_ invoke__">do_work</span>());</span>
+<span class="line">        scope.<span class="hl-title function_ invoke__">spawn</span>(|| <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;this hurts&quot;</span>));</span>
+<span class="line">    });</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, for </span><code>crossbeam</code><span> specifically there</span>&rsquo;<span>s little choice here due to</span>
+<span>the boring requirement for memory safety. But let</span>&rsquo;<span>s pretend for now</span>
+<span>that this is a garbage collected language.</span></p>
+<p><span>So, we have two concurrent threads in a single scope, one of which is</span>
+<span>currently running and another one is, unfortunately, dead.</span></p>
+<p><span>The most obvious choice is to wait for the running thread to finish</span>
+<span>(we don</span>&rsquo;<span>t want to let it escape the scope) and then to reraise the</span>
+<span>panic at scope exit. The problem with this approach is that there</span>&rsquo;<span>s a</span>
+<span>potentially unbounded window between the instant the panic is created,</span>
+<span>and its propagation.</span></p>
+<p><span>This is not a theoretical concern: some time ago a friend of mine had</span>
+<span>a fascinating debugging session with a Python machine learning</span>
+<span>application. The program was processing a huge amount of data, so, to</span>
+<span>speed things up, it partitioned the data and spawned a thread per</span>
+<span>partition (actual processing was in native code, so GIL was avoided):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">with</span> ThreadPoolExecutor() <span class="hl-keyword">as</span> executor:</span>
+<span class="line">    futures = []</span>
+<span class="line">    <span class="hl-keyword">for</span> task_type, hosts <span class="hl-keyword">in</span> <span class="hl-built_in">reversed</span>(tasks):</span>
+<span class="line">        <span class="hl-keyword">for</span> task_id, _host <span class="hl-keyword">in</span> <span class="hl-built_in">enumerate</span>(hosts):</span>
+<span class="line">            futures.append(</span>
+<span class="line">                executor.submit(func, task_type, task_id))</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment"># Re-raise the exception.</span></span>
+<span class="line">    <span class="hl-keyword">for</span> future <span class="hl-keyword">in</span> as_completed(futures):</span>
+<span class="line">        future.result()</span></code></pre>
+
+</figure>
+<p><span>The observed behavior was that a singe thread died, but no exception</span>
+<span>or stack trace were printed anywhere. This was because the </span><code>executor</code>
+<span>was waiting for all other threads before propagating the</span>
+<span>exception. Although technically the exception was not lost, in</span>
+<span>practice you</span>&rsquo;<span>d have to wait for several hours to actually see it!</span></p>
+<p><span>The Trio library uses an</span>
+<a href="https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful/#automated-error-propagation-works"><span>interesting</span>
+<span>refinement</span></a><span> of this strategy: when one of the tasks in scope fails, all others</span>
+<span>are immediately cancelled, and then awaited for. I think this should work well</span>
+<span>for Trio, because it has first-class support for cancellation; </span><strong><span>any</span></strong><span> async</span>
+<span>operation is a cancellation point. So all children tasks will be cancelled in a</span>
+<span>timely manner, although I wouldn</span>&rsquo;<span>t be surprised if there are some pathological</span>
+<span>cases where exception propagation is delayed.</span></p>
+<p><span>Unfortunately, this solution does</span>&rsquo;<span>t work for native threads, because</span>
+<span>there are just no good cancellation points. And I don</span>&rsquo;<span>t know of any</span>
+<span>approach that would work :(</span></p>
+<p><span>One vague idea I have is inspired by handling of orphaned processes in</span>
+<span>Unix: if a thread in a scope dies, the scope is teared down</span>
+<span>immediately, and all the running processes are attached to the value</span>
+<span>that is thrown. If anyone wants to handle the failure, they </span><strong><span>must</span></strong>
+<span>wait for all attached threads to finish first. This way, the initial</span>
+<span>panic and all in-progress threads could be propagated to the top-level</span>
+<code>init</code><span> scope, which then can attempt either a clean exit by waiting</span>
+<span>for all children, or do a </span><code>process::abort</code><span>.</span></p>
+<p><span>However this attachment to the parent violates the property that a</span>
+<span>thread never leaves its original scope. Because crossbeam relies on</span>
+<span>this property for memory safety, this approach is just not applicable</span>
+<span>for threads which share stack data.</span></p>
+<p><span>It</span>&rsquo;<span>s already 4 am here, so I really should be wrapping the post up :)</span>
+<span>So, a challenge: design a Rust library for scoped concurrency based on</span>
+<span>native OS threads that:</span></p>
+<ul>
+<li>
+<span>never looses a thread or a panic,</span>
+</li>
+<li>
+<span>immediately propagates panics,</span>
+</li>
+<li>
+<span>allows to (optionally?) share stack data between the threads.</span>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/91d0u2/blog_post_exceptions_versus_structured_concurrency/"><span>r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2018-07-24-exceptions-in-structured-concurrency.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/05/19/consider-using-asciidoctor-for-your-next-presentation.html b/2019/05/19/consider-using-asciidoctor-for-your-next-presentation.html
new file mode 100644
index 00000000..228ee8fe
--- /dev/null
+++ b/2019/05/19/consider-using-asciidoctor-for-your-next-presentation.html
@@ -0,0 +1,370 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Consider Using Asciidoctor for Your Next Presentation</title>
+  <meta name="description" content="I've spend years looking for a good tool to make slides.
+I've tried LaTeX Beamer, Google Docs, Slides.com and several reveal.js offsprings, but neither was satisfactory for me.
+Last year, I stumbled upon Asciidoctor.js PDF (which had like three GitHub starts at that moment), and it is perfect.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/05/19/consider-using-asciidoctor-for-your-next-presentation.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Consider-Using-Asciidoctor-for-Your-Next-Presentation"><span>Consider Using Asciidoctor for Your Next Presentation</span> <time datetime="2019-05-19">May 19, 2019</time></a>
+    </h1>
+<p><span>I</span>&rsquo;<span>ve spend years looking for a good tool to make slides.</span>
+<span>I</span>&rsquo;<span>ve tried LaTeX Beamer, Google Docs, Slides.com and several reveal.js offsprings, but neither was satisfactory for me.</span>
+<span>Last year, I stumbled upon </span><a href="https://github.com/Mogztter/asciidoctor-pdf.js"><span>Asciidoctor.js PDF</span></a><span> (which had like three GitHub starts at that moment), and it is </span><strong><span>perfect</span></strong><span>.</span></p>
+<p><span>At least, it is perfect for my use case, your requirements might be different.</span>
+<span>I make presentations for teaching programming at </span><a href="https://compscicenter.ru"><span>Computer Science Center</span></a><span>, so my slides are full of code, bullet lists, and sometimes have moderately complex layout.</span>
+<span>To make reviewing course material easier, slides need to have high information density</span></p>
+<p><span>If you want to cut down straight to the code, see the repository with slides for my Rust course:</span></p>
+<p><a href="http://github.com/matklad/rust-course" class="url">http://github.com/matklad/rust-course</a></p>
+<p><span>By the way, </span><a href="/2019/05/19/rust-course-retrospective.html"><span>the sibling post</span></a><span> talks about the course in more detail.</span></p>
+<section id="Requirements">
+
+    <h2>
+    <a href="#Requirements"><span>Requirements</span> </a>
+    </h2>
+<p><span>The specific things I want from the slides are:</span></p>
+<ul>
+<li>
+<span>A source markup language: I like to keep my slides on GitHub</span>
+</li>
+<li>
+<span>Ease of styling and layout.</span>
+<span>A good test here is two-column layout with code snippet on the left and a bullet list on the right</span>
+</li>
+<li>
+<span>The final output should be a PDF.</span>
+<span>I don</span>&rsquo;<span>t use animations, but I need exactly the same look of slides on different computers</span>
+</li>
+</ul>
+<p><span>All the tools I</span>&rsquo;<span>ve tried don</span>&rsquo;<span>t quite fit the bill.</span></p>
+<p><span>While TeX is good for formatting formulas, LaTeX is a relatively poor language for describing the structure of the document.</span>
+<span>Awesome Emacs mode fixes the issue partially, but still, </span><code>\begin{itemize}</code><span> is way to complex for a bullet list.</span>
+<span>Additionally, quality of implementation is not perfect: unicode support needs opt-in, and the build process is fiddly.</span></p>
+<p><a href="http://slides.google.com/"><span>Google Docs</span></a><span> and </span><a href="https://slides.com/"><span>Slides.com</span></a><span> are pretty solid choices if you want WYSWIG.</span>
+<span>In fact, I primarily used these two tools before AsciiDoctor.</span>
+<span>However WYSWIG and limited flexibility which come with it are significant drawbacks</span></p>
+<p><span>I think I</span>&rsquo;<span>ve never made a serious presentation in any of the JavaScript presentation frameworks.</span>
+<span>I</span>&rsquo;<span>ve definitely tried </span><a href="https://revealjs.com/"><span>reveal.js</span></a><span>, </span><a href="https://remarkjs.com/#1"><span>remark</span></a><span> and </span><a href="https://shwr.me"><span>shower</span></a><span>, but turned back to Google Docs in the end.</span>
+<span>The two main reasons for this were:</span></p>
+<ul>
+<li>
+<span>Less than ideal source language:</span>
+<ul>
+<li>
+<span>if it is Markdown, I struggled with creating complex layouts like the two column one;</span>
+</li>
+<li>
+<span>if it is HTML, simple things like bullet lists or emphasis are hard.</span>
+</li>
+</ul>
+</li>
+<li>
+<span>Cross browser CSS.</span>
+<span>These frameworks pack a lot of JS and CSS, which I don</span>&rsquo;<span>t really need, but which makes tweaking stuff difficult for me, as I am not a professional web developer.</span>
+</li>
+</ul>
+</section>
+<section id="AsciiDoc-Language">
+
+    <h2>
+    <a href="#AsciiDoc-Language"><span>AsciiDoc Language</span> </a>
+    </h2>
+<p><span>The killer feature behind Asciidoctor.js PDF is the AsciiDoc markup language.</span>
+<span>Like Markdown, it</span>&rsquo;<span>s a lightweight markup language.</span>
+<span>When I was translating this blog from </span><code>.md</code><span> to </span><code>.adoc</code><span> the only significant change in the syntax was for links, from</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[some link](http://example.com)</span></code></pre>
+
+</figure>
+<p><span>to</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[some link](http://example.com)</span></code></pre>
+
+</figure>
+<p><span>However, unlike Markdown and LaTeX, AsciiDoc has native support for rich hierarchical document model.</span>
+<span>AsciiDoc source is parsed into a tree of nested elements with attributes (historically, AsciiDoc was created as an easier way to author </span><a href="https://docbook.org/"><span>DocBook</span></a><span> XML).</span>
+<span>This allows to express complex document structure without ad-hoc syntax extensions.</span>
+<span>Additionally, the concrete syntax feels very orthogonal and well rounded up.</span>
+<span>We</span>&rsquo;<span>ve seen the syntax for links before, and this is how one includes an image:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">image::assets/logo.svg[alt text]</span></code></pre>
+
+</figure>
+<p><span>Or a snippet from another file:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">include::code_samples/worker.rs[]</span></code></pre>
+
+</figure>
+<p><span>A couple of more examples, just to whet your appetite (Asciidoctor has </span><a href="https://asciidoctor.org/docs/user-manual/"><span>extensive documentation</span></a><span>)</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Paragraphs</figcaption>
+
+
+<pre><code><span class="line">This is a paragraph</span>
+<span class="line"></span>
+<span class="line">[.lead]</span>
+<span class="line">This is a paragraph with an attribute (which translates to CSS class)</span></code></pre>
+
+</figure>
+<div>
+<p><span>This is a paragraph</span></p>
+<p class="lead"><span>This is a paragraph with an attribute (which translates to CSS class)</span></p>
+</div>
+
+<figure class="code-block">
+<figcaption class="title">List with nested elements</figcaption>
+
+
+<pre><code><span class="line">* This is a bullet list</span>
+<span class="line">* Bullet with table (+ joins blocks)</span>
+<span class="line">+</span>
+<span class="line">|===</span>
+<span class="line">|Are tables in lists stupid?| Probably!</span>
+<span class="line">|===</span></code></pre>
+
+</figure>
+<div>
+<ul>
+<li>
+<p><span>This is a bullet list</span></p>
+</li>
+<li>
+<p><span>Bullet with table (+ joins blocks)</span></p>
+<table>
+<tr>
+<td><span>Are tables in lists stupid?</span></td>
+<td><span>Probably!</span></td>
+</tr>
+</table>
+</li>
+</ul>
+</div>
+
+<figure class="code-block">
+<figcaption class="title">Code with inline markup</figcaption>
+
+
+<pre><code><span class="line">[source,rust,subs="+quotes"]</span>
+<span class="line">----</span>
+<span class="line">let x = 1;</span>
+<span class="line">let r: &amp;i32;</span>
+<span class="line">{</span>
+<span class="line">    let y = 2;</span>
+<span class="line">    r = [.hl-error]##&amp;y##;  // borrowed value does not live long enough</span>
+<span class="line">}</span>
+<span class="line">println!("{}", *r);</span>
+<span class="line">----</span></code></pre>
+
+</figure>
+
+<figure>
+
+<img alt="" src="/assets/adoc-hl-error.png">
+</figure>
+<p><span>That is, in addition to the usual syntax highlighting, the </span><code>&amp;xs[0]</code><span> bit is wrapped into a </span><code>&lt;span class="hl-error"&gt;</code><span>.</span>
+<span>This can be used to call out specific bits of code, or, like in this case, to show compiler errors:</span></p>
+<p><span>Here</span>&rsquo;<span>s an example of a complex slide:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[.two-col]  <i class="callout" data-value="1"></i> <i class="callout" data-value="4"></i></span>
+<span class="line">## Ссылки в C++ и Rust</span>
+<span class="line"></span>
+<span class="line">.C++</span>
+<span class="line">- создаются неявно</span>
+<span class="line">- не являются первоклассными объектами (`std::reference_wrapper`)</span>
+<span class="line">- не всегда валидны</span>
+<span class="line"></span>
+<span class="line">.Rust</span>
+<span class="line">- требуют явных `&amp;`/[.language-rust]`&amp;mut` и `*` <i class="callout" data-value="2"></i></span>
+<span class="line">- обычные объекты <i class="callout" data-value="3"></i></span>
+<span class="line">+</span>
+<span class="line">[source,rust]</span>
+<span class="line">----</span>
+<span class="line">let x = 1;</span>
+<span class="line">let y = 2;</span>
+<span class="line">let mut r: &amp;i32 = &amp;x;</span>
+<span class="line">r = &amp;y;</span>
+<span class="line">----</span>
+<span class="line">- всегда валидны</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<code>.two-col</code><span> sets the css class for two-column flex layout.</span>
+</li>
+<li>
+<code>[.language-rust]</code><span> sets css class for inline </span><code>&lt;code&gt;</code><span> element, so </span><code>mut</code><span> gets highlighted.</span>
+</li>
+<li>
+<span>This bullet-point contains a longer snippet of code.</span>
+</li>
+<li>
+<span>Have you noticed these circled numbered callouts? They are another useful feature of AsciiDoc!</span>
+</li>
+</ol>
+<p><span>The result is the following slide</span></p>
+
+<figure>
+
+<img alt="" src="/assets/adoc-slide.png">
+</figure>
+</section>
+<section id="HTML-Translation">
+
+    <h2>
+    <a href="#HTML-Translation"><span>HTML Translation</span> </a>
+    </h2>
+<p><span>AsciiDoc markup language is a powerful primitive, but how do we turn it into pixels on the screen?</span>
+<span>The hard part of making slides is laying out the contents: breaking paragraphs in lines, aligning images, arranging columns.</span>
+<span>As was </span><a href="https://github.com/asciidoctor/asciidoctor/issues/2972#issuecomment-441475262"><span>pointed out by Asciidoctor maintainer</span></a><span>, browsers are extremely powerful layout engines, and HTML + CSS is a decent way to describe the layout.</span></p>
+<p><span>And here</span>&rsquo;<span>s where Asciidoctor.js PDF comes in: it allows one to transform AsciiDoc DOM into HTML, by supplying a functional-style visitor.</span>
+<span>This HTML is then rendered to PDF by chromium (but you can totally use HTML slides directly if you like it more).</span></p>
+<p><span>Here</span>&rsquo;<span>s the visitor which produces the slides for my Rust course:</span></p>
+<p><a href="https://github.com/matklad/rust-course/blob/0fe5fea215514f4aaff6ae61bca5ac033fcfe348/lectures/template.js#L1-L63"><span>https://github.com/matklad/rust-course/blob/master/lectures/template.js</span></a></p>
+<p><span>In contrast to reveal.js, I have full control over the resulting HTML and CSS.</span>
+<span>As I don</span>&rsquo;<span>t need cross browser support or complex animations, I can write a relatively simple modern CSS, which I myself can understand.</span></p>
+</section>
+<section id="Bits-and-Pieces">
+
+    <h2>
+    <a href="#Bits-and-Pieces"><span>Bits and Pieces</span> </a>
+    </h2>
+<p><span>Note that Asciidoctor.js PDF is a relatively new piece of technology (although the underlying Asciidoctor project is </span><strong><strong><span>very</span></strong></strong><span> mature).</span>
+<span>For this reason for my slides I just vendor a specific version of the tool.</span></p>
+<p><span>Because the intermediate result is HTML, the development workflow is very smooth.</span>
+<span>It</span>&rsquo;<span>s easy to make a live preview with a couple of editor plugins, and you can use browser</span>&rsquo;<span>s dev-tools to debug CSS.</span>
+<span>I</span>&rsquo;<span>ve also written a tiny bit of JavaScript to enable keyboard navigation for slides during preview.</span>
+<span>Syntax highlighting is also a bespoke pile of regexes :-)</span></p>
+<p><span>One thing I am worried about is the depth of the stack of technologies of Asciidoctor.js PDF.</span></p>
+<ol>
+<li>
+<span>Original AsciiDoc tool was written in Python.</span>
+</li>
+<li>
+<span>Asciidoctor is a modern enhanced re-implementation in Ruby.</span>
+</li>
+<li>
+<span>Asciidoctor.js PDF runs on NodeJS via </span><a href="https://opalrb.com/"><span>Opal</span></a><span> Ruby -&gt; JavaScript compiler</span>
+</li>
+<li>
+<span>It is used to produce HTML which is then fed into chromium to produce PDF!</span>
+</li>
+</ol>
+<p><span>Oh, and syntax highlighting on this blog is powered by </span><a href="http://pygments.org/"><span>pygments</span></a><span>, so Ruby calls into Python!</span></p>
+<p><span>This is quite a Zoo, but it works reliably for me!</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-05-19-consider-using-asciidoctor-for-your-next-presentation.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/05/19/rust-course-retrospective.html b/2019/05/19/rust-course-retrospective.html
new file mode 100644
index 00000000..7eef6f73
--- /dev/null
+++ b/2019/05/19/rust-course-retrospective.html
@@ -0,0 +1,189 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Rust Course Retrospective</title>
+  <meta name="description" content="It was the last week of the Rust course at Computer Science Center.
+This post is my experience report from teaching this course.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/05/19/rust-course-retrospective.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Rust-Course-Retrospective"><span>Rust Course Retrospective</span> <time datetime="2019-05-19">May 19, 2019</time></a>
+    </h1>
+<p><span>It was the last week of the Rust course at </span><a href="https://compscicenter.ru"><span>Computer Science Center</span></a><span>.</span>
+<span>This post is my experience report from teaching this course.</span></p>
+<section id="Materials">
+
+    <h2>
+    <a href="#Materials"><span>Materials</span> </a>
+    </h2>
+<p><span>Note that the course is in Russian :-)</span></p>
+<p><span>Course slides are available under CC-BY at </span><a href="https://github.com/matklad/rust-course" class="url">https://github.com/matklad/rust-course</a><span>.</span>
+<span>See </span><a href="/2019/05/19/consider-using-asciidoctor-for-your-next-presentation.html"><span>the sibling post</span></a><span> if you want to learn more about how the slides were made</span>
+<span>(TL;DR: Asciidoctor is better than beamer, Google Docs, slides.com, reveal.js, remark).</span></p>
+<p><span>High-quality recordings of lectures are available on YouTube:</span></p>
+<p><a href="https://www.youtube.com/playlist?list=PLlb7e2G7aSpTfhiECYNI2EZ1uAluUqE_e" class="url">https://www.youtube.com/playlist?list=PLlb7e2G7aSpTfhiECYNI2EZ1uAluUqE_e</a></p>
+<p><span>The homework is not available, but it was based on the </span><a href="http://www.realtimerendering.com/raytracing/Ray%20Tracing%20in%20a%20Weekend.pdf"><span>Ray Tracing in One Weekend</span></a><span> book.</span></p>
+</section>
+<section id="Good-Parts">
+
+    <h2>
+    <a href="#Good-Parts"><span>Good Parts</span> </a>
+    </h2>
+<p><span>Teaching is hard, but very rewarding.</span>
+<span>Teaching Rust feels especially good because the language is very well designed and the quality of the implementation is great.</span>
+<span>Overall, I don</span>&rsquo;<span>t feel like this was a particularly hard course for the students.</span>
+<span>In the end most of the folks successfully completed all assignments, which were fairly representative of the typical Rust code.</span></p>
+</section>
+<section id="Hard-Parts">
+
+    <h2>
+    <a href="#Hard-Parts"><span>Hard Parts</span> </a>
+    </h2>
+<p><span>There were one extremely hard topic and one poorly explained topic.</span></p>
+<p><span>The hard one was the module system.</span>
+<span>Many students were completely stumped by it.</span>
+<span>It</span>&rsquo;<span>s difficult to point out the specific hard aspect of the current (Rust 2018) module system: each student struggled in their own way.</span></p>
+<p><span>Here</span>&rsquo;<span>s a selection of points of confusion:</span></p>
+<ul>
+<li>
+<span>you don</span>&rsquo;<span>t need to wrap contents of </span><code>foo.rs</code><span> in </span><code>mod foo { ... }</code>
+</li>
+<li>
+<span>you don</span>&rsquo;<span>t need to add </span><code>mod lib;</code><span> to </span><code>main.rs</code>
+</li>
+<li>
+<span>child module lives in the </span><code>parent/child.rs</code><span> file, </span><strong><strong><span>unless</span></strong></strong><span> the parent is </span><code>lib.rs</code><span> or </span><code>main.rs</code>
+</li>
+</ul>
+<p><span>I feel like my explanation of modules was an OK one, it contained all the relevant details and talked about how things work under the hood.</span>
+<span>However, it seems like just explaining the modules is not enough: one really needs to arrange a series of exercises about modules, and make sure that all students successfully pass them.</span></p>
+<p><span>I don</span>&rsquo;<span>t think that modules are the hardest feature of the language: advanced lifetimes and </span><code>unsafe</code><span> subtleties are more difficult.</span>
+<span>However, you don</span>&rsquo;<span>t really write </span><code>mem::transmute</code><span> or HRTB every day, while you face modules pretty early.</span></p>
+<p><span>The poorly explained topic was </span><code>Send/Sync</code><span>.</span>
+<span>I was like </span>&ldquo;<span>compiler infers </span><code>Send/Sync</code><span> automatically, and after that your code just fails to compile if it would had a data race, isn</span>&rsquo;<span>t Rust wonderful?</span>&rdquo;<span>.</span>
+<span>But this misses the crucial point: in generic code (both for </span><code>impl T</code><span> and </span><code>dyn T</code><span>), you</span>&rsquo;<span>ll need to write </span><code>: Sync</code><span> bounds yourself.</span>
+<span>Of course the homework was about generic code, and there were a number of solutions with (unsound) </span><code>unsafe impl&lt;T&gt; Sync for MyThing&lt;T&gt;</code><span> :-)</span></p>
+</section>
+<section id="Annoying-Parts">
+
+    <h2>
+    <a href="#Annoying-Parts"><span>Annoying Parts</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s very hard to google Rust documentation at the moment, because google links</span>
+<span>you to redirect stubs of the old book, which creates that weird feeling that you</span>
+<span>are inside of a </span><a href="https://en.wikipedia.org/wiki/Sepulka"><span>science-fiction novel</span></a><span>.</span>
+<span>I know that the problem is already fixed, and we just need to wait until the new version of the old book is deployed, but I wish we could have fixed it earlier.</span></p>
+<p><span>Editions are a minor annoyance as well. I</span>&rsquo;<span>ve completely avoided talking about Rust 2015, hoping that I</span>&rsquo;<span>ll just teach the shiny new thing.</span>
+<span>But of course students google for help and get outdated info.</span></p>
+<ul>
+<li>
+<span>many used </span><code>extern crate</code><span> syntax</span>
+</li>
+<li>
+<code>dyn</code><span> in </span><code>dyn T</code><span> was sometimes omitted</span>
+</li>
+<li>
+<span>there was a couple of </span><code>mod.rs</code>
+</li>
+</ul>
+<p><span>Additionally, several students somehow ended up without </span><code>edition = "2015"</code><span> in </span><code>Cargo.toml</code><span>.</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/bqlctn/rust_course_retrospective/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-05-19-rust-course-retrospective.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/06/20/linux-desktop-tips.html b/2019/06/20/linux-desktop-tips.html
new file mode 100644
index 00000000..e5fc156e
--- /dev/null
+++ b/2019/06/20/linux-desktop-tips.html
@@ -0,0 +1,362 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Linux Desktop Tips</title>
+  <meta name="description" content="Over time I have accumulated a number of tricks and hacks that make linux desktop more natural for me.
+Today I've discovered another one: a way to minimize Firefox on close.
+This seems like a good occasion to write about things I've been doing!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/06/20/linux-desktop-tips.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Linux-Desktop-Tips"><span>Linux Desktop Tips</span> <time datetime="2019-06-20">Jun 20, 2019</time></a>
+    </h1>
+<p><span>Over time I have accumulated a number of tricks and hacks that make </span>&ldquo;<span>linux desktop</span>&rdquo;<span> more natural for me.</span>
+<span>Today I</span>&rsquo;<span>ve discovered another one: a way to minimize Firefox on close.</span>
+<span>This seems like a good occasion to write about things I</span>&rsquo;<span>ve been doing!</span></p>
+<section id="Window-Switching">
+
+    <h2>
+    <a href="#Window-Switching"><span>Window Switching</span> </a>
+    </h2>
+<p><span>I</span>&rsquo;<span>ve never understood the appeal of multiple desktops, tiling window managers,</span>
+<span>or Mac style </span>&ldquo;<span>full screen window is outside of your desktop</span>&rdquo;<span>.</span>
+<span>They for sure let you to neatly organize several applications at once, but I never need an overview of all applications.</span>
+<span>What I need most of the time is switching to a specific application, like a browser.</span></p>
+<p><span>Windows has a feature for this, that fits this workflow perfectly.</span>
+<span>If you pin an application to start menu, then </span><kbd><kbd><span>win </kbd>+<kbd> number</span></kbd></kbd><span> will launch or focus that app.</span>
+<span>That is, if the app is already running, its window will be raised and focused.</span></p>
+<p><span>For some reason, this is not available out of the box in any of the Linux window</span>
+<span>managers I</span>&rsquo;<span>ve tried. What is easy is binding launching an application to a</span>
+<span>shortcut, but I rarely use more than once instance of Firefox!</span></p>
+<p><span>Luckily, </span><a href="https://github.com/mkropat/jumpapp"><span>jumpapp</span></a><span> is exactly what is needed</span>
+<span>to implement this properly.</span></p>
+<p><span>I use </span><a href="https://wiki.archlinux.org/index.php/Xbindkeys"><span>Xbindkeys</span></a><span> for global</span>
+<span>shortcuts, with the following config:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">~/.xbindkeysrc</figcaption>
+
+
+<pre><code><span class="line">"jumpapp -m kitty"</span>
+<span class="line">  F1</span>
+<span class="line"></span>
+<span class="line">"jumpapp -m firefox"</span>
+<span class="line">  F2</span>
+<span class="line"></span>
+<span class="line">"jumpapp -m code"</span>
+<span class="line">  F3</span></code></pre>
+
+</figure>
+<p><span>Note that I bind </span><kbd><kbd><span>F?</span></kbd></kbd><span> keys without any modifiers: these keys are rarely used</span>
+<span>by applications and are very convenient for personal use.</span></p>
+</section>
+<section id="Drop-Down-Terminal">
+
+    <h2>
+    <a href="#Drop-Down-Terminal"><span>Drop Down Terminal</span> </a>
+    </h2>
+<p><span>I</span>&rsquo;<span>ve always liked Quake-style terminals, which you can bring to front with a</span>
+<span>single keypress.</span>
+<span>For this reason, I was stuck with</span>
+<a href="https://kde.org/applications/system/org.kde.yakuake"><span>yakuake</span></a><span> for a really long time.</span></p>
+<p><code>jumpapp</code><span> allows me to use any terminal in</span>
+<span>this fashion, so now I use full screen </span><a href="https://sw.kovidgoyal.net/kitty/"><span>kitty</span></a><span>.</span></p>
+</section>
+<section id="Window-Tiling">
+
+    <h2>
+    <a href="#Window-Tiling"><span>Window Tiling</span> </a>
+    </h2>
+<p><span>Because switching windows/applications is easy for me, I typically look at a single maximized window.</span>
+<span>However, sometimes I like to have two windows side-by-side, for example an editor and a browser with preview.</span>
+<span>A full blown tiling window manager will be an overkill for this use-case, but another Windows feature comes in handy.</span>
+<span>In Windows, </span><kbd><kbd><span>Win </kbd>+<kbd> ←</span></kbd></kbd><span> and </span><kbd><kbd><span>Win </kbd>+<kbd> →</span></kbd></kbd><span> tiles active window to the left and right side of the screen.</span>
+<span>Luckily, this </span><em><span>is</span></em><span> a built in feature in most window managers, including KWin and Openbox (the two I use the most).</span></p>
+</section>
+<section id="Screen-Real-Estate">
+
+    <h2>
+    <a href="#Screen-Real-Estate"><span>Screen Real Estate</span> </a>
+    </h2>
+<p><span>This one is tricky!</span>
+<span>On one hand, because I use one maximized window at a time, I feel comfortable with smaller displays.</span>
+<span>I was even disappointed with a purchase of external display for my laptop: turns out, bigger screen doesn</span>&rsquo;<span>t really help me!</span>
+<span>On the other hand, I really like when all pixels I have are utilized fully.</span></p>
+<p><span>I</span>&rsquo;<span>ve tried to work in full screen windows, but that wasn</span>&rsquo;<span>t very convenient for two reasons:</span></p>
+<ul>
+<li>
+<span>Tray area is useful for current time, other status information, and notifications.</span>
+</li>
+<li>
+<span>Full screen doesn</span>&rsquo;<span>t play well with </span><code>jumpapp</code><span> window switching.</span>
+</li>
+</ul>
+<p><span>After some experiments, I</span>&rsquo;<span>ve settled with the following setup:</span></p>
+<ul>
+<li>
+<p><span>Use of maximized, but not full screen windows.</span></p>
+</li>
+<li>
+<p><span>When window is maximized, its borders and title bar are hidden. To do this in </span><code>kwin</code><span> add the following to </span><code>~/.config/kwinrc</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[Windows]</span>
+<span class="line">BorderlessMaximizedWindows=true</span></code></pre>
+
+</figure>
+</li>
+<li>
+<p><span>To still have an ability to close/minimize the window with the mouse, I use </span><a href="https://store.kde.org/p/998910/"><span>Active Window Menu</span></a><span> Plasmoid.</span>
+<span>What it does is that it packs window title and close/maximize/minimize buttons into the desktop panel, without spending extra pixels:</span></p>
+
+<figure>
+
+<img alt="" src="/assets/active-window.png">
+</figure>
+</li>
+</ul>
+<p><span>Another thing I</span>&rsquo;<span>ve noticed is that I look to the bottom side of the screen much more often.</span>
+<span>For this reason, I move desktop panel to the top.</span>
+<span>You can imagine how inconvenient Mac</span>&rsquo;<span>s dock is for me: it wastes so many pixels in the most important area of the display :-)</span></p>
+</section>
+<section id="Keybindings">
+
+    <h2>
+    <a href="#Keybindings"><span>Keybindings</span> </a>
+    </h2>
+<p><span>After several years of using Emacs and a number of short detours into Vim-land, I grew a profound dislike for the arrow keys.</span>
+<span>It</span>&rsquo;<span>s not that they make me slower: they distract me because I need to think about moving my hands.</span></p>
+<p><span>For the long time I</span>&rsquo;<span>ve tried to banish arrow keys from my life by making every</span>
+<span>application understand </span><kbd><kbd><span>ctrl</kbd>+<kbd>b</span></kbd></kbd><span>, </span><kbd><kbd><span>ctrl</kbd>+<kbd>f</span></kbd></kbd><span> and the like.</span>
+<span>But that was always a whack-a-mole game without a chance to win.</span></p>
+<p><span>A much better approach is </span><a href="https://manybutfinite.com/post/home-row-computing/"><span>Home Row Computing</span></a><span>.</span>
+<span>I rebind, on the low level, </span><kbd><kbd><span>CapsLock </kbd>+<kbd> i/j/k/l</span></kbd></kbd><span> to arrow keys.</span>
+<span>This works in every app.</span>
+<span>It also works with </span><kbd><kbd><span>alt</span></kbd></kbd><span> and </span><kbd><kbd><span>shift</span></kbd></kbd><span> modifiers.</span></p>
+<p><span>I use </span><a href="https://jlk.fjfi.cvut.cz/arch/manpages/man/xkbcomp.1"><span>xkbcomp</span></a><span> with </span><a href="https://github.com/matklad/config/blob/ed588057545276e05ea4979ea7086addc3724a4e/home-row.xkb"><span>this config</span></a><span> to set this up.</span>
+<span>I have no idea how this actually works :-)</span></p>
+</section>
+<section id="File-Organization">
+
+    <h2>
+    <a href="#File-Organization"><span>File Organization</span> </a>
+    </h2>
+<p><span>I used to pile up </span><strong><span>everything</span></strong><span> on the desktop.</span>
+<span>But now my desktop is completely empty, and I enjoy uncluttered view of</span>
+<a href="https://upload.wikimedia.org/wikipedia/commons/d/d8/Pieter_Bruegel_the_Elder_-_Hunters_in_the_Snow_%28Winter%29_-_Google_Art_Project.jpg"><span>The Hunters in the Snow</span></a>
+<span>every time I boot my laptop.</span></p>
+<p><span>The trick is to realize that accreting </span>&ldquo;<span>junk</span>&rdquo;<span> files is totally normal, and</span>
+&ldquo;<span>just don</span>&rsquo;<span>t put garbage on desktop</span>&rdquo;<span> is not a solution.</span>
+<span>Instead, one can create a dedicated place for hoarding.</span></p>
+<p><span>I have two of those:</span></p>
+<ul>
+<li>
+<code>~/downloads</code><span> which I remove automatically on every reboot</span>
+</li>
+<li>
+<code>~/tmp</code><span> which I </span><code>rm -fr ~/tmp</code><span> manually once in a while</span>
+</li>
+</ul>
+</section>
+<section id="Shell">
+
+    <h2>
+    <a href="#Shell"><span>Shell</span> </a>
+    </h2>
+<p><span>I used to use </span><code>Zsh</code><span> with a bunch of plugins, hoping that I</span>&rsquo;<span>ll learn </span><code>bash</code><span> this way.</span>
+<span>I still google </span>&ldquo;<span>How to </span><code>if</code><span> in bash?</span>&rdquo;<span> every single time though.</span></p>
+<p><span>For this reason, I</span>&rsquo;<span>ve switched to </span><a href="http://fishshell.com/"><span>fish</span></a><span> with mostly default config.</span>
+<span>The killer feature for me is autosuggestions: completion of the commands based on the history.</span>
+<code>Zsh</code><span> has something similar, via a plugin, but this crucial feature works in </span><code>fish</code><span> out of the box.</span></p>
+<p><span>One slightly non-standard thing I do is a two-line prompt that looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">02:43:39|~/.config</span>
+<span class="line">λ</span></code></pre>
+
+</figure>
+<p><span>Two line prompts are great! You can always see a full working directory, and commands are always visually in the same place.</span>
+<span>Having current time in the prompt is also useful in case you run a long command and forget to </span><code>time</code><span> it.</span></p>
+</section>
+<section id="Minimizing-Firefox">
+
+    <h2>
+    <a href="#Minimizing-Firefox"><span>Minimizing Firefox</span> </a>
+    </h2>
+<p><span>Finally, the direct cause of this post!</span></p>
+<p><span>I don</span>&rsquo;<span>t use a lot of desktop apps, but I keep a browser with at least five tabs for different messaging apps.</span>
+<span>By the way, </span><a href="https://addons.mozilla.org/en-US/firefox/addon/tree-style-tab/"><span>Tree Style Tab</span></a><span> is the best tool for taming modern </span>&ldquo;<code>apps</code>&rdquo;<span>!</span></p>
+<p><span>The problem with this is that I automatically </span><kbd><kbd><span>Alt</kbd>+<kbd>F4</span></kbd></kbd><span> Firefox once I am done with it, but launching it every time is slow.</span>
+<span>Ideally, I want to minimize it on close, just how I do with qBittorrent and Telegram.</span>
+<span>Unfortunately, there</span>&rsquo;<span>s no built-in feature for this in Firefox.</span></p>
+<p><span>I once tried to build it with </span><code>Xbindkeys</code><span> and </span><code>Xdotool</code><span>.</span>
+<span>The idea was to intercept </span><kbd><kbd><span>Alt</kbd>+<kbd>F4</span></kbd></kbd><span> and minize active window if it is Firefox.</span>
+<span>That didn</span>&rsquo;<span>t work too well: to close all other applications, I tried to forward </span><kbd><kbd><span>Alt</kbd>+<kbd>F4</span></kbd></kbd><span>, but that recursed badly :-)</span></p>
+<p><span>Luckily, today I</span>&rsquo;<span>ve realized that I can write a KWin script for this!</span>
+<span>This turned out to be much harder than anticipated, because the docs are thin and setup is fiddly.</span></p>
+<p><a href="https://zren.github.io/2019/03/14/quick-tile-an-app-when-it-opens-using-a-kwin-script"><span>This</span>
+<span>post</span></a><span> was instrumental for me to figure this stuff out. Thanks Chris!</span></p>
+<p><span>I</span>&rsquo;<span>ve created two files:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">~/.local/share/kwin/scripts/SmartCloseWindow/metadata.desktop</figcaption>
+
+
+<pre><code><span class="line">[Desktop Entry]</span>
+<span class="line">Name=Smart Close Window</span>
+<span class="line">Comment=</span>
+<span class="line">Icon=preferences-system-windows-script-test</span>
+<span class="line"></span>
+<span class="line">Type=Service</span>
+<span class="line"></span>
+<span class="line">X-Plasma-API=javascript</span>
+<span class="line">X-Plasma-MainScript=code/main.js</span>
+<span class="line">X-KDE-ServiceTypes=KWin/Script</span>
+<span class="line"></span>
+<span class="line">X-KDE-PluginInfo-Name=SmartCloseWindow # Note, the same name as the dir</span>
+<span class="line">X-KDE-PluginInfo-Author=matklad</span>
+<span class="line">X-KDE-PluginInfo-Email=...</span>
+<span class="line">X-KDE-PluginInfo-License=GPL</span>
+<span class="line">X-KDE-PluginInfo-Version=3</span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+<figcaption class="title">~/.local/share/kwin/scripts/SmartCloseWindow/contents/code/main.js</figcaption>
+
+
+<pre><code><span class="line">registerShortcut("Smart Close Window.",</span>
+<span class="line">    "Smart Close Window.",</span>
+<span class="line">    "alt+f4",</span>
+<span class="line">    function () {</span>
+<span class="line">        var c = workspace.activeClient;</span>
+<span class="line">        if (c.caption.indexOf("Firefox") == -1) {</span>
+<span class="line">            c.closeWindow();</span>
+<span class="line">        } else {</span>
+<span class="line">            c.minimized = true;</span>
+<span class="line">        }</span>
+<span class="line">    });</span></code></pre>
+
+</figure>
+<p><span>After than, I</span>&rsquo;<span>ve ticked a box in front of </span><code>Smart Close Window</code><span> in </span><span class="menu"><span>System Settings › Window Management › KWin Scripts</span></span><span> and</span>
+<span>added a shortcut in </span><span class="menu"><span>System Settings › Shortcuts › Global Shortcuts › System Settings</span></span><span>.</span>
+<span>The last step took a while fo figure out: although it looks like we set shortcut in the script itself, this doesn</span>&rsquo;<span>t actually work for some reason.</span></p>
+</section>
+<section id="Linux-Distribution">
+
+    <h2>
+    <a href="#Linux-Distribution"><span>Linux Distribution</span> </a>
+    </h2>
+<p><span>Finally, my life has become significancy easier since I</span>&rsquo;<span>ve settled on </span><a href="https://nixos.org/"><span>NixOS</span></a><span>.</span>
+<span>I had mainly used </span><a href="https://www.archlinux.org/"><span>Arch</span></a><span> and a bit of </span><a href="https://ubuntu.com/"><span>Ubuntu</span></a><span> before, but NixOS is so much easier to control.</span>
+<span>I highly recommend to check it out!</span></p>
+</section>
+<section id="My-Dotfiles">
+
+    <h2>
+    <a href="#My-Dotfiles"><span>My Dotfiles</span> </a>
+    </h2>
+<p><span>Most of the stuff in this post is codified in my config repo: </span><a href="https://github.com/matklad/config" class="url">https://github.com/matklad/config</a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-06-20-linux-desktop-tips.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/07/16/perils-of-constructors.html b/2019/07/16/perils-of-constructors.html
new file mode 100644
index 00000000..194683d9
--- /dev/null
+++ b/2019/07/16/perils-of-constructors.html
@@ -0,0 +1,424 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Perils of Constructors</title>
+  <meta name="description" content="One of my favorite blog posts about Rust is Things Rust Shipped Without by Graydon Hoare.
+To me, footguns that don't exist in a language are usually more important than expressiveness.
+In this slightly philosophical essay, I want to tell about a missing Rust feature I especially like: constructors.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/07/16/perils-of-constructors.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Perils-of-Constructors"><span>Perils of Constructors</span> <time datetime="2019-07-16">Jul 16, 2019</time></a>
+    </h1>
+<p><span>One of my favorite blog posts about Rust is </span><a href="https://graydon2.dreamwidth.org/218040.html"><span>Things Rust Shipped Without</span></a><span> by Graydon Hoare.</span>
+<span>To me, footguns that don</span>&rsquo;<span>t exist in a language are usually more important than expressiveness.</span>
+<span>In this slightly philosophical essay, I want to tell about a missing Rust feature I especially like: constructors.</span></p>
+<section id="What-Is-Constructor">
+
+    <h2>
+    <a href="#What-Is-Constructor"><span>What Is Constructor</span> </a>
+    </h2>
+<p><span>Constructors are typically found in Object Oriented languages.</span>
+<span>The job of a constructor is to fully initialize an object before the rest of the world sees it.</span>
+<span>At the first blush, this seems like a really good idea:</span></p>
+<ol>
+<li>
+<span>You </span><strong><span>establish invariants</span></strong><span> in the constructor.</span>
+</li>
+<li>
+<span>Each method takes care to </span><strong><span>maintain</span></strong><span> invariants.</span>
+</li>
+<li>
+<span>Together, these two properties mean that it is possible to reason about the object in terms of coarse-grained invariants, instead of fine-grained internal state.</span>
+</li>
+</ol>
+<p><span>The constructor plays a role of induction base here, as it is the only way to create a new object.</span></p>
+<p><span>Unfortunately, there</span>&rsquo;<span>s a hole in this reasoning: constructor itself observes an object in an inconsistent state, and that creates a number of problems.</span></p>
+</section>
+<section id="Value-of-this">
+
+    <h2>
+    <a href="#Value-of-this"><span>Value of </span><code>this</code> </a>
+    </h2>
+<p><span>When the constructor initializes the object, it starts with some dummy state.</span>
+<span>But how do you define a dummy state for an arbitrary object?</span></p>
+<p><span>The easiest answer is to set all fields to default values: booleans to false, numbers to 0, and reference types to null.</span>
+<span>But this requires that every type has a default value, and forces the infamous null into the language.</span>
+<span>This is exactly the path that Java took: at the start of construction, all fields are zero or null.</span></p>
+<p><span>It</span>&rsquo;<span>s really hard to paper over this if you want to get rid of null afterwards.</span>
+<span>A good case study here is Kotlin.</span>
+<span>Kotlin uses non-nullable types by default, but has to work with pre-exiting JVM semantics.</span>
+<span>The language-design heroics to hide this fact are really impressive and work well in practice, but are </span><strong><strong><span>unsound</span></strong></strong><span>.</span>
+<span>That is, with constructors it is possible to circumvent Kotlin null-checking.</span></p>
+<p><span>Kotlin</span>&rsquo;<span>s main trick is to encourage usage of so-called </span>&ldquo;<span>primary constructors</span>&rdquo;<span>, which </span><strong><span>simultaneously</span></strong><span> declare a field and set it before any user code runs:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">class</span> <span class="hl-title class_">Person</span>(</span>
+<span class="line">  <span class="hl-keyword">val</span> firstName: String,</span>
+<span class="line">  <span class="hl-keyword">val</span> lastName: String</span>
+<span class="line">) { ... }</span></code></pre>
+
+</figure>
+<p><span>Alternatively, if the field is not declared in the constructor, the programmer is encouraged to immediately initialize it:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">class</span> <span class="hl-title class_">Person</span>(<span class="hl-keyword">val</span> firstName: String, <span class="hl-keyword">val</span> lastName: String) {</span>
+<span class="line">    <span class="hl-keyword">val</span> fullName: String = <span class="hl-string">&quot;<span class="hl-variable">$firstName</span> <span class="hl-variable">$lastName</span>&quot;</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Trying to use a field before initialization is forbidden statically on the best effort basis:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">class</span> <span class="hl-title class_">Person</span>(<span class="hl-keyword">val</span> firstName: String, <span class="hl-keyword">val</span> lastName: String) {</span>
+<span class="line">    <span class="hl-keyword">val</span> fullName: String</span>
+<span class="line">    <span class="hl-keyword">init</span> {</span>
+<span class="line">        println(fullName) <span class="hl-comment">// error: variable must be initialized</span></span>
+<span class="line">        fullName = <span class="hl-string">&quot;<span class="hl-variable">$firstName</span> <span class="hl-variable">$lastName</span>&quot;</span></span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>But, with some creativity, one can get around these checks.</span>
+<span>For example, a method call would do:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">class</span> <span class="hl-title class_">A</span> {</span>
+<span class="line">    <span class="hl-keyword">val</span> x: Any</span>
+<span class="line">    <span class="hl-keyword">init</span> {</span>
+<span class="line">        observeNull()</span>
+<span class="line">        x = <span class="hl-number">92</span></span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-function"><span class="hl-keyword">fun</span> <span class="hl-title">observeNull</span><span class="hl-params">()</span></span> = println(x) <span class="hl-comment">// prints null</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-function"><span class="hl-keyword">fun</span> <span class="hl-title">main</span><span class="hl-params">()</span></span> {</span>
+<span class="line">    A()</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>As well as capturing </span><code>this</code><span> by a lambda (spelled </span><code>{ args -&gt; body }</code><span> in Kotlin):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">class</span> <span class="hl-title class_">B</span> {</span>
+<span class="line">    <span class="hl-keyword">val</span> x: Any = { y }()</span>
+<span class="line">    <span class="hl-keyword">val</span> y: Any = x</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-function"><span class="hl-keyword">fun</span> <span class="hl-title">main</span><span class="hl-params">()</span></span> {</span>
+<span class="line">    println(B().x) <span class="hl-comment">// prints null</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Examples like these seem contorted (and they are), but I did hit similar issues</span>
+<span>in real code</span>
+<span>(Kolmogorov</span>&rsquo;<span>s zero</span>&ndash;<span>one law of software engineering: in a sufficiently large code base, every code pattern exists almost surely, unless it is statically rejected by the compiler, in which case it almost surely doesn</span>&rsquo;<span>t exist).</span></p>
+<p><span>The reason why Kotlin can get away with this unsoundness is the same as with Java</span>&rsquo;<span>s covariant arrays: runtime does null checks anyway.</span>
+<span>All in all, I wouldn</span>&rsquo;<span>t want to complicate Kotlin</span>&rsquo;<span>s type system to make the above cases rejected at compile time:</span>
+<span>given existing constraints (JVM semantics), cost/benefit ratio of a runtime check is much better than that of a static check.</span></p>
+<p><span>What if the language doesn</span>&rsquo;<span>t have a reasonable default for every type?</span>
+<span>For example, in C++, where user defined types are not necessary references, one can not just assign nulls to every field and call it a day!</span>
+<span>Instead, C++ invents special kind of syntactic machinery for specifying initial values of the fields: initializer lists:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&lt;string&gt;</span></span></span>
+<span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&lt;utility&gt;</span></span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">class</span> <span class="hl-title class_">person</span> {</span>
+<span class="line">  <span class="hl-built_in">person</span>(std::string first_name, std::string last_name)</span>
+<span class="line">    : <span class="hl-built_in">first_name</span>(std::<span class="hl-built_in">move</span>(first_name))</span>
+<span class="line">    , <span class="hl-built_in">last_name</span>(std::<span class="hl-built_in">move</span>(last_name))</span>
+<span class="line">  {}</span>
+<span class="line"></span>
+<span class="line">  std::string first_name;</span>
+<span class="line">  std::string last_name;</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>Being a special syntax, the rest of the language doesn</span>&rsquo;<span>t work completely flawlessly with it.</span>
+<span>For example, it</span>&rsquo;<span>s hard to fit arbitrary statements in initializer lists, because C++ is not expression-oriented language (which by itself is OK!).</span>
+<span>Working with exceptions from initializer lists needs </span><a href="https://en.cppreference.com/w/cpp/language/function-try-block"><span>yet another obscure language feature</span></a><span>.</span></p>
+</section>
+<section id="Calling-Methods-From-Constructor">
+
+    <h2>
+    <a href="#Calling-Methods-From-Constructor"><span>Calling Methods From Constructor</span> </a>
+    </h2>
+<p><span>As Kotlin examples alluded, all hell breaks loose if one calls a method from a constructor.</span>
+<span>Generally, methods expect that </span><code>this</code><span> object is fully constructed and valid (adheres to invariants).</span>
+<span>But, in Java or Kotlin, nothing prevents you from calling a method in constructor, and that way a semi-alive object can </span>&ldquo;<span>escape</span>&rdquo;<span>.</span>
+<span>Constructor promises to establish invariants, but is actually the easiest place to break them!</span></p>
+<p><span>A particularly bizarre thing happens when the base class calls a method overridden in the subclass:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">abstract</span> <span class="hl-keyword">class</span> <span class="hl-title class_">Base</span> {</span>
+<span class="line">    <span class="hl-keyword">init</span> {</span>
+<span class="line">        initialize()</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">abstract</span> <span class="hl-function"><span class="hl-keyword">fun</span> <span class="hl-title">initialize</span><span class="hl-params">()</span></span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">class</span> <span class="hl-title class_">Derived</span>: <span class="hl-type">Base</span>() {</span>
+<span class="line">    <span class="hl-keyword">val</span> x: Any = <span class="hl-number">92</span></span>
+<span class="line">    <span class="hl-keyword">override</span> <span class="hl-function"><span class="hl-keyword">fun</span> <span class="hl-title">initialize</span><span class="hl-params">()</span></span> = println(x) <span class="hl-comment">// prints null!</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Just think about it: code for Derived runs </span><strong><span>before</span></strong><span> the its constructor!</span>
+<span>Doing a similar thing in C++ leads to even curiouser results.</span>
+<span>Instead of calling the function from Derived, a function from Base will be called.</span>
+<span>This makes </span><em><span>some</span></em><span> sense, because Derived is not at all initialized (remember, we can</span>&rsquo;<span>t just say that all fields are null).</span>
+<span>However, if the function in Base happens to be pure virtual, undefined behavior occurs.</span></p>
+</section>
+<section id="Constructor-s-Signature">
+
+    <h2>
+    <a href="#Constructor-s-Signature"><span>Constructor</span>&rsquo;<span>s Signature</span> </a>
+    </h2>
+<p><span>Breaking invariants isn</span>&rsquo;<span>t the only problem with constructors.</span>
+<span>They also have signature with fixed name (empty) and return type (the class itself).</span>
+<span>That makes constructor overloads confusing for humans.</span></p>
+
+<aside class="admn quiz">
+<svg class="icon"><use href="/assets/icons.svg#question"/></svg>
+<div><p><span>Quick, what is </span><code>std::vector&lt;int&gt; xs(92, 2)</code><span>?</span></p>
+<ol type="a">
+<li>
+<span>A vector of length 92 of twos</span>
+</li>
+<li>
+<code>[92, 92]</code>
+</li>
+<li>
+<code>[92, 2]</code>
+</li>
+</ol>
+</div>
+</aside><p><span>The problem with return type usually comes up if construction can fail.</span>
+<span>You can</span>&rsquo;<span>t return </span><code>Result&lt;MyClass, io::Error&gt;</code><span> or null from a constructor!</span></p>
+<p><span>This is often used as an argument that C++ with exceptions disabled is not viable, and that using constructors force one to use exceptions as well.</span>
+<span>I don</span>&rsquo;<span>t think that</span>&rsquo;<span>s a valid argument though: factory functions solve both problems, because they can have arbitrary names and can return arbitrary types.</span>
+<span>I actually this to be an occasionally useful pattern in OO-languages:</span></p>
+<ul>
+<li>
+<p><span>Make a single </span><strong><strong><span>private</span></strong></strong><span> constructor that accepts all the fields as arguments and just sets them.</span>
+<span>That is, this constructor acts almost like a record literal in Rust.</span>
+<span>It can also validate any invariants, but it shouldn</span>&rsquo;<span>t do anything else with arguments or fields.</span></p>
+</li>
+<li>
+<p><span>For public API, provide the necessary public factory functions, with</span>
+<span>appropriate naming and adjusted return types.</span></p>
+</li>
+</ul>
+<p><span>A similar problem with constructors is that, because they are a special kind of thing, it</span>&rsquo;<span>s hard to be generic over them.</span>
+<span>In C++, </span>&ldquo;<span>default constructable</span>&rdquo;<span> or </span>&ldquo;<span>copy constructable</span>&rdquo;<span> can</span>&rsquo;<span>t be expressed more directly than </span>&ldquo;<span>certain </span><em><span>syntax</span></em><span> works</span>&rdquo;<span>.</span>
+<span>Contrast this with Rust, where these concepts have appropriate signatures:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">trait</span> <span class="hl-title class_">Default</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">default</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span>;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">trait</span> <span class="hl-title class_">Clone</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">clone</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Life-Without-Constructors">
+
+    <h2>
+    <a href="#Life-Without-Constructors"><span>Life Without Constructors</span> </a>
+    </h2>
+<p><span>In Rust, there</span>&rsquo;<span>s only one way to create a struct: providing values for all the fields.</span>
+<span>Factory functions, like the conventional </span><code>new</code><span>, play the role of constructors, but, crucially, don</span>&rsquo;<span>t allow calling any methods until you have at least a basically valid struct instance on hand.</span></p>
+<p><span>A perceived downside of this approach is that any code can create a struct, so there</span>&rsquo;<span>s no the single place, like the constructor, to enforce invariants.</span>
+<span>In practice, this is easily solved by privacy: if struct</span>&rsquo;<span>s fields are private it can only be created inside its declaring module.</span>
+<span>Within a </span><em><span>single</span></em><span> module, it</span>&rsquo;<span>s not at all hard to maintain a convention like </span>&ldquo;<span>all construction must go via the </span><code>new</code><span> method</span>&rdquo;<span>.</span>
+<span>One can even imagine a language extension that allows one to mark certain functions with a </span><code>#[constructor]</code><span> attribute, with the effect that the record literal syntax is available only in the marked functions.</span>
+<span>But, again, additional language machinery seems unnecessary: maintaining </span><strong><span>local</span></strong><span> conventions needs little effort.</span></p>
+<p><span>I personally think that this tradeoff looks the same for first-class contract programming in general.</span>
+<span>Contracts like </span>&ldquo;<span>not null</span>&rdquo;<span> or </span>&ldquo;<span>positive</span>&rdquo;<span> are best encoded in types.</span>
+<span>For complex invariants, just writing </span><code>assert!(self.validate())</code><span> in each method manually is not that hard.</span>
+<span>Between these two patterns there</span>&rsquo;<span>s little room for language-level or macro-based </span><code>#[pre]</code><span> and </span><code>#[post]</code><span> conditions.</span></p>
+</section>
+<section id="A-Case-of-Swift">
+
+    <h2>
+    <a href="#A-Case-of-Swift"><span>A Case of Swift</span> </a>
+    </h2>
+<p><span>An interesting language to look at the constructor machinery is Swift.</span>
+<span>Like Kotlin, Swift is a null-safe language.</span>
+<span>Unlike Kotlin, Swift</span>&rsquo;<span>s null-checking needs to be sound, so it employs interesting tricks to mitigate constructor-induced damage.</span></p>
+<p><em><span>First</span></em><span>, Swift embraces named arguments, and that helps quite a bit with </span>&ldquo;<span>all constructors have the same name</span>&rdquo;<span>.</span>
+<span>In particular, having two constructors with the same types of parameters is not a problem:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-type">Celsius</span>(fromFahrenheit: <span class="hl-number">212.0</span>)</span>
+<span class="line"><span class="hl-type">Celsius</span>(fromKelvin: <span class="hl-number">273.15</span>)</span></code></pre>
+
+</figure>
+<p><em><span>Second</span></em><span>, to solve </span>&ldquo;<span>constructor calls virtual function from an object</span>&rsquo;<span>s class that didn</span>&rsquo;<span>t came into existence yet</span>&rdquo;<span> problem, Swift uses elaborate two-phase initialization protocol.</span>
+<span>Although there</span>&rsquo;<span>s no special syntax for initializer lists, compiler statically checks that constructor</span>&rsquo;<span>s body has just the right, safe and sound, form.</span>
+<span>For example, calling methods is only allowed after all fields of the class and its ancestors are set.</span></p>
+<p><em><span>Third</span></em><span>, there</span>&rsquo;<span>s special language-level support for failable constructors.</span>
+<span>A constructor can be declared nullable, which makes the result of a call to a constructor an option.</span>
+<span>A constructor can also have </span><code>throws</code><span> modifier, which works somewhat nicer with Swifts</span>&rsquo;<span>s semantic two-phase initialization than with C++ syntactic initializer lists.</span></p>
+<p><span>Swift manages to plug all of the holes in constructors I am ranting about.</span>
+<span>This comes at a price, however: </span><a href="https://docs.swift.org/swift-book/LanguageGuide/Initialization.html"><span>the initialization chapter</span></a><span> is one of the longest in Swift book!</span></p>
+</section>
+<section id="When-Constructors-Are-Necessary">
+
+    <h2>
+    <a href="#When-Constructors-Are-Necessary"><span>When Constructors Are Necessary</span> </a>
+    </h2>
+<p><span>However, I can think of at least two reasons why constructors can</span>&rsquo;<span>t be easily substituted with Rust-style record literals.</span></p>
+<p><em><span>First</span></em><span>, inheritance more or less forces the language to have constructors.</span>
+<span>One can imagine extending the record syntax with support for base classes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Base</span> { ... }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Derived</span>: Base { foo: <span class="hl-type">i32</span> }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Derived</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>() <span class="hl-punctuation">-&gt;</span> Derived {</span>
+<span class="line">        Derived {</span>
+<span class="line">            Base::<span class="hl-title function_ invoke__">new</span>()..,</span>
+<span class="line">            foo: <span class="hl-number">92</span>,</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>But this won</span>&rsquo;<span>t work in a typical single-inheritance OO language  object layout!</span>
+<span>Usually, an object starts with a header and continues with fields of classes, from the base one to the most derived one.</span>
+<span>This way, a prefix of an object of a derived class forms a valid object of a base class.</span>
+<span>For this layout to work though, constructor needs to allocate memory for the whole object at once.</span>
+<span>It can</span>&rsquo;<span>t allocate just enough space for base, and than append derived fields afterwards.</span>
+<span>But such piece-wise allocation is required if we want a record syntax were we can just specify a value for a base class.</span></p>
+<p><em><span>Second</span></em><span>, unlike records, constructors have a placement-friendly ABI.</span>
+<span>Constructor acts on the </span><code>this</code><span> pointer, which points to a chunk of memory which a newborn object should occupy.</span>
+<span>Crucially, a constructor can easily pass pointer to subobject</span>&rsquo;<span>s constructors, allowing to create a complex tree of values in-place.</span>
+<span>In contrast, in Rust constructing records semantically involves quite a few copies of memory, and we are at the mercy of the optimizer here.</span>
+<span>It</span>&rsquo;<span>s not a coincidence that there</span>&rsquo;<span>s still no accepted RFC for placement in Rust!</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/ceimgw/blog_post_perils_of_constructors/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-07-16-perils-of-constructors.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/07/25/unsafe-as-a-type-system.html b/2019/07/25/unsafe-as-a-type-system.html
new file mode 100644
index 00000000..52cd8269
--- /dev/null
+++ b/2019/07/25/unsafe-as-a-type-system.html
@@ -0,0 +1,200 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Unsafe as a Human-Assisted Type System</title>
+  <meta name="description" content="This is a short note about yet another way to look at Rust's unsafe.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/07/25/unsafe-as-a-type-system.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Unsafe-as-a-Human-Assisted-Type-System"><span>Unsafe as a Human-Assisted Type System</span> <time datetime="2019-07-25">Jul 25, 2019</time></a>
+    </h1>
+<p><span>This is a short note about yet another way to look at Rust</span>&rsquo;<span>s </span><code>unsafe</code><span>.</span></p>
+<p><span>Today, an interesting </span><a href="https://github.com/rust-lang/rust/issues/62894"><span>bug</span></a><span> was found in rustc, which made me aware just how useful </span><code>unsafe</code><span> is for making code maintainable.</span>
+<span>The story begins a couple of months ago, when I was casually browsing through recent pull requests for </span><a href="http://github.com/rust-lang/rust/"><span>rust-lang/rust</span></a><span>.</span>
+<span>I was probably waiting for my code to compile at that moment :]</span>
+<span>Anyway, a </span><a href="https://github.com/rust-lang/rust/pull/58061"><span>pull request</span></a><span> caught my attention, and, while I was reading the diff, I noticed a usage of </span><code>unsafe</code><span>.</span>
+<span>It looked roughly like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">map_in_place</span>&lt;T, F&gt;(t: &amp;<span class="hl-keyword">mut</span> T, f: F)</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">    F: <span class="hl-title function_ invoke__">FnOnce</span>(T) <span class="hl-punctuation">-&gt;</span> T,</span>
+<span class="line">{</span>
+<span class="line">    <span class="hl-keyword">unsafe</span> { std::ptr::<span class="hl-title function_ invoke__">write</span>(t, <span class="hl-title function_ invoke__">f</span>(std::ptr::<span class="hl-title function_ invoke__">read</span>(t))); }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This function applies a </span><code>T -&gt; T</code><span> function to a </span><code>&amp;mut T</code><span> value, a-la </span><a href="https://crates.io/crates/take_mut"><code>take_mut</code></a><span> crate.</span></p>
+<p><span>There is a safe way to do this in Rust, by temporary replacing the value with something useless (</span><a href="http://giphygifs.s3.amazonaws.com/media/MS0fQBmGGMaRy/giphy.gif"><span>Jones</span>&rsquo;<span>s trick</span></a><span>):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">map_in_place_safe</span>&lt;T, F&gt;(t: &amp;<span class="hl-keyword">mut</span> T, f: F)</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">    F: <span class="hl-title function_ invoke__">FnOnce</span>(T) <span class="hl-punctuation">-&gt;</span> T,</span>
+<span class="line">    T: <span class="hl-built_in">Default</span>,</span>
+<span class="line">{</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">stolen_t</span> = std::mem::<span class="hl-title function_ invoke__">replace</span>(t, T::<span class="hl-title function_ invoke__">default</span>());</span>
+<span class="line">    t = <span class="hl-title function_ invoke__">f</span>(stolen_t)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In </span><code>map_in_place</code><span> we don</span>&rsquo;<span>t have a </span><code>T: Default</code><span> bound, so the trick is not applicable.</span>
+<span>Instead, the function uses (</span><code>unsafe</code><span>) </span><code>ptr::read</code><span> to get an owned value out of a unique reference, and then uses </span><code>ptr::write</code><span> to store the new value back, without calling the destructor.</span></p>
+<p><span>However, the code has a particular </span><code>unsafe</code><span> code smell: it calls user-supplied code (</span><code>f</code><span>) from within an </span><code>unsafe</code><span> block.</span>
+<span>This is usually undesirable, because it makes reasoning about invariants harder: arbitrary code can do arbitrary unexpected things.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>In this particular case, </span>&ldquo;<span>user code</span>&rdquo;<span> is clearly visible: it is literally a user-supplied closure.</span>
+<span>The problem is not always that obvious: for example, arbitrary code might hide behind an innocent-looking </span><code>&lt;</code><span> operator that is invoked on a generic </span><code>T: Ord</code><span>.</span></p>
+</div>
+</aside><p><span>And, indeed, this function is unsound: if </span><code>f</code><span> panics and unwinds, the </span><code>t</code><span> value would be dropped twice!</span>
+<span>The solution here (which I know from the </span><code>take_mut</code><span> crate) is to just abort the process if the closure panics.</span>
+<span>Stern, but effective!</span></p>
+<p><span>I felt really torn about bringing this issue up: clearly, inside the compiler we know what we are doing, and the error case seems extremely marginal.</span>
+<span>Nevertheless, I did leave the comment, and the abort trick was implemented.</span></p>
+<p><span>And guess what?</span>
+<span>Today a bug report came in (</span><a href="https://github.com/rust-lang/rust/issues/62894"><span>#62894</span></a><span>), demonstrating that closure does panic in some cases, and </span><code>rustc</code><span> aborts.</span>
+<span>To be clear, the abort in this case </span><em><span>is a good thing</span></em><span>!</span>
+<span>If rustc didn</span>&rsquo;<span>t abort, it would be a use-after-free.</span></p>
+<p><span>Note how cool is this: a casual code-reviewer was able to prevent a memory-safety issue by looking at just a single one-line function.</span>
+<span>This was possible for two reasons:</span></p>
+<ol>
+<li>
+<span>The code was marked </span><code>unsafe</code><span> which made it stand out.</span>
+</li>
+<li>
+<span>The safety reasoning was purely local: I didn</span>&rsquo;<span>t need to understand the PR (or surrounding code) as a whole to reason about the </span><code>unsafe</code><span> block.</span>
+</li>
+</ol>
+<p><span>The last bullet point is especially interesting, because it is what makes type systems  </span><sup><span>[1]</span></sup><span> in general effective in large-scale software development:</span></p>
+<ol>
+<li>
+<span>Checking types is a local (per-expression, per-function, per-module, depending on the language) procedure.</span>
+<span>Every step is almost trivial: verify that sub-expressions have the right type and work out the result type.</span>
+</li>
+<li>
+<span>Together, these local static checks guarantee a highly non-trivial global property:</span>
+<span>during runtime, actual types of all the values match inferred static types of variables.</span>
+</li>
+</ol>
+<p><span>Rust</span>&rsquo;<span>s </span><code>unsafe</code><span> is similar: if we verify every usage of </span><code>unsafe</code><span> (local property!) to be correct, then we guarantee that the program as a whole does not contain undefined behavior.</span></p>
+<p><span>The devil is in the details, however, so the reality is slightly more nuanced.</span></p>
+<p><em><span>First</span></em><span>, </span><code>unsafe</code><span> should be checked by humans, thus a human-assisted type system.</span>
+<span>The problem with humans, however, is that they make mistakes all the time.</span></p>
+<p><em><span>Second</span></em><span>, checking </span><code>unsafe</code><span> can involve a rather large chunk of code.</span>
+<span>For example, if you implement </span><code>Vec</code><span>, you can (safely) write to its </span><code>length</code><span> field from anywhere in the defining module.</span>
+<span>That means that correctness of </span><code>Deref</code><span> impl for </span><code>Vec</code><span> depends on the whole module.</span>
+<span>Common wisdom says that the boundary for </span><code>unsafe</code><span> code is a module, but I would love to see a more precise characteristic.</span>
+<span>For example, in </span><code>map_in_place</code><span> case it</span>&rsquo;<span>s pretty clear that only a single function should be examined.</span>
+<span>On the other hand, if </span><code>Vec</code>&rsquo;<span>s field are </span><code>pub(super)</code><span>, parent module should be scrutinized as well.</span></p>
+<p><em><span>Third</span></em><span>, it</span>&rsquo;<span>s trivial to make all </span><code>unsafe</code><span> blocks technically correct by just making every function </span><code>unsafe</code><span>.</span>
+<span>That wouldn</span>&rsquo;<span>t be a useful thing to do though!</span>
+<span>Similarly, if </span><code>unsafe</code><span> is used willy-nilly across the ecosystem, its value is decreased, because there would be many incorrect </span><code>unsafe</code><span> blocks, and reviewing each additional block would be harder.</span></p>
+<p><em><span>Fourth</span></em><span>, and probably most disturbing, correctness of two </span><code>unsafe</code><span> blocks in isolation </span><a href="http://smallcultfollowing.com/babysteps/blog/2016/10/02/observational-equivalence-and-unsafe-code/"><span>does not guarantee</span></a><span> that they together are correct!</span>
+<span>We shouldn</span>&rsquo;<span>t panic though: in practice, realistic usages of </span><code>unsafe</code><span> do compose.</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/chvl50/blog_post_unsafe_as_a_humanassisted_type_system/"><span>r/rust</span></a><span>.</span></p>
+<p><strong><strong><span>Update(2020-08-17):</span></strong></strong><span> oops, </span><a href="https://lobste.rs/s/9e7o8e/comparative_unsafety#c_btqrdt"><span>I did it again</span></a><span>.</span></p>
+<p><span>[1] </span><code>unsafe</code><span> is really an effect system, but the difference is not important here.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-07-25-unsafe-as-a-type-system.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/08/23/join-your-threads.html b/2019/08/23/join-your-threads.html
new file mode 100644
index 00000000..6e0fcdb3
--- /dev/null
+++ b/2019/08/23/join-your-threads.html
@@ -0,0 +1,319 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Join Your Threads</title>
+  <meta name="description" content="This is a note on how to make multithreaded programs more robust.
+It's not really specific to Rust, but I get to advertise my new jod-thread micro-crate :)">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/08/23/join-your-threads.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Join-Your-Threads"><span>Join Your Threads</span> <time datetime="2019-08-23">Aug 23, 2019</time></a>
+    </h1>
+<p><span>This is a note on how to make multithreaded programs more robust.</span>
+<span>It</span>&rsquo;<span>s not really specific to Rust, but I get to advertise my new </span><a href="https://crates.io/crates/jod-thread"><span>jod-thread</span></a><span> micro-crate :)</span></p>
+<p><span>Let</span>&rsquo;<span>s say you</span>&rsquo;<span>ve created a fresh new thread with </span><code>std::thread::spawn</code><span>, but haven</span>&rsquo;<span>t call </span><code>JoinHandle::join</code><span> anywhere in your program.</span>
+<span>What can go wrong in this situation?</span>
+<span>As a reminder, </span><code>join</code><span> blocks until the thread represented by handle completes successfully or with a panic.</span></p>
+<p><em><span>First</span></em><span>, if the </span><code>main</code><span> function finishes earlier, some destructors on that other thread</span>&rsquo;<span>s stack might not run.</span>
+<span>It</span>&rsquo;<span>s not a big deal if all that destructors do is just freeing memory: the OS cleanups after the process exit anyway.</span>
+<span>However, </span><code>Drop</code><span> could have been used for something like flushing IO buffers, and that is more problematic.</span></p>
+<p><em><span>Second</span></em><span>, not joining threads can lead to surprising interference between unrelated parts of the program and in general to more chaotic behavior.</span>
+<span>Imagine, for example, running a test suite with many tests.</span>
+<span>In this situation typical </span>&ldquo;<span>singleton</span>&rdquo;<span> threads may accumulate during a test run.</span>
+<span>Another scenario is spawning helper threads when processing tasks.</span>
+<span>If you don</span>&rsquo;<span>t join these threads, you might end up using more resources than there are concurrent tasks, making it harder to measure the load.</span>
+<span>To be clear, if you </span><em><span>don</span>&rsquo;<span>t</span></em><span> call </span><code>join</code><span>, the thread will complete at some point anyway, it won</span>&rsquo;<span>t leak or anything.</span>
+<span>But this </span><em><span>some</span></em><span> point is non-deterministic.</span></p>
+<p><em><span>Third</span></em><span>, If a thread panics in a forest, and no one is around to hear it, does it make a sound?</span>
+<span>The </span><code>join</code><span> method returns a </span><code>Result</code><span>, which is be an </span><code>Err</code><span> if the thread has panicked.</span>
+<span>If you don</span>&rsquo;<span>t join the thread, you won</span>&rsquo;<span>t get a chance to react to this event.</span>
+<span>So, unless you are looking at the </span><code>stderr</code><span> at this moment, you might not realize that something is wrong!</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>If you haven</span>&rsquo;<span>t read the </span><a href="https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful/"><span>Notes on Structured Concurrency</span></a><span>, do it now!</span>
+<span>It</span>&rsquo;<span>s a much longer post than mine, but is also more general.</span></p>
+</div>
+</aside><p><span>It seems like joining the threads by default is a good idea.</span>
+<span>However, </span><em><span>just</span></em><span> calling </span><code>JoinHandle::join</code><span> is not enough:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">thread</span> = std::thread::<span class="hl-title function_ invoke__">spawn</span>(|| {</span>
+<span class="line">    <span class="hl-comment">/* useful work */</span></span>
+<span class="line">});</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// ...</span></span>
+<span class="line"></span>
+<span class="line">thread.<span class="hl-title function_ invoke__">join</span>().<span class="hl-title function_ invoke__">unwrap</span>(); <span class="hl-comment">// propagate the panic</span></span></code></pre>
+
+</figure>
+<p><span>The problem is, code in </span>&hellip;<span> might use </span><code>?</code><span> (or some other form of early return), or it can panic, and in both cases the thread won</span>&rsquo;<span>t be joined.</span>
+<span>As usual, the solution is to put the </span>&ldquo;<span>cleanup</span>&rdquo;<span> operation into a </span><code>Drop</code><span> impl.</span>
+<span>That</span>&rsquo;<span>s exactly what my crate, </span><a href="https://crates.io/crates/jod-thread"><code>jod_thread</code></a><span>, does!</span>
+<span>Note that this is really a micro crate, so consider just rolling your own </span><strong><strong><span>j</span></strong></strong><span>oin </span><strong><strong><span>o</span></strong></strong><span>n </span><strong><strong><span>d</span></strong></strong><span>rop.</span>
+<span>The value is not in the code, it</span>&rsquo;<span>s in the pattern of never leaving a loose thread behind!</span></p>
+<section id="A-Look-At-C">
+
+    <h2>
+    <a href="#A-Look-At-C"><span>A Look At C++</span> </a>
+    </h2>
+<p><span>As usual, it is instructive to contrast and compare Rust and C++.</span></p>
+<p><span>In C++, </span><code>std::thread</code><span> has this interesting peculiarity that it terminates the process in destructor unless you call </span><code>.join</code><span> (which works just like in Rust) or </span><code>.detach</code><span> (which says </span>&ldquo;<span>I won</span>&rsquo;<span>t be joining this thread at all</span>&rdquo;<span>).</span>
+<span>In other words, C++ mandates that you explicitly choose between joining and detaching.</span>
+<span>Why is that?</span></p>
+<p><span>It</span>&rsquo;<span>s easy to argue that detach by default is a wrong choice for C++: it can easily lead to undefined behavior if the lambda passed to the thread uses values from parent</span>&rsquo;<span>s stack frame.</span></p>
+<p><span>Or, as Scott Meyer poetically puts it in the Item 37 of </span><a href="https://www.aristeia.com/EMC++.html"><span>Effective Modern C++</span></a><span> (which is probably the best book to read if you are into both Rust and C++):</span></p>
+
+<aside class="admn warn">
+<svg class="icon"><use href="/assets/icons.svg#exclamation"/></svg>
+<div><p><span>In </span><code>doWork</code><span>, for example, </span><code>goodVals</code><span> is a local variable that is captured by reference.</span>
+<span>It</span>&rsquo;<span>s also modified inside the lambda (via the call to </span><code>push_back</code><span>).</span>
+<span>Suppose, then, that while the lambda is running asynchronously, </span><code>conditionsAreSatisfied()</code><span> returns </span><code>false</code><span>.</span>
+<span>In that case, </span><code>doWork</code><span> would return, and its local variables (including </span><code>goodVals</code><span>) would be destroyed.</span>
+<span>Its stack frame would be popped, and execution of its thread would continue at </span><code>doWork</code>&rsquo;<span>s call site.</span></p>
+<p><span>Statements following that call site would, at some point, make additional function calls, and at least one such call would probably end up using some or all of the memory that had once been occupied by the doWork stack frame.</span>
+<span>Let</span>&rsquo;<span>s call such a function </span><code>f</code><span>.</span>
+<span>While </span><code>f</code><span> was running, the lambda that doWork initiated would still be running asynchronously.</span>
+<span>That lambda could call push_back on the stack memory that used to be </span><code>goodVals</code><span> but that is now somewhere inside </span><code>f</code>&rsquo;<span>s stack frame.</span>
+<span>Such a call would modify the memory that used to be </span><code>goodVals</code><span>, and that means that from </span><code>f</code>&rsquo;<span>s perspective, the content of memory in its stack frame could spontaneously change!</span>
+<span>Imagine the fun you</span>&rsquo;<span>d have debugging </span><em><span>that</span></em><span>.</span></p>
+</div>
+</aside><p><span>This also happens to be one of my favorite arguments for </span>&ldquo;<span>why Rust?</span>&rdquo;<span> :)</span></p>
+<p><span>The reasoning behind not making </span><code>join</code><span> the default is less clear cut.</span>
+<span>The book says that </span><code>join</code><span> by default is be counterintuitive, but that is somewhat circular: it is surprising precisely because it is not the default.</span></p>
+<p><span>In Rust, unlike C++, implicit detach can</span>&rsquo;<span>t cause undefined behavior (compiler will just refuse the code if the lambda borrows from the stack).</span>
+<span>I suspect this </span>&ldquo;<span>we can, so why not?</span>&rdquo;<span> is the reason why Rust detaches by default.</span></p>
+<p><span>However, there</span>&rsquo;<span>s a twist!</span>
+<span>C++ core guidelines now recommend to always use </span><code>gsl::joining_thread</code><span> (which does implicit join) over </span><code>std::thread</code><span> in  </span><a href="http://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#cp25-prefer-gsljoining_thread-over-stdthread"><span>CP.25</span></a><span>.</span>
+<span>The following </span><a href="http://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#cp26-dont-detach-a-thread"><span>CP.26</span></a><span> reinforces the point by advising against </span><code>.detach()</code><span> method.</span>
+<span>The reasoning is roughly similar to my post: detached threads make the program more chaotic, as they add superfluous degrees of freedom to the runtime behavior.</span></p>
+<p><span>It</span>&rsquo;<span>s interesting that I</span>&rsquo;<span>ve learned about these two particular guidelines only today, when refreshing my C++ for this section of the post!</span></p>
+<p><span>So, it seems like both C++ and Rust picked the wrong default for the thread API in this case. But at least C++ has official guidelines recommending the better approach.</span>
+<span>And Rust, </span>&hellip;<span> well, Rust has  my blog post now :-)</span></p>
+</section>
+<section id="A-Silver-Bullet">
+
+    <h2>
+    <a href="#A-Silver-Bullet"><span>A Silver Bullet</span> </a>
+    </h2>
+<p><span>Of course there isn</span>&rsquo;<span>t one!</span>
+<span>Joining on drop seems to be a better default, but it brings its own problems.</span>
+<span>The nastiest one is deadlocks: if you are joining a thread which waits for something else, you might wait forever.</span>
+<span>I don</span>&rsquo;<span>t think there</span>&rsquo;<span>s an easy solution here: </span><em><span>not</span></em><span> joining the thread lets you forget about the deadlock, and may even make it go away (if a child thread is blocked on the parent thread), but you</span>&rsquo;<span>ll get a detached thread on your hands!</span>
+<span>The fix is to just arrange the threads in such a way that shutdown is always orderly and clean.</span>
+<span>Ideally, shutdown should work the same for both the happy and panicking path.</span></p>
+<p><span>I want to discuss a specific instructive issue that I</span>&rsquo;<span>ve solved in </span><a href="https://github.com/rust-analyzer/rust-analyzer"><span>rust-analyzer</span></a><span>.</span>
+<span>It was about the usual setup with a worker thread that consumes items from a channel, roughly like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">frobnicate</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> (sender, receiver) = <span class="hl-title function_ invoke__">channel</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">worker</span> = jod_thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-title class_">item</span> receiver {</span>
+<span class="line">            <span class="hl-title function_ invoke__">do_work</span>(item)</span>
+<span class="line">        }</span>
+<span class="line">    });</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// prepare some work and send it via sender</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, the worker thread has a simple termination condition: it stops when the channel is closed.</span>
+<span>However, here lies the problem: we create the channel before the thread, so the </span><code>sender</code><span> is dropped </span><em><span>after</span></em><span> the </span><code>worker</code><span>.</span>
+<span>This is a deadlock: </span><code>frobnicate</code><span> waits for </span><code>worker</code><span> to exit, and </span><code>worker</code><span> waits for </span><code>frobnicate</code><span> to drop the </span><code>sender</code><span>!</span></p>
+<p><span>There</span>&rsquo;<span>s a straightforward fix: drop the </span><code>sender</code><span> first!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">frobnicate</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> (sender, receiver) = <span class="hl-title function_ invoke__">channel</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">worker</span> = jod_thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-title class_">item</span> receiver {</span>
+<span class="line">            <span class="hl-title function_ invoke__">do_work</span>(item)</span>
+<span class="line">        }</span>
+<span class="line">    });</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// prepare some work and send it via sender</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">drop</span>(sender);</span>
+<span class="line">    <span class="hl-title function_ invoke__">drop</span>(worker);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This solution, while obvious, has a pretty serious problem!</span>
+<span>The </span><code>prepare some work ...</code><span> bit of code can contain early returns due to error handling or it may panic.</span>
+<span>In both case the result is a deadlock.</span>
+<span>What is the worst, now deadlock happens only on the </span><em><span>unhappy</span></em><span> path!</span></p>
+<p><span>There is an elegant, but tricky fix for this. Take a minute to think about it!</span>
+<span>How to change the above snippet such that the </span><code>worker</code><span> thread is guranted to be joined, without deadlocks, regardless of the exit condition (normal termination,</span><code>?</code><span>, panic) of </span><code>frobnicate</code><span>?</span></p>
+<p><span>The answer will be below these beautiful Ukiyo-e prints :-)</span></p>
+
+<figure>
+<figcaption class="title">Fine Wind, Clear Morning</figcaption>
+
+<img alt="" src="https://upload.wikimedia.org/wikipedia/commons/thumb/5/57/Red_Fuji_southern_wind_clear_morning.jpg/1024px-Red_Fuji_southern_wind_clear_morning.jpg">
+</figure>
+
+<figure>
+<figcaption class="title">Rainstorm Beneath the Summit</figcaption>
+
+<img alt="" src="https://upload.wikimedia.org/wikipedia/commons/thumb/7/75/Lightnings_below_the_summit.jpg/1024px-Lightnings_below_the_summit.jpg">
+</figure>
+<p><span>First of all, the problem we are seeing here is an instance of a very general setup.</span>
+<span>We have a bug which only manifests itself if a rare error condition arises.</span>
+<span>In some sense, we have a bug in the (implicit) error handling (just like </span><a href="https://www.usenix.org/conference/osdi14/technical-sessions/presentation/yuan"><span>92%</span></a><span> of critical bugs).</span>
+<span>The solutions here are a classic:</span></p>
+<ol>
+<li>
+<span>Artificially trigger unhappy path often (</span>&ldquo;<span>restoring from backup every night</span>&rdquo;<span>).</span>
+</li>
+<li>
+<span>Make sure that there aren</span>&rsquo;<span>t different happy and unhappy paths (</span>&ldquo;<span>crash only software</span>&rdquo;<span>).</span>
+</li>
+</ol>
+<p><span>We are going to do the second one.</span>
+<span>Specifically, we</span>&rsquo;<span>ll arrange the code in such way that compiler automatically drops </span><code>worker</code><span> first, without the need for explicit </span><code>drop</code><span>.</span></p>
+<p><span>Something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">worker</span> = jod_thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || { ... });</span>
+<span class="line"><span class="hl-keyword">let</span> (sender, receiver) = <span class="hl-title function_ invoke__">channel</span>();</span></code></pre>
+
+</figure>
+<p><span>The problem here is that we need </span><code>receiver</code><span> inside the worker, but moving </span><code>let (sender, receiver)</code><span> up brings us back to the square one.</span>
+<span>Instead, we do this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">worker</span>;</span>
+<span class="line"><span class="hl-keyword">let</span> (sender, receiver) = <span class="hl-title function_ invoke__">channel</span>();</span>
+<span class="line">worker = jod_thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || { ... });</span></code></pre>
+
+</figure>
+<p><span>Beautiful, isn</span>&rsquo;<span>t it?</span>
+<span>And super cryptic: the </span><a href="https://github.com/rust-analyzer/ra_vfs/blob/af1a6ace3d0edf57d62a76321e3e52eeb99d6d4c/src/io.rs#L71-L83"><span>real code</span></a><span> has a sizable comment chunk!</span></p>
+<p><span>The second big issue with join by default is that, if you have many threads in the same scope, and one of them errors, you really want to not only </span><em><span>wait</span></em><span> until others are finished, but to actually cancel them.</span>
+<span>Unfortunately, cancelling a thread is a notoriously thorny problem, which I</span>&rsquo;<span>ve explained a bit in </span><a href="/2018/07/24/exceptions-in-structured-concurrency.html"><span>another post</span></a><span>.</span></p>
+</section>
+<section id="Wrapping-Up">
+
+    <h2>
+    <a href="#Wrapping-Up"><span>Wrapping Up</span> </a>
+    </h2>
+<p><span>So, yeah, join your threads, but be on guard about deadlocks!</span>
+<span>Note that most of the time one shouldn</span>&rsquo;<span>t actually spawn threads manually: instead, tasks should be spawned to a common threadpool.</span>
+<span>This way, physical parallelism is nicely separated from logical concurrency.</span>
+<span>However, tasks should generally be joined for the same reason threads should be joined.</span>
+<span>A nice additional properly of tasks is that joining the threadpool itself in the end ensures that no tasks are leaked in the single place.</span></p>
+<p><span>A part of the inspiration for this post was the fact that I once forgot to join a thread :(</span>
+<span>This rather embarrassingly happened in my </span><a href="/2018/03/03-stopping-a-rust-worker.html"><span>other post</span></a><span>.</span>
+<span>Luckily, my current colleague </span><a href="https://github.com/stjepang"><span>Stjepan Glavina</span></a><span> </span><a href="https://github.com/rust-lang/rust/issues/48820"><span>noticed this</span></a><span>.</span>
+<span>Thank you, Stjepan!</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/cummsx/blog_post_join_your_threads/"><span>r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-08-23-join-your-threads.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/11/13/rust-analyzer-blog.html b/2019/11/13/rust-analyzer-blog.html
new file mode 100644
index 00000000..5ad925cb
--- /dev/null
+++ b/2019/11/13/rust-analyzer-blog.html
@@ -0,0 +1,115 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>rust-analyzer Blog</title>
+  <meta name="description" content="Hey, I've set up a website for rust-analyzer:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/11/13/rust-analyzer-blog.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#rust-analyzer-Blog"><span>rust-analyzer Blog</span> <time datetime="2019-11-13">Nov 13, 2019</time></a>
+    </h1>
+<p><span>Hey, I</span>&rsquo;<span>ve set up a website for rust-analyzer:</span></p>
+<p><a href="https://rust-analyzer.github.io/" class="url">https://rust-analyzer.github.io/</a></p>
+<p><span>It has a blog section, and I plan to post rust-analyzer related articles there.</span></p>
+<p><span>The first technical article is </span><a href="https://rust-analyzer.github.io/blog/2019/11/13/find-usages.html"><span>Find Usages</span></a><span>.</span></p>
+<p><span>If you are finding rust-analyzer useful in your work, consider talking to management about sponsoring rust-analyzer.</span>
+<span>We are specifically seeking sponsorship from companies that use Rust!</span></p>
+<p><a href="https://opencollective.com/rust-analyzer/"><span>Support rust-analyzer on Open Collective</span></a></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-11-13-rust-analyzer-blog.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2019/11/16/a-better-shell.html b/2019/11/16/a-better-shell.html
new file mode 100644
index 00000000..007a81d8
--- /dev/null
+++ b/2019/11/16/a-better-shell.html
@@ -0,0 +1,288 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>A Better Shell</title>
+  <meta name="description" content="I want a better shell.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2019/11/16/a-better-shell.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#A-Better-Shell"><span>A Better Shell</span> <time datetime="2019-11-16">Nov 16, 2019</time></a>
+    </h1>
+<p><span>I want a better shell.</span></p>
+<p><span>There are exciting projects to improve data-processing capabilities of shells, like </span><a href="https://github.com/nushell/nushell"><span>nushell</span></a><span>.</span>
+<span>However, I personally don</span>&rsquo;<span>t use this capability of shell a lot: 90% of commands I enter are simpler than </span><code>some cmd | rg pattern</code><span>.</span></p>
+<p><span>I primarily use shell as a way to </span><strong><span>use</span></strong><span> my system, and it is these interactive capabilities that I find lacking.</span>
+<span>So I want something closer in spirit to </span><a href="https://github.com/withoutboats/notty"><span>notty</span></a><span>.</span></p>
+<section id="Things-I-Need">
+
+    <h2>
+    <a href="#Things-I-Need"><span>Things I Need</span> </a>
+    </h2>
+<p><span>The most commands I type are </span><code>cd</code><span>, </span><code>exa</code><span>, </span><code>rm</code><span>, </span><code>git ...</code><span>, </span><code>cargo ...</code><span>.</span>
+<span>I also type </span><code>mg</code><span>, which launches a GUI version of Emacs with Magit:</span></p>
+
+<figure>
+
+<img alt="" src="/assets/magit.png">
+</figure>
+<p><span>These tools make me productive.</span>
+<span>Keyboard-only input is fast and </span>&ldquo;<span>composable</span>&rdquo;<span> (I can press </span><kbd><kbd><span>up</span></kbd></kbd><span> to see previous commands, I can copy-paste paths, etc).</span>
+<span>Colored character-box based presentation is very clear and predictable, I can scan it very quickly.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Take a second to appreciate how Magit interface manages to be both </span><strong><span>faster</span></strong><span> then command line flags (you don</span>&rsquo;<span>t have to type spaces and dashes) and infinitely more </span><strong><strong><span>discoverable</span></strong></strong><span>.</span>
+<span>It was Magit who taught me about </span><code>--force-with-lease</code><span>.</span></p>
+</div>
+</aside><p><span>However, there are serious gaps in the UX:</span></p>
+<ul>
+<li>
+<p><kbd><kbd><span>ctrl</kbd>+<kbd>c</span></kbd></kbd><span> doesn</span>&rsquo;<span>t work as it works in every other application.</span></p>
+</li>
+<li>
+<p><span>I launch GUI version of Emacs: the terminal one changes some keybindings, which is confusing to me.</span>
+<span>For example, I have splits inside emacs, and inside my terminal as well, and I just get confused as to which shortcut I should use.</span></p>
+</li>
+<li>
+<p><span>The output of programs is colored with escaped codes, which are horrible, and not flexible enough.</span>
+<span>When my Rust program panics and prints that it failed in </span><code>my_crate::foo::bar</code><span> function, I want this to be a hyperlink to the source code of the function.</span>
+<span>I want to </span><code>cat</code><span> images and PDFs in my terminal (and html, obviously).</span></p>
+</li>
+<li>
+<p><span>My workflow after I</span>&rsquo;<span>ve done a bunch of changes is:</span></p>
+<ol>
+<li>
+<span>type </span><code>cargo test</code><span> to launch tests</span>
+</li>
+<li>
+<span>type </span><kbd><kbd><span>ctrl</kbd>+<kbd>shift</kbd>+<kbd>Enter</span></kbd></kbd><span> to split the terminal</span>
+</li>
+<li>
+<span>type </span><code>git status</code><span> or </span><code>mg</code><span> in the split to start making a commit in parallel to testing</span>
+</li>
+</ol>
+</li>
+</ul>
+<p><span>The last step is crazy!</span></p>
+<p><span>Like, </span><code>cargo test</code><span> is being run by my shell (fish), the split is handled by the terminal emulator (kitty), which launches a fresh instance of fish and arranges the working directory to be saved.</span></p>
+<p><span>As a user, I don</span>&rsquo;<span>t care about this terminal/terminal emulator/shell split.</span>
+<span>I want to launch a program, and just type commands.</span>
+<span>Why </span><code>cargo test</code><span> blocks my input?</span>
+<span>Why can</span>&rsquo;<span>t I type </span><code>cargo test</code><span>, </span><kbd><kbd><span>Enter</span></kbd></kbd><span>, </span><code>exa -l</code><span>, </span><kbd><kbd><span>Enter</span></kbd></kbd><span> and have this program to automatically create the split?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo test</span>
+<span class="line"><span class="hl-output">...</span></span>
+<span class="line"><span class="hl-output">tons of output in progress</span></span>
+<span class="line"><span class="hl-output">...</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-comment"># -- split (healed once `cargo test` finishes) -- #</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-title function_">$</span> ls</span>
+<span class="line"><span class="hl-output">foo.txt</span></span>
+<span class="line"><span class="hl-output">bar.rs</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-title function_">$</span> git ...</span></code></pre>
+
+</figure>
+<p><span>Additionally, while </span><code>magit</code><span> awesome, I want an option to use such interface for </span><strong><strong><span>all</span></strong></strong><span> my utilities.</span>
+<span>Like, for </span><a href="https://xkcd.com/1168/"><code>tar</code></a><span>?</span>
+<span>And, when I type </span><code>cargo test --package</code><span>, I really want completion for the set of packages which are available in the current directory.</span></p>
+</section>
+<section id="New-Shell">
+
+    <h2>
+    <a href="#New-Shell"><span>New Shell</span> </a>
+    </h2>
+<p><span>What I really want is an </span><strong><strong><span>extensible application container</span></strong></strong><span>, a-la Emacs or Eclipse, but focused for a shell use-case.</span>
+<span>It could look like this:</span></p>
+<ul>
+<li>
+<span>A GUI application (which draws using raw OpenGL: we won</span>&rsquo;<span>t be using native OS GUI widgets).</span>
+</li>
+<li>
+<span>A UI framework for text-based UIs, using magit as a model. </span><kbd><kbd><span>ctrl</kbd>+<kbd>c</span></kbd></kbd><span>, </span><kbd><kbd><span>ctrl</kbd>+<kbd>v</span></kbd></kbd><span> and friends should work as expected.</span>
+</li>
+<li>
+<span>A tilling frame management, again, like the one in Emacs (and golden-ratio should be default).</span>
+</li>
+<li>
+<span>Some concept of process-let, which can occupy a frame.</span>
+</li>
+<li>
+<span>A prompt, which is </span><strong><strong><span>always</span></strong></strong><span> available, and smartly (without blocking, splitting screen if necessary) spawns new processlets.</span>
+</li>
+<li>
+<span>An API to let processlets interact with text UI.</span>
+</li>
+<li>
+<span>A plugin system for in-process processlets (obviously, plugins should be implemented in WASM).</span>
+</li>
+<li>
+<span>A plugin marketplace (versions, dependencies, lockfile, backwards compatibility).</span>
+</li>
+<li>
+<span>A plugin system for out-of-process processlets (JSON over stdio?).</span>
+</li>
+<li>
+<span>A backwards compatibility wrapper to treat usual Unix utilities as processlets.</span>
+</li>
+</ul>
+</section>
+<section id="Emacs">
+
+    <h2>
+    <a href="#Emacs"><span>Emacs?</span> </a>
+    </h2>
+<p><span>Isn</span>&rsquo;<span>t it Emacs that I am trying to describe?</span>
+<span>Well, sort-of.</span>
+<span>Emacs is definitely in the same class of </span>&ldquo;<span>application containers</span>&rdquo;<span>, but it has some severe problems, in my opinion:</span></p>
+<ul>
+<li>
+<span>Emacs Lisp is far from the best possible language for writing extensions.</span>
+</li>
+<li>
+<span>Plugin ecosystem is not really dependable.</span>
+</li>
+<li>
+<span>It doesn</span>&rsquo;<span>t define out-of-process plugin API (things like hyperlinking output).</span>
+</li>
+<li>
+<span>Async support is somewhere between non-existent and awkward.</span>
+</li>
+<li>
+<span>Its main focus is text editing.</span>
+</li>
+<li>
+<span>Its defaults are not really great (fish shell is a great project to learn from here).</span>
+</li>
+<li>
+<kbd><kbd><span>ctrl</kbd>+<kbd>c</span></kbd></kbd><span>, </span><kbd><kbd><span>ctrl</kbd>+<kbd>v</span></kbd></kbd><span> do not work by default, </span><kbd><kbd><span>M-x</span></kbd></kbd><span> is not really remappable.</span>
+</li>
+</ul>
+</section>
+<section id="Random-Closing-Thoughts">
+
+    <h2>
+    <a href="#Random-Closing-Thoughts"><span>Random Closing Thoughts</span> </a>
+    </h2>
+<p><span>This post contains the best plugin diagram ever:</span></p>
+<p><a href="https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html" class="url">https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html</a></p>
+<p><span>This talk echoes similar sentiments:</span></p>
+<p><a href="https://www.destroyallsoftware.com/talks/a-whole-new-world" class="url">https://www.destroyallsoftware.com/talks/a-whole-new-world</a></p>
+<p><span>If you build some like this, please sign me up!</span></p>
+</section>
+<section id="Addendum-2020-03-27">
+
+    <h2>
+    <a href="#Addendum-2020-03-27"><span>Addendum (2020-03-27)</span> </a>
+    </h2>
+<p><span>A </span>&ldquo;<span>terminals are a mess</span>&rdquo;<span> story from today.</span>
+<span>I wanted </span>&ldquo;<span>kill other split</span>&rdquo;<span> shortcut shortcut for my terminal, bound to </span><kbd><kbd><span>ctrl</kbd>+<kbd>k, 1</span></kbd></kbd><span>.</span>
+<span>Implementing it was easy, as kitty has a nice plugin API.</span>
+<span>After that I</span>&rsquo;<span>ve realized that I need to remap </span><code>kill_line</code><span> from </span><kbd><kbd><span>ctrl</kbd>+<kbd>k</span></kbd></kbd><span> to </span><kbd><kbd><span>ctrl</kbd>+<kbd>shift</kbd>+<kbd>k</span></kbd></kbd><span>, so that it doesn</span>&rsquo;<span>t conflict with the </span><kbd><kbd><span>ctrl</kbd>+<kbd>k, 1</span></kbd></kbd><span> chord.</span>
+<span>It took me a while to realize that searching for </span><code>kill_line</code><span> in kitty is futile </span>&mdash;<span> editing is handled by the shell.</span>
+<span>Ok, so it looks like I can just remap the key in fish, by </span><code>bind \cK kill_line</code><span>, except that, no, </span><kbd><kbd><span>ctrl</span></kbd></kbd><span> shortcuts do not work with </span><kbd><kbd><span>Shift</span></kbd></kbd><span> because of some obscure terminal limitation.</span>
+<span>So, let</span>&rsquo;<span>s go back to kitty and add a </span><kbd><kbd><span>ctrl</kbd>+<kbd>shift</kbd>+<kbd>k</span></kbd></kbd><span> shortcut that sends </span><code>^k</code><span> to the fish!</span>
+<a href="https://github.com/matklad/config/commit/fa1bbcb1813242a571f3aba44f7d986db45ef7cc"><span>An hour wasted</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2019-11-16-a-better-shell.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/01/02/spinlocks-considered-harmful.html b/2020/01/02/spinlocks-considered-harmful.html
new file mode 100644
index 00000000..39e7ffec
--- /dev/null
+++ b/2020/01/02/spinlocks-considered-harmful.html
@@ -0,0 +1,474 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Spinlocks Considered Harmful</title>
+  <meta name="description" content="Happy new year 🎉!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/01/02/spinlocks-considered-harmful.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Spinlocks-Considered-Harmful"><span>Spinlocks Considered Harmful</span> <time datetime="2020-01-02">Jan 2, 2020</time></a>
+    </h1>
+<p><span>Happy new year 🎉!</span></p>
+<p><span>In this post, I will be expressing strong opinions about a topic I have relatively little practical experience with, so feel free to roast and educate me in comments (link at the end of the post) :-)</span></p>
+<p><span>Specifically, I</span>&rsquo;<span>ll talk about:</span></p>
+<ul>
+<li>
+<span>spinlocks,</span>
+</li>
+<li>
+<span>spinlocks in Rust with </span><code>#[no_std]</code><span>,</span>
+</li>
+<li>
+<span>priority inversion,</span>
+</li>
+<li>
+<span>CPU interrupts,</span>
+</li>
+<li>
+<span>and a couple of neat/horrible systemsy Rust hacks.</span>
+</li>
+</ul>
+<section id="Context">
+
+    <h2>
+    <a href="#Context"><span>Context</span> </a>
+    </h2>
+<p><span>I maintain </span><a href="https://github.com/matklad/once_cell/"><code>once_cell</code></a><span> crate, which is a synchronization primitive.</span>
+<span>It uses </span><code>std</code><span> blocking facilities under the hood (specifically, </span><code>std::thread::park</code><span>), and as such is not compatible with </span><code>#[no_std]</code><span>.</span>
+<span>A popular request is to add a spin-lock based implementation for use in </span><code>#[no_std]</code><span> environments: </span><a href="https://github.com/matklad/once_cell/issues/61"><span>#61</span></a><span>.</span></p>
+<p><span>More generally, this seems to be a common pattern in Rust ecosystem:</span></p>
+<ul>
+<li>
+<span>A crate uses </span><code>Mutex</code><span> or other synchronization mechanism from </span><code>std</code>
+</li>
+<li>
+<span>Someone asks for </span><code>#[no_std]</code><span> support</span>
+</li>
+<li>
+<code>Mutex</code><span> is swapped for some variation of spinlock.</span>
+</li>
+</ul>
+<p><span>For example, the </span><code>lazy_static</code><span> crate does this:</span></p>
+<p><a href="https://github.com/rust-lang-nursery/lazy-static.rs/blob/421669662b35fcb455f2902daed2e20bbbba79b6/src/core_lazy.rs#L10"><span>github.com/rust-lang-nursery/lazy-static.rs/blob/master/src/core_lazy.rs</span></a></p>
+<p><span>I think this is an anti-pattern, and I am writing this blog post to call it out.</span></p>
+</section>
+<section id="What-Is-a-Spinlock-Anyway">
+
+    <h2>
+    <a href="#What-Is-a-Spinlock-Anyway"><span>What Is a Spinlock, Anyway?</span> </a>
+    </h2>
+<p><span>A </span><code>Spinlock</code><span> is the simplest possible implementation of a mutex, its general form looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">static</span> LOCKED: AtomicBool = AtomicBool::<span class="hl-title function_ invoke__">new</span>(<span class="hl-literal">false</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">while</span> LOCKED.<span class="hl-title function_ invoke__">compare_and_swap</span>(<span class="hl-literal">false</span>, <span class="hl-literal">true</span>, Ordering::Acquire) { <i class="callout" data-value="1"></i></span>
+<span class="line">  std::sync::atomic::<span class="hl-title function_ invoke__">spin_loop_hint</span>(); <i class="callout" data-value="4"></i></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">/* Critical section */</span>  <i class="callout" data-value="2"></i></span>
+<span class="line"></span>
+<span class="line">LOCKED.<span class="hl-title function_ invoke__">store</span>(<span class="hl-literal">false</span>, Ordering::Release);<i class="callout" data-value="3"></i></span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>To grab a lock, we repeatedly execute compare</span><em><span>and</span></em><span>swap until it succeeds. The CPU </span>&ldquo;<span>spins</span>&rdquo;<span> in this very short loop.</span>
+</li>
+<li>
+<span>Only one thread at a time can be here.</span>
+</li>
+<li>
+<span>To release the lock, we do a single atomic store.</span>
+</li>
+<li>
+<span>Spinning is wasteful, so we use an </span><a href="https://en.wikipedia.org/wiki/Intrinsic_function"><span>intrinsic</span></a><span> to instruct the CPU to enter a low-power mode.</span>
+</li>
+</ol>
+<p><span>Why we need </span><code>Ordering::Acquire</code><span> and </span><code>Ordering::Release</code><span> is very interesting, but beyond the scope of this article.</span></p>
+<p><span>The key take-away here is that a spinlock is implemented entirely in user space: from OS point of view, a </span>&ldquo;<span>spinning</span>&rdquo;<span> thread looks exactly like a thread that does a heavy computation.</span></p>
+<p><span>An OS-based mutex, like </span><a href="https://doc.rust-lang.org/std/sync/struct.Mutex.html"><code>std::sync::Mutex</code></a><span> or </span><a href="https://docs.rs/parking_lot/0.10.0/parking_lot/type.Mutex.html"><code>parking_lot::Mutex</code></a><span>, uses a </span><strong><strong><span>system call</span></strong></strong><span> to tell the operating system that a thread needs to be blocked. In pseudo code, an implementation might look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">static</span> LOCKED: AtomicBool = AtomicBool::<span class="hl-title function_ invoke__">new</span>(<span class="hl-literal">false</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">while</span> LOCKED.<span class="hl-title function_ invoke__">compare_and_swap</span>(<span class="hl-literal">false</span>, <span class="hl-literal">true</span>, Ordering::Acquire)</span>
+<span class="line">  <span class="hl-title function_ invoke__">park_this_thread</span>(&amp;LOCKED);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">/* Critical section */</span></span>
+<span class="line"></span>
+<span class="line">LOCKED.<span class="hl-title function_ invoke__">store</span>(<span class="hl-literal">false</span>, Ordering::Release);</span>
+<span class="line"><span class="hl-title function_ invoke__">unpark_some_thread</span>(&amp;LOCKED);</span></code></pre>
+
+</figure>
+<p><span>The main difference is </span><code>park_this_thread</code><span> </span>&mdash;<span> a </span><strong><strong><span>blocking</span></strong></strong><span> system call.</span>
+<span>It instructs the OS to take current thread off the CPU until it is woken up by an </span><code>unpark_some_thread</code><span> call.</span>
+<span>The kernel maintains a </span><strong><span>queue</span></strong><span> of threads waiting for a mutex.</span>
+<span>The </span><code>park</code><span> call enqueues current thread onto this queue, while </span><code>unpark</code><span> dequeues some thread. The </span><code>park</code><span> system call returns when the thread is dequeued.</span>
+<span>In the meantime, the thread waits off the CPU.</span></p>
+<p><span>If there are several different mutexes, the kernel needs to maintain several queues.</span>
+<span>An address of a lock can be used as a token to identify a specific queue (this is a </span><a href="http://man7.org/linux/man-pages/man2/futex.2.html"><span>futex</span></a><span> API).</span></p>
+<p><span>System calls are expensive, so production implementations of </span><code>Mutex</code><span> usually spin for several iterations before calling into OS, optimistically hoping that the </span><code>Mutex</code><span> will be released soon.</span>
+<span>However, the waiting always bottoms out in a syscall.</span></p>
+</section>
+<section id="Spinning-Just-For-a-Little-Bit-What-Can-Go-Wrong">
+
+    <h2>
+    <a href="#Spinning-Just-For-a-Little-Bit-What-Can-Go-Wrong"><span>Spinning Just For a Little Bit, What Can Go Wrong?</span> </a>
+    </h2>
+<p><span>Because spin locks are so simple and fast, it seems to be a good idea to use them for short-lived critical sections.</span>
+<span>For example, if you only need to increment a couple of integers, should you really bother with complicated syscalls? In the worst case, the other thread will spin just for a couple of iterations</span>&hellip;</p>
+<p><span>Unfortunately, this logic is flawed!</span>
+<span>A thread can be preempted at any time, including during a short critical section.</span>
+<span>If it is preempted, that means that all other threads will need to spin until the original thread gets its share of CPU again.</span>
+<span>And, because a spinning thread looks like a good, busy thread to the OS, the other threads will spin until they exhaust their quants, preventing the unlucky thread from getting back on the processor!</span></p>
+<p><span>If this sounds like a series of unfortunate events, don</span>&rsquo;<span>t worry, it gets even worse. Enter </span><strong><strong><span>Priority Inversion</span></strong></strong><span>. Suppose our threads have priorities, and OS tries to schedule high-priority threads over low-priority ones.</span></p>
+<p><span>Now, what happens if the thread that enters a critical section is a low-priority one, but competing threads have high priority?</span>
+<span>It will likely get preempted: there are higher priority threads after all.</span>
+<span>And, if the number of cores is smaller than the number of high priority threads that try to lock a mutex, it likely won</span>&rsquo;<span>t be able to complete a critical section at all: OS will be repeatedly scheduling all the other threads!</span></p>
+</section>
+<section id="No-OS-no-problem">
+
+    <h2>
+    <a href="#No-OS-no-problem"><span>No OS, no problem?</span> </a>
+    </h2>
+<p><span>But wait! </span>&mdash;<span> you would say </span>&mdash;<span> we only use spin locks in </span><code>#[no_std]</code><span> crates, so there</span>&rsquo;<span>s no OS to preempt our threads.</span></p>
+<p><em><span>First</span></em><span>, it</span>&rsquo;<span>s not really true: it</span>&rsquo;<span>s perfectly fine, and often even desirable, to use </span><code>#[no_std]</code><span> crates for usual user-space applications.</span>
+<span>For example, if you write a Rust replacement for a low-level C library, like zlib or openssl, you will probably make the crate </span><code>#[no_std]</code><span>, so that non-Rust applications can link to it without pulling the whole of the Rust runtime.</span></p>
+<p><em><span>Second</span></em><span>, if there</span>&rsquo;<span>s really no OS to speak about, and you are on the bare metal (or in the kernel), it gets even worse than priority inversion.</span></p>
+<p><span>On bare metal, we generally don</span>&rsquo;<span>t worry about </span><em><span>thread</span></em><span> preemption, but we need to worry about </span><a href="https://en.wikipedia.org/wiki/Interrupt"><span>processor interrupts</span></a><span>. That is, while processor is executing some code, it might receive an interrupt from some periphery device, and temporary switch to the interrupt handler</span>&rsquo;<span>s code.</span></p>
+<p><span>And here comes the disaster: if the main code is in the middle of the critical section when the interrupt arrives, and if the interrupt handler tries to enter the critical section as well, we get a guaranteed deadlock!</span>
+<span>There</span>&rsquo;<span>s no OS to switch threads after a quant expires.</span>
+<span>Here are Linux kernel </span><a href="https://www.kernel.org/doc/Documentation/locking/spinlocks.txt"><span>docs</span></a><span> discussing this issue.</span></p>
+</section>
+<section id="Practical-Applications">
+
+    <h2>
+    <a href="#Practical-Applications"><span>Practical Applications</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s trigger priority inversion!</span>
+<span>Our victim is the </span><a href="https://github.com/rust-random/getrandom/tree/v0.1.13"><code>getrandom</code></a><span> crate.</span>
+<span>I don</span>&rsquo;<span>t pick on </span><code>getrandom</code><span> specifically here: the pattern is pervasive across the ecosystem.</span></p>
+<p><span>The crate uses spinning in the </span><a href="https://github.com/rust-random/getrandom/blob/v0.1.13/src/util.rs#L54-L82"><code>LazyUsize</code></a><span> utility type:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">LazyUsize</span>(AtomicUsize);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">LazyUsize</span> {</span>
+<span class="line">  <span class="hl-comment">// Synchronously runs the init() function. Only one caller</span></span>
+<span class="line">  <span class="hl-comment">// will have their init() function running at a time, and</span></span>
+<span class="line">  <span class="hl-comment">// exactly one successful call will be run. init() returning</span></span>
+<span class="line">  <span class="hl-comment">// UNINIT or ACTIVE will be considered a failure, and future</span></span>
+<span class="line">  <span class="hl-comment">// calls to sync_init will rerun their init() function.</span></span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">sync_init</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">self</span>,</span>
+<span class="line">    init: <span class="hl-keyword">impl</span> <span class="hl-title class_">FnOnce</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span>,</span>
+<span class="line">    <span class="hl-keyword">mut</span> wait: <span class="hl-keyword">impl</span> <span class="hl-title class_">FnMut</span>(),</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">    <span class="hl-comment">// Common and fast path with no contention.</span></span>
+<span class="line">    <span class="hl-comment">// Don&#x27;t wast time on CAS.</span></span>
+<span class="line">    <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.<span class="hl-number">0</span>.<span class="hl-title function_ invoke__">load</span>(Relaxed) {</span>
+<span class="line">      <span class="hl-keyword">Self</span>::UNINIT | <span class="hl-keyword">Self</span>::ACTIVE =&gt; {}</span>
+<span class="line">      val =&gt; <span class="hl-keyword">return</span> val,</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-comment">// Relaxed ordering is fine,</span></span>
+<span class="line">    <span class="hl-comment">// as we only have a single atomic variable.</span></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">      <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.<span class="hl-number">0</span>.<span class="hl-title function_ invoke__">compare_and_swap</span>(</span>
+<span class="line">        <span class="hl-keyword">Self</span>::UNINIT,</span>
+<span class="line">        <span class="hl-keyword">Self</span>::ACTIVE,</span>
+<span class="line">        Relaxed,</span>
+<span class="line">      ) {</span>
+<span class="line">        <span class="hl-keyword">Self</span>::UNINIT =&gt; {</span>
+<span class="line">          <span class="hl-keyword">let</span> <span class="hl-variable">val</span> = <span class="hl-title function_ invoke__">init</span>();</span>
+<span class="line">          <span class="hl-keyword">self</span>.<span class="hl-number">0</span>.<span class="hl-title function_ invoke__">store</span>(</span>
+<span class="line">            <span class="hl-keyword">match</span> val {</span>
+<span class="line">              <span class="hl-keyword">Self</span>::UNINIT | <span class="hl-keyword">Self</span>::ACTIVE =&gt; <span class="hl-keyword">Self</span>::UNINIT,</span>
+<span class="line">              val =&gt; val,</span>
+<span class="line">            },</span>
+<span class="line">            Relaxed,</span>
+<span class="line">          );</span>
+<span class="line">          <span class="hl-keyword">return</span> val;</span>
+<span class="line">        }</span>
+<span class="line">        <span class="hl-keyword">Self</span>::ACTIVE =&gt; <span class="hl-title function_ invoke__">wait</span>(),</span>
+<span class="line">        val =&gt; <span class="hl-keyword">return</span> val,</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s a </span><code>static</code><span> instance of </span><code>LazyUsize</code><span> which caches file descriptor for </span><code>/dev/random</code><span>:</span></p>
+<p><a href="https://github.com/rust-random/getrandom/blob/v0.1.13/src/use_file.rs#L26" class="url">https://github.com/rust-random/getrandom/blob/v0.1.13/src/use_file.rs#L26</a></p>
+<p><span>This descriptor is used when calling </span><code>getrandom</code><span> </span>&mdash;<span> the only function that is exported by the crate.</span></p>
+<p><span>To trigger priority inversion, we will create </span><code>1 + N</code><span> threads, each of which will call </span><code>getrandom::getrandom</code><span>.</span>
+<span>We arrange it so that the first thread has a low priority, and the rest are high priority.</span>
+<span>We stagger threads a little bit so that the first one does the initialization.</span>
+<span>We also make creating the file descriptor slow, so that the first thread gets preempted while in the critical section.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>This is actually a typical scenario for </span><code>getrandom</code><span>!</span>
+<span>Getting the first chunk of random bytes might block for a long time while the system gathers entropy after a reboot.</span>
+<span>I even had a fun bug last year, where my desktop environment won</span>&rsquo;<span>t start until I press some key.</span>
+<span>It was waiting for entropy for some reason, and the keypress provided it.</span></p>
+</div>
+</aside><p><span>Here is the implementation of this plan: </span><a href="https://github.com/matklad/spin-of-death" class="url">https://github.com/matklad/spin-of-death</a><span>.</span></p>
+<p><span>It uses a couple of systems programming hacks to make this disaster scenario easy to reproduce.</span>
+<span>To simulate slow </span><code>/dev/random</code><span>, we want to intercept the </span><code>poll</code><span> syscall </span><code>getrandom</code><span> is using to ensure that there</span>&rsquo;<span>s enough entropy.</span>
+<span>We can use </span><a href="https://strace.io/"><span>strace</span></a><span> to log system calls issued by a program.</span>
+<span>I don</span>&rsquo;<span>t know if strace can be used to make a syscall run slow (now, once I</span>&rsquo;<span>ve looked at the website, I see that it can in fact be used to tamper with syscalls, </span><em><span>sigh</span></em><span>), but we actually don</span>&rsquo;<span>t need to!</span>
+<code>getrandom</code><span> does not use the syscall directly, it uses the </span><code>poll</code><span> function from </span><code>libc</code><span>.</span>
+<span>We can substitute this function by using </span><code>LD_PRELOAD</code><span>, but there</span>&rsquo;<span>s an even simpler way!</span>
+<span>We can trick the static linker into using a function which we define ourselves:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[no_mangle]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">extern</span> <span class="hl-string">&quot;C&quot;</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">poll</span>(</span>
+<span class="line">  _fds: *<span class="hl-keyword">const</span> <span class="hl-type">u8</span>,</span>
+<span class="line">  _nfds: <span class="hl-type">usize</span>,</span>
+<span class="line">  _timeout: <span class="hl-type">i32</span>,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">  <span class="hl-title function_ invoke__">sleep_ms</span>(<span class="hl-number">500</span>);</span>
+<span class="line">  <span class="hl-number">1</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The name of the function accidentally ( :) ) clashes with a well-known </span><a href="http://man7.org/linux/man-pages/man2/poll.2.html"><span>POSIX function</span></a><span>.</span></p>
+<p><span>However, this alone is not enough.</span>
+<code>getrandom</code><span> </span><a href="https://github.com/rust-random/getrandom/blob/v0.1.13/src/linux_android.rs"><span>tries to use</span></a><span> </span><code>getrandom</code><span> syscall first, and that code path does not use a spin lock.</span>
+<span>We need to fool </span><code>getrandom</code><span> into believing that the syscall is not available.</span>
+<span>Our </span><code>extern "C"</code><span> trick wouldn</span>&rsquo;<span>t have worked if </span><code>getrandom</code><span> literally used the </span><code>syscall</code><span> instruction.</span>
+<span>However, as inline assembly (which you need to issue a syscall manually) is not available on stable Rust, </span><code>getrandom</code><span> goes via </span><code>syscall</code><span> </span><em><span>function</span></em><span> from </span><code>libc</code><span>.</span>
+<span>That we can override with the same trick.</span></p>
+<p><span>However, there</span>&rsquo;<span>s a wrinkle!</span>
+<span>Traditionally, </span><code>libc</code><span> API used </span><code>errno</code><span> for error reporting.</span>
+<span>That is, on a failure the function would return an single specific invalid value, and set the </span><code>errno</code><span> thread local variable to the specific error code. </span><code>syscall</code><span> follows this pattern.</span></p>
+<p><span>The </span><code>errno</code><span> interface is cumbersome to use.</span>
+<span>The worst part of </span><code>errno</code><span> is that the specification requires it to be a macro, and so you can only really use it from </span><code>C</code><span> </span><em><span>source code</span></em><span>.</span>
+<span>Internally, on Linux the macro calls </span><code>__get_errno_location</code><span> function to get the thread local, but this is an implementation detail (which we will gladly take advantage of, in this land of reckless systems hacking!). The irony is that the ABI of Linux syscall just </span><strong><span>returns</span></strong><span> error codes, so </span><code>libc</code><span> has to do some legwork to adapt to the awkward </span><code>errno</code><span> interface.</span></p>
+<p><span>So, here</span>&rsquo;<span>s a strong contender for the most cursed function I</span>&rsquo;<span>ve written so far:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[no_mangle]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">extern</span> <span class="hl-string">&quot;C&quot;</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">syscall</span>(</span>
+<span class="line">  _syscall: <span class="hl-type">u64</span>,</span>
+<span class="line">  _buf: *<span class="hl-keyword">const</span> <span class="hl-type">u8</span>,</span>
+<span class="line">  _len: <span class="hl-type">usize</span>,</span>
+<span class="line">  _flags: <span class="hl-type">u32</span>,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">isize</span> {</span>
+<span class="line">  <span class="hl-keyword">extern</span> <span class="hl-string">&quot;C&quot;</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">__errno_location</span>() <span class="hl-punctuation">-&gt;</span> *<span class="hl-keyword">mut</span> <span class="hl-type">i32</span>;</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">unsafe</span> {</span>
+<span class="line">    *__errno_location() = <span class="hl-number">38</span>; <span class="hl-comment">// ENOSYS</span></span>
+<span class="line">  }</span>
+<span class="line">  -<span class="hl-number">1</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It makes </span><code>getrandom</code><span> believe that there</span>&rsquo;<span>s no </span><code>getrandom</code><span> syscall, which causes it to fallback to </span><code>/dev/random</code><span> implementation.</span></p>
+<p><span>To set thread priorities, we use </span><a href="https://docs.rs/thread-priority/0.1.1/thread_priority/"><span>thread_priority</span></a><span> crate, which is a thin wrapper around </span><code>pthread</code><span> APIs.</span>
+<span>We will be using real time priorities, which require </span><code>sudo</code><span>.</span></p>
+<p><span>And here are the results:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo build --release</span>
+<span class="line"><span class="hl-output">    Finished release [optimized] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-title function_">$</span> time sudo ./target/release/spin-of-death</span>
+<span class="line"><span class="hl-output">^CCommand terminated by signal 2</span></span>
+<span class="line"><span class="hl-output">real 136.54s</span></span>
+<span class="line"><span class="hl-output">user 96.02s</span></span>
+<span class="line"><span class="hl-output">sys  940.70s</span></span>
+<span class="line"><span class="hl-output">rss  6880k</span></span></code></pre>
+
+</figure>
+<p><span>Note that I had to kill the program after two minutes.</span>
+<span>Also note the impressive system time, as well as load average</span></p>
+
+<figure>
+
+<img alt="" src="/assets/priority-inversion.png">
+</figure>
+<p><span>If we </span><a href="https://github.com/matklad/getrandom/commit/a7dc21fed9b789832702b98807a62de7bf7312d4"><span>patch</span></a><span> </span><code>getrandom</code><span> to use </span><code>std::sync::Once</code><span> instead we get a much better result:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo build --release --features os-blocking-getrandom</span>
+<span class="line"><span class="hl-output">    Finished release [optimized] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-title function_">$</span> time sudo ./target/release/spin-of-death</span>
+<span class="line"><span class="hl-output">real 0.51s </span><i class="callout" data-value="1"></i></span>
+<span class="line"><span class="hl-output">user 0.01s</span></span>
+<span class="line"><span class="hl-output">sys  0.04s</span></span>
+<span class="line"><span class="hl-output">rss  6912k</span></span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>Note how </span><code>real</code><span> is half a second, but </span><code>user</code><span> and </span><code>sys</code><span> are small.</span>
+<span>That</span>&rsquo;<span>s because we are waiting for 500 milliseconds in our </span><code>poll</code>
+</li>
+</ol>
+<p><span>This is because </span><code>Once</code><span> uses OS facilities for blocking, and so OS notices that high priority threads are actually blocked and gives the low priority thread a chance to finish its work.</span></p>
+</section>
+<section id="If-Not-a-Spinlock-Then-What">
+
+    <h2>
+    <a href="#If-Not-a-Spinlock-Then-What"><span>If Not a Spinlock, Then What?</span> </a>
+    </h2>
+<p><em><span>First</span></em><span>, if you only use a spin lock because </span>&ldquo;<span>it</span>&rsquo;<span>s faster for small critical sections</span>&rdquo;<span>, just replace it with a mutex from </span><code>std</code><span> or </span><code>parking_lot</code><span>.</span>
+<span>They already do a small amount of spinning iterations before calling into the kernel, so they are as fast as a spinlock in the best case, and infinitely faster in the worst case.</span></p>
+<p><em><span>Second</span></em><span>, it seems like most problematic uses of spinlocks come from one time initialization (which is exactly what my </span><code>once_cell</code><span> crate helps with). I think it usually is possible to get away without using spinlocks. For example, instead of storing the state itself, the library may just delegate state storing to the user. For </span><code>getrandom</code><span>, it can expose two functions:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">init</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;RandomState&gt;;</span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">getrandom</span>(state: &amp;RandomState, buf: &amp;<span class="hl-keyword">mut</span>[<span class="hl-type">u8</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;<span class="hl-type">usize</span>&gt;;</span></code></pre>
+
+</figure>
+<p><span>It then becomes the user</span>&rsquo;<span>s problem to cache </span><code>RandomState</code><span> appropriately.</span>
+<span>For example, std may continue using a thread local (</span><a href="https://github.com/rust-lang/rust/blob/0ec370670220b712b042ee09aab067ec7e5878d5/src/libstd/collections/hash/map.rs#L2460"><span>src</span></a><span>) while rand, with </span><code>std</code><span> feature enabled, could use a global variable, protected by </span><code>Once</code><span>.</span></p>
+<p><span>Another option, if the state fits into </span><code>usize</code><span> and the initializing function is idempotent and relatively quick, is to do a racy initialization:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">get_state</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">  <span class="hl-keyword">static</span> CACHE: AtomicUsize = AtomicUsize::<span class="hl-title function_ invoke__">new</span>(<span class="hl-number">0</span>);</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">res</span> = CACHE.<span class="hl-title function_ invoke__">load</span>(Ordering::Relaxed);</span>
+<span class="line">  <span class="hl-keyword">if</span> res == <span class="hl-number">0</span> {</span>
+<span class="line">    res = <span class="hl-title function_ invoke__">init</span>();</span>
+<span class="line">    CACHE.<span class="hl-title function_ invoke__">store</span>(res, Ordering::Relaxed);</span>
+<span class="line">  }</span>
+<span class="line">  res</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">init</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> { ... }</span></code></pre>
+
+</figure>
+<p><span>Take a second to appreciate the absence of </span><code>unsafe</code><span> blocks and cross-core communication in the above example!</span>
+<del><span>At worst, </span><code>init</code><span> will be called </span><code>number of cores</code><span> times</span></del><span> (EDIT: this is wrong, thanks to /u/pcpthm for </span><a href="https://www.reddit.com/r/rust/comments/eis1tr/blog_post_spinlocks_considered_harmful/fctg66s"><span>pointing this out</span></a><span>!).</span></p>
+<p><span>There</span>&rsquo;<span>s also a nuclear option: parametrize the library by blocking behavior, and allow the user to supply their own synchronization primitive.</span></p>
+<p><em><span>Third</span></em><span>, sometimes you just </span><strong><strong><span>know</span></strong></strong><span> that there</span>&rsquo;<span>s only a single thread in the program, and you might want to use a spinlock just to silence those annoying compiler errors about </span><code>static mut</code><span>.</span>
+<span>The primary use case here I think is WASM. A solution for this case is to assume that blocking just doesn</span>&rsquo;<span>t happen, and panic otherwise. This is what </span><a href="https://github.com/rust-lang/rust/blob/0ec370670220b712b042ee09aab067ec7e5878d5/src/libstd/sys/wasm/mutex.rs"><span>std does</span></a><span> for </span><code>Mutex</code><span> on WASM, and what is implemented for </span><code>once_cell</code><span> in this PR: </span><a href="https://github.com/matklad/once_cell/pull/82"><span>#82</span></a><span>.</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/eis1tr/blog_post_spinlocks_considered_harmful/"><span>/r/rust</span></a><span>.</span></p>
+<p><span>EDIT: If you enjoyed this post, you might also like this one:</span></p>
+<p><a href="https://probablydance.com/2019/12/30/measuring-mutexes-spinlocks-and-how-bad-the-linux-scheduler-really-is/" class="url">https://probablydance.com/2019/12/30/measuring-mutexes-spinlocks-and-how-bad-the-linux-scheduler-really-is/</a></p>
+<p><span>Looks like we have some contention here!</span></p>
+<p><span>EDIT: there</span>&rsquo;<span>s now a follow up post, where we actually benchmark spinlocks:</span></p>
+<p><a href="https://matklad.github.io/2020/01/04/mutexes-are-faster-than-spinlocks.html" class="url">https://matklad.github.io/2020/01/04/mutexes-are-faster-than-spinlocks.html</a></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-01-02-spinlocks-considered-harmful.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/01/04/mutexes-are-faster-than-spinlocks.html b/2020/01/04/mutexes-are-faster-than-spinlocks.html
new file mode 100644
index 00000000..9ba82d66
--- /dev/null
+++ b/2020/01/04/mutexes-are-faster-than-spinlocks.html
@@ -0,0 +1,470 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Mutexes Are Faster Than Spinlocks</title>
+  <meta name="description" content="(at least on commodity desktop Linux with stock settings)">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/01/04/mutexes-are-faster-than-spinlocks.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Mutexes-Are-Faster-Than-Spinlocks"><span>Mutexes Are Faster Than Spinlocks</span> <time datetime="2020-01-04">Jan 4, 2020</time></a>
+    </h1>
+<p><span>(at least on commodity desktop Linux with stock settings)</span></p>
+<p><span>This is a followup to the </span><a href="/2020/01/02/spinlocks-considered-harmful"><span>previous post</span></a><span> about spinlocks.</span>
+<span>The gist of the previous post was that spinlocks have some pretty bad worst-case behaviors, and, for that reason, one shouldn</span>&rsquo;<span>t blindly use a spinlock if using a sleeping mutex or avoiding blocking altogether is cumbersome.</span></p>
+<p><span>In the comments, I was pointed to </span><a href="https://probablydance.com/2019/12/30/measuring-mutexes-spinlocks-and-how-bad-the-linux-scheduler-really-is/"><span>this interesting article</span></a><span>, which made me realize that there</span>&rsquo;<span>s another misconception:</span></p>
+
+<aside class="admn warn">
+<svg class="icon"><use href="/assets/icons.svg#exclamation"/></svg>
+<div><p><span>For short critical sections, spinlocks perform better</span></p>
+</div>
+</aside><p><span>Until today, I haven</span>&rsquo;<span>t benchmarked any mutexes, so I don</span>&rsquo;<span>t know for sure.</span>
+<span>However, what I know in theory about mutexes and spinlocks makes me doubt this claim, so let</span>&rsquo;<span>s find out.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>In the following, I used the term </span><strong><strong><span>mutex</span></strong></strong><span> as a short-hand for a synchronization</span>
+<span>primitive which is guaranteed to eventually call into the kernel under</span>
+<span>contention. A more appropriate term is </span><strong><strong><span>sleeping mutex</span></strong></strong><span>.</span></p>
+</div>
+</aside><section id="Where-Does-The-Misconception-Come-From">
+
+    <h2>
+    <a href="#Where-Does-The-Misconception-Come-From"><span>Where Does The Misconception Come From?</span> </a>
+    </h2>
+<p><span>I do understand why people might think that way though.</span>
+<span>A simplest mutex just makes </span><code>lock</code><span> / </span><code>unlock</code><span> syscalls when entering and exiting a critical section, offloading all synchronization to the kernel.</span>
+<span>However, syscalls are slow and so, if the length of critical section is smaller than the length of two syscalls, spinning would be faster.</span></p>
+<p><span>It</span>&rsquo;<span>s easy to eliminate the syscall on entry in an uncontended state.</span>
+<span>We can try to optimistically </span><code>CAS</code><span> lock to the locked state, and call into kernel only if we failed and need to sleep.</span>
+<span>Eliminating syscall on exit is </span><a href="http://dept-info.labri.fr/~denis/Enseignement/2008-IR/Articles/01-futex.pdf"><span>tricky</span></a><span>, and so I think historically many implementations did at least one syscall in practice.</span>
+<span>Thus, mutexes </span><strong><span>were</span></strong><span>, in fact, slower than spinlocks in some benchmarks.</span></p>
+<p><span>However, modern mutex implementations avoid all syscalls if there</span>&rsquo;<span>s no contention.</span>
+<span>The trick is to make the state of the mutex an enum: unlocked, locked with some waiting threads, locked without waiting threads.</span>
+<span>This way, we only need to call into the kernel if there are in fact waiters.</span></p>
+<p><span>Another historical benefit of spinlocks is that they are smaller in size.</span>
+<span>A state of a spinlock is just a single boolean variable, while for a mutex you also need a queue of waiting threads. But there</span>&rsquo;<span>s a </span><a href="http://dept-info.labri.fr/~denis/Enseignement/2008-IR/Articles/01-futex.pdf"><span>trick</span></a><span> to combat this inefficiency as well.</span>
+<span>We can use the </span><strong><span>address</span></strong><span> of the boolean flag as token to identify the mutex, and store non-empty queues in a side table.</span>
+<span>Note how this also reduces the (worst case) total number of queues from </span><code>number of mutexes</code><span> to </span><code>number of threads</code><span>!</span></p>
+<p><span>So a modern mutex, like the one in </span><a href="https://webkit.org/blog/6161/locking-in-webkit/"><span>WTF::ParkingLot</span></a><span>, is a single boolean, which behaves more or less like a spinlock in an uncontended case but doesn</span>&rsquo;<span>t have pathological behaviors of the spinlock.</span></p>
+</section>
+<section id="Benchmark">
+
+    <h2>
+    <a href="#Benchmark"><span>Benchmark</span> </a>
+    </h2>
+<p><span>So, let</span>&rsquo;<span>s check if the theory works in practice!</span>
+<span>The source code for the benchmark is here:</span></p>
+<p><a href="https://github.com/matklad/lock-bench" class="url">https://github.com/matklad/lock-bench</a></p>
+<p><span>The interesting bit is reproduced below:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">run_bench</span>&lt;M: Mutex&gt;(options: &amp;Options) <span class="hl-punctuation">-&gt;</span> time::Duration {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">locks</span> = &amp;(<span class="hl-number">0</span>..options.n_locks) <i class="callout" data-value="3"></i></span>
+<span class="line">      .<span class="hl-title function_ invoke__">map</span>(|_| CachePadded::<span class="hl-title function_ invoke__">new</span>(M::<span class="hl-title function_ invoke__">default</span>()))</span>
+<span class="line">      .collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">start_barrier</span> =</span>
+<span class="line">    &amp;Barrier::<span class="hl-title function_ invoke__">new</span>(options.n_threads <span class="hl-keyword">as</span> <span class="hl-type">usize</span> + <span class="hl-number">1</span>);</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">end_barrier</span> =</span>
+<span class="line">    &amp;Barrier::<span class="hl-title function_ invoke__">new</span>(options.n_threads <span class="hl-keyword">as</span> <span class="hl-type">usize</span> + <span class="hl-number">1</span>);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-title function_ invoke__">scope</span>(|scope| {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">thread_seeds</span> = <span class="hl-title function_ invoke__">random_numbers</span>(<span class="hl-number">0x6F4A955E</span>)</span>
+<span class="line">      .<span class="hl-title function_ invoke__">scan</span>(<span class="hl-number">0x9BA2BF27</span>, |state, n| {</span>
+<span class="line">        *state ^= n;</span>
+<span class="line">        <span class="hl-title function_ invoke__">Some</span>(*state)</span>
+<span class="line">      })</span>
+<span class="line">      .<span class="hl-title function_ invoke__">take</span>(options.n_threads <span class="hl-keyword">as</span> <span class="hl-type">usize</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">thread_seed</span> <span class="hl-keyword">in</span> thread_seeds {</span>
+<span class="line">      scope.<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> |_| {</span>
+<span class="line">        start_barrier.<span class="hl-title function_ invoke__">wait</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">indexes</span> = <span class="hl-title function_ invoke__">random_numbers</span>(thread_seed)</span>
+<span class="line">          .<span class="hl-title function_ invoke__">map</span>(|it| it % options.n_locks)</span>
+<span class="line">          .<span class="hl-title function_ invoke__">map</span>(|it| it <span class="hl-keyword">as</span> <span class="hl-type">usize</span>)</span>
+<span class="line">          .<span class="hl-title function_ invoke__">take</span>(options.n_ops <span class="hl-keyword">as</span> <span class="hl-type">usize</span>);</span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-variable">idx</span> <span class="hl-keyword">in</span> indexes {</span>
+<span class="line">          locks[idx].<span class="hl-title function_ invoke__">with_lock</span>(|cnt| *cnt += <span class="hl-number">1</span>); <i class="callout" data-value="1"></i></span>
+<span class="line">        }</span>
+<span class="line">        end_barrier.<span class="hl-title function_ invoke__">wait</span>();</span>
+<span class="line">      });</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    std::thread::<span class="hl-title function_ invoke__">sleep</span>(time::Duration::<span class="hl-title function_ invoke__">from_millis</span>(<span class="hl-number">100</span>));</span>
+<span class="line">    start_barrier.<span class="hl-title function_ invoke__">wait</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">start</span> = time::Instant::<span class="hl-title function_ invoke__">now</span>();</span>
+<span class="line">    end_barrier.<span class="hl-title function_ invoke__">wait</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">elapsed</span> = start.<span class="hl-title function_ invoke__">elapsed</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">lock</span> <span class="hl-keyword">in</span> locks.<span class="hl-title function_ invoke__">iter</span>() {</span>
+<span class="line">      lock.<span class="hl-title function_ invoke__">with_lock</span>(|cnt| total += *cnt);</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(total, options.n_threads * options.n_ops); <i class="callout" data-value="2"></i></span>
+<span class="line"></span>
+<span class="line">    elapsed</span>
+<span class="line">  })</span>
+<span class="line">  .<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">random_numbers</span>(seed: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">impl</span> <span class="hl-title class_">Iterator</span>&lt;Item = <span class="hl-type">u32</span>&gt; { <i class="callout" data-value="4"></i></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">random</span> = seed;</span>
+<span class="line">  iter::<span class="hl-title function_ invoke__">repeat_with</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">    random ^= random &lt;&lt; <span class="hl-number">13</span>;</span>
+<span class="line">    random ^= random &gt;&gt; <span class="hl-number">17</span>;</span>
+<span class="line">    random ^= random &lt;&lt; <span class="hl-number">5</span>;</span>
+<span class="line">    random</span>
+<span class="line">  })</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Our hypothesis is that mutexes are faster, so we need to pick a workload which favors spinlocks.</span>
+<span>That is, we need to pick a very short critical section, and so we will just be incrementing a counter (</span><strong><strong><span>1</span></strong></strong><span>).</span></p>
+<p><span>This is better than doing a dummy lock/unlock.</span>
+<span>At the end of the benchmark, we will assert that the counter is indeed incremented the correct number of times (</span><strong><strong><span>2</span></strong></strong><span>).</span>
+<span>This has a number of benefits:</span></p>
+<ul>
+<li>
+<span>This is a nice smoke test which at least makes sure that we haven</span>&rsquo;<span>t done an off by one error anywhere.</span>
+</li>
+<li>
+<span>As we will be benchmarking different implementations, it</span>&rsquo;<span>s important to verify that they indeed give the same answer! More than once I</span>&rsquo;<span>ve made some piece of code ten times faster by accidentally eliminating some essential logic :D</span>
+</li>
+<li>
+<span>We can be reasonably sure that compiler won</span>&rsquo;<span>t outsmart us and won</span>&rsquo;<span>t remove empty critical sections.</span>
+</li>
+</ul>
+<p><span>Now, we can just make all the threads hammer a single global counter, but that would only test a situation of extreme contention.</span>
+<span>We need to structure a benchmark in a way that allow us to vary contention level.</span></p>
+<p><span>So instead of a single global counter, we will use an array of counters (</span><strong><strong><span>3</span></strong></strong><span>).</span>
+<span>Each thread will be incrementing random elements of this array.</span>
+<span>By varying the size of the array, we will be able to control the level of contention.</span>
+<span>To avoid false sharing between neighboring elements of the array we will use crossbeam</span>&rsquo;<span>s </span><a href="https://docs.rs/crossbeam-utils/0.7.0/crossbeam_utils/struct.CachePadded.html"><code>CachePadded</code></a><span>.</span>
+<span>To make the benchmark more reproducible, we will vendor a simple PRNG (</span><strong><strong><span>4</span></strong></strong><span>), which we seed manually.</span></p>
+</section>
+<section id="Results">
+
+    <h2>
+    <a href="#Results"><span>Results</span> </a>
+    </h2>
+<p><span>We are testing </span><code>std::sync::Mutex</code><span>, </span><code>parking_lot::Mutex</code><span>, </span><code>spin::Mutex</code><span> and a bespoke implementation of spinlock from </span><a href="https://probablydance.com/2019/12/30/measuring-mutexes-spinlocks-and-how-bad-the-linux-scheduler-really-is/"><span>probablydance article</span></a><span>.</span>
+<span>We  use 32 threads (on 4 core/8 hyperthreads CPU), and each thread increments some counter 10 000 times.</span>
+<span>We run each benchmark 100 times and compute average, min and max times (we are primarily measuring throughput, so average makes more sense than median this time).</span>
+<span>Finally, we run the whole suite twice, to sanity check that the results are reproducible.</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Extreme Contention</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run --release 32 2 10000 100</span>
+<span class="line"><span class="hl-output">    Finished release [optimized] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-output">     Running `target/release/lock-bench 32 2 10000 100`</span></span>
+<span class="line"><span class="hl-output">Options {</span></span>
+<span class="line"><span class="hl-output">    n_threads: 32,</span></span>
+<span class="line"><span class="hl-output">    n_locks: 2,</span></span>
+<span class="line"><span class="hl-output">    n_ops: 10000,</span></span>
+<span class="line"><span class="hl-output">    n_rounds: 100,</span></span>
+<span class="line"><span class="hl-output">}</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg  97ms  min 38ms  max 103ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg  68ms  min 32ms  max  72ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg 142ms  min 69ms  max 217ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg 127ms  min 50ms  max 219ms</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg  98ms  min 68ms  max 125ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg  68ms  min 58ms  max  71ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg 139ms  min 54ms  max 193ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg 127ms  min 50ms  max 210ms</span></span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+<figcaption class="title">Heavy contention</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run --release 32 64 10000 100</span>
+<span class="line"><span class="hl-output">    Finished release [optimized] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-output">     Running `target/release/lock-bench 32 64 10000 100`</span></span>
+<span class="line"><span class="hl-output">Options {</span></span>
+<span class="line"><span class="hl-output">    n_threads: 32,</span></span>
+<span class="line"><span class="hl-output">    n_locks: 64,</span></span>
+<span class="line"><span class="hl-output">    n_ops: 10000,</span></span>
+<span class="line"><span class="hl-output">    n_rounds: 100,</span></span>
+<span class="line"><span class="hl-output">}</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg 21ms  min 11ms  max  23ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg 10ms  min  6ms  max  11ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg 55ms  min  7ms  max 161ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg 40ms  min  6ms  max 123ms</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg 21ms  min 20ms  max  24ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg  9ms  min  6ms  max  12ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg 48ms  min  7ms  max 138ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg 40ms  min  8ms  max 110ms</span></span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+<figcaption class="title">Light contention</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run --release 32 1000 10000 100</span>
+<span class="line"><span class="hl-output">    Finished release [optimized] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-output">     Running `target/release/lock-bench 32 1000 10000 100`</span></span>
+<span class="line"><span class="hl-output">Options {</span></span>
+<span class="line"><span class="hl-output">    n_threads: 32,</span></span>
+<span class="line"><span class="hl-output">    n_locks: 1000,</span></span>
+<span class="line"><span class="hl-output">    n_ops: 10000,</span></span>
+<span class="line"><span class="hl-output">    n_rounds: 100,</span></span>
+<span class="line"><span class="hl-output">}</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg 13ms  min 8ms   max  15ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg  6ms  min 3ms   max   8ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg 37ms  min 4ms   max 115ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg 39ms  min 2ms   max 127ms</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg 13ms  min 12ms  max  15ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg  6ms  min  5ms  max   8ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg 39ms  min  4ms  max 102ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg 37ms  min  5ms  max 103ms</span></span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+<figcaption class="title">No contention</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run --release 32 1000000 10000 100</span>
+<span class="line"><span class="hl-output">    Finished release [optimized] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-output">     Running `target/release/lock-bench 32 1000000 10000 100`</span></span>
+<span class="line"><span class="hl-output">Options {</span></span>
+<span class="line"><span class="hl-output">    n_threads: 32,</span></span>
+<span class="line"><span class="hl-output">    n_locks: 1000000,</span></span>
+<span class="line"><span class="hl-output">    n_ops: 10000,</span></span>
+<span class="line"><span class="hl-output">    n_rounds: 100,</span></span>
+<span class="line"><span class="hl-output">}</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg 15ms  min 8ms   max 27ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg  7ms  min 4ms   max  9ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg  5ms  min 4ms   max  8ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg  6ms  min 5ms   max 10ms</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">std::sync::Mutex     avg 15ms  min 8ms   max 27ms</span></span>
+<span class="line"><span class="hl-output">parking_lot::Mutex   avg  6ms  min 4ms   max  9ms</span></span>
+<span class="line"><span class="hl-output">spin::Mutex          avg  5ms  min 4ms   max  7ms</span></span>
+<span class="line"><span class="hl-output">AmdSpinlock          avg  6ms  min 5ms   max  7ms</span></span></code></pre>
+
+</figure>
+</section>
+<section id="Analysis">
+
+    <h2>
+    <a href="#Analysis"><span>Analysis</span> </a>
+    </h2>
+<p><span>There are several interesting observations here!</span></p>
+<p><em><span>First</span></em><span>, we reproduce the result that the variance of spinlocks on Linux with default scheduling settings can be huge:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">parking_lot::Mutex  min 6ms  max  11ms</span>
+<span class="line">AmdSpinlock         min 6ms  max 123ms</span></code></pre>
+
+</figure>
+<p><span>Note that these are extreme results for 100 runs, where each run does </span><code>32 * 10_000</code><span> lock operations.</span>
+<span>That is, individual lock/unlock operations probably have an even higher spread.</span></p>
+<p><em><span>Second</span></em><span>, the uncontended case looks like I have expected: mutexes and spinlocks are not that different, because they essentially use the same code</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Parking_lot::Mutex   avg 6ms  min 4ms  max 9ms</span>
+<span class="line">spin::Mutex          avg 5ms  min 4ms  max 7ms</span></code></pre>
+
+</figure>
+<p><em><span>Third</span></em><span>, under heavy contention mutexes annihilate spinlocks:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">parking_lot::Mutex   avg 10ms  max  11ms</span>
+<span class="line">spin::Mutex          avg 55ms  max 161ms</span></code></pre>
+
+</figure>
+<p><span>Now, this is the opposite of what I would naively expect.</span>
+<span>Even in heavy contended state, the critical section is still extremely short, so for each thread, the most efficient strategy seems to spin for a couple of iterations.</span></p>
+<p><span>But I think I can explain why mutexes are so much better in this case.</span>
+<span>One reason is that with spinlocks a thread can get unlucky and be preempted in the critical section.</span>
+<span>The other more important reason is that, at any given moment in time, there are many threads trying to enter the same critical section.</span>
+<span>With spinlocks, all cores can be occupied by threads who compete for the same lock.</span>
+<span>With mutexes, there is a queue of sleeping threads for each lock, and the kernel generally tries to make sure that only one thread from the group is awake.</span></p>
+<p><span>This is a funny example of mechanical </span><a href="https://en.wikipedia.org/wiki/Race_to_the_bottom"><span>race to the bottom</span></a><span>. Due to the short length of critical section, each individual thread would spend less CPU cycles in total if it were spinning, but it increases the overall cost.</span></p>
+<p><span>EDIT: simpler and more plausible </span><a href="https://www.reddit.com/r/rust/comments/ejx7y8/blog_post_mutexes_are_faster_than_spinlocks/fd3u7rw"><span>explanation</span></a><span> from the author of Rust</span>&rsquo;<span>s parking lot is that it does exponential backoff when spinning, unlike the two spinlock implementations.</span></p>
+<p><em><span>Fourth</span></em><span>, even under heavy contention spin locks can luck out and finish almost as fast as mutexes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">parking_lot::Mutex   avg 10ms  min 6ms</span>
+<span class="line">spin::Mutex          avg 55ms  min 7ms</span></code></pre>
+
+</figure>
+<p><span>This again shows that a good mutex is roughly equivalent to a spinlock in the best case.</span></p>
+<p><em><span>Fifth</span></em><span>, the amount of contention required to disrupt spinlocks seems to be small. Even if 32 threads compete for 1 000 locks, spinlocks still are considerably slower:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">parking_lot::Mutex   avg  6ms  min 3ms   max   8ms</span>
+<span class="line">spin::Mutex          avg 37ms  min 4ms   max 115ms</span></code></pre>
+
+</figure>
+<p><span>EDIT: someone on Reddit </span><a href="https://www.reddit.com/r/rust/comments/ejx7y8/blog_post_mutexes_are_faster_than_spinlocks/fd3u8vq"><span>noticed</span></a><span> that the number of threads is significantly higher than the number of cores, which is an unfortunate situation for spinlocks.</span>
+<span>And, although the number of threads in the benchmark is configurable, it never occurred to me to actually vary it 😅!</span>
+<span>Lowering the number of threads to four gives a picture similar to the </span>&ldquo;<span>no contention</span>&rdquo;<span> situation above: spinlocks a slightly, but not massively, faster.</span>
+<span>Which makes total sense! as there are more cores than CPUs, there</span>&rsquo;<span>s no harm in spinning.</span>
+<span>And, if you can carefully architecture you application such that it runs a small fixed number of threads, ideally pinned to specific CPUs (like in the </span><a href="http://seastar.io/shared-nothing/"><span>seastar</span></a><span> architecture), using spinlocks might make sense!</span></p>
+</section>
+<section id="Disclaimer">
+
+    <h2>
+    <a href="#Disclaimer"><span>Disclaimer</span> </a>
+    </h2>
+<p><span>As usual, each benchmark exercises only a narrow slice from the space of possible configurations, so it would be wrong to draw a sweeping conclusion that mutexes are </span><strong><strong><span>always</span></strong></strong><span> faster.</span>
+<span>For example, if you are in a situation where preemption is impossible (interrupts are disabled, cooperative multitasking, realtime scheduling, etc), spinlocks might be better (or even the only!) choice.</span>
+<span>And there</span>&rsquo;<span>s also a chance the benchmark doesn</span>&rsquo;<span>t measure what I think it measures :-)</span></p>
+<p><span>But I find this particular benchmark convincing enough to disprove that </span>&ldquo;<span>spinlocks are faster then mutexes for short critical sections</span>&rdquo;<span>.</span>
+<span>In particular I find the qualitative observation that, under contention mutexes allow for better scheduling even if critical sections are short and not preempted in the middle, enlightening.</span></p>
+</section>
+<section id="Reading-List">
+
+    <h2>
+    <a href="#Reading-List"><span>Reading List</span> </a>
+    </h2>
+<ul>
+<li>
+<a href="http://dept-info.labri.fr/~denis/Enseignement/2008-IR/Articles/01-futex.pdf"><span>Futexes Are Tricky</span></a><span> </span>&mdash;<span> a paper describing the </span><code>futex</code><span> syscall used to implement efficient sleeping on Linux.</span>
+</li>
+<li>
+<a href="https://webkit.org/blog/6161/locking-in-webkit/"><span>Locking in WebKit</span></a><span> </span>&mdash;<span> a long post, describing a modern mutex implementation.</span>
+</li>
+<li>
+<a href="https://www.kernel.org/doc/Documentation/locking/mutex-design.txt"><span>Generic Mutex Subsystem</span></a><span> </span>&mdash;<span> Linux kernel docs about sleeping mutexes.</span>
+</li>
+<li>
+<a href="https://www.kernel.org/doc/Documentation/locking/spinlocks.txt"><span>Spinlock</span></a><span> </span>&mdash;<span> Linux kernel docs about spinlocks.</span>
+</li>
+<li>
+<a href="https://www.realworldtech.com/forum/?threadid=189711&amp;curpostid=189723"><span>Do not use spinlocks in user space</span></a><span> </span>&mdash;<span> Linus explains why user space spinlocks are usually bad.</span>
+</li>
+<li>
+<a href="https://www.realworldtech.com/forum/?threadid=189711&amp;curpostid=189755"><span>Almost all serious locking libraries try to do something exactly like that</span></a><span> </span>&mdash;<span> Linus explains how good mutex might be implemented instead.</span>
+</li>
+<li>
+<a href="https://linuxplumbersconf.org/event/4/contributions/286/attachments/225/398/LPC-2019-OptSpin-Locks.pdf"><span>Effcient Userspace Optimistic Spinning Locks</span></a><span> </span>&mdash;<span> a presentation about making fast-path spinlocking in futex-based locks even more efficient.</span>
+<span>The main problem with optimistic spinning is how much of it do you want (that is, tweaking the number of iterations parameter).</span>
+<span>The proposal solves this in an ingenious self-tweeking way (with the help of the kernel): we spin until the holder of the lock itself goes to sleep.</span>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/ejx7y8/blog_post_mutexes_are_faster_than_spinlocks/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-01-04-mutexes-are-faster-than-spinlocks.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/02/14/why-rust-is-loved.html b/2020/02/14/why-rust-is-loved.html
new file mode 100644
index 00000000..c8c9f317
--- /dev/null
+++ b/2020/02/14/why-rust-is-loved.html
@@ -0,0 +1,450 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Why is Rust the Most Loved Programming Language?</title>
+  <meta name="description" content="... by me?">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/02/14/why-rust-is-loved.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Why-is-Rust-the-Most-Loved-Programming-Language"><span>Why is Rust the Most Loved Programming Language?</span> <time datetime="2020-02-14">Feb 14, 2020</time></a>
+    </h1>
+<p><span>... by me?</span></p>
+<p><span>Rust is my favorite programming language (other languages I enjoy are Kotlin and Python).</span>
+<span>In this post I want to explain why I, somewhat irrationally, find this language so compelling.</span>
+<span>The post does not try to explain why Rust is the most loved language according to</span>
+<a href="https://insights.stackoverflow.com/survey/2019#most-loved-dreaded-and-wanted"><span>StackOverflow survey</span></a><span> :-)</span></p>
+<p><span>Additionally, this post does not cover the actual good reasons why one might want to use Rust.</span>
+<span>Briefly:</span></p>
+<ul>
+<li>
+<span>If you use C++ or C, Rust allows you to get roughly the same binary, but with compile-time guaranteed absence of undefined behavior.</span>
+<span>This is a big deal and the reason why Rust exists.</span>
+</li>
+<li>
+<span>If you use a statically typed managed language (Java, C#, Go, etc), the benefit of Rust is a massive simplification of multithreaded programming: data races are eliminated at compile time.</span>
+<span>Additionally, you get the benefits of a lower level language (less RAM, less CPU, direct access to platform libraries) without paying as much cost as you would with C++.</span>
+<span>This is not free: you</span>&rsquo;<span>ll pay with compile times and cognitive complexity, but it would be </span>&ldquo;<span>why my code does not compile</span>&rdquo;<span> complexity, rather than </span>&ldquo;<span>why my heap is corrupted</span>&rdquo;<span> complexity.</span>
+</li>
+</ul>
+<p><span>If you</span>&rsquo;<span>d like to hear more about the above, this post will disappoint you :-)</span></p>
+<section id="It-s-All-the-Small-Things">
+
+    <h2>
+    <a href="#It-s-All-the-Small-Things"><span>It</span>&rsquo;<span>s All the Small Things!</span> </a>
+    </h2>
+<p><span>The reason why I irrationally like Rust is that it, subjectively, gets a lot of small details just right (or at least better than other languages I know).</span>
+<span>The rest of the post would be a laundry list of those things, but first I</span>&rsquo;<span>d love to mention why I think Rust is the way it is.</span></p>
+<p><em><span>First</span></em><span>, it is a relatively young language, so it can have many </span>&ldquo;<span>obviously good</span>&rdquo;<span> things.</span>
+<span>For example, I feel like there</span>&rsquo;<span>s a general consensus now that, by default, local variables should not be reassignable.</span>
+<span>This probably was much less obvious in the 90s, when today</span>&rsquo;<span>s mainstream languages were designed.</span></p>
+<p><em><span>Second</span></em><span>, it does not try to maintain source/semantic compatibility with any existing language.</span>
+<span>Even if we think that const by default is a good idea, we can</span>&rsquo;<span>t employ it in TypeScript, because it needs to stay compatible with JavaScript.</span></p>
+<p><em><span>Third</span></em><span>, (and this is a pure speculation on my part) I feel that the initial bunch of people who designed the language and its design principles just had an excellent taste!</span></p>
+<p><span>So, to the list of adorable things!</span></p>
+</section>
+<section id="Naming-Convention">
+
+    <h2>
+    <a href="#Naming-Convention"><span>Naming Convention</span> </a>
+    </h2>
+<p><span>To set the right mood for the rest of the discussion, let me start with claiming that </span><code>snake_case</code><span> is more readable than </span><code>camelCase</code><span> :-)</span>
+<span>Similarly, </span><code>XmlRpcRequest</code><span> is better than </span><code>XMLRPCRequest</code><span>.</span></p>
+<p><span>I believe that readability is partially a matter of habit.</span>
+<span>But it also seems logical that </span><code>_</code><span> is better at separating words than case change or nothing at all.</span>
+<span>And, subjectively, after writing a bunch of </span><code>camelCase</code><span> and </span><code>snake_case</code><span>, I much prefer </span><code>_</code><span>.</span></p>
+</section>
+<section id="Keyword-First-Syntax">
+
+    <h2>
+    <a href="#Keyword-First-Syntax"><span>Keyword First Syntax</span> </a>
+    </h2>
+<p><span>How would you </span><kbd><kbd><span>Ctrl</kbd>+<kbd>F</span></kbd></kbd><span> the definition of </span><code>foo</code><span> function in a Java file on GitHub?</span>
+<span>Probably just </span><code>foo(</code><span>, which would give you both the definition and all the calls.</span>
+<span>In Rust, you</span>&rsquo;<span>d search for </span><code>fn foo</code><span>.</span>
+<span>In general, every construct is introduced by a leading keyword, which makes it much easier to read the code for a human.</span>
+<span>When I read C++, I always have a hard time distinguishing field declarations from method declarations: they start the same.</span>
+<span>Leading keywords also make it easier to do stupid text searches for things.</span>
+<span>If you don</span>&rsquo;<span>t find this argument compelling because </span>&ldquo;<span>one should just use an IDE to look for methods</span>&rdquo;<span>, well, it actually makes implementing an IDE slightly easier as well:</span></p>
+<ul>
+<li>
+<span>Parsing has a nice LL(1) vibe to it, you just dispatch on the current token.</span>
+</li>
+<li>
+<span>Parser resilience is easy, you can synchronize on leading keywords like </span><code>fn</code><span>, </span><code>struct</code><span> etc.</span>
+</li>
+<li>
+<span>It</span>&rsquo;<span>s easier for the IDE to guess the intention of a user.</span>
+<span>If you type </span><code>fn</code><span>, IDE recognizes that you want to add a new function and can, for example, complete function overrides for you.</span>
+</li>
+</ul>
+</section>
+<section id="Type-Last-Syntax">
+
+    <h2>
+    <a href="#Type-Last-Syntax"><span>Type Last Syntax</span> </a>
+    </h2>
+<p><span>C-family languages usually use </span><code>Type name</code><span> order.</span>
+<span>Languages with type inference, including Rust, usually go for </span><code>name: Type</code><span>.</span>
+<span>Technically, this is more convenient because in a recursive descent parser it</span>&rsquo;<span>s easier to make the second part optional.</span>
+<span>It</span>&rsquo;<span>s also more readable, because you put the most important part, the name, first.</span>
+<span>Because names are usually more uniform in length than types, groups of fields/local variables align better.</span></p>
+</section>
+<section id="No-Dangling-Else">
+
+    <h2>
+    <a href="#No-Dangling-Else"><span>No Dangling Else</span> </a>
+    </h2>
+<p><span>Many languages use </span><code>if (condition) { then_branch }</code><span> syntax, where parenthesis around condition are mandatory, and braces around </span><code>then_branch</code><span> are optional.</span>
+<span>Rust does the opposite, which has the following benefits:</span></p>
+<ul>
+<li>
+<span>There</span>&rsquo;<span>s no need for a special rule to associate </span><code>else</code><span> with just the right </span><code>if</code><span>. Instead, </span><code>else if</code><span> is an indivisible unambiguous bit of syntax.</span>
+</li>
+<li>
+<a href="https://www.imperialviolet.org/2014/02/22/applebug.html"><span>goto fail;</span></a><span> bug is impossible; more generally, you don</span>&rsquo;<span>t have to make the decision if it is ok to omit the braces.</span>
+</li>
+</ul>
+</section>
+<section id="Everything-Is-An-Expression-Including-Blocks">
+
+    <h2>
+    <a href="#Everything-Is-An-Expression-Including-Blocks"><span>Everything Is An Expression, Including Blocks</span> </a>
+    </h2>
+<p><span>I think </span>&ldquo;<span>everything is an expression</span>&rdquo;<span> is generally a good idea, because it makes things composable.</span>
+<span>Just the other day I tried to handle </span><code>null</code><span> in TypeScript in a Kotlin way, with </span><code>foo() ?? return false</code><span>, and failed because </span><code>return</code><span> is not an expression.</span></p>
+<p><span>The problem with traditional functional (Haskell/OCaml) approach is that it uses </span><code>let name = expr in</code><span> expression for introducing new variables, which just feels bulky.</span>
+<span>Specifically, the closing </span><code>in</code><span> keyword feels verbose, and also emphasizes the nesting of expression.</span>
+<span>The nesting is undoubtedly there, but usually it is very boring, and calling it out is not very helpful.</span></p>
+<p><span>Rust doesn</span>&rsquo;<span>t have a let expression per se, instead it has flat-feeling blocks which can contain many </span><code>let</code><span> statements:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">d</span> = {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">a</span> = <span class="hl-number">1</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">b</span> = <span class="hl-number">6</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">c</span> = <span class="hl-number">9</span>;</span>
+<span class="line">    b*b - <span class="hl-number">4</span>*a*c</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>This gives, subjectively, a lighter-weight syntax for introducing bindings and side-effecting statements, as well as an ability to nicely scope local variables to sub-blocks!</span></p>
+</section>
+<section id="Immutable-non-Reassignable-by-Default">
+
+    <h2>
+    <a href="#Immutable-non-Reassignable-by-Default"><span>Immutable/non-Reassignable by Default</span> </a>
+    </h2>
+<p><span>In Rust, reassignable variables are declared with </span><code>let mut</code><span> and non-reassignable with </span><code>let</code><span>.</span>
+<span>Note how the rarer option is more verbose, and how it is expressed as a modifier, and not a separate keyword, like </span><code>let</code><span> and </span><code>const</code><span>.</span></p>
+</section>
+<section id="Namespaced-Enums">
+
+    <h2>
+    <a href="#Namespaced-Enums"><span>Namespaced Enums</span> </a>
+    </h2>
+<p><span>In Rust, enums (sum types, algebraic data types) are namespaced.</span></p>
+<p><span>You declare enums like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Expr</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Int</span>(<span class="hl-type">i32</span>),</span>
+<span class="line">    <span class="hl-title function_ invoke__">Bool</span>(<span class="hl-type">bool</span>),</span>
+<span class="line">    Sum { lhs: <span class="hl-type">Box</span>&lt;Expr&gt;, rhs: <span class="hl-type">Box</span>&lt;Expr&gt; },</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And use them like </span><code>Expr::Int</code><span>, without worrying that it might collide with</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Type</span> {</span>
+<span class="line">    Int,</span>
+<span class="line">    Bool</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>No more repetitive </span><code>data Expr = ExprInt Int | ExprBool Bool | ExprSum Expr Expr</code><span>!</span></p>
+<p><span>Swift does even a nicer trick here, by using an </span><code>.VariantName</code><span> syntax to refer to a namespaced enum (</span><a href="https://docs.swift.org/swift-book/LanguageGuide/Enumerations.html#ID147"><span>docs</span></a><span>).</span>
+<span>This makes matching less verbose and completely dodges the sad Rust ambiguity between constants and bindings:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">x</span>: <span class="hl-type">Option</span>&lt;<span class="hl-type">i32</span>&gt; = <span class="hl-title function_ invoke__">Some</span>(<span class="hl-number">92</span>);</span>
+<span class="line"><span class="hl-keyword">match</span> x {</span>
+<span class="line">    <span class="hl-literal">None</span> =&gt; <span class="hl-number">1</span>,</span>
+<span class="line">    none =&gt; <span class="hl-number">2</span>,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Syntactic-Separation-of-Fields-and-Methods">
+
+    <h2>
+    <a href="#Syntactic-Separation-of-Fields-and-Methods"><span>Syntactic Separation of Fields and Methods</span> </a>
+    </h2>
+<p><span>Fields and methods are declared in separate blocks (like in Go):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Clone, Copy)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Point</span> {</span>
+<span class="line">    x: <span class="hl-type">f64</span>,</span>
+<span class="line">    y: <span class="hl-type">f64</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Point</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">distance_to_origin</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">f64</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">Point</span> { x, y } = <span class="hl-keyword">self</span>;</span>
+<span class="line">        (x*x + y*y).<span class="hl-title function_ invoke__">sqrt</span>()</span>
+<span class="line">    }</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This is a </span><strong><strong><span>huge</span></strong></strong><span> improvement to readability: there are usually far fewer fields than methods, but by looking at the fields you can usually understand which set of methods can exist.</span></p>
+</section>
+<section id="Integer-Types">
+
+    <h2>
+    <a href="#Integer-Types"><span>Integer Types</span> </a>
+    </h2>
+<p><code>u32</code><span> and </span><code>i64</code><span> are shorter and clearer than </span><code>unsigned int</code><span> or </span><code>long</code><span>.</span>
+<code>usize</code><span> and </span><code>isize</code><span> cover the most important use case for arch-dependent integer type, and also make it clearer at the type level which things are addresses/indices, and which are quantities.</span>
+<span>There</span>&rsquo;<span>s also no question of how integer literals of various types look, it</span>&rsquo;<span>s just </span><code>1i8</code><span> or </span><code>92u64</code></p>
+<p><span>The overflow during arithmetic operations is considered a bug, traps in debug builds and wraps in release builds.</span>
+<span>However, there</span>&rsquo;<span>s a plethora of methods like </span><code>wrapping_add</code><span>, </span><code>saturating_sub</code><span>, etc, so you can exactly specify behavior on overflow in specific cases where it is not a bug.</span>
+<span>In general, methods on primitives allow to expose a ton of compiler intrinsics in a systematic way, like </span><code>u64::count_ones</code><span>.</span></p>
+</section>
+<section id="Definitive-Initialization">
+
+    <h2>
+    <a href="#Definitive-Initialization"><span>Definitive Initialization</span> </a>
+    </h2>
+<p><span>Rust uses control flow analysis to check that every local variable is assigned before the first use.</span>
+<span>This is a much better default than making this UB, or initializing all locals to some default value.</span>
+<span>Additionally, Rust has a first-class support for diverging control flow (</span><code>!</code><span> type and </span><code>loop {}</code><span> construct), which protects it from at-a-distance changes like</span>
+<a href="https://javax0.wordpress.com/2020/01/01/jdk14-instanceof-ea-issue/"><span>this example</span></a>
+<span>from Java.</span></p>
+<p><span>Definitive initialization analysis is an interesting example of a language feature which requires relatively high-brow implementation techniques, but whose effects seem very intuitive, almost trivial, to the users of the language.</span></p>
+</section>
+<section id="Crates">
+
+    <h2>
+    <a href="#Crates"><span>Crates</span> </a>
+    </h2>
+<p><span>The next two things are actually not so small.</span></p>
+<p><span>Rust libraries (</span>&ldquo;<span>crates</span>&rdquo;<span>) don</span>&rsquo;<span>t have names.</span>
+<span>More generally, Rust doesn</span>&rsquo;<span>t have any kind of global shared namespace.</span></p>
+<p><span>This is in contrast to languages which have a concept of library path (</span><code>PYTHONPATH</code><span>, </span><code>classpath</code><span>, </span><code>-I</code><span>).</span>
+<span>If you have a library path, you are exposed to name/symbol clashes between libraries.</span>
+<span>While a name clash between two libraries seems pretty unlikely, there</span>&rsquo;<span>s a special case where collision happens regularly.</span>
+<span>One of your dependencies can depend on </span><code>libfoo v1</code><span>, and another one on </span><code>libfoo v2</code><span>.</span>
+<span>Usually this means that you either can</span>&rsquo;<span>t use the two libraries together, or need to implement some pretty horrific workarounds.</span></p>
+<p><span>In Rust the name you use for a library is a property of the dependency edge between upstream and downstream crate.</span>
+<span>That is, the single crate can be known under different names in different dependant crates or, vice versa, two different crates might be known under equal names in different parts of the crate graph!</span>
+<span>This (and semver discipline, which is a social thing) is the reason why Cargo doesn</span>&rsquo;<span>t suffer from dependency hell as much as some other ecosystems.</span></p>
+</section>
+<section id="Crate-Visibility">
+
+    <h2>
+    <a href="#Crate-Visibility"><span>Crate Visibility</span> </a>
+    </h2>
+<p><span>Related to the previous point, crates are also an important visibility boundary, which allows you clearly delineate public API </span><strong><strong><span>of a library</span></strong></strong><span> from implementation details.</span>
+<span>This is a major improvement over class-level visibility controls.</span></p>
+<p><span>It</span>&rsquo;<span>s interesting though that it took Rust two tries to get first-class </span>&ldquo;<span>exported from the library</span>&rdquo;<span> (</span><code>pub</code><span>) and </span>&ldquo;<span>internal to the library</span>&rdquo;<span> (</span><code>pub(crate)</code><span>) visibilities.</span>
+<span>That is also the reason why more restrictive </span><code>pub(crate)</code><span> is unfortunately longer to write, I wish we used </span><code>pub</code><span> and </span><code>pub*</code><span>.</span></p>
+<p><span>Before 2018 edition, Rust had a simpler and more orthogonal system, where you can only say </span>&ldquo;<span>visible in the parent</span>&rdquo;<span>, which happens to be </span>&ldquo;<span>exported</span>&rdquo;<span> if the parent is root or is itself exported.</span>
+<span>But the old system is less convenient in practice, because you can</span>&rsquo;<span>t look at the declaration and immediately say if it is a part of crate</span>&rsquo;<span>s public API or not.</span></p>
+<p><span>The next language should use these library-level visibilities from the start.</span></p>
+</section>
+<section id="Cross-Platform-Binaries">
+
+    <h2>
+    <a href="#Cross-Platform-Binaries"><span>Cross Platform Binaries</span> </a>
+    </h2>
+<p><span>Rust programs generally just work on Linux, Mac and Windows, and you don</span>&rsquo;<span>t need to install a separate runtime to run them.</span></p>
+</section>
+<section id="Eq">
+
+    <h2>
+    <a href="#Eq"><span>Eq</span> </a>
+    </h2>
+<p><span>Equality operator (</span><code>==</code><span>) is not polymorphic, comparing things of different types (</span><code>92 == "the answer"</code><span>) is a type error.</span></p>
+</section>
+<section id="Ord">
+
+    <h2>
+    <a href="#Ord"><span>Ord</span> </a>
+    </h2>
+<p><span>The canonical comparison function returns an </span><code>enum Ordering { Less, Equal, Greater }</code><span>, you don</span>&rsquo;<span>t need to override all six comparison operators.</span>
+<span>Rust also manages this without introducing a separate </span><code>&lt;=&gt;</code><span> spaceship operator just for this purpose.</span>
+<span>And you still can implement fast path for </span><code>==</code><span> / </span><code>!=</code><span> checks.</span></p>
+</section>
+<section id="Debug-Display">
+
+    <h2>
+    <a href="#Debug-Display"><span>Debug &amp; Display</span> </a>
+    </h2>
+<p><span>Rust defines two ways to turn something into a string: </span><code>Display</code><span>, which is intended for user-visible strings, and </span><code>Debug</code><span>, which is generally intended for </span><code>printf</code><span> debugging.</span>
+<span>This is similar to Python</span>&rsquo;<span>s </span><code>__str__</code><span> and </span><code>__repr__</code><span>.</span></p>
+<p><span>Unlike Python, the compiler derives </span><code>Debug</code><span> for you.</span>
+<span>Being able to inspect all data structures is a huge productivity boost.</span>
+<span>I hope some day we</span>&rsquo;<span>ll be able to call custom user-provided </span><code>Debug</code><span> from a debugger.</span></p>
+<p><span>A nice bonus is that you can debug-print things in two modes:</span></p>
+<ul>
+<li>
+<span>compactly on a single-line</span>
+</li>
+<li>
+<span>verbosely, on multiple lines as an indented tree</span>
+</li>
+</ul>
+</section>
+<section id="Trivial-Data-Types">
+
+    <h2>
+    <a href="#Trivial-Data-Types"><span>Trivial Data Types</span> </a>
+    </h2>
+<p><span>Creating simple bag of data types takes almost no syntax, and you can opt-into all kinds of useful extra functionality:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(</span></span>
+<span class="line"><span class="hl-meta">    Debug,</span></span>
+<span class="line"><span class="hl-meta">    Clone, Copy,</span></span>
+<span class="line"><span class="hl-meta">    PartialEq, Eq,</span></span>
+<span class="line"><span class="hl-meta">    PartialOrd, Ord,</span></span>
+<span class="line"><span class="hl-meta">    Hash,</span></span>
+<span class="line"><span class="hl-meta">    Serialize, Deserialize,</span></span>
+<span class="line"><span class="hl-meta">)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Point</span> {</span>
+<span class="line">    x: <span class="hl-type">i64</span>,</span>
+<span class="line">    y: <span class="hl-type">i64</span>,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Strings">
+
+    <h2>
+    <a href="#Strings"><span>Strings</span> </a>
+    </h2>
+<p><span>Another obvious in retrospect thing.</span></p>
+<p><span>Strings are represented as utf-8 byte buffers.</span>
+<span>The encoding is fixed, can</span>&rsquo;<span>t be changed, and its validity is enforced.</span>
+<span>There</span>&rsquo;<span>s no random access to </span>&ldquo;<span>characters</span>&rdquo;<span>, but you can slice string with a byte index, provided that it doesn</span>&rsquo;<span>t fall in the middle of a multi-byte character.</span></p>
+</section>
+<section id="assert">
+
+    <h2>
+    <a href="#assert"><span>assert!</span> </a>
+    </h2>
+<p><span>The default </span><code>assert!</code><span> macro is always enabled.</span>
+<span>The flavor which can be disabled with a compilation flag, </span><code>debug_assert</code><span>, is more verbose.</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/f41ynd/blog_post_why_is_rust_the_most_loved_programming/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-02-14-why-rust-is-loved.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/03/22/fast-simple-rust-interner.html b/2020/03/22/fast-simple-rust-interner.html
new file mode 100644
index 00000000..4c26a41c
--- /dev/null
+++ b/2020/03/22/fast-simple-rust-interner.html
@@ -0,0 +1,399 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Fast and Simple Rust Interner</title>
+  <meta name="description" content="This post describes a simple technique for writing interners in Rust which I haven't seen documented before.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/03/22/fast-simple-rust-interner.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Fast-and-Simple-Rust-Interner"><span>Fast and Simple Rust Interner</span> <time datetime="2020-03-22">Mar 22, 2020</time></a>
+    </h1>
+<p><span>This post describes a simple technique for writing interners in Rust which I haven</span>&rsquo;<span>t seen documented before.</span></p>
+<p><span>String interning is a classical optimization when you have to deal with many equal strings.</span>
+<span>The canonical example would be a compiler: most identifiers in a program are repeated several times.</span></p>
+<p><span>Interning works by ensuring that there</span>&rsquo;<span>s only one canonical copy of each distinct string in memory.</span>
+<span>It can give the following benefits:</span></p>
+<ul>
+<li>
+<span>Less memory allocated to hold strings.</span>
+</li>
+<li>
+<span>If all strings are canonicalized, comparison can be done in </span><code>O(1)</code><span> (instead of </span><code>O(n)</code><span>) by using pointer equality.</span>
+</li>
+<li>
+<span>Interned strings themselves can be represented with an index (typically </span><code>u32</code><span>) instead of a </span><code>(ptr, len)</code><span> pair.</span>
+<span>This makes data structures which embed strings more compact.</span>
+</li>
+</ul>
+<p><span>The simplest possible interner in Rust could look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::collections::HashMap;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(Default)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    map: HashMap&lt;<span class="hl-type">String</span>, <span class="hl-type">u32</span>&gt;,</span>
+<span class="line">    vec: <span class="hl-type">Vec</span>&lt;<span class="hl-type">String</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">intern</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, name: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(&amp;idx) = <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">get</span>(name) {</span>
+<span class="line">            <span class="hl-keyword">return</span> idx;</span>
+<span class="line">        }</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">idx</span> = <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;</span>
+<span class="line">        <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">insert</span>(name.<span class="hl-title function_ invoke__">to_owned</span>(), idx);</span>
+<span class="line">        <span class="hl-keyword">self</span>.vec.<span class="hl-title function_ invoke__">push</span>(name.<span class="hl-title function_ invoke__">to_owned</span>());</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-built_in">debug_assert!</span>(<span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">lookup</span>(idx) == name);</span>
+<span class="line">        <span class="hl-built_in">debug_assert!</span>(<span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">intern</span>(name) == idx);</span>
+<span class="line"></span>
+<span class="line">        idx</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">lookup</span>(&amp;<span class="hl-keyword">self</span>, idx: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-type">str</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.vec[idx <span class="hl-keyword">as</span> <span class="hl-type">usize</span>].<span class="hl-title function_ invoke__">as_str</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To remove duplicates, we store strings in a </span><code>HashMap</code><span>.</span>
+<span>To map from an index back to the string, we also store strings in a </span><code>Vec</code><span>.</span></p>
+<p><span>I didn</span>&rsquo;<span>t quite like this solution yesterday, for two reasons:</span></p>
+<ul>
+<li>
+<span>It allocates a lot </span>&mdash;<span> each interned string is </span><em><span>two</span></em><span> separate allocations.</span>
+</li>
+<li>
+<span>Using a </span><code>HashMap</code><span> feels like cheating, surely there should be a better, more classical data structure!</span>
+</li>
+</ul>
+<p><span>So I</span>&rsquo;<span>ve spent a part of the evening cobbling together a non-allocating </span><a href="https://en.wikipedia.org/wiki/Trie"><span>trie</span></a><span>-based interner.</span>
+<span>The result: trie does indeed asymptotically reduce the number of allocations from </span><code>O(n)</code><span> to </span><code>O(log(n))</code><span>.</span>
+<span>Unfortunately, it is slower, larger and way more complex than the above snippet.</span>
+<span>Minimizing allocations </span><em><span>is</span></em><span> important, but allocators are pretty fast, and that shouldn</span>&rsquo;<span>t be done at the expense of everything else.</span>
+<span>Also, Rust </span><code>HashMap</code><span> (implemented by </span><a href="https://github.com/Amanieu/"><span>@Amanieu</span></a><span> based on </span><a href="https://abseil.io/blog/20180927-swisstables"><span>Swiss Table</span></a><span>) is </span><em><em><span>fast</span></em></em><span>.</span></p>
+
+<details>
+<summary>For the curious, the Trie design I've used</summary>
+<p><span>The trie is build on per-byte basis (each node has at most 256 children).</span>
+<span>Each internal node is marked with a single byte.</span>
+<span>Leaf nodes are marked with substrings, so that only the common prefix requires node per byte.</span></p>
+<p><span>To avoid allocating individual interned strings, we store them in a </span><strong><span>single</span></strong><span> long </span><code>String</code><span>.</span>
+<span>An interned string is represented by a </span><code>Span</code><span> (pair of indexes) inside the big buffer.</span></p>
+<p><span>Trie itself is a tree structure, and we can use a standard trick of packing its nodes into array and using indexes to avoid allocating every node separately.</span>
+<span>However, nodes themselves can be of varying size, as each node can have different number of children.</span>
+<span>We can still array-allocate them, by rolling our own mini-allocator (using a segregated free list)!</span></p>
+<p><span>Node</span>&rsquo;<span>s children are represented as a sorted array of links.</span>
+<span>We use binary search for indexing and simple linear shift insertion.</span>
+<span>With at most 256 children per node, it shouldn</span>&rsquo;<span>t be </span><em><span>that</span></em><span> bad.</span>
+<span>Additionally, we pre-allocate 256 nodes and use array indexing for the first transition.</span></p>
+<p><span>Links are organized in layers.</span>
+<span>The layer </span><code>n</code><span> stores a number of </span><code>[Link]</code><span> chunks of length </span><code><span>2</span><sup><span>n</span></sup></code><span> (in a single contiguous array).</span>
+<span>Each chunk represents the links for a single node (with possibly some extra capacity).</span>
+<span>Node can find its chunk because it knows the number of links (which gives the number of layers) and the first link in the layer.</span>
+<span>A new link for the node is added to the current chunk if there</span>&rsquo;<span>s space.</span>
+<span>If the chunk is full, it is copied to a chunk twice as big first.</span>
+<span>The old chunk is then added to the list of free chunks for reuse.</span></p>
+<p><span>Here</span>&rsquo;<span>s the whole definition of the data structure:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    trie: <span class="hl-type">Vec</span>&lt;Node&gt;,</span>
+<span class="line">    links: <span class="hl-type">Vec</span>&lt;Layer&gt;,</span>
+<span class="line">    strs: <span class="hl-type">Vec</span>&lt;Span&gt;,</span>
+<span class="line">    buf: <span class="hl-type">String</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Span</span> { start: <span class="hl-type">u32</span>, end: <span class="hl-type">u32</span> }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span> {</span>
+<span class="line">    <span class="hl-type">str</span>: <span class="hl-type">Option</span>&lt;<span class="hl-type">u32</span>&gt;,</span>
+<span class="line">    n_links: <span class="hl-type">u8</span>,</span>
+<span class="line">    first_link: <span class="hl-type">u32</span>,</span>
+<span class="line"><span class="hl-comment">//  layer: u32 = first_link.next_power_of_two(),</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Link</span> { byte: <span class="hl-type">u8</span>, node: <span class="hl-type">u32</span>, }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Layer</span> {</span>
+<span class="line">    links: <span class="hl-type">Vec</span>&lt;Link&gt;,</span>
+<span class="line">    free: <span class="hl-type">Vec</span>&lt;<span class="hl-type">u32</span>&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Isn</span>&rsquo;<span>t it incredibly cool that you can look only at the fields and understand how the thing works,</span>
+<span>without even seeing the rest 150 lines of relatively tricky implementation?</span></p>
+
+</details>
+  <p><span>However, implementing a trie made me realize that there</span>&rsquo;<span>s a simple optimization we can apply to our naive interner to get rid of extra allocations.</span>
+<span>In the trie, I concatenate all interned strings into one giant </span><code>String</code><span> and use </span><code>(u32, u32)</code><span> index pairs as an internal representation of string slice.</span></p>
+<p><span>If we translate this idea to our naive interner, we get:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Span</span> { start: <span class="hl-type">u32</span>, end: <span class="hl-type">u32</span> }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    map: HashMap&lt;Span, <span class="hl-type">u32</span>&gt;,</span>
+<span class="line">    vec: <span class="hl-type">Vec</span>&lt;Span&gt;,</span>
+<span class="line">    buf: <span class="hl-type">String</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">intern</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, name: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> { ... }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">lookup</span>(&amp;<span class="hl-keyword">self</span>, idx: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-type">str</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">Span</span> { start, end } = <span class="hl-keyword">self</span>.vec[idx <span class="hl-keyword">as</span> <span class="hl-type">usize</span>]</span>
+<span class="line">        &amp;<span class="hl-keyword">self</span>.buf[start <span class="hl-keyword">as</span> <span class="hl-type">usize</span>..end <span class="hl-keyword">as</span> <span class="hl-type">usize</span>]</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The problem here is that we can</span>&rsquo;<span>t actually write implementations of </span><code>Eq</code><span> and </span><code>Hash</code><span> for </span><code>Span</code><span> to make this work.</span>
+<span>In theory, this is possible: to compare two </span><code>Spans</code><span>, you resolve them to </span><code>&amp;str</code><span> via </span><code>buf</code><span>, and then compare the strings.</span>
+<span>However, Rust API does not allow to express this idea.</span>
+<span>Moreover, even if </span><code>HashMap</code><span> allowed supplying a key closure at </span><em><span>construction</span></em><span> time, it wouldn</span>&rsquo;<span>t help!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">HashMap</span>&lt;K, V, KeyFn, Key&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">    KeyFn: <span class="hl-title function_ invoke__">Fn</span>(&amp;K) <span class="hl-punctuation">-&gt;</span> Key,</span>
+<span class="line">    Key: Hash + <span class="hl-built_in">Eq</span>,</span>
+<span class="line">{</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new_with_key_fn</span>(key_fn: F) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> { ... }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Such API would run afoul of the borrow checker.</span>
+<span>The </span><code>key_fn</code><span> would have to borrow from the same </span><code>struct</code><span>.</span>
+<span>What would work is supplying a </span><code>key_fn</code><span> at call-site for every </span><code>HashMap</code><span> operation, but that would hurt ergonomics and ease of use a lot.</span>
+<span>This exact problem requires</span>
+<a href="https://github.com/matklad/rfcs/blob/std-lazy/text/0000-standard-lazy-types.md#why-not-lazy-as-a-primitive"><span>slightly unusual</span></a>
+<span>design of lazy values in Rust.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>If you find yourself in need of such </span>&ldquo;<span>call-site closure</span>&rdquo;<span> container, you can use a sorted </span><code>Vec</code><span>,</span>
+<a href="https://doc.rust-lang.org/std/primitive.slice.html#method.binary_search_by_key"><code>binary_search_by_key</code></a>
+<span>is exactly this pattern.</span></p>
+<p><span>Another alternative is the </span><code>hashbrown</code><span> crate, which has</span>
+<a href="https://docs.rs/hashbrown/0.7.1/hashbrown/hash_map/struct.RawVacantEntryMut.html#method.insert_with_hasher"><span>raw entry API</span></a><span>.</span></p>
+</div>
+</aside><p><span>However, with a bit of </span><code>unsafe</code><span>, we can make something similar work.</span>
+<span>The trick is to add strings to </span><code>buf</code><span> in such a way that they are never moved, even if more strings are added on top.</span>
+<span>That way, we can just store </span><code>&amp;str</code><span> in the </span><code>HashMap</code><span>.</span>
+<span>To achieve address stability, we use another trick from the </span><a href="https://crates.io/crates/typed_arena"><code>typed_arena</code></a><span> crate.</span>
+<span>If the </span><code>buf</code><span> is full (so that adding a new string would invalidate old pointers), we allocate a new buffer, twice as large,</span>
+<em><span>without</span></em><span> coping the contents of the old one.</span></p>
+<p><span>Here</span>&rsquo;<span>s the full implementation:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::{mem, collections::HashMap};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    map: HashMap&lt;&amp;<span class="hl-symbol">&#x27;static</span> <span class="hl-type">str</span>, <span class="hl-type">u32</span>&gt;,</span>
+<span class="line">    vec: <span class="hl-type">Vec</span>&lt;&amp;<span class="hl-symbol">&#x27;static</span> <span class="hl-type">str</span>&gt;,</span>
+<span class="line">    buf: <span class="hl-type">String</span>,</span>
+<span class="line">    full: <span class="hl-type">Vec</span>&lt;<span class="hl-type">String</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">with_capacity</span>(cap: <span class="hl-type">usize</span>) <span class="hl-punctuation">-&gt;</span> Interner {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">cap</span> = cap.<span class="hl-title function_ invoke__">next_power_of_two</span>();</span>
+<span class="line">        Interner {</span>
+<span class="line">            map: HashMap::<span class="hl-title function_ invoke__">default</span>(),</span>
+<span class="line">            vec: <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>(),</span>
+<span class="line">            buf: <span class="hl-type">String</span>::<span class="hl-title function_ invoke__">with_capacity</span>(cap),</span>
+<span class="line">            full: <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>(),</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">intern</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, name: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(&amp;id) = <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">get</span>(name) {</span>
+<span class="line">            <span class="hl-keyword">return</span> id;</span>
+<span class="line">        }</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">name</span> = <span class="hl-keyword">unsafe</span> { <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">alloc</span>(name) };</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">id</span> = <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;</span>
+<span class="line">        <span class="hl-keyword">self</span>.map.<span class="hl-title function_ invoke__">insert</span>(name, id);</span>
+<span class="line">        <span class="hl-keyword">self</span>.vec.<span class="hl-title function_ invoke__">push</span>(name);</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-built_in">debug_assert!</span>(<span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">lookup</span>(id) == name);</span>
+<span class="line">        <span class="hl-built_in">debug_assert!</span>(<span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">intern</span>(name) == id);</span>
+<span class="line"></span>
+<span class="line">        id</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">lookup</span>(&amp;<span class="hl-keyword">self</span>, id: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-type">str</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.vec[id <span class="hl-keyword">as</span> <span class="hl-type">usize</span>]</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">unsafe</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">alloc</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, name: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-symbol">&#x27;static</span> <span class="hl-type">str</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">cap</span> = <span class="hl-keyword">self</span>.buf.<span class="hl-title function_ invoke__">capacity</span>();</span>
+<span class="line hl-line">        <span class="hl-keyword">if</span> cap &lt; <span class="hl-keyword">self</span>.buf.<span class="hl-title function_ invoke__">len</span>() + name.<span class="hl-title function_ invoke__">len</span>() {</span>
+<span class="line hl-line">            <span class="hl-keyword">let</span> <span class="hl-variable">new_cap</span> = (cap.<span class="hl-title function_ invoke__">max</span>(name.<span class="hl-title function_ invoke__">len</span>()) + <span class="hl-number">1</span>)</span>
+<span class="line hl-line">                .<span class="hl-title function_ invoke__">next_power_of_two</span>();</span>
+<span class="line hl-line">            <span class="hl-keyword">let</span> <span class="hl-variable">new_buf</span> = <span class="hl-type">String</span>::<span class="hl-title function_ invoke__">with_capacity</span>(new_cap);</span>
+<span class="line hl-line">            <span class="hl-keyword">let</span> <span class="hl-variable">old_buf</span> = mem::<span class="hl-title function_ invoke__">replace</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>.buf, new_buf);</span>
+<span class="line hl-line">            <span class="hl-keyword">self</span>.full.<span class="hl-title function_ invoke__">push</span>(old_buf);</span>
+<span class="line hl-line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">interned</span> = {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">start</span> = <span class="hl-keyword">self</span>.buf.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">            <span class="hl-keyword">self</span>.buf.<span class="hl-title function_ invoke__">push_str</span>(name);</span>
+<span class="line">            &amp;<span class="hl-keyword">self</span>.buf[start..]</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        &amp;*(interned <span class="hl-keyword">as</span> *<span class="hl-keyword">const</span> <span class="hl-type">str</span>)</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The precise rule for increasing capacity is slightly more complicated:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">new_cap</span> = (cap.<span class="hl-title function_ invoke__">max</span>(name.<span class="hl-title function_ invoke__">len</span>()) + <span class="hl-number">1</span>).<span class="hl-title function_ invoke__">next_power_of_two</span>();</span></code></pre>
+
+</figure>
+<p><span>Just doubling won</span>&rsquo;<span>t be enough, we also need to make sure that the new string actually fits.</span></p>
+<p><span>We could have used a single </span><code>bufs: Vec&lt;String&gt;</code><span> in place of both </span><code>buf</code><span> and </span><code>full</code><span>.</span>
+<span>The benefit of splitting the last buffer into a dedicated field is that we statically guarantee that there</span>&rsquo;<span>s at least one buffer.</span>
+<span>That way, we void a bounds check and/or </span><code>.unwrap</code><span> when accessing the active buffer.</span></p>
+<p><span>We also use </span><code>&amp;'static str</code><span> to fake interior references.</span>
+<a href="https://github.com/rust-lang/miri"><span>Miri</span></a><span> (rust in-progress UB checker) is not entirely happy about this.</span>
+<span>I haven</span>&rsquo;<span>t dug into this yet, it might be another instance of</span>
+<a href="https://github.com/rust-lang/rust/pull/61114"><span>rust-lang/rust#61114</span></a><span>.</span>
+<span>To be on the safe side, we can use </span><code>*const str</code><span> instead, with a bit of boilerplate to delegate </span><code>PartialEq</code><span> and </span><code>Hash</code><span>.</span>
+<span>Some kind of (hypothetical) </span><code>'unsafe</code><span> lifetime could also be useful here!</span>
+<span>The critical detail that makes our use of fake </span><code>'static</code><span> sound here is that the </span><code>alloc</code><span> function is private.</span>
+<span>The public </span><code>lookup</code><span> function shortens the lifetime to that of </span><code>&amp;self</code><span> (via lifetime elision).</span></p>
+<p><span>For the real implementation, I would change two things:</span></p>
+<ul>
+<li>
+<p><span>Use </span><code>rustc_hash::FxHashMap</code><span>.</span>
+<span>It</span>&rsquo;<span>s a standard Rust </span><code>HashMap</code><span> with a faster (but not DOS-resistant) hash function </span>&mdash;<span> </span><code>FxHash</code><span>.</span>
+<code>Fx</code><span> stands for </span><strong><strong><span>F</span></strong></strong><span>irefo</span><strong><strong><span>x</span></strong></strong><span>, this is a modification of FNV hash originally used in the browser.</span></p>
+</li>
+<li>
+<p><span>Add a newtype wrapper for string indexes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Debug, Clone, Copy, Eq, PartialEq, Hash)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">StrId</span>(<span class="hl-type">u32</span>);</span></code></pre>
+
+</figure>
+</li>
+</ul>
+<p><span>That</span>&rsquo;<span>s all I have to say about fast and simple string interning in Rust!</span>
+<span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/fn1jxf/blog_post_fast_and_simple_rust_interner/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-03-22-fast-simple-rust-interner.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/04/13/simple-but-powerful-pratt-parsing.html b/2020/04/13/simple-but-powerful-pratt-parsing.html
new file mode 100644
index 00000000..01304588
--- /dev/null
+++ b/2020/04/13/simple-but-powerful-pratt-parsing.html
@@ -0,0 +1,1398 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Simple but Powerful Pratt Parsing</title>
+  <meta name="description" content="Welcome to my article about Pratt parsing --- the monad tutorial of syntactic analysis.
+The number of Pratt parsing articles is so large that there exists a survey post :)">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/04/13/simple-but-powerful-pratt-parsing.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Simple-but-Powerful-Pratt-Parsing"><span>Simple but Powerful Pratt Parsing</span> <time datetime="2020-04-13">Apr 13, 2020</time></a>
+    </h1>
+<p><span>Welcome to my article about Pratt parsing </span>&mdash;<span> the monad tutorial of syntactic analysis.</span>
+<span>The number of Pratt parsing articles is so large that there exists a </span><a href="https://www.oilshell.org/blog/2017/03/31.html"><span>survey post</span></a><span> :)</span></p>
+<p><span>The goals of this particular article are:</span></p>
+<ul>
+<li>
+<span>Raising an issue that the so-called left-recursion problem is overstated.</span>
+</li>
+<li>
+<span>Complaining about inadequacy of BNF for representing infix expressions.</span>
+</li>
+<li>
+<span>Providing a description and implementation of Pratt parsing algorithm which sticks to the core and doesn</span>&rsquo;<span>t introduce a DSL-y abstraction.</span>
+</li>
+<li>
+<span>Understanding the algorithm myself for hopefully the last time. I</span>&rsquo;<span>ve</span>
+<a href="https://github.com/rust-analyzer/rust-analyzer/blob/c388130f5ffbcbe7d3131213a24d12d02f769b87/crates/ra_parser/src/grammar/expressions.rs#L280-L281"><span>implemented</span></a>
+<span>a production-grade Pratt parser once, but I no longer immediately understand that code :-)</span>
+</li>
+</ul>
+<p><span>This post assumes a fair bit of familiarity with parsing techniques, and, for example, does not explain what a context free grammar is.</span></p>
+<section id="Introduction">
+
+    <h2>
+    <a href="#Introduction"><span>Introduction</span> </a>
+    </h2>
+<p><span>Parsing is the process by which a compiler turns a </span><em><span>sequence</span></em><span> of tokens into a </span><em><span>tree</span></em><span> representation:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">                            Add</span>
+<span class="line">                 Parser     / \</span>
+<span class="line"> "1 + 2 * 3"    -------&gt;   1  Mul</span>
+<span class="line">                              / \</span>
+<span class="line">                             2   3</span></code></pre>
+
+</figure>
+<p><span>There are many approaches to this task, which roughly fall into one of the broad two categories:</span></p>
+<ul>
+<li>
+<span>Using a DSL to specify an abstract grammar of the language</span>
+</li>
+<li>
+<span>Hand-writing the parser</span>
+</li>
+</ul>
+<p><span>Pratt parsing is one of the most frequently used techniques for hand-written parsing.</span></p>
+</section>
+<section id="BNF">
+
+    <h2>
+    <a href="#BNF"><span>BNF</span> </a>
+    </h2>
+<p><span>The pinnacle of syntactic analysis theory is discovering the context free grammar</span>
+<span>notation (often using BNF concrete syntax) for decoding linear structures into trees:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Item ::=</span>
+<span class="line">    StructItem</span>
+<span class="line">  | EnumItem</span>
+<span class="line">  | ...</span>
+<span class="line"></span>
+<span class="line">StructItem ::=</span>
+<span class="line">    'struct' Name '{' FieldList '}'</span>
+<span class="line"></span>
+<span class="line">...</span></code></pre>
+
+</figure>
+<p><span>I remember being fascinated by this idea, especially by parallels with natural language sentence structure.</span>
+<span>However, my optimism quickly waned once we got to describing expressions.</span>
+<span>The natural expression grammar indeed allows one to see what is an expression.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Expr ::=</span>
+<span class="line">    Expr '+' Expr</span>
+<span class="line">  | Expr '*' Expr</span>
+<span class="line">  | '(' Expr ')'</span>
+<span class="line">  | 'number'</span></code></pre>
+
+</figure>
+<p><span>Although this grammar looks great, it is in fact ambiguous and imprecise, and needs to be rewritten to be amendable to automated parser generation.</span>
+<span>Specifically, we need to specify precedence and associativity of operators.</span>
+<span>The fixed grammar looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Expr ::=</span>
+<span class="line">    Factor</span>
+<span class="line">  | Expr '+' Factor</span>
+<span class="line"></span>
+<span class="line">Factor ::=</span>
+<span class="line">    Atom</span>
+<span class="line">  | Factor '*' Atom</span>
+<span class="line"></span>
+<span class="line">Atom ::=</span>
+<span class="line">    'number'</span>
+<span class="line">  | '(' Expr ')'</span></code></pre>
+
+</figure>
+<p><span>To me, the </span>&ldquo;<span>shape</span>&rdquo;<span> of expressions feels completely lost in this new formulation.</span>
+<span>Moreover, it took me three or four </span><em><span>courses</span></em><span> in formal languages before I was able to reliably create this grammar myself.</span></p>
+<p><span>And that</span>&rsquo;<span>s why I love Pratt parsing </span>&mdash;<span> it is an enhancement of recursive descent parsing algorithm, which uses the natural terminology of precedence and associativity for parsing expressions, instead of grammar obfuscation techniques.</span></p>
+</section>
+<section id="Recursive-descent-and-left-recursion">
+
+    <h2>
+    <a href="#Recursive-descent-and-left-recursion"><span>Recursive descent and left-recursion</span> </a>
+    </h2>
+<p><span>The simplest technique for hand-writing a parser is recursive descent, which</span>
+<span>models the grammar as a set of mutually recursive functions. For example, the</span>
+<span>above item grammar fragment can look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">item</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">    <span class="hl-keyword">match</span> p.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">        STRUCT_KEYWORD =&gt; <span class="hl-title function_ invoke__">struct_item</span>(p),</span>
+<span class="line">        ENUM_KEYWORD   =&gt; <span class="hl-title function_ invoke__">enum_item</span>(p),</span>
+<span class="line">        ...</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">struct_item</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">    p.<span class="hl-title function_ invoke__">expect</span>(STRUCT_KEYWORD);</span>
+<span class="line">    <span class="hl-title function_ invoke__">name</span>(p);</span>
+<span class="line">    p.<span class="hl-title function_ invoke__">expect</span>(L_CURLY);</span>
+<span class="line">    <span class="hl-title function_ invoke__">field_list</span>(p);</span>
+<span class="line">    p.<span class="hl-title function_ invoke__">expect</span>(R_CURLY);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">...</span></code></pre>
+
+</figure>
+<p><span>Traditionally, text-books point out left-recursive grammars as the Achilles heel</span>
+<span>of this approach, and use this drawback to motivate more advanced LR parsing</span>
+<span>techniques. An example of problematic grammar can look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Sum ::=</span>
+<span class="line">    Sum '+' Int</span>
+<span class="line">  | Int</span></code></pre>
+
+</figure>
+<p><span>Indeed, if we naively code the </span><code>sum</code><span> function, it wouldn</span>&rsquo;<span>t be too useful:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sum</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">    <span class="hl-comment">// Try first alternative</span></span>
+<span class="line">    <span class="hl-title function_ invoke__">sum</span>(p); <i class="callout" data-value="1"></i></span>
+<span class="line">    p.<span class="hl-title function_ invoke__">expect</span>(PLUS);</span>
+<span class="line">    <span class="hl-title function_ invoke__">int</span>(p);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// If that fails, try the second one</span></span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>At this point we immediately loop and overflow the stack</span>
+</li>
+</ol>
+<p><span>A theoretical fix to the problem involves rewriting the grammar to eliminate the left recursion.</span>
+<span>However in practice, for a hand-written parser, a solution is much simpler </span>&mdash;<span> breaking away with a pure </span><em><span>recursive</span></em><span> paradigm and using a loop:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sum</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">    <span class="hl-title function_ invoke__">int</span>(p);</span>
+<span class="line">    <span class="hl-keyword">while</span> p.<span class="hl-title function_ invoke__">eat</span>(PLUS) {</span>
+<span class="line">        <span class="hl-title function_ invoke__">int</span>(p);</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Pratt-parsing-the-general-shape">
+
+    <h2>
+    <a href="#Pratt-parsing-the-general-shape"><span>Pratt parsing, the general shape</span> </a>
+    </h2>
+<p><span>Using just loops won</span>&rsquo;<span>t be enough for parsing infix expressions.</span>
+<span>Instead, Pratt parsing uses </span><em><span>both</span></em><span> loops </span><em><span>and</span></em><span> recursion:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">parse_expr</span>() {</span>
+<span class="line">    ...</span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        ...</span>
+<span class="line">        <span class="hl-title function_ invoke__">parse_expr</span>()</span>
+<span class="line">        ...</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Not only does it send your mind into Möbeus-shaped hamster wheel, it also handles associativity and precedence!</span></p>
+</section>
+<section id="From-Precedence-to-Binding-Power">
+
+    <h2>
+    <a href="#From-Precedence-to-Binding-Power"><span>From Precedence to Binding Power</span> </a>
+    </h2>
+<p><span>I have a confession to make: I am always confused by </span>&ldquo;<span>high precedence</span>&rdquo;<span> and </span>&ldquo;<span>low precedence</span>&rdquo;<span>. In </span><code>a + b * c</code><span>, addition has a lower precedence, but it is at the top of the parse tree</span>&hellip;</p>
+<p><span>So instead, I find thinking in terms of binding power more intuitive.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">expr:   A       +       B       *       C</span>
+<span class="line">power:      3       3       5       5</span></code></pre>
+
+</figure>
+<p><span>The </span><code>*</code><span> is stronger, it has more power to hold together </span><code>B</code><span> and </span><code>C</code><span>, and so the expression is parsed as</span>
+<code>A + (B * C)</code><span>.</span></p>
+<p><span>What about associativity though? In </span><code>A + B + C</code><span> all operators seem to have the same power, and it is unclear which </span><code>+</code><span> to fold first.</span>
+<span>But this can also be modelled with power, if we make it slightly asymmetric:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">expr:      A       +       B       +       C</span>
+<span class="line">power:  0      3      3.1      3      3.1     0</span></code></pre>
+
+</figure>
+<p><span>Here, we pumped the right power of </span><code>+</code><span> just a little bit, so that it holds the right operand tighter.</span>
+<span>We also added zeros at both ends, as there are no operators to bind from the sides.</span>
+<span>Here, the first (and only the first) </span><code>+</code><span> holds both of its arguments tighter than the neighbors, so we can reduce it:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">expr:     (A + B)     +     C</span>
+<span class="line">power:  0          3    3.1    0</span></code></pre>
+
+</figure>
+<p><span>Now we can fold the second plus and get </span><code>(A + B) + C</code><span>.</span>
+<span>Or, in terms of the syntax tree, the second </span><code>+</code><span> really likes its right operand more than the left one, so it rushes to get hold of </span><code>C</code><span>.</span>
+<span>While he does that, the first </span><code>+</code><span> captures both </span><code>A</code><span> and </span><code>B</code><span>, as they are uncontested.</span></p>
+<p><span>What Pratt parsing does is that it finds these badass, stronger than neighbors operators, by processing the string left to right.</span>
+<span>We are almost at a point where we finally start writing some code, but let</span>&rsquo;<span>s first look at the other running example.</span>
+<span>We will use function composition operator, </span><code>.</code><span> (dot) as a </span><em><span>right</span></em><span> associative operator with a high binding power.</span>
+<span>That is, </span><code>f . g . h</code><span> is parsed as </span><code>f . (g . h)</code><span>, or, in terms of power</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">  f     .    g     .    h</span>
+<span class="line">0   8.5    8   8.5    8   0</span></code></pre>
+
+</figure>
+</section>
+<section id="Minimal-Pratt-Parser">
+
+    <h2>
+    <a href="#Minimal-Pratt-Parser"><span>Minimal Pratt Parser</span> </a>
+    </h2>
+<p><span>We will be parsing expressions where basic atoms are </span><em><span>single character</span></em><span> numbers and variables, and which uses punctuation for operators.</span>
+<span>Let</span>&rsquo;<span>s define a simple tokenizer:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Token</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Atom</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    <span class="hl-title function_ invoke__">Op</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    Eof,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    tokens: <span class="hl-type">Vec</span>&lt;Token&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> Lexer {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">tokens</span> = input</span>
+<span class="line">            .<span class="hl-title function_ invoke__">chars</span>()</span>
+<span class="line">            .<span class="hl-title function_ invoke__">filter</span>(|it| !it.<span class="hl-title function_ invoke__">is_ascii_whitespace</span>())</span>
+<span class="line">            .<span class="hl-title function_ invoke__">map</span>(|c| <span class="hl-keyword">match</span> c {</span>
+<span class="line">                <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span> |</span>
+<span class="line">                <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; Token::<span class="hl-title function_ invoke__">Atom</span>(c),</span>
+<span class="line">                _ =&gt; Token::<span class="hl-title function_ invoke__">Op</span>(c),</span>
+<span class="line">            })</span>
+<span class="line">            .collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line">        tokens.<span class="hl-title function_ invoke__">reverse</span>();</span>
+<span class="line">        Lexer { tokens }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">next</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Token {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap_or</span>(Token::Eof)</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">peek</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Token {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">last</span>().<span class="hl-title function_ invoke__">copied</span>().<span class="hl-title function_ invoke__">unwrap_or</span>(Token::Eof)</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To make sure that we got the </span><del><span>precedence</span></del><span> binding power correctly, we will be transforming infix expressions into a gold-standard (not so popular in Poland, for whatever reason) unambiguous notation </span>&mdash;<span> S-expressions:</span><br>
+<code>1 + 2 * 3 == (+ 1 (* 2 3))</code><span>.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::fmt;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Atom</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    <span class="hl-title function_ invoke__">Cons</span>(<span class="hl-type">char</span>, <span class="hl-type">Vec</span>&lt;S&gt;),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">fmt</span>::Display <span class="hl-keyword">for</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">fmt</span>(&amp;<span class="hl-keyword">self</span>, f: &amp;<span class="hl-keyword">mut</span> fmt::Formatter&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> fmt::<span class="hl-type">Result</span> {</span>
+<span class="line">        <span class="hl-keyword">match</span> <span class="hl-keyword">self</span> {</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Atom</span>(i) =&gt; <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;{}&quot;</span>, i),</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(head, rest) =&gt; {</span>
+<span class="line">                <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;({}&quot;</span>, head)?;</span>
+<span class="line">                <span class="hl-keyword">for</span> <span class="hl-variable">s</span> <span class="hl-keyword">in</span> rest {</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot; {}&quot;</span>, s)?</span>
+<span class="line">                }</span>
+<span class="line">                <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;)&quot;</span>)</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And let</span>&rsquo;<span>s start with just this: expressions with atoms and two infix binary operators, </span><code>+</code><span> and </span><code>*</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lexer</span> = Lexer::<span class="hl-title function_ invoke__">new</span>(input);</span>
+<span class="line">    <span class="hl-title function_ invoke__">expr_bp</span>(&amp;<span class="hl-keyword">mut</span> lexer)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    todo!()</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1 + 2 * 3&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ 1 (* 2 3))&quot;</span>)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>So, the general approach is roughly the one we used to deal with left recursion </span>&mdash;<span> start with parsing a first number, and then loop, consuming operators and doing </span>&hellip;<span> something?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        todo!()</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1&quot;</span>); <i class="callout" data-value="1"></i></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;1&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>Note that we already can parse this simple test!</span>
+</li>
+</ol>
+<p><span>We want to use this power idea, so let</span>&rsquo;<span>s compute both left and right powers of the operator.</span>
+<span>We</span>&rsquo;<span>ll use </span><code>u8</code><span> to represent power, so, for associativity, we</span>&rsquo;<span>ll add </span><code>1</code><span>.</span>
+<span>And we</span>&rsquo;<span>ll reserve the </span><code>0</code><span> power for the end of input, so the lowest power operator can have is </span><code>1</code><span>.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line">        <span class="hl-keyword">let</span> (l_bp, r_bp) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op);</span>
+<span class="line"></span>
+<span class="line">        todo!()</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">1</span>, <span class="hl-number">2</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">3</span>, <span class="hl-number">4</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>)</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And now comes the tricky bit, where we introduce recursion into the picture.</span>
+<span>Let</span>&rsquo;<span>s think about this example (with powers below):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">a   +   b   *   c   *   d   +   e</span>
+<span class="line">  1   2   3   4   3   4   1   2</span></code></pre>
+
+</figure>
+<p><span>The cursor is at the first </span><code>+</code><span>, we know that the left </span><code>bp</code><span> is </span><code>1</code><span> and the right one is </span><code>2</code><span>.</span>
+<span>The </span><code>lhs</code><span> stores </span><code>a</code><span>.</span>
+<span>The next operator after </span><code>+</code><span> is </span><code>*</code><span>, so we shouldn</span>&rsquo;<span>t add </span><code>b</code><span> to </span><code>a</code><span>.</span>
+<span>The problem is that we haven</span>&rsquo;<span>t yet seen the next operator, we are just past </span><code>+</code><span>.</span>
+<span>Can we add a lookahead?</span>
+<span>Looks like no </span>&mdash;<span> we</span>&rsquo;<span>d have to look past all of </span><code>b</code><span>, </span><code>c</code><span> and </span><code>d</code><span> to find the next operator with lower binding power, which sounds pretty unbounded.</span>
+<span>But we are onto something!</span>
+<span>Our current right priority is </span><code>2</code><span>, and, to be able to fold the expression, we need to find the next operator with lower priority.</span>
+<span>So let</span>&rsquo;<span>s recursively call </span><code>expr_bp</code><span> starting at </span><code>b</code><span>, but also tell it to stop as soon as </span><code>bp</code><span> drops below </span><code>2</code><span>.</span>
+<span>This necessitates the addition of </span><code>min_bp</code><span> argument to the main function.</span></p>
+<p><span>And lo, we have a fully functioning minimal Prat parser:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lexer</span> = Lexer::<span class="hl-title function_ invoke__">new</span>(input);</span>
+<span class="line">    <span class="hl-title function_ invoke__">expr_bp</span>(&amp;<span class="hl-keyword">mut</span> lexer, <span class="hl-number">0</span>) <i class="callout" data-value="5"></i></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S { <i class="callout" data-value="1"></i></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> (l_bp, r_bp) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op);</span>
+<span class="line">        <span class="hl-keyword">if</span> l_bp &lt; min_bp { <i class="callout" data-value="2"></i></span>
+<span class="line">            <span class="hl-keyword">break</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>(); <i class="callout" data-value="3"></i></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line"></span>
+<span class="line">        lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs]); <i class="callout" data-value="4"></i></span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">1</span>, <span class="hl-number">2</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">3</span>, <span class="hl-number">4</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;1&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1 + 2 * 3&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ 1 (* 2 3))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;a + b * c * d + e&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ (+ a (* (* b c) d)) e)&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<code>min_bp</code><span> argument is the crucial addition. </span><code>expr_bp</code><span> now parses expressions with relatively high binding power. As soon as it sees something weaker than </span><code>min_bp</code><span>, it stops.</span>
+</li>
+<li>
+<span>This is the </span>&ldquo;<span>it stops</span>&rdquo;<span> point.</span>
+</li>
+<li>
+<span>And here we bump past the operator itself and make the recursive call.</span>
+<span>Note how we use </span><code>l_bp</code><span> to check against </span><code>min_bp</code><span>, and </span><code>r_bp</code><span> as the new </span><code>min_bp</code><span> of the recursive call.</span>
+<span>So, you can think about </span><code>min_bp</code><span> as the binding power of the operator to the left of the current expressions.</span>
+</li>
+<li>
+<span>Finally, after parsing the correct right hand side, we assemble the new current expression.</span>
+</li>
+<li>
+<span>To start the recursion, we use binding power of zero.</span>
+<span>Remember, at the beginning the binding power of the operator to the left is the lowest possible, zero, as there</span>&rsquo;<span>s no actual operator there.</span>
+</li>
+</ol>
+<p><span>So, yup, these 40 lines </span><em><span>are</span></em><span> the Pratt parsing algorithm.</span>
+<span>They are tricky, but, if you understand them, everything else is straightforward additions.</span></p>
+</section>
+<section id="Bells-and-Whistles">
+
+    <h2>
+    <a href="#Bells-and-Whistles"><span>Bells and Whistles</span> </a>
+    </h2>
+<p><span>Now let</span>&rsquo;<span>s add all kinds of weird expressions to show the power and flexibility of the algorithm.</span>
+<span>First, let</span>&rsquo;<span>s add a high-priority, right associative function composition operator: </span><code>.</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">1</span>, <span class="hl-number">2</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">3</span>, <span class="hl-number">4</span>),</span>
+<span class="line hl-line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">6</span>, <span class="hl-number">5</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>),</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Yup, it</span>&rsquo;<span>s a single line!</span>
+<span>Note how the left side of the operator binds tighter, which gives us desired right associativity:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;f . g . h&quot;</span>);</span>
+<span class="line"><span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(. f (. g h))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot; 1 + 2 + f . g . h * 3 * 4&quot;</span>);</span>
+<span class="line"><span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ (+ 1 2) (* (* (. f (. g h)) 3) 4))&quot;</span>);</span></code></pre>
+
+</figure>
+<p><span>Now, let</span>&rsquo;<span>s add unary </span><code>-</code><span>, which binds tighter than binary arithmetic operators, but less tight than composition.</span>
+<span>This requires changes to how we start our loop, as we no longer can assume that the first token is an atom, and need to handle minus as well.</span>
+<span>But let the types drive us.</span>
+<span>First, we start with binding powers.</span>
+<span>As this is an unary operator, it really only have right binding power, so, ahem, let</span>&rsquo;<span>s just code this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">prefix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> ((), <span class="hl-type">u8</span>) { <i class="callout" data-value="1"></i></span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; ((), <span class="hl-number">5</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>, op),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">1</span>, <span class="hl-number">2</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">3</span>, <span class="hl-number">4</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">8</span>, <span class="hl-number">7</span>), <i class="callout" data-value="2"></i></span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>),</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>Here, we return a dummy </span><code>()</code><span> to make it clear that this is a prefix, and not a postfix operator, and thus can only bind things to the right.</span>
+</li>
+<li>
+<span>Note, as we want to add unary </span><code>-</code><span> between </span><code>.</code><span> and </span><code>*</code><span>, we need to shift priorities of </span><code>.</code><span> by two.</span>
+<span>The general rule is that we use an odd priority as base, and bump it by one for associativity, if the operator is binary. For unary minus it doesn</span>&rsquo;<span>t matter and we could have used either </span><code>5</code><span> or </span><code>6</code><span>, but sticking to odd is more consistent.</span>
+</li>
+</ol>
+<p><span>Plugging this into </span><code>expr_bp</code><span>, we get:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            todo!()</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, we only have </span><code>r_bp</code><span> and not </span><code>l_bp</code><span>, so let</span>&rsquo;<span>s just copy-paste half of the code from the main loop?</span>
+<span>Remember, we use </span><code>r_bp</code><span> for recursive calls.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> (l_bp, r_bp) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op);</span>
+<span class="line">        <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">            <span class="hl-keyword">break</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line"></span>
+<span class="line">        lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs]);</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    ...</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;--1 * 2&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(* (- (- 1)) 2)&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;--f . g&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(- (- (. f g)))&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Amusingly, this purely mechanical, type-driven transformation works.</span>
+<span>You can also reason why it works, of course.</span>
+<span>The same argument applies; after we</span>&rsquo;<span>ve consumed a prefix operator, the operand consists of operators that bind tighter, and we just so conveniently happen to have a function which can parse expressions tighter than the specified power.</span></p>
+<p><span>Ok, this is getting stupid.</span>
+<span>If using </span><code>((), u8)</code><span> </span>&ldquo;<span>just worked</span>&rdquo;<span> for prefix operators, can </span><code>(u8, ())</code><span> deal with postfix ones?</span>
+<span>Well, let</span>&rsquo;<span>s add </span><code>!</code><span> for factorials. It should bind tighter than </span><code>-</code><span>, because </span><code>-(92!)</code><span> is obviously more useful than </span><code>(-92)!</code><span>.</span>
+<span>So, the familiar drill </span>&mdash;<span> new priority function, shifting priority of </span><code>.</code><span> (this bit </span><em><span>is</span></em><span> annoying in Pratt parsers), copy-pasting the code</span>&hellip;</p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> (l_bp, ()) = <span class="hl-title function_ invoke__">postfix_binding_power</span>(op);</span>
+<span class="line"><span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">    <span class="hl-keyword">break</span>;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> (l_bp, r_bp) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op);</span>
+<span class="line"><span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">    <span class="hl-keyword">break</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Wait, something</span>&rsquo;<span>s wrong here.</span>
+<span>After we</span>&rsquo;<span>ve parsed the prefix expression, we can see either a postfix or an infix operator.</span>
+<span>But we bail on unrecognized operators, which is not going to work</span>&hellip;
+<span>So, let</span>&rsquo;<span>s make </span><code>postfix_binding_power</code><span> to return an option, for the case where the operator is not postfix:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line hl-line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, ())) = <span class="hl-title function_ invoke__">postfix_binding_power</span>(op) {</span>
+<span class="line hl-line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line hl-line">                <span class="hl-keyword">break</span>;</span>
+<span class="line hl-line">            }</span>
+<span class="line hl-line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line hl-line"></span>
+<span class="line hl-line">            lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs]);</span>
+<span class="line hl-line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line hl-line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> (l_bp, r_bp) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op);</span>
+<span class="line">        <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">            <span class="hl-keyword">break</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line"></span>
+<span class="line">        lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs]);</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">prefix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> ((), <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; ((), <span class="hl-number">5</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>, op),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line hl-line"><span class="hl-keyword">fn</span> <span class="hl-title function_">postfix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, ())&gt; {</span>
+<span class="line hl-line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line hl-line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">7</span>, ()),</span>
+<span class="line hl-line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line hl-line">    };</span>
+<span class="line hl-line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line hl-line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">1</span>, <span class="hl-number">2</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">3</span>, <span class="hl-number">4</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">10</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;-9!&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(- (! 9))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;f . g !&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(! (. f g))&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Amusingly, both the old and the new tests pass.</span></p>
+<p><span>Now, we are ready to add a new kind of expression: parenthesised expression.</span>
+<span>It is actually not that hard, and we could have done it from the start, but it makes sense to handle this here, you</span>&rsquo;<span>ll see in a moment why.</span>
+<span>Parens are just a primary expressions, and are handled similar to atoms:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">    Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">    Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">        <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">        lhs</span>
+<span class="line">    }</span>
+<span class="line">    Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">        <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">        S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">    }</span>
+<span class="line">    t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>Unfortunately, the following test fails:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;(((0)))&quot;</span>);</span>
+<span class="line"><span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;0&quot;</span>);</span></code></pre>
+
+</figure>
+<p><span>The panic comes from the loop below </span>&mdash;<span> the only termination condition we have is reaching eof, and </span><code>)</code><span> is definitely not eof.</span>
+<span>The easiest way to fix that is to change </span><code>infix_binding_power</code><span> to return </span><code>None</code><span> on unrecognized operands.</span>
+<span>That way, it</span>&rsquo;<span>ll become similar to </span><code>postfix_binding_power</code><span> again!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line hl-line">        Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line hl-line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line hl-line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line hl-line">            lhs</span>
+<span class="line hl-line">        }</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, ())) = <span class="hl-title function_ invoke__">postfix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs]);</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line hl-line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op) {</span>
+<span class="line hl-line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line hl-line">                <span class="hl-keyword">break</span>;</span>
+<span class="line hl-line">            }</span>
+<span class="line hl-line"></span>
+<span class="line hl-line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line hl-line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line hl-line"></span>
+<span class="line hl-line">            lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs]);</span>
+<span class="line hl-line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line hl-line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">prefix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> ((), <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; ((), <span class="hl-number">5</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>, op),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">postfix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, ())&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">7</span>, ()),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line hl-line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, <span class="hl-type">u8</span>)&gt; {</span>
+<span class="line hl-line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line hl-line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">1</span>, <span class="hl-number">2</span>),</span>
+<span class="line hl-line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">3</span>, <span class="hl-number">4</span>),</span>
+<span class="line hl-line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">10</span>, <span class="hl-number">9</span>),</span>
+<span class="line hl-line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line hl-line">    };</span>
+<span class="line hl-line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line hl-line">}</span></code></pre>
+
+</figure>
+<p><span>And now let</span>&rsquo;<span>s add array indexing operator: </span><code>a[i]</code><span>.</span>
+<span>What kind of -fix is it?</span>
+<span>Around-fix?</span>
+<span>If it were just </span><code>a[]</code><span>, it would clearly be postfix.</span>
+<span>if it were just </span><code>[i]</code><span>, it would work exactly like parens.</span>
+<span>And it is the key: the </span><code>i</code><span> part doesn</span>&rsquo;<span>t really participate in the whole power game, as it is unambiguously delimited. So, let</span>&rsquo;<span>s do this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            lhs</span>
+<span class="line">        }</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, ())) = <span class="hl-title function_ invoke__">postfix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line hl-line">            lhs = <span class="hl-keyword">if</span> op == <span class="hl-string">&#x27;[&#x27;</span> {</span>
+<span class="line hl-line">                <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line hl-line">                <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;]&#x27;</span>));</span>
+<span class="line hl-line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs])</span>
+<span class="line hl-line">            } <span class="hl-keyword">else</span> {</span>
+<span class="line hl-line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs])</span>
+<span class="line hl-line">            };</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line"></span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line"></span>
+<span class="line">            lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs]);</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">prefix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> ((), <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; ((), <span class="hl-number">5</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>, op),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">postfix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, ())&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> | <span class="hl-string">&#x27;[&#x27;</span> =&gt; (<span class="hl-number">7</span>, ()), <i class="callout" data-value="1"></i></span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, <span class="hl-type">u8</span>)&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">1</span>, <span class="hl-number">2</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">3</span>, <span class="hl-number">4</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">10</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    ...</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;x[0][1]&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;([ ([ x 0) 1)&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>Note that we use the same priority for </span><code>!</code><span> as for </span><code>[</code><span>.</span>
+<span>In general, for the correctness of our algorithm it</span>&rsquo;<span>s pretty important that, when we make decisions, priorities are never equal.</span>
+<span>Otherwise, we might end up in a situation like the one before tiny adjustment for associativity, where there were two equally-good candidates for reduction.</span>
+<span>However, we only compare right </span><code>bp</code><span> with left </span><code>bp</code><span>!</span>
+<span>So for two postfix operators it</span>&rsquo;<span>s OK to have priorities the same, as they are both right.</span>
+</li>
+</ol>
+<p><span>Finally, the ultimate boss of all operators, the dreaded ternary:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">c ? e1 : e2</span></code></pre>
+
+</figure>
+<p><span>Is this </span>&hellip;<span> all-other-the-place-fix operator?</span>
+<span>Well, let</span>&rsquo;<span>s change the syntax of ternary slightly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">c [ e1 ] e2</span></code></pre>
+
+</figure>
+<p><span>And let</span>&rsquo;<span>s recall that </span><code>a[i]</code><span> turned out to be a postfix operator + parenthesis</span>&hellip;
+<span>So, yeah, </span><code>?</code><span> and </span><code>:</code><span> are actually a weird pair of parens!</span>
+<span>And let</span>&rsquo;<span>s handle it as such!</span>
+<span>Now, what about priority and associativity?</span>
+<span>What associativity even is in this case?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">a ? b : c ? d : e</span></code></pre>
+
+</figure>
+<p><span>To figure it out, we just squash the parens part:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">a ?: c ?: e</span></code></pre>
+
+</figure>
+<p><span>This can be parsed as</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">(a ?: c) ?: e</span></code></pre>
+
+</figure>
+<p><span>or  as</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">a ?: (c ?: e)</span></code></pre>
+
+</figure>
+<p><span>What is more useful?</span>
+<span>For </span><code>?</code><span>-chains like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">a ? b :</span>
+<span class="line">c ? d :</span>
+<span class="line">e</span></code></pre>
+
+</figure>
+<p><span>the right-associative reading is more useful.</span>
+<span>Priority-wise, the ternary is low priority.</span>
+<span>In C, only </span><code>=</code><span> and </span><code>,</code><span> have lower priority.</span>
+<span>While we are at it, let</span>&rsquo;<span>s add C-style right associative </span><code>=</code><span> as well.</span></p>
+<p><span>Here</span>&rsquo;<span>s our the most complete and perfect version of a simple Pratt parser:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::{fmt, io::BufRead};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Atom</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    <span class="hl-title function_ invoke__">Cons</span>(<span class="hl-type">char</span>, <span class="hl-type">Vec</span>&lt;S&gt;),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">fmt</span>::Display <span class="hl-keyword">for</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">fmt</span>(&amp;<span class="hl-keyword">self</span>, f: &amp;<span class="hl-keyword">mut</span> fmt::Formatter&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> fmt::<span class="hl-type">Result</span> {</span>
+<span class="line">        <span class="hl-keyword">match</span> <span class="hl-keyword">self</span> {</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Atom</span>(i) =&gt; <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;{}&quot;</span>, i),</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(head, rest) =&gt; {</span>
+<span class="line">                <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;({}&quot;</span>, head)?;</span>
+<span class="line">                <span class="hl-keyword">for</span> <span class="hl-variable">s</span> <span class="hl-keyword">in</span> rest {</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot; {}&quot;</span>, s)?</span>
+<span class="line">                }</span>
+<span class="line">                <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;)&quot;</span>)</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Token</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Atom</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    <span class="hl-title function_ invoke__">Op</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    Eof,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    tokens: <span class="hl-type">Vec</span>&lt;Token&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> Lexer {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">tokens</span> = input</span>
+<span class="line">            .<span class="hl-title function_ invoke__">chars</span>()</span>
+<span class="line">            .<span class="hl-title function_ invoke__">filter</span>(|it| !it.<span class="hl-title function_ invoke__">is_ascii_whitespace</span>())</span>
+<span class="line">            .<span class="hl-title function_ invoke__">map</span>(|c| <span class="hl-keyword">match</span> c {</span>
+<span class="line">                <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span></span>
+<span class="line">                | <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; Token::<span class="hl-title function_ invoke__">Atom</span>(c),</span>
+<span class="line">                _ =&gt; Token::<span class="hl-title function_ invoke__">Op</span>(c),</span>
+<span class="line">            })</span>
+<span class="line">            .collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line">        tokens.<span class="hl-title function_ invoke__">reverse</span>();</span>
+<span class="line">        Lexer { tokens }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">next</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Token {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap_or</span>(Token::Eof)</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">peek</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Token {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">last</span>().<span class="hl-title function_ invoke__">copied</span>().<span class="hl-title function_ invoke__">unwrap_or</span>(Token::Eof)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lexer</span> = Lexer::<span class="hl-title function_ invoke__">new</span>(input);</span>
+<span class="line">    <span class="hl-title function_ invoke__">expr_bp</span>(&amp;<span class="hl-keyword">mut</span> lexer, <span class="hl-number">0</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            lhs</span>
+<span class="line">        }</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, ())) = <span class="hl-title function_ invoke__">postfix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            lhs = <span class="hl-keyword">if</span> op == <span class="hl-string">&#x27;[&#x27;</span> {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">                <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;]&#x27;</span>));</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs])</span>
+<span class="line">            } <span class="hl-keyword">else</span> {</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs])</span>
+<span class="line">            };</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            lhs = <span class="hl-keyword">if</span> op == <span class="hl-string">&#x27;?&#x27;</span> {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">mhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">                <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;:&#x27;</span>));</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, mhs, rhs])</span>
+<span class="line">            } <span class="hl-keyword">else</span> {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs])</span>
+<span class="line">            };</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">prefix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> ((), <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; ((), <span class="hl-number">9</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>, op),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">postfix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, ())&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, ()),</span>
+<span class="line">        <span class="hl-string">&#x27;[&#x27;</span> =&gt; (<span class="hl-number">11</span>, ()),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, <span class="hl-type">u8</span>)&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;?&#x27;</span> =&gt; (<span class="hl-number">4</span>, <span class="hl-number">3</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;1&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1 + 2 * 3&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ 1 (* 2 3))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;a + b * c * d + e&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ (+ a (* (* b c) d)) e)&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;f . g . h&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(. f (. g h))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot; 1 + 2 + f . g . h * 3 * 4&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(</span>
+<span class="line">        s.<span class="hl-title function_ invoke__">to_string</span>(),</span>
+<span class="line">        <span class="hl-string">&quot;(+ (+ 1 2) (* (* (. f (. g h)) 3) 4))&quot;</span>,</span>
+<span class="line">    );</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;--1 * 2&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(* (- (- 1)) 2)&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;--f . g&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(- (- (. f g)))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;-9!&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(- (! 9))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;f . g !&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(! (. f g))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;(((0)))&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;0&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;x[0][1]&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;([ ([ x 0) 1)&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(</span>
+<span class="line">        <span class="hl-string">&quot;a ? b :</span></span>
+<span class="line"><span class="hl-string">         c ? d</span></span>
+<span class="line"><span class="hl-string">         : e&quot;</span>,</span>
+<span class="line">    );</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(? a b (? c d e))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;a = 0 ? b : c = d&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(= a (= (? 0 b c) d))&quot;</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">line</span> <span class="hl-keyword">in</span> std::io::<span class="hl-title function_ invoke__">stdin</span>().<span class="hl-title function_ invoke__">lock</span>().<span class="hl-title function_ invoke__">lines</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">line</span> = line.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(&amp;line);</span>
+<span class="line">        <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{}&quot;</span>, s)</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The code is also available in</span>
+<a href="https://github.com/matklad/minipratt"><span>this repository</span></a><span>, Eof :-)</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-04-13-simple-but-powerful-pratt-parsing.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/04/15/from-pratt-to-dijkstra.html b/2020/04/15/from-pratt-to-dijkstra.html
new file mode 100644
index 00000000..501af111
--- /dev/null
+++ b/2020/04/15/from-pratt-to-dijkstra.html
@@ -0,0 +1,1260 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>From Pratt to Dijkstra</title>
+  <meta name="description" content="This is a sequel to the previous post about Pratt parsing.
+Here, we'll study the relationship between top-down operator precedence (Pratt parsing) and the more famous shunting yard algorithm.
+Spoiler: they are the same algorithm, the difference is implementation style with recursion (Pratt) or a manual stack (Dijkstra).">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/04/15/from-pratt-to-dijkstra.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#From-Pratt-to-Dijkstra"><span>From Pratt to Dijkstra</span> <time datetime="2020-04-15">Apr 15, 2020</time></a>
+    </h1>
+<p><span>This is a sequel to the </span><a href="/2020/04/13/simple-but-powerful-pratt-parsing"><span>previous post</span></a><span> about Pratt parsing.</span>
+<span>Here, we</span>&rsquo;<span>ll study the relationship between top-down operator precedence (Pratt parsing) and the more famous shunting yard algorithm.</span>
+<span>Spoiler: they are the same algorithm, the difference is implementation style with recursion (Pratt) or a manual stack (Dijkstra).</span></p>
+<p><span>Unlike the previous educational post, this one is going to be an excruciatingly boring pile of technicalities </span>&mdash;<span> we</span>&rsquo;<span>ll just slowly and mechanically refactor our way to victory.</span>
+<span>Specifically,</span></p>
+<ol>
+<li>
+<span>We start with refactoring Pratt parser to minimize control flow variations.</span>
+</li>
+<li>
+<span>Then, having arrived at the code with only one </span><code>return</code><span> and only one recursive call, we replace recursion with an explicit stack.</span>
+</li>
+<li>
+<span>Finally, we streamline control in the iterative version.</span>
+</li>
+<li>
+<span>At this point, we have a bona fide shunting yard algorithm.</span>
+</li>
+</ol>
+<p><span>To further reveal the connection, we further verify that the original recursive and the iterative formulation produce syntax nodes in the same order.</span></p>
+<p><span>Really, the most exciting bit about this post is the conclusion, and you already know it :)</span></p>
+<section id="Starting-Point">
+
+    <h2>
+    <a href="#Starting-Point"><span>Starting Point</span> </a>
+    </h2>
+<p><span>Last time, we</span>&rsquo;<span>ve ended up with the following code:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Atom</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    <span class="hl-title function_ invoke__">Cons</span>(<span class="hl-type">char</span>, <span class="hl-type">Vec</span>&lt;S&gt;),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">fmt</span>::Display <span class="hl-keyword">for</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">fmt</span>(&amp;<span class="hl-keyword">self</span>, f: &amp;<span class="hl-keyword">mut</span> fmt::Formatter&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> fmt::<span class="hl-type">Result</span> {</span>
+<span class="line">        <span class="hl-keyword">match</span> <span class="hl-keyword">self</span> {</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Atom</span>(i) =&gt; <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;{}&quot;</span>, i),</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(head, rest) =&gt; {</span>
+<span class="line">                <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;({}&quot;</span>, head)?;</span>
+<span class="line">                <span class="hl-keyword">for</span> <span class="hl-variable">s</span> <span class="hl-keyword">in</span> rest {</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot; {}&quot;</span>, s)?</span>
+<span class="line">                }</span>
+<span class="line">                <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;)&quot;</span>)</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Token</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Atom</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    <span class="hl-title function_ invoke__">Op</span>(<span class="hl-type">char</span>),</span>
+<span class="line">    Eof,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    tokens: <span class="hl-type">Vec</span>&lt;Token&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> Lexer {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">tokens</span> = input</span>
+<span class="line">            .<span class="hl-title function_ invoke__">chars</span>()</span>
+<span class="line">            .<span class="hl-title function_ invoke__">filter</span>(|it| !it.<span class="hl-title function_ invoke__">is_ascii_whitespace</span>())</span>
+<span class="line">            .<span class="hl-title function_ invoke__">map</span>(|c| <span class="hl-keyword">match</span> c {</span>
+<span class="line">                <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span></span>
+<span class="line">                | <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; Token::<span class="hl-title function_ invoke__">Atom</span>(c),</span>
+<span class="line">                _ =&gt; Token::<span class="hl-title function_ invoke__">Op</span>(c),</span>
+<span class="line">            })</span>
+<span class="line">            .collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line">        tokens.<span class="hl-title function_ invoke__">reverse</span>();</span>
+<span class="line">        Lexer { tokens }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">next</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Token {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap_or</span>(Token::Eof)</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">peek</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Token {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">last</span>().<span class="hl-title function_ invoke__">copied</span>().<span class="hl-title function_ invoke__">unwrap_or</span>(Token::Eof)</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lexer</span> = Lexer::<span class="hl-title function_ invoke__">new</span>(input);</span>
+<span class="line">    <span class="hl-title function_ invoke__">expr_bp</span>(&amp;<span class="hl-keyword">mut</span> lexer, <span class="hl-number">0</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            lhs</span>
+<span class="line">        }</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, ())) = <span class="hl-title function_ invoke__">postfix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            lhs = <span class="hl-keyword">if</span> op == <span class="hl-string">&#x27;[&#x27;</span> {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">                <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;]&#x27;</span>));</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs])</span>
+<span class="line">            } <span class="hl-keyword">else</span> {</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs])</span>
+<span class="line">            };</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            lhs = <span class="hl-keyword">if</span> op == <span class="hl-string">&#x27;?&#x27;</span> {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">mhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">                <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;:&#x27;</span>));</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, mhs, rhs])</span>
+<span class="line">            } <span class="hl-keyword">else</span> {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">                S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs])</span>
+<span class="line">            };</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">prefix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> ((), <span class="hl-type">u8</span>) {</span>
+<span class="line">    <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; ((), <span class="hl-number">9</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad op: {:?}&quot;</span>, op),</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">postfix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, ())&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, ()),</span>
+<span class="line">        <span class="hl-string">&#x27;[&#x27;</span> =&gt; (<span class="hl-number">11</span>, ()),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">infix_binding_power</span>(op: <span class="hl-type">char</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, <span class="hl-type">u8</span>)&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;?&#x27;</span> =&gt; (<span class="hl-number">4</span>, <span class="hl-number">3</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>First, to not completely drown in minutia, we</span>&rsquo;<span>ll simplify it by removing support for indexing operator </span><code>[]</code><span> and ternary operator </span><code>?:</code><span>.</span>
+<span>We will keep parenthesis, left and right associative operators, and the unary minus (which is somewhat tricky to handle in shunting yard).</span>
+<span>So this is our starting point:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>);</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            lhs</span>
+<span class="line">        }</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, ())) = <span class="hl-title function_ invoke__">postfix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs]);</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[lhs, rhs]);</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>What I like about this code is how up-front it is about all special cases and control flow.</span>
+<span>This is a </span>&ldquo;<span>shameless green</span>&rdquo;<span> code!</span>
+<span>However, it is clear that we have a bunch of duplication between prefix, infix and postfix operators.</span>
+<span>Our first step would be to simplify the control flow to its core.</span></p>
+</section>
+<section id="Minimization">
+
+    <h2>
+    <a href="#Minimization"><span>Minimization</span> </a>
+    </h2>
+<p><span>First, let</span>&rsquo;<span>s merge postfix and infix cases, as they are almost the same.</span>
+<span>The idea is to change priorities for </span><code>!</code><span> from </span><code>(11, ())</code><span> to </span><code>(11, 100)</code><span>, where </span><code>100</code><span> is a special, very strong priority, which means that the right hand side of a </span>&ldquo;<span>binary</span>&rdquo;<span> operator is empty.</span>
+<span>We</span>&rsquo;<span>ll handle this in a pretty crude way right now, but all the hacks would go away once we refactor the rest.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">if</span> min_bp == <span class="hl-number">100</span> {</span>
+<span class="line">        <span class="hl-keyword">return</span> <span class="hl-literal">None</span>;</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">next</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; S::<span class="hl-title function_ invoke__">Atom</span>(it),</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            lhs</span>
+<span class="line">        }</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; {</span>
+<span class="line">            <span class="hl-keyword">let</span> ((), r_bp) = <span class="hl-title function_ invoke__">prefix_binding_power</span>(op);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(op, <span class="hl-built_in">vec!</span>[rhs])</span>
+<span class="line">        }</span>
+<span class="line">        t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) = <span class="hl-title function_ invoke__">infix_binding_power</span>(op) {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">push</span>(lhs);</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">extend</span>(rhs);</span>
+<span class="line">            lhs = S::<span class="hl-title function_ invoke__">Cons</span>(op, args);</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(lhs)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Yup, we just check for hard-coded </span><code>100</code><span> constant and use a bunch of unwraps all over the place.</span>
+<span>But the code is already smaller.</span></p>
+<p><span>Let</span>&rsquo;<span>s apply the same treatment for prefix operators.</span>
+<span>We</span>&rsquo;<span>ll need to move their handing into the loop, and we also need to make </span><code>lhs</code><span> optional, which is now not a big deal, as the function as a whole returns an </span><code>Option</code><span>.</span>
+<span>On a happier note, this will allow us to remove the </span><code>if 100</code><span> wart.</span>
+<span>What</span>&rsquo;<span>s more problematic is handing priorities: minus has different binding powers depending on whether it is in an infix or a prefix position.</span>
+<span>We solve this problem by just adding an </span><code>prefix: bool</code><span> argument to the </span><code>binding_power</code><span> function.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Atom</span>(it) =&gt; {</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">            <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Atom</span>(it))</span>
+<span class="line">        }</span>
+<span class="line">        Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), Token::<span class="hl-title function_ invoke__">Op</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            <span class="hl-title function_ invoke__">Some</span>(lhs)</span>
+<span class="line">        }</span>
+<span class="line">        _ =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            Token::Eof =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">            Token::<span class="hl-title function_ invoke__">Op</span>(op) =&gt; op,</span>
+<span class="line">            t =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;bad token: {:?}&quot;</span>, t),</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) =</span>
+<span class="line">            <span class="hl-title function_ invoke__">binding_power</span>(op, lhs.<span class="hl-title function_ invoke__">is_none</span>())</span>
+<span class="line">        {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">extend</span>(lhs);</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">extend</span>(rhs);</span>
+<span class="line">            lhs = <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(op, args));</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binding_power</span>(op: <span class="hl-type">char</span>, prefix: <span class="hl-type">bool</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, <span class="hl-type">u8</span>)&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> <span class="hl-keyword">if</span> prefix =&gt; (<span class="hl-number">99</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Keen readers might have noticed that we use </span><code>99</code><span> and not </span><code>100</code><span> here for </span>&ldquo;<span>no operand</span>&rdquo;<span> case.</span>
+<span>This is not important yet, but will be during the next step.</span></p>
+<p><span>We</span>&rsquo;<span>ve unified prefix, infix and postfix operators.</span>
+<span>The next logical step is to treat atoms as nullary operators!</span>
+<span>That is, we</span>&rsquo;<span>ll parse </span><code>92</code><span> into </span><code>(92)</code><span> S-expression, with </span><code>None</code><span> for both </span><code>lhs</code><span> and </span><code>rhs</code><span>.</span>
+<span>We get this by using </span><code>(99, 100)</code><span> binding power.</span>
+<span>At this stage, we can get rid of distinction between atom tokens and operator tokens, and make the lexer return underlying </span><code>char</code>&rsquo;<span>s directly.</span>
+<span>We</span>&rsquo;<span>ll also get rid of </span><code>S::Atom</code><span>, which gives us this somewhat large change:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Cons</span>(<span class="hl-type">char</span>, <span class="hl-type">Vec</span>&lt;S&gt;),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">fmt</span>::Display <span class="hl-keyword">for</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">fmt</span>(&amp;<span class="hl-keyword">self</span>, f: &amp;<span class="hl-keyword">mut</span> fmt::Formatter&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> fmt::<span class="hl-type">Result</span> {</span>
+<span class="line">        <span class="hl-keyword">match</span> <span class="hl-keyword">self</span> {</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(head, rest) =&gt; {</span>
+<span class="line">                <span class="hl-keyword">if</span> rest.<span class="hl-title function_ invoke__">is_empty</span>() {</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;{}&quot;</span>, head)</span>
+<span class="line">                } <span class="hl-keyword">else</span> {</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;({}&quot;</span>, head)?;</span>
+<span class="line">                    <span class="hl-keyword">for</span> <span class="hl-variable">s</span> <span class="hl-keyword">in</span> rest {</span>
+<span class="line">                        <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot; {}&quot;</span>, s)?</span>
+<span class="line">                    }</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;)&quot;</span>)</span>
+<span class="line">                }</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    tokens: <span class="hl-type">Vec</span>&lt;<span class="hl-type">char</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> Lexer {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">tokens</span> = input</span>
+<span class="line">            .<span class="hl-title function_ invoke__">chars</span>()</span>
+<span class="line">            .<span class="hl-title function_ invoke__">filter</span>(|it| !it.<span class="hl-title function_ invoke__">is_ascii_whitespace</span>())</span>
+<span class="line">            .collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line">        tokens.<span class="hl-title function_ invoke__">reverse</span>();</span>
+<span class="line">        Lexer { tokens }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">next</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt; {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">pop</span>()</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">peek</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt; {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">last</span>().<span class="hl-title function_ invoke__">copied</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lexer</span> = Lexer::<span class="hl-title function_ invoke__">new</span>(input);</span>
+<span class="line">    <span class="hl-title function_ invoke__">expr_bp</span>(&amp;<span class="hl-keyword">mut</span> lexer, <span class="hl-number">0</span>).<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">        <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;(&#x27;</span>) =&gt; {</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, <span class="hl-number">0</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            <span class="hl-title function_ invoke__">Some</span>(lhs)</span>
+<span class="line">        }</span>
+<span class="line">        _ =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            <span class="hl-title function_ invoke__">Some</span>(token) =&gt; token,</span>
+<span class="line">            <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) =</span>
+<span class="line">            <span class="hl-title function_ invoke__">binding_power</span>(token, lhs.<span class="hl-title function_ invoke__">is_none</span>())</span>
+<span class="line">        {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">extend</span>(lhs);</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">extend</span>(rhs);</span>
+<span class="line">            lhs = <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(token, args));</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binding_power</span>(op: <span class="hl-type">char</span>, prefix: <span class="hl-type">bool</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, <span class="hl-type">u8</span>)&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span> | <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> <span class="hl-keyword">if</span> prefix =&gt; (<span class="hl-number">99</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This is the stage where it becomes important that </span>&ldquo;<span>fake</span>&rdquo;<span> binding power of unary </span><code>-</code><span> is </span><code>99</code><span>.</span>
+<span>After parsing first constant in </span><code>1 - 2</code><span> the </span><code>r_bp</code><span> is </span><code>100</code><span>, and we need to avoid eating the following minus.</span></p>
+<p><span>The only thing left outside the main loop are parenthesis.</span>
+<span>We can deal with them using </span><code>(99, 0)</code><span> priority </span>&mdash;<span> after </span><code>(</code><span> we enter a new context where all operators are allowed.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-literal">None</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            <span class="hl-title function_ invoke__">Some</span>(token) =&gt; token,</span>
+<span class="line">            <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((l_bp, r_bp)) =</span>
+<span class="line">            <span class="hl-title function_ invoke__">binding_power</span>(token, lhs.<span class="hl-title function_ invoke__">is_none</span>())</span>
+<span class="line">        {</span>
+<span class="line">            <span class="hl-keyword">if</span> l_bp &lt; min_bp {</span>
+<span class="line">                <span class="hl-keyword">break</span>;</span>
+<span class="line">            }</span>
+<span class="line">            lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">            <span class="hl-keyword">if</span> token == <span class="hl-string">&#x27;(&#x27;</span> {</span>
+<span class="line">                <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">                lhs = rhs;</span>
+<span class="line">                <span class="hl-keyword">continue</span>;</span>
+<span class="line">            }</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">extend</span>(lhs);</span>
+<span class="line">            args.<span class="hl-title function_ invoke__">extend</span>(rhs);</span>
+<span class="line">            lhs = <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(token, args));</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    lhs</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binding_power</span>(op: <span class="hl-type">char</span>, prefix: <span class="hl-type">bool</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">u8</span>, <span class="hl-type">u8</span>)&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span> | <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;(&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">0</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> <span class="hl-keyword">if</span> prefix =&gt; (<span class="hl-number">99</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Or, after some control flow cleanup:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-literal">None</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = <span class="hl-keyword">match</span> lexer.<span class="hl-title function_ invoke__">peek</span>() {</span>
+<span class="line">            <span class="hl-title function_ invoke__">Some</span>(token) =&gt; token,</span>
+<span class="line">            <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> lhs,</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">r_bp</span> = <span class="hl-keyword">match</span> <span class="hl-title function_ invoke__">binding_power</span>(token, lhs.<span class="hl-title function_ invoke__">is_none</span>()) {</span>
+<span class="line">            <span class="hl-title function_ invoke__">Some</span>((l_bp, r_bp)) <span class="hl-keyword">if</span> min_bp &lt;= l_bp =&gt; r_bp,</span>
+<span class="line">            _ =&gt; <span class="hl-keyword">return</span> lhs,</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">        <span class="hl-keyword">if</span> token == <span class="hl-string">&#x27;(&#x27;</span> {</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            lhs = rhs;</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">        args.<span class="hl-title function_ invoke__">extend</span>(lhs);</span>
+<span class="line">        args.<span class="hl-title function_ invoke__">extend</span>(rhs);</span>
+<span class="line">        lhs = <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(token, args));</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This is still recognizably a Pratt parse, with its characteristic shape</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">parse_expr</span>() {</span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        ...</span>
+<span class="line">        <span class="hl-title function_ invoke__">parse_expr</span>()</span>
+<span class="line">        ...</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>What we</span>&rsquo;<span>ll do next is mechanical replacement of recursion with a manual stack.</span></p>
+</section>
+<section id="From-Recursion-to-Stack">
+
+    <h2>
+    <a href="#From-Recursion-to-Stack"><span>From Recursion to Stack</span> </a>
+    </h2>
+<p><span>This is a general transformation and (I think) it can be done mechanically.</span>
+<span>The interesting bits during transformation are recursive calls themselves and returns.</span>
+<span>The underlying goal of the preceding refactorings was to reduce the number of recursive invocations to one.</span>
+<span>We still have two </span><code>return</code><span> statements there, so let</span>&rsquo;<span>s condense that to just one as well:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-literal">None</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = lexer.<span class="hl-title function_ invoke__">peek</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> (token, r_bp) =</span>
+<span class="line">            <span class="hl-keyword">match</span> <span class="hl-title function_ invoke__">binding_power</span>(token, lhs.<span class="hl-title function_ invoke__">is_none</span>()) {</span>
+<span class="line">                <span class="hl-title function_ invoke__">Some</span>((t, (l_bp, r_bp))) <span class="hl-keyword">if</span> min_bp &lt;= l_bp =&gt; {</span>
+<span class="line">                    (t, r_bp)</span>
+<span class="line">                }</span>
+<span class="line">                _ =&gt; <span class="hl-keyword">return</span> lhs,</span>
+<span class="line">            };</span>
+<span class="line"></span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">        <span class="hl-keyword">if</span> token == <span class="hl-string">&#x27;(&#x27;</span> {</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            lhs = rhs;</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">        args.<span class="hl-title function_ invoke__">extend</span>(lhs);</span>
+<span class="line">        args.<span class="hl-title function_ invoke__">extend</span>(rhs);</span>
+<span class="line">        lhs = <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(token, args));</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binding_power</span>(</span>
+<span class="line">    op: <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt;,</span>
+<span class="line">    prefix: <span class="hl-type">bool</span>,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">char</span>, (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>))&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = op?;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span> | <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;(&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">0</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> <span class="hl-keyword">if</span> prefix =&gt; (<span class="hl-number">99</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>((op, res))</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Next, we should reify locals which are live across the recursive call into a data structure.</span>
+<span>If there were more than one recursive call, we</span>&rsquo;<span>d have to reify control-flow as enum as well, but we</span>&rsquo;<span>ve prudently removed all but one recursive invocation.</span></p>
+<p><span>So let</span>&rsquo;<span>s start with introducing a </span><code>Frame</code><span> struct, without actually adding a stack just yet.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Frame</span> {</span>
+<span class="line">    min_bp: <span class="hl-type">u8</span>,</span>
+<span class="line">    lhs: <span class="hl-type">Option</span>&lt;S&gt;,</span>
+<span class="line">    token: <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer, min_bp: <span class="hl-type">u8</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">top</span> = Frame {</span>
+<span class="line">        min_bp,</span>
+<span class="line">        lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">        token: <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = lexer.<span class="hl-title function_ invoke__">peek</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> (token, r_bp) =</span>
+<span class="line">            <span class="hl-keyword">match</span> <span class="hl-title function_ invoke__">binding_power</span>(token, top.lhs.<span class="hl-title function_ invoke__">is_none</span>()) {</span>
+<span class="line">                <span class="hl-title function_ invoke__">Some</span>((t, (l_bp, r_bp))) <span class="hl-keyword">if</span> top.min_bp &lt;= l_bp =&gt; {</span>
+<span class="line">                    (t, r_bp)</span>
+<span class="line">                }</span>
+<span class="line">                _ =&gt; <span class="hl-keyword">return</span> top.lhs,</span>
+<span class="line">            };</span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">        top.token = <span class="hl-title function_ invoke__">Some</span>(token);</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">rhs</span> = <span class="hl-title function_ invoke__">expr_bp</span>(lexer, r_bp);</span>
+<span class="line">        <span class="hl-keyword">if</span> token == <span class="hl-string">&#x27;(&#x27;</span> {</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">            top.lhs = rhs;</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">        args.<span class="hl-title function_ invoke__">extend</span>(top.lhs);</span>
+<span class="line">        args.<span class="hl-title function_ invoke__">extend</span>(rhs);</span>
+<span class="line">        top.lhs = <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(token, args));</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And now, let</span>&rsquo;<span>s add a </span><code>stack: Vec&lt;Frame&gt;</code><span>.</span>
+<span>This is the point where the magic happens.</span>
+<span>We</span>&rsquo;<span>ll still keep the </span><code>top</code><span> local variable: representing a stack as </span><code>(T, Vec&lt;T&gt;)</code><span> and not as just </span><code>Vec&lt;T&gt;</code><span> gives us compile-time guarantee of non-emptiness.</span>
+<span>We replace the </span><code>expr_bp(lexer, r_bp)</code><span> recursive call with pushing to the stack.</span>
+<span>All operations after the call are moved after </span><code>return</code><span>.</span>
+<code>return</code><span> itself is replaced with popping off the stack.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">top</span> = Frame {</span>
+<span class="line">        min_bp: <span class="hl-number">0</span>,</span>
+<span class="line">        lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">        token: <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">stack</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = lexer.<span class="hl-title function_ invoke__">peek</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> (token, r_bp) =</span>
+<span class="line">            <span class="hl-keyword">match</span> <span class="hl-title function_ invoke__">binding_power</span>(token, top.lhs.<span class="hl-title function_ invoke__">is_none</span>()) {</span>
+<span class="line">                <span class="hl-title function_ invoke__">Some</span>((t, (l_bp, r_bp))) <span class="hl-keyword">if</span> top.min_bp &lt;= l_bp =&gt; {</span>
+<span class="line">                    (t, r_bp)</span>
+<span class="line">                }</span>
+<span class="line">                _ =&gt; {</span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = top;</span>
+<span class="line">                    top = <span class="hl-keyword">match</span> stack.<span class="hl-title function_ invoke__">pop</span>() {</span>
+<span class="line">                        <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">                        <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> res.lhs,</span>
+<span class="line">                    };</span>
+<span class="line"></span>
+<span class="line">                    <span class="hl-keyword">if</span> res.token == <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;(&#x27;</span>) {</span>
+<span class="line">                        <span class="hl-built_in">assert_eq!</span>(lexer.<span class="hl-title function_ invoke__">next</span>(), <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;)&#x27;</span>));</span>
+<span class="line">                        top.lhs = res.lhs;</span>
+<span class="line">                        <span class="hl-keyword">continue</span>;</span>
+<span class="line">                    }</span>
+<span class="line"></span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(top.lhs);</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(res.lhs);</span>
+<span class="line">                    top.lhs =</span>
+<span class="line">                        <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(res.token.<span class="hl-title function_ invoke__">unwrap</span>(), args));</span>
+<span class="line">                    <span class="hl-keyword">continue</span>;</span>
+<span class="line">                }</span>
+<span class="line">            };</span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line"></span>
+<span class="line">        stack.<span class="hl-title function_ invoke__">push</span>(top);</span>
+<span class="line">        top = Frame {</span>
+<span class="line">            min_bp: r_bp,</span>
+<span class="line">            lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">            token: <span class="hl-title function_ invoke__">Some</span>(token),</span>
+<span class="line">        };</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Tada! No recursion anymore, and still passes the tests!</span>
+<span>Let</span>&rsquo;<span>s cleanup this further though.</span>
+<span>First, let</span>&rsquo;<span>s treat </span><code>)</code><span> more like a usual operator.</span>
+<span>The correct binding powers here are the opposite of </span><code>(</code><span>: </span><code>(0, 100)</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">top</span> = Frame {</span>
+<span class="line">        min_bp: <span class="hl-number">0</span>,</span>
+<span class="line">        lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">        token: <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">stack</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = lexer.<span class="hl-title function_ invoke__">peek</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> (token, r_bp) =</span>
+<span class="line">            <span class="hl-keyword">match</span> <span class="hl-title function_ invoke__">binding_power</span>(token, top.lhs.<span class="hl-title function_ invoke__">is_none</span>()) {</span>
+<span class="line">                <span class="hl-title function_ invoke__">Some</span>((t, (l_bp, r_bp))) <span class="hl-keyword">if</span> top.min_bp &lt;= l_bp =&gt; {</span>
+<span class="line">                    (t, r_bp)</span>
+<span class="line">                }</span>
+<span class="line">                _ =&gt; {</span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = top;</span>
+<span class="line">                    top = <span class="hl-keyword">match</span> stack.<span class="hl-title function_ invoke__">pop</span>() {</span>
+<span class="line">                        <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">                        <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> res.lhs,</span>
+<span class="line">                    };</span>
+<span class="line"></span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(top.lhs);</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(res.lhs);</span>
+<span class="line">                    top.lhs =</span>
+<span class="line">                        <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(res.token.<span class="hl-title function_ invoke__">unwrap</span>(), args));</span>
+<span class="line">                    <span class="hl-keyword">continue</span>;</span>
+<span class="line">                }</span>
+<span class="line">            };</span>
+<span class="line">        lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">        <span class="hl-keyword">if</span> token == <span class="hl-string">&#x27;)&#x27;</span> {</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(top.token, <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;(&#x27;</span>));</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = top;</span>
+<span class="line">            top = stack.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            top.lhs = res.lhs;</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        stack.<span class="hl-title function_ invoke__">push</span>(top);</span>
+<span class="line">        top = Frame {</span>
+<span class="line">            min_bp: r_bp,</span>
+<span class="line">            lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">            token: <span class="hl-title function_ invoke__">Some</span>(token),</span>
+<span class="line">        };</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binding_power</span>(</span>
+<span class="line">    op: <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt;,</span>
+<span class="line">    prefix: <span class="hl-type">bool</span>,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">char</span>, (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>))&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = op?;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span> | <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;(&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">0</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;)&#x27;</span> =&gt; (<span class="hl-number">0</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> <span class="hl-keyword">if</span> prefix =&gt; (<span class="hl-number">99</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>((op, res))</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Finally, let</span>&rsquo;<span>s note that </span><code>continue</code><span> inside the </span><code>match</code><span> is somewhat wasteful </span>&mdash;<span> when we hit it, we</span>&rsquo;<span>ll re-</span><code>peek</code><span> the same token again.</span>
+<span>So let</span>&rsquo;<span>s repeat just the match until we know we can make progress.</span>
+<span>This also allows replacing </span><code>peek() / next()</code><span> pair with just </span><code>next()</code><span>.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">top</span> = Frame {</span>
+<span class="line">        min_bp: <span class="hl-number">0</span>,</span>
+<span class="line">        lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">        token: <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">stack</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> (token, r_bp) = <span class="hl-keyword">loop</span> {</span>
+<span class="line">            <span class="hl-keyword">match</span> <span class="hl-title function_ invoke__">binding_power</span>(token, top.lhs.<span class="hl-title function_ invoke__">is_none</span>()) {</span>
+<span class="line">                <span class="hl-title function_ invoke__">Some</span>((t, (l_bp, r_bp))) <span class="hl-keyword">if</span> top.min_bp &lt;= l_bp =&gt; {</span>
+<span class="line">                    <span class="hl-title function_ invoke__">break</span> (t, r_bp)</span>
+<span class="line">                }</span>
+<span class="line">                _ =&gt; {</span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = top;</span>
+<span class="line">                    top = <span class="hl-keyword">match</span> stack.<span class="hl-title function_ invoke__">pop</span>() {</span>
+<span class="line">                        <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">                        <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> res.lhs,</span>
+<span class="line">                    };</span>
+<span class="line"></span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(top.lhs);</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(res.lhs);</span>
+<span class="line">                    top.lhs =</span>
+<span class="line">                        <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(res.token.<span class="hl-title function_ invoke__">unwrap</span>(), args));</span>
+<span class="line">                }</span>
+<span class="line">            };</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> token == <span class="hl-string">&#x27;)&#x27;</span> {</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(top.token, <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;(&#x27;</span>));</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = top;</span>
+<span class="line">            top = stack.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            top.lhs = res.lhs;</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        stack.<span class="hl-title function_ invoke__">push</span>(top);</span>
+<span class="line">        top = Frame {</span>
+<span class="line">            min_bp: r_bp,</span>
+<span class="line">            lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">            token: <span class="hl-title function_ invoke__">Some</span>(token),</span>
+<span class="line">        };</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And guess what? This is the shunting yard algorithm, with </span><em><span>its</span></em><span> characteristic shape of</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">loop</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = <span class="hl-title function_ invoke__">next_token</span>();</span>
+<span class="line">    <span class="hl-keyword">while</span> stack.top.priority &gt; token.priority {</span>
+<span class="line">        stack.<span class="hl-title function_ invoke__">pop</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To drive the point home, let</span>&rsquo;<span>s print the tokens we pop off the stack, to verify that we get reverse Polish notation without any kind of additional tree rearrangement, just like in the original algorithm description:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::{fmt, io::BufRead};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Cons</span>(<span class="hl-type">char</span>, <span class="hl-type">Vec</span>&lt;S&gt;),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">fmt</span>::Display <span class="hl-keyword">for</span> <span class="hl-title class_">S</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">fmt</span>(&amp;<span class="hl-keyword">self</span>, f: &amp;<span class="hl-keyword">mut</span> fmt::Formatter&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> fmt::<span class="hl-type">Result</span> {</span>
+<span class="line">        <span class="hl-keyword">match</span> <span class="hl-keyword">self</span> {</span>
+<span class="line">            S::<span class="hl-title function_ invoke__">Cons</span>(head, rest) =&gt; {</span>
+<span class="line">                <span class="hl-keyword">if</span> rest.<span class="hl-title function_ invoke__">is_empty</span>() {</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;{}&quot;</span>, head)</span>
+<span class="line">                } <span class="hl-keyword">else</span> {</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;({}&quot;</span>, head)?;</span>
+<span class="line">                    <span class="hl-keyword">for</span> <span class="hl-variable">s</span> <span class="hl-keyword">in</span> rest {</span>
+<span class="line">                        <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot; {}&quot;</span>, s)?</span>
+<span class="line">                    }</span>
+<span class="line">                    <span class="hl-built_in">write!</span>(f, <span class="hl-string">&quot;)&quot;</span>)</span>
+<span class="line">                }</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    tokens: <span class="hl-type">Vec</span>&lt;<span class="hl-type">char</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Lexer</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> Lexer {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">tokens</span> = input</span>
+<span class="line">            .<span class="hl-title function_ invoke__">chars</span>()</span>
+<span class="line">            .<span class="hl-title function_ invoke__">filter</span>(|it| !it.<span class="hl-title function_ invoke__">is_ascii_whitespace</span>())</span>
+<span class="line">            .collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line">        tokens.<span class="hl-title function_ invoke__">reverse</span>();</span>
+<span class="line">        Lexer { tokens }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">next</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt; {</span>
+<span class="line">        <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">pop</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> S {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lexer</span> = Lexer::<span class="hl-title function_ invoke__">new</span>(input);</span>
+<span class="line">    eprintln!(<span class="hl-string">&quot;{}&quot;</span>, input);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-title function_ invoke__">expr_bp</span>(&amp;<span class="hl-keyword">mut</span> lexer).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    eprintln!(<span class="hl-string">&quot;{}\n&quot;</span>, res);</span>
+<span class="line">    res</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Frame</span> {</span>
+<span class="line">    min_bp: <span class="hl-type">u8</span>,</span>
+<span class="line">    lhs: <span class="hl-type">Option</span>&lt;S&gt;,</span>
+<span class="line">    token: <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_bp</span>(lexer: &amp;<span class="hl-keyword">mut</span> Lexer) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;S&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">top</span> = Frame {</span>
+<span class="line">        min_bp: <span class="hl-number">0</span>,</span>
+<span class="line">        lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">        token: <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">stack</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = lexer.<span class="hl-title function_ invoke__">next</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> (token, r_bp) = <span class="hl-keyword">loop</span> {</span>
+<span class="line">            <span class="hl-keyword">match</span> <span class="hl-title function_ invoke__">binding_power</span>(token, top.lhs.<span class="hl-title function_ invoke__">is_none</span>()) {</span>
+<span class="line">                <span class="hl-title function_ invoke__">Some</span>((t, (l_bp, r_bp))) <span class="hl-keyword">if</span> top.min_bp &lt;= l_bp =&gt;{</span>
+<span class="line">                    <span class="hl-title function_ invoke__">break</span> (t, r_bp)</span>
+<span class="line">                }</span>
+<span class="line">                _ =&gt; {</span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = top;</span>
+<span class="line">                    top = <span class="hl-keyword">match</span> stack.<span class="hl-title function_ invoke__">pop</span>() {</span>
+<span class="line">                        <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">                        <span class="hl-literal">None</span> =&gt; {</span>
+<span class="line">                            eprintln!();</span>
+<span class="line">                            <span class="hl-keyword">return</span> res.lhs;</span>
+<span class="line">                        }</span>
+<span class="line">                    };</span>
+<span class="line"></span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">args</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(top.lhs);</span>
+<span class="line">                    args.<span class="hl-title function_ invoke__">extend</span>(res.lhs);</span>
+<span class="line">                    <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = res.token.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">                    eprint!(<span class="hl-string">&quot;{} &quot;</span>, token);</span>
+<span class="line">                    top.lhs = <span class="hl-title function_ invoke__">Some</span>(S::<span class="hl-title function_ invoke__">Cons</span>(token, args));</span>
+<span class="line">                }</span>
+<span class="line">            };</span>
+<span class="line">        };</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> token == <span class="hl-string">&#x27;)&#x27;</span> {</span>
+<span class="line">            <span class="hl-built_in">assert_eq!</span>(top.token, <span class="hl-title function_ invoke__">Some</span>(<span class="hl-string">&#x27;(&#x27;</span>));</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = top;</span>
+<span class="line">            top = stack.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">            top.lhs = res.lhs;</span>
+<span class="line">            <span class="hl-keyword">continue</span>;</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        stack.<span class="hl-title function_ invoke__">push</span>(top);</span>
+<span class="line">        top = Frame {</span>
+<span class="line">            min_bp: r_bp,</span>
+<span class="line">            lhs: <span class="hl-literal">None</span>,</span>
+<span class="line">            token: <span class="hl-title function_ invoke__">Some</span>(token),</span>
+<span class="line">        };</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binding_power</span>(</span>
+<span class="line">    op: <span class="hl-type">Option</span>&lt;<span class="hl-type">char</span>&gt;,</span>
+<span class="line">    prefix: <span class="hl-type">bool</span>,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(<span class="hl-type">char</span>, (<span class="hl-type">u8</span>, <span class="hl-type">u8</span>))&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">op</span> = op?;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> op {</span>
+<span class="line">        <span class="hl-string">&#x27;0&#x27;</span>..=<span class="hl-string">&#x27;9&#x27;</span> | <span class="hl-string">&#x27;a&#x27;</span>..=<span class="hl-string">&#x27;z&#x27;</span> | <span class="hl-string">&#x27;A&#x27;</span>..=<span class="hl-string">&#x27;Z&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;(&#x27;</span> =&gt; (<span class="hl-number">99</span>, <span class="hl-number">0</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;)&#x27;</span> =&gt; (<span class="hl-number">0</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span> =&gt; (<span class="hl-number">2</span>, <span class="hl-number">1</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> <span class="hl-keyword">if</span> prefix =&gt; (<span class="hl-number">99</span>, <span class="hl-number">9</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> =&gt; (<span class="hl-number">5</span>, <span class="hl-number">6</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span> =&gt; (<span class="hl-number">7</span>, <span class="hl-number">8</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;!&#x27;</span> =&gt; (<span class="hl-number">11</span>, <span class="hl-number">100</span>),</span>
+<span class="line">        <span class="hl-string">&#x27;.&#x27;</span> =&gt; (<span class="hl-number">14</span>, <span class="hl-number">13</span>),</span>
+<span class="line">        _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>((op, res))</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;1&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1 + 2 * 3&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ 1 (* 2 3))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;a + b * c * d + e&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ (+ a (* (* b c) d)) e)&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;f . g . h&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(. f (. g h))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot; 1 + 2 + f . g . h * 3 * 4&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(</span>
+<span class="line">        s.<span class="hl-title function_ invoke__">to_string</span>(),</span>
+<span class="line">        <span class="hl-string">&quot;(+ (+ 1 2) (* (* (. f (. g h)) 3) 4))&quot;</span></span>
+<span class="line">    );</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;--1 * 2&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(* (- (- 1)) 2)&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;--f . g&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(- (- (. f g)))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;-9!&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(- (! 9))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;f . g !&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(! (. f g))&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;(((0)))&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;0&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;(1 + 2) * 3&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(* (+ 1 2) 3)&quot;</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">s</span> = <span class="hl-title function_ invoke__">expr</span>(<span class="hl-string">&quot;1 + (2 * 3)&quot;</span>);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(s.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;(+ 1 (* 2 3))&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">1</span>
+<span class="line">1</span>
+<span class="line">1</span>
+<span class="line"></span>
+<span class="line">1 + 2 * 3</span>
+<span class="line">1 2 3 * +</span>
+<span class="line">(+ 1 (* 2 3))</span>
+<span class="line"></span>
+<span class="line">a + b * c * d + e</span>
+<span class="line">a b c * d * + e +</span>
+<span class="line">(+ (+ a (* (* b c) d)) e)</span>
+<span class="line"></span>
+<span class="line">f . g . h</span>
+<span class="line">f g h . .</span>
+<span class="line">(. f (. g h))</span>
+<span class="line"></span>
+<span class="line"> 1 + 2 + f . g . h * 3 * 4</span>
+<span class="line">1 2 + f g h . . 3 * 4 * +</span>
+<span class="line">(+ (+ 1 2) (* (* (. f (. g h)) 3) 4))</span>
+<span class="line"></span>
+<span class="line">--1 * 2</span>
+<span class="line">1 - - 2 *</span>
+<span class="line">(* (- (- 1)) 2)</span>
+<span class="line"></span>
+<span class="line">--f . g</span>
+<span class="line">f g . - -</span>
+<span class="line">(- (- (. f g)))</span>
+<span class="line"></span>
+<span class="line">-9!</span>
+<span class="line">9 ! -</span>
+<span class="line">(- (! 9))</span>
+<span class="line"></span>
+<span class="line">f . g !</span>
+<span class="line">f g . !</span>
+<span class="line">(! (. f g))</span>
+<span class="line"></span>
+<span class="line">(((0)))</span>
+<span class="line">0</span>
+<span class="line">0</span>
+<span class="line"></span>
+<span class="line">(1 + 2) * 3</span>
+<span class="line">1 2 + 3 *</span>
+<span class="line">(* (+ 1 2) 3)</span>
+<span class="line"></span>
+<span class="line">1 + (2 * 3)</span>
+<span class="line">1 2 3 * +</span>
+<span class="line">(+ 1 (* 2 3))</span></code></pre>
+
+</figure>
+<p><span>We actually could have done it with the original recursive formulation as well.</span>
+<span>Placing </span><code>print</code><span> statements at all points where we construct an </span><code>S</code><span> node prints expression in a reverse polish notation,</span>
+<span>proving that the recursive algorithm does the same steps and in the same order as the shunting yard.</span></p>
+<p><span>Q.E.D.</span></p>
+<p><span>The code from this and the previous article is available here: </span><a href="https://github.com/matklad/minipratt" class="url">https://github.com/matklad/minipratt</a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-04-15-from-pratt-to-dijkstra.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/07/15/two-beautiful-programs.html b/2020/07/15/two-beautiful-programs.html
new file mode 100644
index 00000000..7a0cdf73
--- /dev/null
+++ b/2020/07/15/two-beautiful-programs.html
@@ -0,0 +1,211 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Two Beautiful Rust Programs</title>
+  <meta name="description" content="This is a short ad of a Rust programming language targeting experienced C++ developers.
+Being an ad, it will only whet your appetite, consult other resources for fine print.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/07/15/two-beautiful-programs.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Two-Beautiful-Rust-Programs"><span>Two Beautiful Rust Programs</span> <time datetime="2020-07-15">Jul 15, 2020</time></a>
+    </h1>
+<p><span>This is a short ad of a </span><a href="https://www.rust-lang.org/"><span>Rust</span></a><span> programming language targeting experienced C++ developers.</span>
+<span>Being an ad, it will only whet your appetite, consult other resources for fine print.</span></p>
+<section id="First-Program">
+
+    <h2>
+    <a href="#First-Program"><span>First Program</span> </a>
+    </h2>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">xs</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">1</span>, <span class="hl-number">2</span>, <span class="hl-number">3</span>];</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">x</span>: &amp;<span class="hl-type">i32</span> = &amp;xs[<span class="hl-number">0</span>];</span>
+<span class="line">  xs.<span class="hl-title function_ invoke__">push</span>(<span class="hl-number">92</span>);</span>
+<span class="line">  <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{}&quot;</span>, *x);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This program creates a vector of 32-bit integers (</span><code>std::vector&lt;int32_t&gt;</code><span>), takes a reference to the first element, </span><code>x</code><span>, pushes one more number onto the vector and then uses </span><code>x</code><span>.</span>
+<span>The program is wrong: extending the vector may invalidate references to element, and </span><code>*x</code><span> might dereference a dangling pointer.</span></p>
+<p><span>The beauty of this program is that it doesn</span>&rsquo;<span>t compile:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">error[E0502]: cannot borrow xs as mutable</span>
+<span class="line">    because it is also borrowed as immutable</span>
+<span class="line"> --&gt; src/main.rs:4:5</span>
+<span class="line"></span>
+<span class="line">     let x: &amp;i32 = &amp;xs[0];</span>
+<span class="line">                    -- immutable borrow occurs here</span>
+<span class="line">     xs.push(92);</span>
+<span class="line">     ^^^^^^^^^^^ mutable borrow occurs here</span>
+<span class="line">     println!(x);</span>
+<span class="line">              - immutable borrow later used here</span></code></pre>
+
+</figure>
+<p><span>Rust compiler tracks the aliasing status of every piece of data and forbids mutations of potentially aliased data.</span>
+<span>In this example, </span><code>x</code><span> and </span><code>xs</code><span> alias the first integer in the vector</span>&rsquo;<span>s storage in the heap.</span></p>
+<p><span>Rust doesn</span>&rsquo;<span>t allow doing stupid things.</span></p>
+</section>
+<section id="Second-Program">
+
+    <h2>
+    <a href="#Second-Program"><span>Second Program</span> </a>
+    </h2>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> crossbeam::scope;</span>
+<span class="line"><span class="hl-keyword">use</span> parking_lot::{Mutex, MutexGuard};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">counter</span> = Mutex::<span class="hl-title function_ invoke__">new</span>(<span class="hl-number">0</span>);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-title function_ invoke__">scope</span>(|s| {</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">10</span> {</span>
+<span class="line">      s.<span class="hl-title function_ invoke__">spawn</span>(|_| {</span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">10</span> {</span>
+<span class="line">          <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">guard</span>: MutexGuard&lt;<span class="hl-type">i32</span>&gt; = counter.<span class="hl-title function_ invoke__">lock</span>();</span>
+<span class="line">          *guard += <span class="hl-number">1</span>;</span>
+<span class="line">        }</span>
+<span class="line">      });</span>
+<span class="line">    }</span>
+<span class="line">  }).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">total</span>: &amp;<span class="hl-keyword">mut</span> <span class="hl-type">i32</span> = counter.<span class="hl-title function_ invoke__">get_mut</span>();</span>
+<span class="line">  <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;total = {}&quot;</span>, *total)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This program creates an integer counter protected by a mutex, spawns 10 threads, increments the counter 10 times from each thread, and prints the total.</span></p>
+<p><span>The </span><code>counter</code><span> variable lives on the stack, and  a pointer to these stack data is shared with other threads.</span>
+<span>The threads have to lock the mutex to do the increments.</span>
+<span>When printing the total, the counter is read bypassing the mutex, without any synchronization.</span></p>
+<p><span>The beauty of this program is that it relies on several bits of subtle reasoning for correctness, each of which is checked by compiler:</span></p>
+<ol>
+<li>
+<span>Child threads don</span>&rsquo;<span>t escape the </span><code>main</code><span> function and so can read </span><code>counter</code><span> from its stack.</span>
+</li>
+<li>
+<span>Child threads only access </span><code>counter</code><span> through the mutex.</span>
+</li>
+<li>
+<span>Child threads will have terminated by the time we read </span><code>total</code><span> out of </span><code>counter</code><span> without mutex.</span>
+</li>
+</ol>
+<p><span>If any of these constraints are broken, the compiler rejects the code.</span>
+<span>There</span>&rsquo;<span>s no need for </span><code>std::shared_ptr</code><span> just to defensively make sure that the memory isn</span>&rsquo;<span>t freed under your feet.</span></p>
+<p><span>Rust allows doing dangerous, clever, and fast things without fear of introducing undefined behavior.</span></p>
+<p><span>If you like what you see, here are two books I recommend for diving deeper into Rust:</span></p>
+<ul>
+<li>
+<a href="https://doc.rust-lang.org/book/"><span>The Rust Programming Language</span></a>
+</li>
+<li>
+<a href="https://www.oreilly.com/library/view/programming-rust/9781491927274/"><span>Programming Rust</span></a>
+</li>
+</ul>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-07-15-two-beautiful-programs.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/08/11/things-I-have-learned-about-life.html b/2020/08/11/things-I-have-learned-about-life.html
new file mode 100644
index 00000000..fda140e9
--- /dev/null
+++ b/2020/08/11/things-I-have-learned-about-life.html
@@ -0,0 +1,392 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Things I Have Learned About Life</title>
+  <meta name="description" content="Hey, unlike all other articles on this blog, this one isn't about programming, it's about my personal life.
+It's nothing important, just some thoughts that have been on my mind recently.
+So, if you come here for technical content, feel free to skip this one!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/08/11/things-I-have-learned-about-life.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Things-I-Have-Learned-About-Life"><span>Things I Have Learned About Life</span> <time datetime="2020-08-11">Aug 11, 2020</time></a>
+    </h1>
+<p><span>Hey, unlike all other articles on this blog, this one isn</span>&rsquo;<span>t about programming, it</span>&rsquo;<span>s about my personal life.</span>
+<span>It</span>&rsquo;<span>s nothing important, just some thoughts that have been on my mind recently.</span>
+<span>So, if you come here for technical content, feel free to skip this one!</span></p>
+<p><span>I do, however, intentionally post this together with other articles, for two main reasons:</span></p>
+<ul>
+<li>
+<p><span>There are some things here which I wish I had understood earlier.</span>
+<span>So, I would have liked it if I had accidentally read about them in some technical blog.</span></p>
+</li>
+<li>
+<p><span>I am always casually interested in people behind technical blogs I read, so, again, I would have liked it to read a similar article.</span></p>
+</li>
+</ul>
+<section id="Background">
+
+    <h2>
+    <a href="#Background"><span>Background</span> </a>
+    </h2>
+<p><span>I think giving some background info about me would be useful.</span>
+<span>I come from a middle class Russian family.</span>
+<span>I was born in 1992, so my earliest years fell onto a rather fun historical period, of which I don</span>&rsquo;<span>t really remember anything.</span>
+<span>I grew up in Stavropol </span>&mdash;<span> a city circa 400_000 in the southern part of Russia.</span>
+<span>After finishing school, (I was sixteen), I moved to St. Petersburg to study in the state University there.</span>
+<span>I had spent 10 years in that city before moving to Berlin, the place I currently live, last year.</span></p>
+<p><span>In terms of understanding </span>&ldquo;<span>how the life works</span>&rdquo;<span>, I became somewhat actively self-conscious at about 14.</span>
+<span>The set of important beliefs I</span>&rsquo;<span>ve learned/discovered then hasn</span>&rsquo;<span>t changed until about 2017 or so.</span>
+<span>This latter change (which I feel is still very much ongoing) gives the title to the present article.</span></p>
+</section>
+<section id="Romance-Polyamory">
+
+    <h2>
+    <a href="#Romance-Polyamory"><span>Romance &amp; Polyamory</span> </a>
+    </h2>
+<p><span>I guess the biggest deal for me is discovering that polyamory 1) exists 2) is something I</span>&rsquo;<span>ve been missing a lot in my interpersonal relations.</span>
+<span>It</span>&rsquo;<span>s the big one because it most directly affected me, and because other stuff I</span>&rsquo;<span>ve learned, I</span>&rsquo;<span>ve learned from my poly partners.</span></p>
+<p><span>In a nutshell, polyamory is the idea that is OK to love several people at the same time time.</span>
+<span>That if you love A, and also love B, it doesn</span>&rsquo;<span>t mean that your love for A is somehow fake or untrue.</span>
+<span>I find the analogy with kids illuminating </span>&mdash;<span> if it</span>&rsquo;<span>s OK to love both your kids, than it should be OK to love both your partners, right?</span>
+<span>I highly recommend everyone to read </span><a href="https://www.morethantwo.com/more-than-two-polyamory-book.html"><span>More than Two</span></a><span>, on the basis that it</span>&rsquo;<span>s a rare book that directly affected my life, and that it would probably would have affected it even if polyamory weren</span>&rsquo;<span>t my thing (which is, of course, totally valid as well!).</span></p>
+<p><span>A more general point is that until 2017, I didn</span>&rsquo;<span>t have a real working model of romantic relationships.</span>
+<span>I am reasonably sure that a lot of people are in a similar situation: it</span>&rsquo;<span>s hard to encounter a reasonable relationship model in society to learn from!</span>
+<span>(This might be biased by my culture, but I suspect that it might not).</span></p>
+<p><span>We aren</span>&rsquo;<span>t taught how to be with another person (if we are lucky enough, we are taught how to practice safe sex at least), so we have to learn on our own by observing.</span>
+<span>One model is the relationships of our parents, which are quite often at least somewhat broken (like in my case).</span>
+<span>The other model is the art, and the portrayal of romance in art is (and this is an uncomfortably strong opinion for me) actively harmful garbage.</span></p>
+<p><span>What I now hold as the most important thing in romantic relations is a very clear, direct and honest communication.</span>
+<span>Honest with yourself and honest with your partner.</span>
+<span>Honesty includes the ability to feel your genuine needs and desires (as opposed to following the model of what you think you should feel).</span></p>
+<p><span>An example that is near and dear to my heart is when  you are in relationship with A, but there</span>&rsquo;<span>s also this other person B whom your you find attractive.</span>
+<span>Honesty is accepting that </span>&ldquo;<span>attractive</span>&rdquo;<span> means </span>&ldquo;<span>my body (and quite probably my consciousness) wants to have sex with this person</span>&rdquo;<span> and acting on that observation, rather than pretending that it doesn</span>&rsquo;<span>t exist or shaming yourself into thinking it shouldn</span>&rsquo;<span>t exist.</span></p>
+<p><span>Or a more concrete example: one of my favorite dishes (code named </span>&ldquo;<span>the dish I find the most yummy</span>&rdquo;<span>) is bananas mixed with sour cream and quark.</span>
+<span>Me and my partner O enjoyed eating this dish in the morning, and I was usually tasked with preparing it.</span>
+<span>There are two variates of quark </span>&mdash;<span> a hard grainy one and a soft one.</span>
+<span>O had a preference for the soft one, so, naturally, I made morning meals using the soft one, because I don</span>&rsquo;<span>t really care, and eating the same thing is oh sooo romantic.</span>
+<span>This continued until one day O said </span>&ldquo;<span>Kladov, stop bullshitting yourself and admit that you love the grainy one. Let</span>&rsquo;<span>s buy both variates and make two portions</span>&rdquo;<span>.</span>
+<span>O was totally right.</span>
+<span>And the thing is, I haven</span>&rsquo;<span>t even noticed my (useless, stupid, and most egregiously, not called for) sacrifice for the sake of the relationship until it was called out by my partner.</span>
+<span>(In the end, O came to the conclusion that the grainy quark is actually yummier, but that</span>&rsquo;<span>s besides the point).</span></p>
+<p><span>And the depiction of love in art is the opposite of this.</span>
+<span>Which is understandable </span>&mdash;<span> the reason why romance (and death) is featured so prominently in art is that a major component of art</span>&rsquo;<span>s success is its capacity for evoking emotions, and there</span>&rsquo;<span>s little so heart wrecking as romantic drama (and death).</span>
+<span>And the model of </span>&ldquo;<span>speak with words through the mouth</span>&rdquo;<span> relationships is very good at minimizing drama.</span>
+<span>(Reminder: this is non-technical post, so if I say here that something is or isn</span>&rsquo;<span>t doesn</span>&rsquo;<span>t mean I</span>&rsquo;<span>ve performed due diligence to confirm that it is true).</span>
+<span>My relations with poly partners were more boring than my relations with monogamous partners.</span>
+<span>This is great for participating people, but bad for art (unless it is some kind of slow-cinema piece).</span></p>
+<p><span>Recently, I re-read Anna Karenina by Leo Tolstoy.</span>
+<span>I highly recommend this novel if you can read Russian.</span>
+<span>(I am not sure if it is translatable to English, a big part of its merit is the exquisite language).</span>
+<span>There are two romantic lines there: a passionate, forbidden and fatal love between Anna (who is married) and Vronski (who is not the guy Anna is married to), and a homely love/family of Levin and Kity.</span>
+<span>The second one is portraited in a favorable light, as a representative of the isomorphism class of happy families.</span>
+<span>The scene of engagement between Levin and Kity made my blood boil.</span>
+<span>They are sitting at the table, with a piece of chalk.</span>
+<span>Leving feels that it</span>&rsquo;<span>s kind of an appropriate model to ask Kity to mary him.</span>
+<span>So he take chalk and writes:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>к, в, м, о, э, н, м, б, з, л, э, н, и, т</span></p>
+</blockquote>
+
+</figure>
+<p><span>Which are the initial letters of a phrase</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>когда вы мне ответили: этого  не  может быть, значило ли это, что никогда, или тогда?</span></p>
+</blockquote>
+
+</figure>
+<p><span>Which asks about Kity</span>&rsquo;<span>s original rejection of Levin several years ago.</span>
+<span>Kity decodes this messages, and answers in a likewise manner.</span>
+<span>This </span>&ldquo;<span>dialog</span>&rdquo;<span> continues for some time, at the end of which they are happily engaged, and I am enraged.</span>
+<span>Such implicit, subtle and ellipsis based communication is exactly how you wreck any relation.</span></p>
+<p><span>Which is the saddest part here is that I wasn</span>&rsquo;<span>t enraged when I</span>&rsquo;<span>ve read the book for the first time when I was 15 or so.</span>
+<span>Granted, I had a full understanding that the book is about late XIX century, and that the models of relations are questionable.</span>
+<span>But still, I think I subconsciously binned Levin and Kity</span>&rsquo;<span>s relationship to the good ones, and this why I find the art harmful in this respect.</span></p>
+<p><span>My smaller quibble is that sex is both a fetishized and a taboo topic.</span>
+<span>It</span>&rsquo;<span>s hinted at, today not so subtly, but is rarely shown or studied as a subject of art.</span>
+<span>Von Trier and Gaspar Noe being two great exceptions among the artists I like.</span></p>
+</section>
+<section id="The-Road-There">
+
+    <h2>
+    <a href="#The-Road-There"><span>The Road There</span> </a>
+    </h2>
+<p><span>So, how did I go from a default void model of romance, to my current place, where I know what I want and can actively build my relationships as I like, and not </span>&ldquo;<span>as they are supposed to be</span>&rdquo;<span>?</span>
+<span>This is the most fascinating thing about this, and one of the primary reasons for me to write this down for other people to read.</span></p>
+<p><span>I think I am a pretty introspective person </span>&mdash;<span> I like to think about things, form models and opinions, adjust and tweak them.</span>
+<span>And I did think about relationships a lot.</span>
+<span>And, for example, one conclusion was that I don</span>&rsquo;<span>t really understand jealousy, and I don</span>&rsquo;<span>t want to </span>&ldquo;<span>own</span>&rdquo;<span> or otherwise restrict my partner.</span>
+<span>I was always ok with the fact that a person I love has relationship with someone else, both in theory and a couple of times in practice.</span></p>
+<p><span>But I didn</span>&rsquo;<span>t make a jump to </span>&ldquo;<span>it</span>&rsquo;<span>s OK for me to love more than a single person</span>&rdquo;<span>, and I don</span>&rsquo;<span>t really understand that.</span>
+<span>It feels like a very simple theorem, which you should just prove yourself.</span>
+<span>Instead, it took me several chance encounters to get to this truth.</span>
+<span>(To clarify again, I don</span>&rsquo;<span>t claim that polyamory is a universal truth, this is just something that works for me, you are likely different).</span>
+<span>Once I got it, it turned out obvious and self evident.</span>
+<span>But to get it, I needed:</span></p>
+<ol>
+<li>
+<span>A relation with a poly person S, who was literally reading More Than Two when we were together.</span>
+</li>
+<li>
+<span>A relation with an extremely monogamous (as in, expressing a lot of distress due to jealousy) S.</span>
+</li>
+<li>
+<span>A relation with another poly person A, at which point it finally clicked that if I like 1 &amp; 3, and don</span>&rsquo;<span>t like 2, than maybe it makes sense for me to read that book as well.</span>
+</li>
+</ol>
+<p><span>So, surprise, it</span>&rsquo;<span>s possible to have some hugely important, but not so subtly broken things in life which were carried over from childhood/early adolescence without reconsidering.</span>
+<span>If they are pointed out, it</span>&rsquo;<span>s clear how to fix them, but noticing them is the tricky bit</span>&hellip;</p>
+</section>
+<section id="Mental-Health">
+
+    <h2>
+    <a href="#Mental-Health"><span>Mental Health</span> </a>
+    </h2>
+<p><span>Speaking of things which are hard to notice</span>&hellip;
+<span>Surprisingly, mental health exists!</span>
+<span>Up until very recently, my association for mental health was The Cabinet of Dr. Caligari: something which just doesn</span>&rsquo;<span>t happen </span>&ldquo;<span>in real life</span>&rdquo;<span>.</span>
+<span>Very, very far from the truth.</span>
+<span>A lot of people seriously struggle with their minds.</span>
+<span>Major depression or borderline personality disorder (examples I am familiar with) affect the very way you think, and are not that uncommon.</span>
+<span>And many people struggle with smaller problems, like anxiety, self-loathing, low self-esteem, etc.</span></p>
+<p><span>My own emotional responses are pretty muted.</span>
+<span>I</span>&rsquo;<span>d pass a Voight Kampff test I guess. Maybe.</span>
+<span>My own self-esteem is adequate, and I love myself.</span></p>
+<p><span>So, it was eye-opening to realize that this might not be the case for other people.</span>
+<span>Empathy is also not my strongest point, hehe.</span></p>
+<p><span>Well, it gets even better than this.</span>
+<span>I suspect I might be autistic :-)</span>
+<span>Thanks M for pointing that out to me:</span></p>
+<p><span>M: I am autistic. +</span>
+<span>A: Wait wat? On the contrary, you are the first person I</span>&rsquo;<span>ve met who doesn</span>&rsquo;<span>t seem insane. Wait a second</span>&hellip;</p>
+<p><span>(</span>
+<span>Actually, S had made a bet that I am an aspie couple of years before that</span>&hellip;
+<span>Apparently, just telling me something important about myself never works?</span>
+<span>)</span></p>
+<p><span>To clarify, I</span>&rsquo;<span>ve never been to a counselor, so I don</span>&rsquo;<span>t know what labels are applicable to me, if any, but I do think that I can be described as a person demonstrating certain unusual/autistic traits.</span>
+<span>They don</span>&rsquo;<span>t bother me (on the contrary, having learned a bit about minds of other people, I feel super lucky about the way my brain works), so I don</span>&rsquo;<span>t think I</span>&rsquo;<span>d get counseling any time soon.</span>
+<span>However, if something in your life bothers you (or even if it doesn</span>&rsquo;<span>t), counseling is probably a good idea to try!</span>
+<span>Several people I trust highly recommend it.</span>
+<span>Keep in mind that a lot that is called psychology is oscillating between science and, well, bullshit, so be careful with your choice.</span>
+<span>Check that it is indeed a science based thing (</span><strong><strong><span>C</span></strong></strong><span>ognitive </span><strong><strong><span>B</span></strong></strong><span>ehavioral </span><strong><strong><span>T</span></strong></strong><span>herapy being one of the most properly researched approaches).</span></p>
+<p><span>Anyway, I guess it makes sense to share a bit of my experiences, in case someone reads this and thinks </span>&ldquo;<span>oh shit, that</span>&rsquo;<span>s me</span>&rdquo;<span> :-)</span>
+<span>Hypothetical me from ten years ago would have appreciated this.</span></p>
+<p><span>I think the single most telling thing is that I am Meursault, from Camus</span>&rsquo;<span>s The Stranger.</span>
+<span>I read a lot, but characters rarely make sense to me, even less so than people.</span>
+<span>Except for Meursault, I can associate myself with him.</span>
+<span>Not as </span>&ldquo;<span>he is in a similar situation to mine</span>&rdquo;<span> but </span>&ldquo;<span>I understand the motives of his actions in any given situation</span>&rdquo;<span>.</span></p>
+<p><span>After I had formed a hypothesis that I might have some autistic traits, I thought that Meursault feels very similar to me, and after some googling, presto:</span>
+<a href="https://www.dovepress.com/camuss-letranger-and-the-first-description-of-a-man-with-aspergers-syn-peer-reviewed-fulltext-article-PRBM" class="url">https://www.dovepress.com/camuss-letranger-and-the-first-description-of-a-man-with-aspergers-syn-peer-reviewed-fulltext-article-PRBM</a></p>
+<p><span>Apparently, Meursault had a real life prototype, Camus</span>&rsquo;<span>s best fried, and looks like that friend had Asperger</span>&rsquo;<span>s since before it was was named!</span>
+<span>Hey, the hypothesis that I am autistic has predictive power!</span></p>
+<p><span>Another thing where I find myself different from other people is that I am introverted.</span>
+<span>Well, a lot of folks I know claim </span>&ldquo;<span>I am introverted</span>&rdquo;<span>, but the amount of social life they have gives me chills :-)</span>
+<span>Kladov</span>&rsquo;<span>s radius </span>&mdash;<span> the minimal degree of introvercy such that you are the most introverted person you know, because for any person more introverted than yourself, you two have zero chance to meet.</span></p>
+<p><span>I don</span>&rsquo;<span>t really have a need for social interactions I think </span>&mdash;<span> I like being by myself.</span>
+<span>Not uttering a single word in a day (or a weekend) is something which happens to me pretty regularly, and I enjoy that.</span>
+<span>By the way, did you know that Gandhi had one day in the week when he spoke to no one?</span></p>
+<p><span>What do I do instead of people?</span>
+<span>(formerly) Mathematics, programming, watching good movies, reading good books.</span>
+<span>Programming is a big one for the past six years or so </span>&mdash;<span> I rather easily loose myself in the state of flow (although my overall productivity is super unstable, and sometimes I can</span>&rsquo;<span>t have anything done for the whole day just because).</span>
+<span>I also </span><em><span>occasionally</span></em><span> get mildly annoying by the work-life balance articles on reddit (I am thinking about a specific one which contrasted having life with building a carrier).</span>
+<span>Of course everyone should do what works best for them.</span>
+<span>But if someone codes at work, and then codes at home, it doesn</span>&rsquo;<span>t necessary mean they are optimizing their salary or are trying to get better at coding or something.</span>
+<span>They might just </span><em><span>really</span></em><span> like writing code, and sometimes practice it during working hours as well because what else would you do between the meetings?</span></p>
+<p><span>Otherwise, I am pretty uninterested in stuff.</span>
+<span>I don</span>&rsquo;<span>t like traveling or trying out new things.</span></p>
+<p><span>I don</span>&rsquo;<span>t have any super specific physical or psychological sensitivities.</span>
+<span>I don</span>&rsquo;<span>t go outside of my apartment without headphones; music helps me to create a sort of bubble of my space around myself.</span>
+<span>I am pretty easily overwhelmed in groups of people (which is different from not enjoying people generally </span>&mdash;<span> I might get overwhelmed even among people I like to be around.).</span></p>
+<p><span>My interpersonal relations are funny </span>&mdash;<span> I always perceive myself much colder than the other person (and I project much fewer emotions in stressful situations).</span>
+<span>Note that </span>&ldquo;<span>colder</span>&rdquo;<span> here is a positive thing </span>&mdash;<span> I wish other people were more like me, not the other way around.</span></p>
+<p><span>I am awkward and avoidant of </span>&ldquo;<span>casual</span>&rdquo;<span> social contact.</span>
+<span>As in, I don</span>&rsquo;<span>t eat alone in cafes and such, as that means interacting with the waiter.</span>
+<span>I do that in company though, where I can just observe and repeat what others are doing.</span></p>
+<p><span>In general, I am pretty happy to be at the place where I am.</span>
+<span>Well, I guess it would have helped a tiny bit if I could go to the supermarket in the next building, and not to the one three blocks away where I had already been before and where I know how to behave.</span>
+<span>But, really, I perceive these as small things which are not worth fixing.</span></p>
+</section>
+<section id="Mechanism-rule-the-world">
+
+    <h2>
+    <a href="#Mechanism-rule-the-world"><span>Mechanism rule the world</span> </a>
+    </h2>
+<p><span>The next discovery (or rather, subtle shift in the world view) is from a slightly earlier era (2014 maybe?).</span>
+<span>I don</span>&rsquo;<span>t believe that people are X.</span>
+<span>Or rather, I believe that it</span>&rsquo;<span>s generally unimportant that </span>&ldquo;<span>this person is X</span>&rdquo;<span> when explaining their actions.</span>
+<span>I weigh circumstances as relatively more important that personalities when explaining events.</span>
+<span>In other words, there are no </span>&ldquo;<span>good</span>&rdquo;<span> or </span>&ldquo;<span>bad</span>&rdquo;<span> people, the same person can display a wide range of behaviors, depending on the </span><strong><span>current</span></strong><span> (not necessary historical) environment.</span>
+<span>This is what I</span>&rsquo;<span>ve learned from The Lucifer Effect.</span></p>
+<p><span>More generally, I feel that systems, mechanisms and institutions in place define the broad outlook of the world, and, if something is wrong, we should not make it right, but understand what </span><strong><span>force</span></strong><span> makes it wrong, and try to build a counter-mechanism.</span></p>
+<p><span>A specific example here is that, if I see a less than polite/constructive/respectful comment on reddit making a point I disagree with, I answer with two comments.</span>
+<span>One is factual comment about the point being discussed, another one is a templated response along the lines of </span>&ldquo;<span>I read your comment as _, I find this unacceptable, please avoid antagonistic rhetorical constructs like _</span>&rdquo;<span>.</span></p>
+<p><span>That is:</span></p>
+<ol>
+<li>
+<span>Clarify my subjective interpretation of the comment.</span>
+</li>
+<li>
+<span>State that I don</span>&rsquo;<span>t find it appropriate.</span>
+</li>
+<li>
+<span>Point out specific ways to improve the comment.</span>
+</li>
+</ol>
+<p><span>The goal here is not to disagree with a single specific comment or a to change behavior of a single specific commenter to write better comments in the future.</span>
+<span>The goal is to create a </span><strong><span>culture</span></strong><span> which I think promotes healthy discussion, so that, when other people read the exchange, they get a strong signal what is ok and what is not.</span></p>
+</section>
+<section id="Mechanisms-rule-me">
+
+    <h2>
+    <a href="#Mechanisms-rule-me"><span>Mechanisms rule me</span> </a>
+    </h2>
+<p><span>A more recent development of this idea is that mechanisms rule me as well (thanks to O again for this one!).</span></p>
+<p><span>Specifically, I now separate my mind from myself.</span>
+<span>What my mind feels/wants is not necessary what </span><strong><span>I</span></strong><span> want.</span>
+<span>I am not my brain.</span></p>
+<p><span>If I feel a craving for a bit of chocolate, that doesn</span>&rsquo;<span>t mean that I actually want sweets!</span>
+<span>It only means that some chemistry in my brain decided that I need to experience the feeling of wanting something sweet right now.</span></p>
+<p><span>An interesting aspect of this is that the </span>&ldquo;<span>desires</span>&rdquo;<span> part of our brain is older and more primitive than </span>&ldquo;<span>proving theorems</span>&rdquo;<span> part of our brain.</span>
+<span>As it is simpler, it is more reliable and powerful.</span>
+<span>So, it takes a disproportionally large amount of willpower to override your primitive wanting brain.</span></p>
+<p><span>This flipped me from </span>&ldquo;<span>If I want to stop doing X, I</span>&rsquo;<span>d easily do that</span>&rdquo;<span> to </span>&ldquo;<span>ok, I should not start wanting X, otherwise getting rid of that would be a pain</span>&rdquo;<span>.</span>
+<span>Somehow, I</span>&rsquo;<span>ve never tried alcohol, tobacco or drugs before (yes, I voluntarily moved to Berlin).</span>
+<span>There wasn</span>&rsquo;<span>t strong reason for that, I am totally OK with all those things, it</span>&rsquo;<span>s just that (I guess) I am too introverted to land into a company to start.</span>
+<span>However, now I think I would deliberately avoid addictive substances, because I value my thinking about complicated stuff.</span>
+<span>And when I am dealing with a hard math-y problem, I don</span>&rsquo;<span>t want to think </span>&ldquo;<span>and don</span>&rsquo;<span>t drink that extra bottle of beer</span>&rdquo;<span> on top, as that</span>&rsquo;<span>s too hard.</span></p>
+<p><span>I am less successful with the torrent of low-quality superficial info from the internet.</span>
+<span>Luckily, I</span>&rsquo;<span>ve never had any social network profiles (I guess for the same reason as with alcohol), but I started reading reddit at some point, and that eats into my attention.</span>
+<code>/etc/hosts</code><span> and RSS help a lot here.</span></p>
+</section>
+<section id="Rationality">
+
+    <h2>
+    <a href="#Rationality"><span>Rationality?</span> </a>
+    </h2>
+<p><span>This discussion about mind, cognitive biases, mechanisms etc sounds a lot like something from rationalists community.</span>
+<span>I am somewhat superficially familiar with it, and it does sound like a good thing.</span>
+<span>If I were to optimize my life to better achieve my goals, I would probably dedicate some time studying https://www.lesswrong.com/.</span>
+<span>Perhaps even me not having any particular goals (besides locally optimizing for what I find the most desirable at any given moment) is some form of a bias?</span></p>
+</section>
+<section id="Ethics">
+
+    <h2>
+    <a href="#Ethics"><span>Ethics</span> </a>
+    </h2>
+<p><span>To conclude, a small, but crisp observation.</span>
+<span>I often find myself in emotionally non-neutral debates about whether doing </span>&ldquo;<span>X</span>&rdquo;<span> is good.</span>
+<span>If there</span>&rsquo;<span>s an actual disagreement, I tend to find myself a relatively more cold/cynic side, and my interlocutor a more empathetic one.</span>
+<span>Surprisingly to me, many of such disagreements are traced to a single fundamental difference in decision-making process.</span></p>
+<p><span>When I make a decision (especially an ethical one), I tend to go for what I feel is </span>&ldquo;<span>right</span>&rdquo;<span> in some abstract sense.</span>
+<span>I can</span>&rsquo;<span>t explain this any better, this is really a gut feeling (and is not categorical imperative, at least not consciously).</span></p>
+<p><span>Apparently, another mode for making ethical decisions is common </span>&mdash;<span> weighing the consequences of a specific action in a specific context, and making decision based on that, without taking </span>&ldquo;<span>poperness</span>&rdquo;<span> of the action itself into consideration.</span></p>
+<p><span>With this two different underlying algorithms, it</span>&rsquo;<span>s pretty easy to heatedly disagree about some specific conclusion!</span>
+<span>(Tip: to unearth such deep disagreements more efficiently, use the following rule: as soon anyone notices that a debate is happening, the debate is paused, and each side explains the position the other side is arguing for).</span></p>
+</section>
+<section id="Conclusion">
+
+    <h2>
+    <a href="#Conclusion"><span>Conclusion</span> </a>
+    </h2>
+<p><span>I guess that</span>&rsquo;<span>s it for now and the nearest future!</span>
+<span>If you have comments, suggestions or just want to say hello, feel free to drop me a email (in GitHub profile) or contact me on Telegram (@matklad).</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-08-11-things-I-have-learned-about-life.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/08/12/who-builds-the-builder.html b/2020/08/12/who-builds-the-builder.html
new file mode 100644
index 00000000..84027230
--- /dev/null
+++ b/2020/08/12/who-builds-the-builder.html
@@ -0,0 +1,151 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Who Builds the Builder?</title>
+  <meta name="description" content="This is a short note on the builder pattern, or, rather, on the builder method pattern.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/08/12/who-builds-the-builder.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Who-Builds-the-Builder"><span>Who Builds the Builder?</span> <time datetime="2020-08-12">Aug 12, 2020</time></a>
+    </h1>
+<p><span>This is a short note on the builder pattern, or, rather, on the builder </span><em><span>method</span></em><span> pattern.</span></p>
+<p><span>TL;DR: if you have </span><code>Foo</code><span> and </span><code>FooBuilder</code><span>, consider adding a </span><code>builder</code><span> method to </span><code>Foo</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span> { ... }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(Default)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">FooBuilder</span> { ... }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Foo</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">builder</span>() <span class="hl-punctuation">-&gt;</span> FooBuilder {</span>
+<span class="line">        FooBuilder::<span class="hl-title function_ invoke__">default</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">FooBuilder</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">build</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Foo { ... }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>A more minimal solution is to rely just on </span><code>FooBuilder::default</code><span> or </span><code>FooBuilder::new</code><span>.</span>
+<span>There are two problems with that:</span></p>
+<p><em><span>First</span></em><span>, it is hard to discover.</span>
+<span>Nothing in the docs/signature of </span><code>Foo</code><span> mentions </span><code>FooBuilder</code><span>, you need to look elsewhere to learn how to create a </span><code>Foo</code><span>.</span>
+<span>I remember being puzzled at how to create a </span><a href="https://docs.rs/globset/0.4.5/globset/struct.GlobSet.html"><code>GlobSet</code></a><span> for exactly this reason.</span>
+<span>In contrast, the </span><code>builder</code><span> method is right there on </span><code>Foo</code><span>, probably the first one.</span></p>
+<p><em><span>Second</span></em><span>, it is more annoying to use, as you need to import </span><em><span>both</span></em><span> </span><code>Foo</code><span> and </span><code>FooBuilder</code><span>.</span>
+<span>With </span><code>Foo::builder</code><span> method often only one import suffices, as you don</span>&rsquo;<span>t need to name the builder type.</span></p>
+<p><span>Case studies:</span></p>
+<ul>
+<li>
+<a href="https://github.com/rust-analyzer/rust-analyzer/commit/7510048ec0a5d5e7136e3ea258954eb244d15baf"><code>TextEdit::build</code></a><span> from rust-analyzer.</span>
+</li>
+<li>
+<a href="https://doc.rust-lang.org/std/fs/struct.File.html#method.with_options"><code>File::with_options</code></a><span> from std.</span>
+</li>
+<li>
+<a href="https://docs.rs/http/0.2.1/http/request/struct.Request.html#method.builder"><code>Request</code></a><span> and </span><a href="https://docs.rs/http/0.2.1/http/response/struct.Response.html#method.builder"><code>Response</code></a><span> from http.</span>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/i8kofk/blog_post_who_builds_the_builder/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-08-12-who-builds-the-builder.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/08/15/concrete-abstraction.html b/2020/08/15/concrete-abstraction.html
new file mode 100644
index 00000000..13719468
--- /dev/null
+++ b/2020/08/15/concrete-abstraction.html
@@ -0,0 +1,271 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Code Smell: Concrete Abstraction</title>
+  <meta name="description" content="This is a hand-wavy philosophical article about programming, without quantifiable justification, but with some actionable advice and a case study.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/08/15/concrete-abstraction.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Code-Smell-Concrete-Abstraction"><span>Code Smell: Concrete Abstraction</span> <time datetime="2020-08-15">Aug 15, 2020</time></a>
+    </h1>
+<p><span>This is a hand-wavy philosophical article about programming, without quantifiable justification, but with some actionable advice and a case study.</span></p>
+<p><span>Suppose that there are two types in the program, </span><code>Blorb</code><span> and </span><code>Gonk</code><span>.</span>
+<span>Suppose also that they both can </span><code>blag</code><span>.</span></p>
+<p><span>Does it make sense to add the following trait?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">trait</span> <span class="hl-title class_">Blag</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">blag</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I claim that it makes sense only if you have a function like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">blagyify</span>&lt;T: Blag&gt;(x: T) {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>That is, if some part of you program is generic over </span><code>T: Blag</code><span>.</span></p>
+<p><span>If in every </span><code>x.blag()</code><span> the </span><code>x</code><span> is either </span><code>Blorg</code><span>, or </span><code>Gonk</code><span>, but never a </span><code>T</code><span> (each usage is </span><em><span>concrete</span></em><span>), you don</span>&rsquo;<span>t need this abstraction.</span>
+&ldquo;<span>Need</span>&rdquo;<span> is used in a literal sense here: replace a trait with two inherent methods named </span><code>blag</code><span>, and the code will be essentially the same.</span>
+<span>Using a trait here doesn</span>&rsquo;<span>t achieve any </span><a href="https://caseymuratori.com/blog_0015"><span>semantic compression</span></a><span>.</span></p>
+<p><span>Given that abstractions have costs </span>&ldquo;<span>don</span>&rsquo;<span>t need</span>&rdquo;<span> can be strengthen to </span>&ldquo;<span>probably shouldn</span>&rsquo;<span>t</span>&rdquo;<span>.</span></p>
+
+<aside class="block">
+<div class="title">What Is The Cost of Abstraction?</div>
+<p><em><span>First</span></em><span> is the cognitive cost </span>&mdash;<span> generics (and abstractions in general) are often harder to understand than concretions.</span>
+<span>I think this is true regardless of the abstraction </span><em><span>skill</span></em><span>.</span>
+<span>I am skilled with  math which is incomparably more complicated than the typical code; still, I find concrete code easier to understand.</span>
+<span>There are exceptions here (you can do less things with </span><code>T: Default</code><span> than with </span><code>Blorb</code><span>), but they seem to be exceptions rather than a common case.</span></p>
+<p><em><span>Second</span></em><span>, in the context of Rust, is the compile time cost.</span>
+<span>It is important to understand </span><em><span>why</span></em><span> it is the case.</span>
+&ldquo;<span>Traits are more complicated for compiler to understand</span>&rdquo;<span> would be a wrong reason.</span>
+<span>Rust uses monomorphization to compile generic code.</span>
+<span>An </span><code>fn foo&lt;T&gt;</code><span> is compiled afresh for each different </span><code>T</code><span>, </span><em><span>per crate</span></em><span>.</span>
+<span>If </span><code>foo&lt;T&gt;</code><span> is defined in crate </span><code>a</code><span>, and </span><code>foo::&lt;i32&gt;</code><span> is called in crates </span><code>b</code><span> and </span><code>c</code><span>, then </span><code>rustc</code><span> compiles the same code twice.</span></p>
+<p><em><span>Third</span></em><span>, it often is just more code to write and read.</span>
+<span>Consider the original </span><code>Blag</code><span> example.</span>
+<span>For non-abstract case, there are two inherent impls with </span><code>blag</code><span> function.</span>
+<span>For abstract case there are these same two impls, plus a trait definition, </span><em><span>plus</span></em><span> a </span><code>use Blag</code><span> on every call-site.</span></p>
+
+</aside>
+  <p><span>Not going for an abstraction often allows a for more specific interface.</span>
+<span>A monad in Haskell is a thing with </span><code>&gt;&gt;=</code><span>.</span>
+<span>Which isn</span>&rsquo;<span>t telling much.</span>
+<span>Languages like Rust and OCaml can</span>&rsquo;<span>t express a general monad, but they still have concrete monads.</span>
+<span>The </span><code>&gt;&gt;=</code><span> is called </span><code>and_then</code><span> for futures and </span><code>flat_map</code><span> for lists.</span>
+<span>These names are </span><em><span>more specific</span></em><span> than </span><code>&gt;&gt;=</code><span> and are easier to understand.</span>
+<span>The </span><code>&gt;&gt;=</code><span> is only required if you want to write code generic over type of monad itself, which happens rarely.</span></p>
+<p><span>Another example of abstraction which is used mostly concretely are collection hierarchies.</span>
+<span>In Java or Scala, there</span>&rsquo;<span>s a whole type hierarchy for things which can hold other things.</span>
+<span>Rust</span>&rsquo;<span>s type system can</span>&rsquo;<span>t express </span><code>Collection</code><span> trait, so we have to get by with using </span><code>Vec</code><span>, </span><code>HashSet</code><span> and </span><code>BTreeSet</code><span> directly.</span>
+<span>And it isn</span>&rsquo;<span>t actually a problem in practice.</span>
+<span>Turns out, writing code which is generic over collections (and not just over iterators) is not that useful.</span>
+<span>The </span>&ldquo;<span>but I can change the collection type later</span>&rdquo;<span> argument also seems overrated </span>&mdash;<span> often, there</span>&rsquo;<span>s only single collection type that makes sense.</span>
+<span>Moreover, swapping </span><code>HashSet</code><span> for </span><code>BTreeSet</code><span> is mostly just a change at the definition site, as the two happen to have almost identical interface anyway.</span>
+<span>The only case where I miss Java collections is when I return </span><code>Vec&lt;T&gt;</code><span>, but mean a generic </span><em><span>unordered</span></em><span> collection.</span>
+<span>In Java, the difference is captured by </span><code>List&lt;T&gt;</code><span> vs </span><code>Collection&lt;T&gt;</code><span>.</span>
+<span>In Rust, there</span>&rsquo;<span>s nothing built-in for this.</span>
+<span>It is possible to define a </span><code>VecSet&lt;T&gt;(Vec&lt;T&gt;)</code><span>, but doesn</span>&rsquo;<span>t seem worth the effort.</span></p>
+<p><span>Collections also suffer from </span><code>&gt;&gt;=</code><span> problem </span>&mdash;<span> collapsing similar synonyms under a single name.</span>
+<span>Java</span>&rsquo;<span>s</span>
+<a href="https://docs.oracle.com/javase/7/docs/api/java/util/Queue.html"><span>Queue</span></a>
+<span>has </span><code>add</code><span>, </span><code>offer</code><span>, </span><code>remove</code><span>, and </span><code>poll</code><span> methods, because it needs to be a collection, but also is a special kind of collection.</span>
+<span>In C++, you have to spell </span><code>push_back</code><span> for </span><code>vector</code>&rsquo;<span>s push operation, so that it duck-types with </span><code>deque</code>&rsquo;<span>s </span><code>front</code><span> and </span><code>back</code><span>.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Collection hierarchy is a sufficient, but not necessary condition for mixing up method names.</span>
+<span>Rust</span>&rsquo;<span>s </span><code>BinaryHeap</code><span> should have had </span><code>BinaryHeap::pop_max</code><span> method.</span>
+<span>Alas, we are stuck with </span><code>pop</code><span>, which, coupled with the fact that the heap is surprisingly and uselessly a max-heap, means many student-hours wasted on debugging misbehaving Dijkstra algorithm.</span></p>
+</div>
+</aside><p><span>Finally, the promised case study!</span>
+<span>rust-analyzer needs to convert a bunch of internal type to types suitable for converting them into JSON message of the Language Server Protocol.</span>
+<code>ra::Completion</code><span> is converted into </span><code>lsp::Completion</code><span>; </span><code>ra::Completion</code><span> contains </span><code>ra::TextRange</code><span> which is converted to </span><code>lsp::Range</code><span>, etc.</span></p>
+<p><span>The first implementation started with an abstraction for conversion:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">trait</span> <span class="hl-title class_">Conv</span> {</span>
+<span class="line">    <span class="hl-keyword">type</span> <span class="hl-title class_">Output</span>;</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">conv</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span>::Output;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This abstraction doesn</span>&rsquo;<span>t work for all cases </span>&mdash;<span> sometimes the conversion requires additional context.</span>
+<span>For example, to convert a rust-analyzer</span>&rsquo;<span>s offset (a position of byte in the file) to an LSP position (</span><code>(line, column)</code><span> pair), a table with positions of newlines is needed.</span>
+<span>This is easy to handle:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">trait</span> <span class="hl-title class_">ConvWith</span>&lt;CTX&gt; {</span>
+<span class="line">    <span class="hl-keyword">type</span> <span class="hl-title class_">Output</span>;</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">conv_with</span>(<span class="hl-keyword">self</span>, ctx: CTX) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span>::Output;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Naturally, there was an intricate web of delegating impls.</span>
+<span>The typical one looked like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">ConvWith</span>&lt;&amp;LineIndex&gt; <span class="hl-keyword">for</span> <span class="hl-title class_">TextRange</span> {</span>
+<span class="line">    <span class="hl-keyword">type</span> <span class="hl-title class_">Output</span> = Range;</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">conv_with</span>(</span>
+<span class="line">        <span class="hl-keyword">self</span>,</span>
+<span class="line">        line_index: &amp;LineIndex,</span>
+<span class="line">    ) <span class="hl-punctuation">-&gt;</span> lsp_types::Range {</span>
+<span class="line">        Range::<span class="hl-title function_ invoke__">new</span>(</span>
+<span class="line">            <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">start</span>().<span class="hl-title function_ invoke__">conv_with</span>(line_index),</span>
+<span class="line">            <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">end</span>().<span class="hl-title function_ invoke__">conv_with</span>(line_index),</span>
+<span class="line">        )</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There were a couple of genuinely generic impls for converting iterators of convertible things.</span></p>
+<p><span>The code was hard to understand.</span>
+<span>It also was hard to use: if calling </span><code>.conv</code><span> didn</span>&rsquo;<span>t work immediately, it took a lot of time to find which specific impl didn</span>&rsquo;<span>t apply.</span>
+<span>Finally, there were many accidental (as in </span>&ldquo;<span>accidental complexity</span>&rdquo;<span>) changes to the shape of code: </span><code>CTX</code><span> being passed by value or by reference, switching between generic parameters and associated types, etc.</span></p>
+<p><span>I was really annoyed by how this conceptually simple pure boilerplate operation got expressed as clever and fancy abstraction.</span>
+<span>Crucially, almost all of the usages of the abstraction (besides those couple of iterator impls) were concrete.</span>
+<span>So I replaced the whole edifice with much simpler code, a bunch of functions:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">range</span>(</span>
+<span class="line">    line_index: &amp;LineIndex,</span>
+<span class="line">    range: TextRange,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> lsp_types::Range {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">start</span> = <span class="hl-title function_ invoke__">position</span>(line_index, range.<span class="hl-title function_ invoke__">start</span>());</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">end</span> = <span class="hl-title function_ invoke__">position</span>(line_index, range.<span class="hl-title function_ invoke__">end</span>());</span>
+<span class="line">    lsp_types::Range::<span class="hl-title function_ invoke__">new</span>(start, end)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">position</span>(</span>
+<span class="line">    line_index: &amp;LineIndex,</span>
+<span class="line">    offset: TextSize,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> lsp_types::Position {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Simplicity and ease of use went up tremendously.</span>
+<span>Now instead of typing </span><code>x.conv()</code><span> and trying to figure out why an impl I think should apply doesn</span>&rsquo;<span>t apply, I just auto-complete </span><code>to_proto::range</code><span> and let the compiler tell me exactly which types don</span>&rsquo;<span>t line up.</span></p>
+<p><span>I</span>&rsquo;<span>ve lost fancy iterator impls, but the</span>
+<a href="https://github.com/rust-analyzer/rust-analyzer/pull/4418/commits/1586bab0b97bef411e6187dfc389557edbc5a16e"><span>total diff</span></a>
+<span>for the commit was </span><code>+999,-1123</code><span>.</span>
+<span>There was some genuine code re-use in those impls, but it was not justified by the overall compression, even disregarding additional complexity tax.</span></p>
+<p><span>To sum up, </span>&ldquo;<span>is this abstraction used exclusively concretely?</span>&rdquo;<span> is a meaningful question about the overall shape of code.</span>
+<span>If the answer is </span>&ldquo;<span>Yes!</span>&rdquo;<span>, then the abstraction can be replaced by a number of equivalent non-abstract implementations.</span>
+<span>As the latter tend to be simpler, shorter, and more direct, </span>&ldquo;<span>Concrete Abstraction</span>&rdquo;<span> can be considered a code smell.</span>
+<span>As usual though, any abstract programming advice can be applied only in a concrete context </span>&mdash;<span> don</span>&rsquo;<span>t blindly replace abstractions with concretions, check if provided justifications work for your particular case!</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/iaic5w/blog_post_code_smell_concrete_abstraction/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-08-15-concrete-abstraction.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/09/12/rust-in-2021.html b/2020/09/12/rust-in-2021.html
new file mode 100644
index 00000000..b27aba48
--- /dev/null
+++ b/2020/09/12/rust-in-2021.html
@@ -0,0 +1,384 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Rust in 2021</title>
+  <meta name="description" content="This is my response for this year's call for blog posts.
+I am writing this as a language implementor, not as a language user.
+I also don't try to prioritize the problems.
+The two things I'll mention are the things that worry me most without reflecting on the overall state of the project.
+They are not necessary the most important things.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/09/12/rust-in-2021.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Rust-in-2021"><span>Rust in 2021</span> <time datetime="2020-09-12">Sep 12, 2020</time></a>
+    </h1>
+<p><span>This is my response for this year</span>&rsquo;<span>s </span><a href="https://blog.rust-lang.org/2020/09/03/Planning-2021-Roadmap.html"><span>call for blog posts</span></a><span>.</span>
+<span>I am writing this as a language implementor, not as a language user.</span>
+<span>I also don</span>&rsquo;<span>t try to prioritize the problems.</span>
+<span>The two things I</span>&rsquo;<span>ll mention are the things that worry me most without reflecting on the overall state of the project.</span>
+<span>They are not necessary the most important things.</span></p>
+<section id="Funding-Teams-to-Work-on-Rust">
+
+    <h2>
+    <a href="#Funding-Teams-to-Work-on-Rust"><span>Funding Teams to Work on Rust</span> </a>
+    </h2>
+<p><span>For the past several years, I</span>&rsquo;<span>ve been a maintainer of </span>&ldquo;<span>Sponsored Open Source</span>&rdquo;<span> projects (rust-analyzer &amp; IntelliJ Rust).</span>
+<span>These projects:</span></p>
+<ul>
+<li>
+<span>have a small number of core developers who work full-time at company X, and whose job is to maintain the project,</span>
+</li>
+<li>
+<span>are explicitly engineered for </span><em><span>active</span></em><span> open source:</span>
+<ul>
+<li>
+<span>significant fraction of maintainer</span>&rsquo;<span>s time goes to contribution documentation, issue mentoring, etc,</span>
+</li>
+<li>
+<span>non-trivial amount of features end up being implemented by the community.</span>
+</li>
+</ul>
+</li>
+</ul>
+<p><span>This experience taught me that there</span>&rsquo;<span>s a great deal of a difference between the work done by the community, and the work done during payed hours.</span>
+<span>To put it bluntly, a small team of 2-3 people working full-time on a specific project with a long time horizon can do </span><em><em><span>a lot</span></em></em><span>.</span>
+<span>Not because </span><code>payed hours == higher quality work</code><span>, but because of the cumulative effect of:</span></p>
+<ul>
+<li>
+<span>being able to focus on a single thing,</span>
+</li>
+<li>
+<span>keeping the project in a mental cache and accumulating knowledge,</span>
+</li>
+<li>
+<span>being able to </span>&ldquo;<span>invest</span>&rdquo;<span> into the code and do long-term planing effectively.</span>
+</li>
+</ul>
+<p><span>In other words, community gives breadth of contributions, while payed hours give depth.</span>
+<span>Both are important, but I feel that Rust could use a lot of the latter at the moment, in two senses.</span></p>
+<p><em><span>First</span></em><span>, marginal utility of adding a full-time developer to the Rust project will be high for quite a few full-time developers.</span></p>
+<p><em><span>Second</span></em><span>, perhaps more worrying, I have a nagging feeling that the imbalance between community and payed hours can affect the quality of the technical artifact, and not just the speed of development.</span>
+<span>The two styles of work lend themselves to different kinds of work actually getting done.</span>
+<span>Most of pull requests I merge are about new features, and some are about bug-fixes.</span>
+<span>Most of pull requests I submit are about refactoring existing code.</span>
+<span>Community naturally picks the work of </span><em><em><span>incrementally adding</span></em></em><span> new code, maintainers can </span><em><em><span>refactor and rewrite</span></em></em><span> existing code.</span>
+<span>It</span>&rsquo;<span>s easy to see that, in the limit, this could end with an effectively immutable/append only code base.</span>
+<span>I think we are pretty far from the limit today, but I don</span>&rsquo;<span>t exactly like the current dynamics.</span>
+<span>I keep coming back to this </span><a href="https://graydon2.dreamwidth.org/263429.html"><span>Rust 2019</span></a><span> post when I think about this issue.</span></p>
+<p><span>The conclusion from this section is that we should find ways to fund teams of people to focus on improving the Rust programming language.</span>
+<span>Through luck, hard work of my colleagues at JetBrains and Ferrous Systems, and my own efforts it became possible to move in this direction for both IntelliJ Rust and rust-analyzer.</span>
+<span>This was pretty stressful, and, well, I feel that the marginal utility of one more compiler engineer is still huge in the IDE domain at least.</span></p>
+</section>
+<section id="Compiling-the-Compiler">
+
+    <h2>
+    <a href="#Compiling-the-Compiler"><span>Compiling the Compiler</span> </a>
+    </h2>
+<p><span>And now to something completely different!</span>
+<span>I want this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> git clone git@github.com:rust-lang/rust.git &amp;&amp; cd rust</span>
+<span class="line"><span class="hl-title function_">$</span> cargo t</span>
+<span class="line"><span class="hl-output">info: syncing channel updates for 'beta-x86_64-unknown-linux-gnu'</span></span>
+<span class="line"><span class="hl-output">info: latest update on 2020-09-10, rust version 1.47.0-beta</span></span>
+<span class="line"><span class="hl-output">info: downloading component 'cargo'</span></span>
+<span class="line"><span class="hl-output">info: downloading component 'rustc'</span></span>
+<span class="line"><span class="hl-output">info: installing component 'cargo'</span></span>
+<span class="line"><span class="hl-output">info: installing component 'rustc'</span></span>
+<span class="line"><span class="hl-output">Compiling unicode-xid v0.2.1</span></span>
+<span class="line"><span class="hl-output">Compiling proc-macro2 v1.0.20</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">...</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">Finished test [unoptimized] target(s) in 5m 45s</span></span>
+<span class="line"><span class="hl-output">  Running target/debug/deps/rustc-bf0145d0690d0fbc</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">running 9001 tests</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">...</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">test result: ok. 9001 passed;  in 1m 3s</span></span></code></pre>
+
+</figure>
+<p><span>That is, I want to simplify working on the compiler itself to it being just a crate.</span>
+<span>This section of the article expands on the comment I</span>&rsquo;<span>ve made on the</span>
+<a href="https://internals.rust-lang.org/t/experience-report-contributing-to-rust-lang-rust/12012/17?u=matklad"><span>irlo</span></a>
+<span>a while ago.</span></p>
+<p><span>Since a couple of months ago, I am slowly pivoting from doing mostly green field dev in the rust-analyzer</span>&rsquo;<span>s code base to refactoring </span><code>rustc</code><span> internals towards merging the two.</span>
+<span>The process has been underwhelming, and slow and complicated build process plays a significant part in this: I feel like my own productivity is at least five times greater when I work on </span><code>rust-analyzer</code><span> in comparison to </span><code>rustc</code><span>.</span></p>
+<p><span>Before I go into details about my vision here, I want to give shout-outs to</span>
+<a href="https://github.com/Mark-Simulacrum/"><span>@Mark-Simulacrum</span></a><span>, </span><a href="https://github.com/mark-i-m"><span>@mark-i-m</span></a><span>, and </span><a href="https://github.com/jyn514"><span>@jyn514</span></a>
+<span>who already did a lot of work on simplifying the build process in the recent several months.</span></p>
+<p><span>Note that I am going to make a slightly deeper than </span>&ldquo;<span>Rust in 20XX</span>&rdquo;<span> dive into the topic, feel free to skip the rest of the post if technical details about bootstrapping process are not your cup of tea.</span></p>
+<p><span>Finally, I also should warn that I have an intern advantage here </span>&mdash;<span> I have absolutely no idea about how Rust</span>&rsquo;<span>s current build process works, so I tell how it should work from the position of ignorance. Without further ado,</span></p>
+<section id="How-Simple-Could-the-Build-Process-Be">
+
+    <h3>
+    <a href="#How-Simple-Could-the-Build-Process-Be"><span>How Simple Could the Build Process Be?</span> </a>
+    </h3>
+<p><code>rustc</code><span> is a bootstrapping compiler.</span>
+<span>This means that, to compile </span><code>rustc</code><span> itself, one needs to have a previous version of </span><code>rustc</code><span> available.</span>
+<span>This </span><em><span>could</span></em><span> make compiler</span>&rsquo;<span>s build process peculiar.</span>
+<span>My thesis is that this doesn</span>&rsquo;<span>t need to be the case, and that the compiler could be just a crate.</span></p>
+<p><span>Bootstrapping does make this harder to see though, so, as a thought experiment, let</span>&rsquo;<span>s imagine what would </span><code>rustc</code>&rsquo;<span>s build process look like were it not written in Rust.</span>
+<span>Let</span>&rsquo;<span>s imagine the world where </span><code>rustc</code><span> is implemented in Go.</span>
+<span>How would one build and test this rust compiler?</span></p>
+<p><span>First, we clone the </span><code>rust-lang/rust</code><span> repository.</span>
+<span>Then we download the latest version of the Go compiler </span>&mdash;<span> as we are shipping </span><code>rustc</code><span> binaries to the end user, it</span>&rsquo;<span>s OK to require a cutting-edge compiler.</span>
+<span>But there</span>&rsquo;<span>s probably some script or gvm config file to make getting the latest Go compiler easier.</span>
+<span>After that, </span><code>go test</code><span> builds the compiler and runs the unit tests.</span>
+<span>Unit tests take a snippet of Rust code as an input and check that the compiler correctly analyses the snippet: that the parse tree is correct, that diagnostics are emitted, that borrow checker correctly accepts or rejects certain problems.</span></p>
+<p><span>What we can not check in this way is that the compiler is capable of producing a real binary which we can run (that is, </span><code>run-pass</code><span> tests).</span>
+<span>The reason for that is slightly subtle </span>&mdash;<span> to produce a binary, compiler needs to link the tested code with the standard library.</span>
+<span>But we</span>&rsquo;<span>ve only compiled the compiler, we don</span>&rsquo;<span>t have a standard library yet!</span></p>
+<p><span>So, in addition to unit-tests, we also need somewhat ad-hoc integration tests, which assume that the compiler has been build already, use it to compile the standard library, and then compile, link, and run the corpus of the test programs.</span>
+<span>Running std</span>&rsquo;<span>s own </span><code>#[test]</code><span> tests is also a part of this integration testing.</span></p>
+<p><span>Now, let</span>&rsquo;<span>s see if the above setup has any bottlenecks:</span></p>
+<ol>
+<li>
+<p><span>Getting the Go compiler is fast and straightforward.</span>
+<span>In fact, it</span>&rsquo;<span>s reasonable to assume that the user already have a recent Go compiler installed, and that they are familiar with standard Go workflows.</span></p>
+</li>
+<li>
+<p><span>Compiling </span><code>rustc</code><span> would take a little while.</span>
+<span>On the one hand, Rust is a big language, and you need to spend quite a few lines of code to implement it.</span>
+<span>On the other hand, compilers are very straightforward programs, which don</span>&rsquo;<span>t do a lot of IO, don</span>&rsquo;<span>t have to deal with changing business requirements and don</span>&rsquo;<span>t have a lot of dependencies.</span>
+<span>Besides, Go is a language known for fast compile times.</span>
+<span>So, spending something like five minutes on a quad-core machine for compiling the compiler seems reasonable.</span></p>
+</li>
+<li>
+<p><span>After that, running unit-tests is a breeze: unit-tests do not depend on any state external to the test itself; we are testing pure functions.</span></p>
+</li>
+<li>
+<p><span>The first integration tests is compiling and </span><code>#[test]</code><span>ing </span><code>std</code><span>.</span>
+<span>As </span><code>std</code><span> is relatively small, compiling it with our compiler should be relatively fast.</span></p>
+</li>
+<li>
+<p><span>Running tens of thousands of full integration tests will be slow.</span>
+<span>Each such test would need to do IO to read the source code, write the executable, and run the process.</span>
+<span>It is reasonable to assume that </span><em><span>most</span></em><span> of potential failures are covered with compiler</span>&rsquo;<span>s and </span><code>std</code>&rsquo;<span>s unit tests.</span>
+<span>But it would be foolish to rely solely on those tests </span>&mdash;<span> fully integrated test suite is important to make sure that compiler indeed does what it is supposed to, and it is vital to compare several independent implementations </span>&mdash;<span> who knows, maybe one day we</span>&rsquo;<span>ll rewrite </span><code>rustc</code><span> from Go to Rust, and re-using compiler</span>&rsquo;<span>s unit-tests would be much harder in that context.</span></p>
+</li>
+</ol>
+<p><span>So, it seems like except for the final integration test suite, there</span>&rsquo;<span>s no complexity/performance bottlenecks in our setup for a from-scratch build.</span>
+<span>The problem with integrated suite can be handled by running a subset of smoke tests by default, and only running the full set of integrated tests on CI.</span>
+<span>Testing is embarrassingly parallel, so a beefy CI fleet should handle that just fine.</span></p>
+<p><span>What about incremental builds?</span>
+<span>Let</span>&rsquo;<span>s say we want to contribute a change to </span><code>std</code><span>.</span>
+<span>First time around, this requires building the compiler, which is unfortunate.</span>
+<span>This is a one-time cost though, and it shouldn</span>&rsquo;<span>t be prohibitive (or we will have troubles with changes to the compiler itself anyway).</span>
+<span>We can also cheat here, and just download some version of </span><code>rustc</code><span> from the internet to check </span><code>std</code><span>.</span>
+<span>This will mostly work, except for the bits where </span><code>std</code><span> and rustc need to know about each other (lang items and intrinsics).</span>
+<span>For those, we can use </span><code>#[cfg(not(bootstrap))]</code><span> in the </span><code>std</code><span> to compile different code for older versions of the compiler.</span>
+<span>This makes </span><code>std</code><span> implementation mind-bending though, so a better alternative might be to just make CI publish the artifacts for the compiler built off the master branch.</span>
+<span>That is, if you only contribute to </span><code>std</code><span>, you download the latest compiler instead of building it yourself.</span>
+<span>We have a trade off between implementation complexity and compile times.</span></p>
+<p><span>If we want to contribute a change to the compiler, then we are golden as long as it can be checked by the unit-tests (which, again, in theory is everything except for </span><code>run-pass</code><span> tests).</span>
+<span>If we need to run integrated tests with </span><code>std</code><span>, then we need to recompile </span><code>std</code><span> with the new compiler, after every change to the compiler.</span>
+<span>This is pretty unfortunate, but:</span></p>
+<ul>
+<li>
+<span>if you fundamentally need to recompile </span><code>std</code><span> (for example, you change lang-items), there</span>&rsquo;<span>s no way around this,</span>
+</li>
+<li>
+<span>if you don</span>&rsquo;<span>t need to recompile </span><code>std</code><span>, than you probably can write an </span><code>std</code><span>-less unit-test,</span>
+</li>
+<li>
+<span>as an escape hatch, there might be some kind of </span><code>KEEP_STDLIB</code><span> env var, which causes integrated tests to re-use existing </span><code>std</code><span>, even if the compiler is newer.</span>
+</li>
+</ul>
+<p><span>To sum up, compiler is just a program which does some text processing.</span>
+<span>In the modern world full of distributed highly-available long-running systems, compiler is actually a pretty simple program.</span>
+<span>It also is fairly easy to test.</span>
+<span>The hard bit is not the compiler itself, but the standard library: to even start building the standard library, we need to compile the compiler.</span>
+<span>However, most of the compiler can be tested without </span><code>std</code><span>, and </span><code>std</code><span> itself can be tested using compiler binary built from the master branch by CI.</span></p>
+</section>
+<section id="Why-Today-s-Build-Process-is-not-Simple">
+
+    <h3>
+    <a href="#Why-Today-s-Build-Process-is-not-Simple"><span>Why Today</span>&rsquo;<span>s Build Process is not Simple?</span> </a>
+    </h3>
+<p><span>In theory, it should be possible to replace Go from the last section with Rust, and get a similarly simple bootstrapping compiler.</span>
+<span>That is, we would use latest stable/beta Rust to compile </span><code>rustc</code><span>, then we</span>&rsquo;<span>ll use this </span><code>rustc</code><span> to compile </span><code>std</code><span>, and we are done.</span>
+<span>We might add a sanity check </span>&mdash;<span> using the freshly built compiler &amp; </span><code>std</code><span>, recompile the compiler again and check that everything works.</span>
+<span>This is optional, and in a sense just a subset of a crater run, where we check one specific crate </span>&mdash;<span> compiler itself.</span></p>
+<p><span>However, today</span>&rsquo;<span>s build is </span><em><span>more</span></em><span> complicated than that.</span></p>
+<p><em><span>First</span></em><span>, instead of using a </span>&ldquo;<span>standard distribution</span>&rdquo;<span> of the compiler for bootstrapping, </span><code>x.py</code><span> downloads custom beta toolchain.</span>
+<span>This could and should be replaced with using </span><code>rustup</code><span> by default.</span></p>
+<p><em><span>Second</span></em><span>, master </span><code>rustc</code><span> requires master </span><code>std</code><span> to build.</span>
+<span>This is the bit which makes </span><code>rustc</code><span> not a simple crate.</span>
+<span>Remember how before the build started with just compiling the compiler as a usual program?</span>
+<span>Today, </span><code>rustc</code><span> build starts with compiling master </span><code>std</code><span> using the beta compiler, than with compiling master </span><code>rustc</code><span> using master </span><code>std</code><span> and beta compiler.</span>
+<span>So, there</span>&rsquo;<span>s a requirement that </span><code>std</code><span> builds with both master and beta compilers, and we also has this weird state where versions of compiler and </span><code>std</code><span> we are using to compile the code do not match. In other words, while </span><code>#[cfg(not(bootstrap))]</code><span> was an optimization in the previous section (which could be replaced with downloading binary </span><code>rustc</code><span> from CI), today it is required.</span></p>
+<p><em><span>Third</span></em><span>, there</span>&rsquo;<span>s not much in a way of the unit tests in the compiler.</span>
+<span>Almost all tests require </span><code>std</code><span>, which means that, to test anything, one needs to rebuild everything.</span></p>
+<p><em><span>Fourth</span></em><span>, LLVM &amp; linkers.</span>
+<span>A big part of </span>&ldquo;<span>compilers are easy to test</span>&rdquo;<span> is the fact that they are, in theory, closed systems interacting with the outside world in a limited well-defined way.</span>
+<span>In the real world, however, rustc relies on a bunch of external components to work, the biggest one of which is LLVM.</span>
+<span>Luckily, these external components are required only for making the final binary.</span>
+<span>The bulk of the compiler, analysis phases which reject invalid programs and lower valid ones, does not need them.</span></p>
+</section>
+<section id="Specific-Improvements">
+
+    <h3>
+    <a href="#Specific-Improvements"><span>Specific Improvements</span> </a>
+    </h3>
+<p><span>With all this in mind, here are specific steps which I believe would make the build process easier:</span></p>
+<ul>
+<li>
+<span>Gear the overall build process and defaults to the </span>&ldquo;<span>hacking on the compiler</span>&rdquo;<span> use case.</span>
+</li>
+<li>
+<span>By default, rely on </span><code>rust-toolchain</code><span> file and rustup to get the beta compiler.</span>
+</li>
+<li>
+<span>Switch from </span><code>x.py</code><span> to something like </span><a href="https://github.com/matklad/cargo-xtask"><code>cargo-xtask</code></a><span>, to remove dependency on Python.</span>
+</li>
+<li>
+<span>Downgrade rustc</span>&rsquo;<span>s libstd requirements to beta.</span>
+<span>Note that this refers solely to the </span><code>std</code><span> used to build </span><code>rustc</code><span> itself.</span>
+<code>rustc</code><span> will use master </span><code>std</code><span> for building user</span>&rsquo;<span>s code.</span>
+</li>
+<li>
+<span>Split compiler and </span><code>std</code><span> into separate Cargo workspaces.</span>
+</li>
+<li>
+<span>Make sure that, by default, </span><code>rustc</code><span> is using system llvm, or llvm downloaded from a CI server.</span>
+<span>Building llvm from source should require explicit op-in.</span>
+</li>
+<li>
+<span>Make sure that </span><code>cd compiler &amp;&amp; cargo test</code><span> just works.</span>
+</li>
+<li>
+<span>Add ability to to make a build of the compiler which can run </span><code>check</code><span>, but doesn</span>&rsquo;<span>t do llvm-dependent codegen.</span>
+</li>
+<li>
+<span>Split the test suite into cross-platform codegen-less </span><code>check</code><span> part, and the fully-integrated part.</span>
+</li>
+<li>
+<span>Split the compiler itself into frontend and codegen parts, such that changes in frontend can be tested without linking backend, and changes in backend can be tested without recompiling the frontend.</span>
+</li>
+<li>
+<span>Stop building </span><code>std</code><span> with beta compiler and remove all </span><code>#[cfg(bootstrap)]</code><span>.</span>
+</li>
+<li>
+<em><span>Somehow</span></em><span> make </span><code>cargo test</code><span> just work in </span><code>std</code><span>.</span>
+<span>This will require some hackery to plug the logic for </span>&ldquo;<span>build compiler from source or download from CI</span>&rdquo;<span> somewhere.</span>
+</li>
+</ul>
+<p><span>At this stage, we have a compiler which is 100% bog standard crate, and </span><code>std</code><span>, which is </span><em><span>almost</span></em><span> a typical crate (it only requires a very recent compiler to build).</span></p>
+<p><span>After this, we can start the standard procedure to optimize compile and test times, just how you would do for any other Rust project (I am planning to write a couple of posts on these topics).</span>
+<span>I have a suspicion that there</span>&rsquo;<span>s a lot of low-hanging fruit there </span>&mdash;<span> one of the reasons why I writing this post is that I</span>&rsquo;<span>ve noticed that doctests in </span><code>std</code><span> are insanely slow, and that nobody complains about that just because everything else is even slower!</span></p>
+<p><span>This post ended up being too technical for the genre, but, to recap, there seems to be two force multipliers we could leverage to develop Rust itself:</span></p>
+<ul>
+<li>
+<span>Creating a space for small teams of people to work full-time on Rust.</span>
+</li>
+<li>
+<span>Simplifying hacking on the compiler to just </span><code>cargo test</code><span>.</span>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/irhj4o/blog_post_rust_in_2021/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-09-12-rust-in-2021.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/09/13/your-language-sucks.html b/2020/09/13/your-language-sucks.html
new file mode 100644
index 00000000..84a71696
--- /dev/null
+++ b/2020/09/13/your-language-sucks.html
@@ -0,0 +1,188 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Your Language Sucks, It Doesn't Matter</title>
+  <meta name="description" content="This post describes my own pet theory of programming languages popularity.
+My understanding is that no one knows why some languages are popular and others aren't, so there's no harm done if I add my own thoughts to the overall confusion.
+Obviously, this is all wild speculation and a just-so story without any kind of data backed research.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/09/13/your-language-sucks.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Your-Language-Sucks-It-Doesn-t-Matter"><span>Your Language Sucks, It Doesn</span>&rsquo;<span>t Matter</span> <time datetime="2020-09-13">Sep 13, 2020</time></a>
+    </h1>
+<p><span>This post describes my own pet theory of programming languages popularity.</span>
+<span>My understanding is that no one knows why some languages are popular and others aren</span>&rsquo;<span>t, so there</span>&rsquo;<span>s no harm done if I add my own thoughts to the overall confusion.</span>
+<span>Obviously, this is all wild speculation and a just-so story without any kind of data backed research.</span></p>
+<p><span>The central thesis is that the actual programming language (syntax, semantics, paradigm) doesn</span>&rsquo;<span>t really matter.</span>
+<span>What matters is characteristics of the runtime </span>&mdash;<span> roughly, what does memory of the running process look like?</span></p>
+<p><span>To start, an observation.</span>
+<span>A lot of software is written in vimscript and emacs lisp (</span><a href="https://magit.vc/"><span>magit</span></a><span> being one example I can</span>&rsquo;<span>t live without).</span>
+<span>And these languages are objectively bad.</span>
+<span>This happens even with less esoteric technologies, notable examples being PHP and JavaScript.</span>
+<span>While JavaScript is great in some aspects (it</span>&rsquo;<span>s the first mainstream language with lambdas!), it surely isn</span>&rsquo;<span>t hard to imagine a trivially better version of it (for example, without two different </span><code>null</code><span>s).</span></p>
+<p><span>This is a general rule </span>&mdash;<span> as soon as you have a language which is Turing-complete, and has some capabilities for building abstractions, people will just get the things done with it.</span>
+<span>Surely, some languages are more productive, some are less productive, but, overall, FP vs OOP vs static types vs dynamic types doesn</span>&rsquo;<span>t seem super relevant.</span>
+<span>It</span>&rsquo;<span>s always possible to overcome the language by spending some more time writing a program.</span></p>
+<p><span>In contrast, overcoming language runtime is not really possible.</span>
+<span>If you want to extend vim, you kinda have to use vimscript.</span>
+<span>If you want your code to run in the browser, JavaScript is still the best bet.</span>
+<span>Need to embed your code anywhere? GC is probably not an option for you.</span></p>
+<p><span>This two observations lead to the following hypothesis:</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Languages generally become popular when they bring innovative runtime, or when they have runtime exclusivity.</span>
+<span>The quality of the language itself is secondary.</span></p>
+</div>
+</aside><p><span>Let</span>&rsquo;<span>s see some examples which can be </span>&ldquo;<span>explained</span>&rdquo;<span> by this theory.</span></p>
+<dl>
+<dt><span>C</span></dt>
+<dd>
+<p><span>C has a pretty spartan runtime, which is notable for two reasons.</span>
+<span>First, it was the first fast enough runtime for a high-level language.</span>
+<span>It was possible to write the OS kernel in C, which had been typically done in assembly before that for performance.</span>
+<span>Second, C is the language of Unix.</span>
+<span>(And yes, I would put C into the </span>&ldquo;<span>easily improved upon</span>&rdquo;<span> category of languages. Null-terminated strings are just a bad design).</span></p>
+</dd>
+<dt><span>JavaScript</span></dt>
+<dd>
+<p><span>This language has been exclusive in the browsers for quite some time.</span></p>
+</dd>
+<dt><span>Java</span></dt>
+<dd>
+<p><span>This case I think is the most interesting for the theory.</span>
+<span>A common explanation for Java</span>&rsquo;<span>s popularity is </span>&ldquo;<span>marketing by Sun</span>&rdquo;<span>, and subsequent introduction of Java into University</span>&rsquo;<span>s curricula.</span>
+<span>This doesn</span>&rsquo;<span>t seem convincing to me.</span>
+<span>Let</span>&rsquo;<span>s look at the 90</span>&rsquo;<span>s popular languages (I am not sure about percentage and relative ranking here, but the composition seems broadly correct to me):</span></p>
+
+<figure>
+<figcaption class="title">https://www.youtube.com/watch?v=Og847HVwRSI</figcaption>
+
+<img alt="" src="/assets/lang-pop.png">
+</figure>
+<p><span>On this list, Java is the only non-dynamic cross-platform memory safe language.</span>
+<span>That is, Java is both memory safe (no manual error-prone memory management) and can be implemented reasonably efficiently (field access is a load and not a dictionary lookup).</span>
+<span>This seems like a pretty compelling reason to choose Java, irrespective of what the language itself actually looks like.</span></p>
+</dd>
+<dt><span>Go</span></dt>
+<dd>
+<p><span>One can argue whether focus on simplicity at the expense of everything else is good or bad, but statically linked zero dependency binaries definitely were a reason for Go popularity in the devops sphere.</span>
+<span>In a sense, Go is an upgrade over </span>&ldquo;<span>memory safe &amp; reasonably fast</span>&rdquo;<span> Java runtime, when you no longer need to install JVM separately.</span></p>
+</dd>
+</dl>
+<p><span>Naturally, there are also some things which are not explained by my hypothesis.</span>
+<span>One is scripting languages.</span>
+<span>A highly dynamic runtime with </span><code>eval</code><span> and ability to easily link C extensions indeed would be a differentiator, so we would expect a popular scripting language.</span>
+<span>However, it</span>&rsquo;<span>s unclear why they are Python and PHP, and not Ruby and Perl.</span></p>
+<p><span>Another one is language evolutions: C++ and TypeScript don</span>&rsquo;<span>t innovate runtime-wise, yet they are still major languages.</span></p>
+<p><span>Finally, let</span>&rsquo;<span>s make some bold predictions using the theory.</span></p>
+<p><em><span>First</span></em><span>, I expect Rust to become a major language, naturally :)</span>
+<span>This needs some explanation </span>&mdash;<span> on the first blush, Rust is runtime-equivalent to C and C++, so the theory should predict just the opposite.</span>
+<span>But I would argue that memory safety is a runtime property, despite the fact that it is, uniquely to Rust, achieved exclusively via language machinery.</span></p>
+<p><em><span>Second</span></em><span>, I predict Julia to become more popular.</span>
+<span>It</span>&rsquo;<span>s pretty unique, runtime-wise, with its stark rejection of </span><a href="https://en.wikipedia.org/wiki/Ousterhout's_dichotomy"><span>Ousterhout</span>&rsquo;<span>s Dichotomy</span></a><span> and insisting that, yeah, we</span>&rsquo;<span>ll just JIT highly dynamic language to suuuper fast numeric code at runtime.</span></p>
+<p><em><span>Third</span></em><span>, I wouldn</span>&rsquo;<span>t be surprised if Dart grows.</span>
+<span>On the one hand, it</span>&rsquo;<span>s roughly in the same boat as Go and Java, with memory safe runtime with fixed layout of objects and pervasive dynamic dispatch.</span>
+<span>But the quality of implementation of the runtimes is staggering: it has first-class JIT, AOT and JS compilers.</span>
+<span>Moreover, it has top-notch hot-reload support.</span>
+<span>Nothing here is a breakthrough, but the combination is impressive.</span></p>
+<p><em><span>Fourth</span></em><span>, I predict that Nim, Crystal and Zig (which is very interesting, language design wise) would not become popular.</span></p>
+<p><em><span>Fifth</span></em><span>, I predict that Swift will be pretty popular on Apple hardware due to platform exclusivity, but won</span>&rsquo;<span>t grow much outside of it, </span><em><span>despite</span></em><span> being very innovative in language design (generics in Swift are the opposite of the generics in Go).</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-09-13-your-language-sucks.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/09/20/why-not-rust.html b/2020/09/20/why-not-rust.html
new file mode 100644
index 00000000..19249c00
--- /dev/null
+++ b/2020/09/20/why-not-rust.html
@@ -0,0 +1,258 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Why Not Rust?</title>
+  <meta name="description" content="I've recently read an article criticizing Rust, and, while it made a bunch of good points, I didn't enjoy it --- it was an easy to argue with piece.
+In general, I feel that I can't recommend an article criticizing Rust.
+This is a shame --- confronting drawbacks is important, and debunking low effort/miss informed attempts at critique sadly inoculates against actually good arguments.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/09/20/why-not-rust.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Why-Not-Rust"><span>Why Not Rust?</span> <time datetime="2020-09-20">Sep 20, 2020</time></a>
+    </h1>
+<p><span>I</span>&rsquo;<span>ve recently read an article criticizing Rust, and, while it made a bunch of good points, I didn</span>&rsquo;<span>t enjoy it </span>&mdash;<span> it was an easy to argue with piece.</span>
+<span>In general, I feel that I can</span>&rsquo;<span>t recommend an article criticizing Rust.</span>
+<span>This is a shame </span>&mdash;<span> confronting drawbacks is important, and debunking low effort/miss informed attempts at critique sadly inoculates against actually good arguments.</span></p>
+<p><span>So, here</span>&rsquo;<span>s my attempt to argue </span><em><span>against</span></em><span> Rust:</span></p>
+<dl>
+<dt><span>Not All Programming is Systems Programming</span></dt>
+<dd>
+<p><span>Rust is a systems programming language.</span>
+<span>It offers precise control over data layout and runtime behavior of the code, granting  you maximal performance and flexibility.</span>
+<span>Unlike other systems programming languages, it also provides memory safety </span>&mdash;<span> buggy programs terminate in a well-defined manner, instead of unleashing (potentially security-sensitive) undefined behavior.</span></p>
+<p><span>However, in many (most) cases, one doesn</span>&rsquo;<span>t need ultimate performance or control over hardware resources.</span>
+<span>For these situations, modern managed languages like Kotlin or Go offer decent speed, enviable</span>
+<a href="https://qconlondon.com/london-2017/system/files/presentation-slides/highperformancemanagedlanguages.pdf"><span>time to performance</span></a><span>, and are memory safe by virtue of using a garbage collector for dynamic memory management.</span></p>
+</dd>
+<dt><span>Complexity</span></dt>
+<dd>
+<p><span>Programmer</span>&rsquo;<span>s time is valuable, and, if you pick Rust, expect to spend some of it on learning the ropes.</span>
+<span>Rust community poured a lot of time into creating high-quality teaching materials, but the Rust language </span><em><span>is</span></em><span> big.</span>
+<span>Even if a Rust implementation would provide value for you, you might not have resources to invest into growing the language expertise.</span></p>
+<p><span>Rust</span>&rsquo;<span>s price for improved control is the curse of choice:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>     { bar: Bar         }</span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; { bar: &amp;<span class="hl-symbol">&#x27;a</span> Bar     }</span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; { bar: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> Bar }</span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>     { bar: <span class="hl-type">Box</span>&lt;Bar&gt;    }</span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>     { bar: Rc&lt;Bar&gt;     }</span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span>     { bar: Arc&lt;Bar&gt;    }</span></code></pre>
+
+</figure>
+<p><span>In Kotlin, you write </span><code>class Foo(val bar: Bar)</code><span>, and proceed with solving your business problem.</span>
+<span>In Rust, there are choices to be made, some important enough to have dedicated syntax.</span></p>
+<p><span>All this complexity is there for a reason </span>&mdash;<span> we don</span>&rsquo;<span>t know how to create a simpler memory safe low-level language.</span>
+<span>But not every task requires a low-level language to solve it.</span></p>
+<p><span>See also </span><a href="https://www.youtube.com/watch?v=ltCgzYcpFUI"><span>Why C++ Sails When the Vasa Sank</span></a><span>.</span></p>
+</dd>
+<dt><span>Compile Times</span></dt>
+<dd>
+<p><span>Compile times are a multiplier for everything.</span>
+<span>A program written in a slower to run but faster to compile programming language can be </span><em><span>faster</span></em><span> to run because the programmer will have more time to optimize!</span></p>
+<p><span>Rust intentionally picked slow compilers in the </span><a href="https://research.swtch.com/generic"><span>generics dilemma</span></a><span>.</span>
+<span>This is not necessarily the end of the world (the resulting runtime performance improvements are real), but it does mean that you</span>&rsquo;<span>ll have to fight tooth and nail for reasonable build times in larger projects.</span></p>
+<p><code>rustc</code><span> implements what is probably the most advanced </span><a href="https://rustc-dev-guide.rust-lang.org/queries/incremental-compilation.html"><span>incremental compilation</span></a><span> algorithm in production compilers, but this feels a bit like fighting with language compilation model.</span></p>
+<p><span>Unlike C++, Rust build is not embarrassingly parallel; the amount of parallelism is limited by length of the critical path in the dependency graph.</span>
+<span>If you have 40+ cores to compile, this shows.</span></p>
+<p><span>Rust also lacks an analog for the </span><a href="https://en.cppreference.com/w/cpp/language/pimpl"><span>pimpl</span></a><span> idiom, which means that changing a crate requires recompiling (and not just relinking) all of its reverse dependencies.</span></p>
+</dd>
+<dt><span>Maturity</span></dt>
+<dd>
+<p><span>Five years old, Rust is definitely a young language.</span>
+<span>Even though its future looks bright, I will bet more money on </span>&ldquo;<span>C will be around in ten years</span>&rdquo;<span> than on </span>&ldquo;<span>Rust will be around in ten years</span>&rdquo;
+<span>(See </span><a href="https://en.wikipedia.org/wiki/Lindy_effect"><span>Lindy Effect</span></a><span>).</span>
+<span>If you are writing software to last decades, you should seriously consider risks associated with picking new technologies.</span>
+<span>(But keep in mind that picking Java over Cobol for banking software in 90s retrospectively turned out to be the right choice).</span></p>
+<p><span>There</span>&rsquo;<span>s only one complete implementation of Rust </span>&mdash;<span> the </span><a href="https://github.com/rust-lang/rust/"><code>rustc</code></a><span> compiler.</span>
+<span>The most advanced alternative implementation, </span><a href="https://github.com/thepowersgang/mrustc"><code>mrustc</code></a><span>, purposefully omits many static safety checks.</span>
+<code>rustc</code><span> at the moment supports only a single production-ready backend </span>&mdash;<span> LLVM.</span>
+<span>Hence, its support for CPU architectures is narrower than that of C, which has GCC implementation as well as a number of vendor specific proprietary compilers.</span></p>
+<p><span>Finally, Rust lacks an official specification.</span>
+<a href="https://doc.rust-lang.org/reference/"><span>The reference</span></a><span> is a work in progress, and does not yet document all the fine implementation details.</span></p>
+</dd>
+<dt><span>Alternatives</span></dt>
+<dd>
+<p><span>There are other languages besides Rust in systems programming space, notably, C, C++, and Ada.</span></p>
+<p><span>Modern C++ provides </span><a href="https://www.viva64.com/en/pvs-studio/"><span>tools</span></a><span> and </span><a href="https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines"><span>guidelines</span></a><span> for improving safety.</span>
+<span>There</span>&rsquo;<span>s even a proposal for a Rust-like </span><a href="https://github.com/isocpp/CppCoreGuidelines/blob/master/docs/Lifetime.pdf"><span>lifetimes</span></a><span> mechanism!</span>
+<span>Unlike Rust, using these tools does not </span><em><span>guarantee</span></em><span> the absence of memory safety issues.</span>
+<span>Modern C++ is </span><em><span>safer</span></em><span>, Rust is </span><em><span>safe</span></em><span>.</span>
+<span>However, if you already maintain a large body of C++ code, it makes sense to check if following best practices and using </span><a href="https://clang.llvm.org/docs/UndefinedBehaviorSanitizer.html"><span>sanitizers</span></a><span> helps with security issues.</span>
+<span>This is hard, but clearly is easier than rewriting in another language!</span></p>
+<p><span>If you use C, you can use formal methods to </span><a href="https://sel4.systems/Info/FAQ/proof.pml"><span>prove</span></a><span> the absence of undefined behaviors, or just </span><a href="https://sqlite.org/testing.html"><span>exhaustively test</span></a><span> everything.</span></p>
+<p><span>Ada is memory safe if you don</span>&rsquo;<span>t use dynamic memory (never call </span><code>free</code><span>).</span></p>
+<p><span>Rust is an interesting point on the cost/safety curve, but is far from the only one!</span></p>
+</dd>
+<dt><span>Tooling</span></dt>
+<dd>
+<p><span>Rust tooling is a bit of a hit and miss.</span>
+<span>The baseline tooling, the compiler and the build system</span>
+<span>(</span><a href="https://doc.rust-lang.org/cargo/index.html"><span>cargo</span></a><span>), are often cited as best in class.</span></p>
+<p><span>But, for example, some runtime-related tools (most notably, heap profiling) are just absent </span>&mdash;<span> it</span>&rsquo;<span>s hard to reflect on the runtime of the program if there</span>&rsquo;<span>s no runtime!</span>
+<span>Additionally, while IDE support is decent, it is nowhere near the Java-level of reliability.</span>
+<span>Automated complex refactors of multi-million line programs are not possible in Rust today.</span></p>
+</dd>
+<dt><span>Integration</span></dt>
+<dd>
+<p><span>Whatever the Rust promise is, it</span>&rsquo;<span>s a fact of life that today</span>&rsquo;<span>s systems programming world speaks C, and is inhabited by C and C++.</span>
+<span>Rust intentionally doesn</span>&rsquo;<span>t try to mimic these languages </span>&mdash;<span> it doesn</span>&rsquo;<span>t use C++-style classes or C ABI.</span></p>
+<p><span>That means that integration between the worlds needs explicit bridges.</span>
+<span>These are not seamless.</span>
+<span>They are </span><code>unsafe</code><span>, not always completely zero-cost and need to be synchronized between the languages.</span>
+<span>While the general promise of </span><a href="http://adventures.michaelfbryan.com/posts/how-to-riir/"><span>piece-wise integration</span></a><span> holds up and the </span><a href="https://github.com/dtolnay/cxx"><span>tooling</span></a><span> catches up, there is accidental complexity along the way.</span></p>
+<p><span>One specific gotcha is that Cargo</span>&rsquo;<span>s opinionated world view (which </span><em><span>is</span></em><span> a blessing for pure Rust projects) might make it harder to integrate with a bigger build system.</span></p>
+</dd>
+<dt><span>Performance</span></dt>
+<dd>
+<p>&ldquo;<span>Using LLVM</span>&rdquo;<span> is not a universal solution to all performance problems.</span>
+<span>While I am not aware of benchmarks comparing performance of C++ and Rust at scale, it</span>&rsquo;<span>s not to hard to come up with a list of cases where Rust leaves some performance on the table relative to C++.</span></p>
+<p><span>The biggest one is probably the fact that Rust</span>&rsquo;<span>s move semantics is based on values (</span><code>memcpy</code><span> at the machine code level).</span>
+<span>In contrast, C++ semantics uses special references you can steal data from (pointers at the machine code level).</span>
+<span>In theory, compiler should be able to see through chain of copies; in practice it often doesn</span>&rsquo;<span>t: </span><a href="https://github.com/rust-lang/rust/issues/57077"><span>#57077</span></a><span>.</span>
+<span>A related problem is the absence of placement new </span>&mdash;<span> Rust sometimes need to copy bytes to/from the stack, while C++ can construct the thing in place.</span></p>
+<p><span>Somewhat amusingly, Rust</span>&rsquo;<span>s default ABI (which is not stable, to make it as efficient as possible) is sometimes worse than that of C: </span><a href="https://github.com/rust-lang/rust/issues/26494#issuecomment-619506345"><span>#26494</span></a><span>.</span></p>
+<p><span>Finally, while in theory Rust code should be more efficient due to the significantly richer aliasing information, enabling aliasing-related optimizations triggers LLVM bugs and miscompilations: </span><a href="https://github.com/rust-lang/rust/issues/54878"><span>#54878</span></a><span>.</span></p>
+<p><span>But, to reiterate, these are cherry-picked examples, sometimes the field is tilted the other way.</span>
+<span>For example, </span><code>std::unique_ptr</code><span> </span><a href="https://www.youtube.com/watch?v=rHIkrotSwcc&amp;feature=youtu.be&amp;t=1261"><span>has a performance problem</span></a><span> which Rust</span>&rsquo;<span>s </span><code>Box</code><span> lacks.</span></p>
+<p><span>A potentially bigger issue is that Rust, with its definition time checked generics, is less expressive than C++.</span>
+<span>So, some C++ </span><a href="http://eigen.tuxfamily.org/index.php?title=Expression_templates"><span>template tricks</span></a><span> for high performance are not expressible in Rust using a nice syntax.</span></p>
+</dd>
+<dt><span>Meaning of Unsafe</span></dt>
+<dd>
+<p><span>An idea which is even more core to Rust than ownership &amp; borrowing is perhaps that of </span><code>unsafe</code><span> boundary.</span>
+<span>That, by delineating all dangerous operations behind </span><code>unsafe</code><span> blocks and functions and insisting on providing a safe higher-level interface to them, it is possible to create a system which is both</span></p>
+<ol>
+<li>
+<span>sound (non-</span><code>unsafe</code><span> code can</span>&rsquo;<span>t cause undefined behavior),</span>
+</li>
+<li>
+<span>and modular (different </span><code>unsafe</code><span> blocks can be checked separately).</span>
+</li>
+</ol>
+<p><span>It</span>&rsquo;<span>s pretty clear that the promise works out in practice: </span><a href="https://github.com/rust-fuzz/trophy-case"><span>fuzzing Rust code</span></a><span> unearths panics, not buffer overruns.</span></p>
+<p><span>But the theoretical outlook is not as rosy.</span></p>
+<p><em><span>First</span></em><span>, there</span>&rsquo;<span>s no definition of Rust memory model, so it is impossible to formally check if a given unsafe block is valid or not.</span>
+<span>There</span>&rsquo;<span>s informal definition of </span>&ldquo;<span>things rustc does or might rely on</span>&rdquo;<span> and in in-progress </span><a href="https://github.com/rust-lang/miri"><span>runtime verifier</span></a><span>, but the actual model is in flux.</span>
+<span>So there might be some </span><code>unsafe</code><span> code somewhere which works OK in practice today, might be declared invalid tomorrow, and broken by a new compiler optimization next year.</span></p>
+<p><em><span>Second</span></em><span>, there</span>&rsquo;<span>s also an observation that </span><code>unsafe</code><span> blocks are not, in fact, modular.</span>
+<span>Sufficiently powerful </span><code>unsafe</code><span> blocks can, in effect, extend the language.</span>
+<span>Two such extensions might be fine in isolation, but lead to undefined behavior if used simultaneously:</span>
+<a href="https://smallcultfollowing.com/babysteps/blog/2016/10/02/observational-equivalence-and-unsafe-code/"><span>Observational equivalence and unsafe code</span></a><span>.</span></p>
+<p><span>Finally, there are outright </span><a href="https://github.com/rust-lang/rust/issues?q=is%3Aopen+label%3AI-unsound+sort%3Aupdated-desc"><span>bugs in the compiler</span></a><span>.</span></p>
+</dd>
+</dl>
+<hr>
+<p><span>Here are some thing I have deliberately omitted from the list:</span></p>
+<ul>
+<li>
+<span>Economics (</span>&ldquo;<span>it</span>&rsquo;<span>s harder to hire Rust programmers</span>&rdquo;<span>) </span>&mdash;<span> I feel that the </span>&ldquo;<span>maturity</span>&rdquo;<span> section captures the essence of it which is not reducible to chicken and egg problem.</span>
+</li>
+<li>
+<span>Dependencies (</span>&ldquo;<span>stdlib is too small / everything has too many deps</span>&rdquo;<span>) </span>&mdash;<span> given how good Cargo and the relevant parts of the language are, I personally don</span>&rsquo;<span>t see this as a problem.</span>
+</li>
+<li>
+<span>Dynamic linking (</span>&ldquo;<span>Rust should have stable ABI</span>&rdquo;<span>) </span>&mdash;<span> I don</span>&rsquo;<span>t think this is a strong argument. Monomorphization is pretty fundamentally incompatible with dynamic linking and there</span>&rsquo;<span>s C ABI if you really need to. I do think that the situation here can be improved, </span><a href="https://internals.rust-lang.org/t/a-stable-modular-abi-for-rust/12347/10?u=matklad"><span>but I don</span>&rsquo;<span>t think that improvement needs to be Rust-specific</span></a><span>.</span>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/iwij5i/blog_post_why_not_rust/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-09-20-why-not-rust.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/10/03/fast-thread-locals-in-rust.html b/2020/10/03/fast-thread-locals-in-rust.html
new file mode 100644
index 00000000..d2793bfd
--- /dev/null
+++ b/2020/10/03/fast-thread-locals-in-rust.html
@@ -0,0 +1,345 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Fast Thread Locals In Rust</title>
+  <meta name="description" content="Rust thread-locals are slower than they could be.
+This is because they violate zero-cost abstraction principle, specifically the you don't pay for what you don't use bit.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/10/03/fast-thread-locals-in-rust.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Fast-Thread-Locals-In-Rust"><span>Fast Thread Locals In Rust</span> <time datetime="2020-10-03">Oct 3, 2020</time></a>
+    </h1>
+<p><span>Rust thread-locals are slower than they could be.</span>
+<span>This is because they violate zero-cost abstraction principle, specifically the </span>&ldquo;<span>you don</span>&rsquo;<span>t pay for what you don</span>&rsquo;<span>t use bit</span>&rdquo;<span>.</span></p>
+<p><span>Rust</span>&rsquo;<span>s thread-local implementation(</span>
+<a href="https://github.com/rust-lang/rust/blob/6f56fbdc1c58992a9db630f5cd2ba9882d32e84b/library/std/src/thread/local.rs#L156-L188"><span>1</span></a><span>,</span>
+<a href="https://github.com/rust-lang/rust/blob/6f56fbdc1c58992a9db630f5cd2ba9882d32e84b/library/std/src/thread/local.rs#L445-L459"><span>2</span></a>
+<span>) comes with built-in support for laziness </span>&mdash;<span> thread locals are initialized on the first access.</span>
+<span>Sometimes this overhead is a big deal, as thread locals are a common tool for writing high-performance code.</span>
+<span>For example, allocator fast path often involves looking into thread-local heap.</span></p>
+<p><span>There</span>&rsquo;<span>s an unstable </span><code>#[thread_local]</code><span> attribute for a zero-cost implementation</span>
+<span>(see the </span><a href="https://github.com/rust-lang/rust/issues/29594"><span>tracking issue</span></a><span>).</span></p>
+<p><span>Let</span>&rsquo;<span>s see how much </span>&ldquo;<span>is thread local initialized?</span>&rdquo;<span> check costs by comparing these two programs:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">./src/main.rs</figcaption>
+
+
+<pre><code><span class="line">thread_local! {</span>
+<span class="line">  <span class="hl-keyword">static</span> COUNTER: Cell&lt;<span class="hl-type">u32</span>&gt; = Cell::<span class="hl-title function_ invoke__">new</span>(<span class="hl-number">0</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">const</span> STEPS: <span class="hl-type">u32</span> = <span class="hl-number">1_000_000_000</span>;</span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sum_rust</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">step</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..STEPS {</span>
+<span class="line">    COUNTER.<span class="hl-title function_ invoke__">with</span>(|it| {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">inc</span> = step.<span class="hl-title function_ invoke__">wrapping_mul</span>(step) ^ step;</span>
+<span class="line">      it.<span class="hl-title function_ invoke__">set</span>(it.<span class="hl-title function_ invoke__">get</span>().<span class="hl-title function_ invoke__">wrapping_add</span>(inc))</span>
+<span class="line">    })</span>
+<span class="line">  }</span>
+<span class="line">  COUNTER.<span class="hl-title function_ invoke__">with</span>(|it| it.<span class="hl-title function_ invoke__">get</span>())</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = Instant::<span class="hl-title function_ invoke__">now</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">r</span> = <span class="hl-title function_ invoke__">sum_rust</span>();</span>
+<span class="line">  eprintln!(<span class="hl-string">&quot;Rust:   {} {}ms&quot;</span>, r, t.<span class="hl-title function_ invoke__">elapsed</span>().<span class="hl-title function_ invoke__">as_millis</span>());</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+<figcaption class="title">./src/main.c</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-meta">#<span class="hl-keyword">define</span> _POSIX_C_SOURCE 200809L</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&quot;inttypes.h&quot;</span></span></span>
+<span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&quot;stdint.h&quot;</span></span></span>
+<span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&quot;stdio.h&quot;</span></span></span>
+<span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&quot;threads.h&quot;</span></span></span>
+<span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&quot;time.h&quot;</span></span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">thread_local</span> <span class="hl-type">uint32_t</span> COUNTER = <span class="hl-number">0</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">const</span> <span class="hl-type">uint32_t</span> STEPS = <span class="hl-number">1000000000</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">uint32_t</span> <span class="hl-title function_">sum_c</span><span class="hl-params">()</span> {</span>
+<span class="line">  <span class="hl-keyword">for</span> (<span class="hl-type">uint32_t</span> step = <span class="hl-number">0</span>; step &lt; STEPS; step++) {</span>
+<span class="line">    <span class="hl-type">uint32_t</span> inc = (step * step) ^ step;</span>
+<span class="line">    COUNTER += inc;</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">return</span> COUNTER;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">uint64_t</span> <span class="hl-title function_">now_ms</span><span class="hl-params">()</span> {</span>
+<span class="line">  <span class="hl-class"><span class="hl-keyword">struct</span> <span class="hl-title">timespec</span> <span class="hl-title">spec</span>;</span></span>
+<span class="line">  clock_gettime(CLOCK_MONOTONIC, &amp;spec);</span>
+<span class="line">  <span class="hl-keyword">return</span> spec.tv_sec * <span class="hl-number">1000</span> + spec.tv_nsec / <span class="hl-number">1000000</span>;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">int</span> <span class="hl-title function_">main</span><span class="hl-params">(<span class="hl-type">void</span>)</span> {</span>
+<span class="line">  <span class="hl-type">uint64_t</span> t = now_ms();</span>
+<span class="line">  <span class="hl-type">uint32_t</span> r = sum_c();</span>
+<span class="line">  <span class="hl-built_in">printf</span>(<span class="hl-string">&quot;C:      %&quot;</span> PRIu32 <span class="hl-string">&quot; %&quot;</span>PRIu64<span class="hl-string">&quot;ms\n&quot;</span>, r, now_ms() - t);</span>
+<span class="line">  <span class="hl-keyword">return</span> <span class="hl-number">0</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In this test, we declare an integer thread-local variable, and use it as an accumulator for the summation.</span></p>
+<p><span>We use non-trivial summation term: </span><code>(step * step) ^ step</code><span> </span>&mdash;<span> this is to prevent LLVM from evaluating the sum at compile time.</span>
+<span>If a term of a summation is a polynomial (like </span><code>1</code><span>, </span><code>step</code><span> or </span><code>step * step</code><span>), then the sum itself is a one degree higher polynomial, and LLVM can figure this out!</span>
+<span>We rely on wrapping overflow of </span><em><span>unsigned</span></em><span> integers in C, and use </span><code>wrapping_mul</code><span> and </span><code>wrapping_add</code><span> in Rust.</span>
+<span>To make sure that both programs are equivalent, we also print the result.</span></p>
+<p><span>One optimization we specifically don</span>&rsquo;<span>t protect from is caching thread-local access.</span>
+<span>That is, instead of doing a billion of thread-local loads and stores, the compiler could generate code to compute the sum into the local variable, and do a single store at the end.</span>
+<span>This is because </span>&ldquo;<span>can the compiler optimize thread-local access?</span>&rdquo;<span> is exactly the property we want to measure.</span></p>
+<p><span>There</span>&rsquo;<span>s no standard way to get monotonic wall-clock time in C, so the C version is not cross-platform.</span></p>
+<p><span>This code gives the following results on my machine:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo build --release -q        &amp;&amp; ./target/release/ftl</span>
+<span class="line"><span class="hl-output">Rust:   62565888 487ms</span></span>
+<span class="line"><span class="hl-title function_">$</span> clang -std=c17 -O3 ./src/main.c &amp;&amp; ./a.out</span>
+<span class="line"><span class="hl-output">C:      62565888 239ms</span></span></code></pre>
+
+</figure>
+<p><span>This benchmark doesn</span>&rsquo;<span>t allow to measure the cost of thread-local access per se, but the overall time is about 2x longer for Rust.</span></p>
+<p><span>Can we make Rust faster?</span>
+<span>I don</span>&rsquo;<span>t know how to do that, but I know how to cheat.</span>
+<span>We can apply a general Rust extension trick </span>&mdash;<span> write some C code and link it with Rust!</span></p>
+<p><span>Let</span>&rsquo;<span>s implement a simple C library which declares a thread-local and provides access to it:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">../src/thread_local.c</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&quot;stdint.h&quot;</span></span></span>
+<span class="line"><span class="hl-meta">#<span class="hl-keyword">include</span> <span class="hl-string">&quot;threads.h&quot;</span></span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">thread_local</span> <span class="hl-type">uint32_t</span> COUNTER = <span class="hl-number">0</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">uint32_t</span>* <span class="hl-title function_">get_thread_local</span><span class="hl-params">()</span> {</span>
+<span class="line">  <span class="hl-keyword">return</span> &amp;COUNTER;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Link it with Rust:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">../build.rs</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::{env, path::Path, process::Command};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">out_dir</span> = env::<span class="hl-title function_ invoke__">var</span>(<span class="hl-string">&quot;OUT_DIR&quot;</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line"></span>
+<span class="line">  Command::<span class="hl-title function_ invoke__">new</span>(<span class="hl-string">&quot;clang&quot;</span>)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">args</span>(&amp;[ <span class="hl-string">&quot;src/thread_local.c&quot;</span>, <span class="hl-string">&quot;-O3&quot;</span>, <span class="hl-string">&quot;-c&quot;</span>, <span class="hl-string">&quot;-o&quot;</span>])</span>
+<span class="line">    .<span class="hl-title function_ invoke__">arg</span>(&amp;<span class="hl-built_in">format!</span>(<span class="hl-string">&quot;{}/thread_local.o&quot;</span>, out_dir))</span>
+<span class="line">    .<span class="hl-title function_ invoke__">status</span>()</span>
+<span class="line">    .<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">  Command::<span class="hl-title function_ invoke__">new</span>(<span class="hl-string">&quot;ar&quot;</span>)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">args</span>(&amp;[<span class="hl-string">&quot;crus&quot;</span>, <span class="hl-string">&quot;libthread_local.a&quot;</span>, <span class="hl-string">&quot;thread_local.o&quot;</span>])</span>
+<span class="line">    .<span class="hl-title function_ invoke__">current_dir</span>(&amp;Path::<span class="hl-title function_ invoke__">new</span>(&amp;out_dir))</span>
+<span class="line">    .<span class="hl-title function_ invoke__">status</span>()</span>
+<span class="line">    .<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;cargo:rustc-link-search=native={}&quot;</span>, out_dir);</span>
+<span class="line">  <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;cargo:rustc-link-lib=static=thread_local&quot;</span>);</span>
+<span class="line">  <span class="hl-built_in">println!</span>(<span class="hl-string">&quot;cargo:rerun-if-changed=src/thread_local.c&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And use it:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">../src/main.rs</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">with_counter</span>&lt;T&gt;(f: <span class="hl-keyword">impl</span> <span class="hl-title class_">FnOnce</span>(&amp;Cell&lt;<span class="hl-type">u32</span>&gt;) <span class="hl-punctuation">-&gt;</span> T) <span class="hl-punctuation">-&gt;</span> T {</span>
+<span class="line">  <span class="hl-keyword">extern</span> <span class="hl-string">&quot;C&quot;</span> { <span class="hl-keyword">fn</span> <span class="hl-title function_">get_thread_local</span>() <span class="hl-punctuation">-&gt;</span> *<span class="hl-keyword">mut</span> <span class="hl-type">u32</span>; }</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">counter</span> =</span>
+<span class="line">    <span class="hl-keyword">unsafe</span> { &amp;*(<span class="hl-title function_ invoke__">get_thread_local</span>() <span class="hl-keyword">as</span> *<span class="hl-keyword">mut</span> Cell&lt;<span class="hl-type">u32</span>&gt;) };</span>
+<span class="line">  <span class="hl-title function_ invoke__">f</span>(&amp;counter)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sum_rust_c</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">step</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..STEPS {</span>
+<span class="line">    <span class="hl-title function_ invoke__">with_counter</span>(|it| {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">inc</span> = step.<span class="hl-title function_ invoke__">wrapping_mul</span>(step) ^ step;</span>
+<span class="line">      it.<span class="hl-title function_ invoke__">set</span>(it.<span class="hl-title function_ invoke__">get</span>().<span class="hl-title function_ invoke__">wrapping_add</span>(inc))</span>
+<span class="line">    })</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-title function_ invoke__">with_counter</span>(|it| it.<span class="hl-title function_ invoke__">get</span>())</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The result are underwhelming:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">C:               62565888 239ms</span>
+<span class="line">Rust:            62565888 485ms</span>
+<span class="line">Rust/C:          62565888 1198ms</span></code></pre>
+
+</figure>
+<p><span>This is expected </span>&mdash;<span> we replaced access to a thread local with a function call.</span>
+<span>As we are crossing the language boundary, the compiler can</span>&rsquo;<span>t inline it, which destroys performance.</span>
+<span>However, there</span>&rsquo;<span>s a way around that: Rust allows cross-language </span><strong><strong><span>L</span></strong></strong><span>ink </span><strong><strong><span>T</span></strong></strong><span>ime </span><strong><strong><span>O</span></strong></strong><span>ptimization (</span><a href="https://doc.rust-lang.org/rustc/linker-plugin-lto.html"><span>docs</span></a><span>).</span>
+<span>That is, Rust and C compilers can cooperate, to allow the linker to do inlining across the languages.</span></p>
+<p><span>This requires to manually align a bunch of stars:</span></p>
+<ul>
+<li>
+<p><span>The C compiler, the Rust compiler and the linker must use the same version of LLVM.</span>
+<span>As you might have noticed, this excludes gcc.</span>
+<span>I had luck with </span><code>rustc 1.46.0</code><span>, </span><code>clang 10.0.0</code><span>, and </span><code>LLD 10.0.0</code><span>.</span></p>
+</li>
+<li>
+<p><code>-flto=thin</code><span> in the  C compiler flags.</span></p>
+</li>
+<li>
+<p><code>RUSTFLAGS</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> export RUSTFLAGS=\</span>
+<span class="line">  "-Clinker-plugin-lto -Clinker=clang -Clink-arg=-fuse-ld=lld"</span></code></pre>
+
+</figure>
+</li>
+</ul>
+<p><span>Now, just recompiling the old code gives the same performance for C and Rust:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">C:               62565888 240ms</span>
+<span class="line">Rust:            62565888 495ms</span>
+<span class="line">Rust/C:          62565888 241ms</span></code></pre>
+
+</figure>
+<p><span>Interestingly, this is the same performance we get without any thread-locals at all:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sum_local</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">counter</span> = <span class="hl-number">0u32</span>;</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">step</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..STEPS {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">inc</span> = step.<span class="hl-title function_ invoke__">wrapping_mul</span>(step) ^ step;</span>
+<span class="line">    counter = counter.<span class="hl-title function_ invoke__">wrapping_add</span>(inc)</span>
+<span class="line">  }</span>
+<span class="line">  counter</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>So, either the compiler/linker was able to lift thread-local access out of the loop, or its cost is masked by arithmetics.</span></p>
+<p><span>Full code for the benchmarks is available at </span><a href="https://github.com/matklad/ftl" class="url">https://github.com/matklad/ftl</a><span>.</span>
+<span>Note that this research only scratches the surface of the topic: thread locals are implemented differently on different OSes.</span>
+<span>Even on a single OS, there are be differences depending on compilation flags (dynamic libraries differ from static libraries, for example).</span>
+<span>Looking at the generated assembly could also be illuminating (code on </span><a href="https://godbolt.org/z/zMqdn4"><span>Compiler Explorer</span></a><span>).</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/j4iy50/blog_post_fast_thread_locals_in_rust/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-10-03-fast-thread-locals-in-rust.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/10/15/study-of-std-io-error.html b/2020/10/15/study-of-std-io-error.html
new file mode 100644
index 00000000..269d6be8
--- /dev/null
+++ b/2020/10/15/study-of-std-io-error.html
@@ -0,0 +1,582 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Study of std::io::Error</title>
+  <meta name="description" content="In this article we'll dissect the implementation of std::io::Error type from the Rust's standard library.
+The code in question is here:
+library/std/src/io/error.rs.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/10/15/study-of-std-io-error.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Study-of-std-io-Error"><span>Study of std</span><span>::io</span><span>::Error</span> <time datetime="2020-10-15">Oct 15, 2020</time></a>
+    </h1>
+<p><span>In this article we</span>&rsquo;<span>ll dissect the implementation of </span><code>std::io::Error</code><span> type from the Rust</span>&rsquo;<span>s standard library.</span>
+<span>The code in question is here:</span>
+<a href="https://github.com/rust-lang/rust/blob/5565241f65cf402c3dbcb55dd492f172c473d4ce/library/std/src/io/error.rs"><span>library/std/src/io/error.rs</span></a><span>.</span></p>
+<p><span>You can read this post as either of:</span></p>
+<ol>
+<li>
+<span>A study of a specific bit of standard library.</span>
+</li>
+<li>
+<span>An advanced error management guide.</span>
+</li>
+<li>
+<span>A case of a beautiful API design.</span>
+</li>
+</ol>
+<p><span>The article requires basic familiarity with Rust error handing.</span></p>
+<hr>
+<p><span>When designing an </span><code>Error</code><span> type for use with </span><code>Result&lt;T, E&gt;</code><span>, the main question to ask is </span>&ldquo;<span>how the error will be used?</span>&rdquo;<span>.</span>
+<span>Usually, one of the following is true.</span></p>
+<ul>
+<li>
+<p><span>The error is handled programmatically.</span>
+<span>The consumer inspects the error, so its internal structure needs to be exposed to a reasonable degree.</span></p>
+</li>
+<li>
+<p><span>The error is propagated and displayed to the user.</span>
+<span>The consumer doesn</span>&rsquo;<span>t inspect the error beyond the </span><code>fmt::Display</code><span>; so its internal structure can be encapsulated.</span></p>
+</li>
+</ul>
+<p><span>Note that there</span>&rsquo;<span>s a tension between exposing implementation details and encapsulating them. A common anti-pattern for implementing the first case is to define a kitchen-sink enum:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-title function_ invoke__">Tokio</span>(tokio::io::Error),</span>
+<span class="line">  ConnectionDiscovery {</span>
+<span class="line">    path: PathBuf,</span>
+<span class="line">    reason: <span class="hl-type">String</span>,</span>
+<span class="line">    stderr: <span class="hl-type">String</span>,</span>
+<span class="line">  },</span>
+<span class="line">  Deserialize {</span>
+<span class="line">    source: serde_json::Error,</span>
+<span class="line">    data: <span class="hl-type">String</span>,</span>
+<span class="line">  },</span>
+<span class="line">  ...,</span>
+<span class="line">  <span class="hl-title function_ invoke__">Generic</span>(<span class="hl-type">String</span>),</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There is a number of problems with this approach.</span></p>
+<p><em><span>First</span></em><span>, exposing errors from underlying libraries makes them a part of your public API.</span>
+<span>Major semver bump in your dependency would require you to make a new major version as well.</span></p>
+<p><em><span>Second</span></em><span>, it sets all the implementation details in stone.</span>
+<span>For example, if you notice that the size of </span><code>ConnectionDiscovery</code><span> is huge, boxing this variant would be a breaking change.</span></p>
+<p><em><span>Third</span></em><span>, it is usually indicative of a larger design issue.</span>
+<span>Kitchen sink errors pack dissimilar failure modes into one type.</span>
+<span>But, if failure modes vary widely, it probably isn</span>&rsquo;<span>t reasonable to handle them!</span>
+<span>This is an indication that the situation looks more like the case two.</span></p>
+
+<aside class="block">
+
+<p><span>An often-working cure for error kitchensinkosis is the pattern of pushing errors to the caller.</span></p>
+<p><span>Consider this example</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">my_function</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;<span class="hl-type">i32</span>, MyError&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">thing</span> = <span class="hl-title function_ invoke__">dep_function</span>()?;</span>
+<span class="line">  ...</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-number">92</span>)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><code>my_function</code><span> calls </span><code>dep_function</code><span>, so </span><code>MyError</code><span> should be convertible from </span><code>DepError</code><span>.</span>
+<span>A better way to write the same might be this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">my_function</span>(thing: DepThing) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;<span class="hl-type">i32</span>, MyError&gt; {</span>
+<span class="line">  ...</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-number">92</span>)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In this version, the caller is forced to invoke </span><code>dep_function</code><span> and handle its error.</span>
+<span>This exchanges more typing for more type-safety.</span>
+<code>MyError</code><span> and </span><code>DepError</code><span> are now different types, and the caller can handle them separately.</span>
+<span>If </span><code>DepError</code><span> were a variant of </span><code>MyError</code><span> a runtime </span><code>match</code><span>-ing would be required.</span></p>
+<p><span>An extreme version of this idea is </span><a href="https://sans-io.readthedocs.io/"><span>sans-io</span></a><span> programming.</span>
+<span>Most errors come from IO; if you push all IO to the caller, you can skip most of the error handing!</span></p>
+
+</aside>
+  <p><span>However bad the </span><code>enum</code><span> approach might be, it does achieve maximum inspectability of the first case.</span></p>
+<p><span>The propagation-centered second case of error management is typically handled by using a boxed trait object.</span>
+<span>A type like </span><code>Box&lt;dyn std::error::Error&gt;</code><span> can be constructed from any specific concrete error, can be printed via </span><code>Display</code><span>, and can still optionally expose the underlying error via dynamic downcasting.</span>
+<span>The </span><a href="https://lib.rs/crates/anyhow"><code>anyhow</code></a><span> crate is a great example of this style.</span></p>
+<p><span>The case of </span><code>std::io::Error</code><span> is interesting because it wants to be both of the above and more.</span></p>
+<ul>
+<li>
+<span>This is </span><code>std</code><span>, so encapsulation and future-proofing are paramount.</span>
+</li>
+<li>
+<span>IO errors coming from the operating system often can be handled (for example, </span><code>EWOULDBLOCK</code><span>).</span>
+</li>
+<li>
+<span>For a systems programming language, it</span>&rsquo;<span>s important to expose the underlying OS error exactly.</span>
+</li>
+<li>
+<span>The set of potential future OS error is unbounded.</span>
+</li>
+<li>
+<code>io::Error</code><span> is also a vocabulary type, and should be able to represent some not-quite-os errors.</span>
+<span>For example, Rust </span><code>Path</code><span>s can contain internal </span><code>0</code><span> bytes and </span><code>open</code><span>ing such path should return an </span><code>io::Error</code><span> </span><em><span>before</span></em><span> making a syscall.</span>
+</li>
+</ul>
+<p><span>Here</span>&rsquo;<span>s what </span><code>std::io::Error</code><span> looks like:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  repr: Repr,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Repr</span> {</span>
+<span class="line">  <span class="hl-title function_ invoke__">Os</span>(<span class="hl-type">i32</span>),</span>
+<span class="line">  <span class="hl-title function_ invoke__">Simple</span>(ErrorKind),</span>
+<span class="line">  <span class="hl-title function_ invoke__">Custom</span>(<span class="hl-type">Box</span>&lt;Custom&gt;),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Custom</span> {</span>
+<span class="line">  kind: ErrorKind,</span>
+<span class="line">  error: <span class="hl-type">Box</span>&lt;<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span>&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>First thing to notice is that it</span>&rsquo;<span>s an enum internally, but this is a well-hidden implementation detail.</span>
+<span>To allow inspecting and handing of various error conditions there</span>&rsquo;<span>s a separate public fieldless kind enum:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Clone, Copy)]</span></span>
+<span class="line"><span class="hl-meta">#[non_exhaustive]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">ErrorKind</span> {</span>
+<span class="line">  NotFound,</span>
+<span class="line">  PermissionDenied,</span>
+<span class="line">  Interrupted,</span>
+<span class="line">  ...</span>
+<span class="line">  Other,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">kind</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> ErrorKind {</span>
+<span class="line">    <span class="hl-keyword">match</span> &amp;<span class="hl-keyword">self</span>.repr {</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Os</span>(code) =&gt; sys::<span class="hl-title function_ invoke__">decode_error_kind</span>(*code),</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Custom</span>(c) =&gt; c.kind,</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Simple</span>(kind) =&gt; *kind,</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Although both </span><code>ErrorKind</code><span> and </span><code>Repr</code><span> are enums, publicly exposing </span><code>ErrorKind</code><span> is much less scary.</span>
+<span>A </span><code>#[non_exhaustive]</code><span> </span><code>Copy</code><span> fieldless enum</span>&rsquo;<span>s design space is a point </span>&mdash;<span> there are no plausible alternatives or compatibility hazards.</span></p>
+<p><em><span>Some</span></em><span> </span><code>io::Errors</code><span> are just raw OS error codes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_raw_os_error</span>(code: <span class="hl-type">i32</span>) <span class="hl-punctuation">-&gt;</span> Error {</span>
+<span class="line">    Error { repr: Repr::<span class="hl-title function_ invoke__">Os</span>(code) }</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">raw_os_error</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;<span class="hl-type">i32</span>&gt; {</span>
+<span class="line">    <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.repr {</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Os</span>(i) =&gt; <span class="hl-title function_ invoke__">Some</span>(i),</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Custom</span>(..) =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Simple</span>(..) =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Platform-specific </span><code>sys::decode_error_kind</code><span> function takes care of mapping error codes to </span><code>ErrorKind</code><span> enum.</span>
+<span>All this together means that code can handle error categories in a cross-platform way by inspecting the </span><code>.kind()</code><span>.</span>
+<span>However, if the need arises to handle a very specific error code in an OS-dependent way, that is also possible.</span>
+<span>The API carefully provides a convenient abstraction without abstracting away important low-level details.</span></p>
+<p><span>An </span><code>std::io::Error</code><span> can also be constructed from an </span><code>ErrorKind</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">From</span>&lt;ErrorKind&gt; <span class="hl-keyword">for</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">from</span>(kind: ErrorKind) <span class="hl-punctuation">-&gt;</span> Error {</span>
+<span class="line">    Error { repr: Repr::<span class="hl-title function_ invoke__">Simple</span>(kind) }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This provides cross-platform access to error-code style error handling.</span>
+<span>This is handy if you need the fastest possible errors.</span></p>
+<p><span>Finally, there</span>&rsquo;<span>s a third, fully custom variant of the representation:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>&lt;E&gt;(kind: ErrorKind, error: E) <span class="hl-punctuation">-&gt;</span> Error</span>
+<span class="line">  <span class="hl-keyword">where</span></span>
+<span class="line">    E: <span class="hl-built_in">Into</span>&lt;<span class="hl-type">Box</span>&lt;<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span>&gt;&gt;,</span>
+<span class="line">  {</span>
+<span class="line">    <span class="hl-keyword">Self</span>::_new(kind, error.<span class="hl-title function_ invoke__">into</span>())</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">_new</span>(</span>
+<span class="line">    kind: ErrorKind,</span>
+<span class="line">    error: <span class="hl-type">Box</span>&lt;<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span>&gt;,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> Error {</span>
+<span class="line">    Error {</span>
+<span class="line">      repr: Repr::<span class="hl-title function_ invoke__">Custom</span>(<span class="hl-type">Box</span>::<span class="hl-title function_ invoke__">new</span>(Custom { kind, error })),</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">get_ref</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">self</span>,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;&amp;(<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span> + <span class="hl-symbol">&#x27;static</span>)&gt; {</span>
+<span class="line">    <span class="hl-keyword">match</span> &amp;<span class="hl-keyword">self</span>.repr {</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Os</span>(..) =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Simple</span>(..) =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Custom</span>(c) =&gt; <span class="hl-title function_ invoke__">Some</span>(&amp;*c.error),</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">into_inner</span>(</span>
+<span class="line">    <span class="hl-keyword">self</span>,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;<span class="hl-type">Box</span>&lt;<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span>&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.repr {</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Os</span>(..) =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Simple</span>(..) =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Custom</span>(c) =&gt; <span class="hl-title function_ invoke__">Some</span>(c.error),</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Things to note:</span></p>
+<ul>
+<li>
+<p><span>Generic </span><code>new</code><span> function delegates to monomorphic </span><code>_new</code><span> function.</span>
+<span>This improves compile time, as less code needs to be duplicated during monomorphization.</span>
+<span>I think it also improves the runtime a bit: the </span><code>_new</code><span> function is not marked as inline, so a function call would be generated at the call-site.</span>
+<span>This is good, because error construction is the cold-path and saving instruction cache is welcome.</span></p>
+</li>
+<li>
+<p><span>The </span><code>Custom</code><span> variant is boxed </span>&mdash;<span> this is to keep overall </span><code>size_of</code><span> smaller.</span>
+<span>On-the-stack size of errors is important: you pay for it even if there are no errors!</span></p>
+</li>
+<li>
+<p><span>Both these types refer to a </span><code>'static</code><span> error:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">A</span> =   &amp;(<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span> + <span class="hl-symbol">&#x27;static</span>);</span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">B</span> = <span class="hl-type">Box</span>&lt;<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span>&gt;</span></code></pre>
+
+</figure>
+<p><span>In a </span><code>dyn Trait + '_</code><span>, the </span><code>'_</code><span> is elided to </span><code>'static</code><span>, unless the trait object is behind a reference, in which case it is elided as </span><code>&amp;'a dyn Trait + 'a</code><span>.</span></p>
+</li>
+<li>
+<p><code>get_ref</code><span>, </span><code>get_mut</code><span> and </span><code>into_inner</code><span> provide full access to the underlying error.</span>
+<span>Similarly to </span><code>os_error</code><span> case, abstraction blurs details, but also provides hooks to get the underlying data as-is.</span></p>
+</li>
+</ul>
+<p><span>Similarly, </span><code>Display</code><span> implementation reveals the most important details about internal representation.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">fmt</span>::Display <span class="hl-keyword">for</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">fmt</span>(&amp;<span class="hl-keyword">self</span>, fmt: &amp;<span class="hl-keyword">mut</span> fmt::Formatter&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> fmt::<span class="hl-type">Result</span> {</span>
+<span class="line">    <span class="hl-keyword">match</span> &amp;<span class="hl-keyword">self</span>.repr {</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Os</span>(code) =&gt; {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">detail</span> = sys::os::<span class="hl-title function_ invoke__">error_string</span>(*code);</span>
+<span class="line">        <span class="hl-built_in">write!</span>(fmt, <span class="hl-string">&quot;{} (os error {})&quot;</span>, detail, code)</span>
+<span class="line">      }</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Simple</span>(kind) =&gt; <span class="hl-built_in">write!</span>(fmt, <span class="hl-string">&quot;{}&quot;</span>, kind.<span class="hl-title function_ invoke__">as_str</span>()),</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Custom</span>(c) =&gt; c.error.<span class="hl-title function_ invoke__">fmt</span>(fmt),</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To sum up, </span><code>std::io::Error</code><span>:</span></p>
+<ul>
+<li>
+<span>encapsulates its internal representation and optimizes it by boxing large enum variant,</span>
+</li>
+<li>
+<span>provides a convenient way to handle error based on category via </span><code>ErrorKind</code><span> pattern,</span>
+</li>
+<li>
+<span>fully exposes underlying OS error, if any.</span>
+</li>
+<li>
+<span>can transparently wrap any other error type.</span>
+</li>
+</ul>
+<p><span>The last point means that </span><code>io::Error</code><span> can be used for ad-hoc errors, as </span><code>&amp;str</code><span> and </span><code>String</code><span> are convertible to </span><code>Box&lt;dyn std::error::Error&gt;</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">io::Error::<span class="hl-title function_ invoke__">new</span>(io::ErrorKind::Other, <span class="hl-string">&quot;something went wrong&quot;</span>)</span></code></pre>
+
+</figure>
+<p><span>It also can be used as a simple replacement for </span><code>anyhow</code><span>.</span>
+<span>I </span><strong><span>think</span></strong><span> some libraries might simplify their error handing with this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">io::Error::<span class="hl-title function_ invoke__">new</span>(io::ErrorKind::InvalidData, my_specific_error)</span></code></pre>
+
+</figure>
+<p><span>For example, </span><a href="https://docs.rs/serde_json/1.0.59/serde_json/fn.from_reader.html"><code>serde_json</code></a><span> provides the following method:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">from_reader</span>&lt;R, T&gt;(rdr: R) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;T, serde_json::Error&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">  R: Read,</span>
+<span class="line">  T: DeserializeOwned,</span></code></pre>
+
+</figure>
+<p><code>Read</code><span> can fail with </span><code>io::Error</code><span>, so </span><code>serde_json::Error</code><span> needs to be able to represent </span><code>io::Error</code><span> internally.</span>
+<span>I think this is backwards (but I don</span>&rsquo;<span>t know the whole context, I</span>&rsquo;<span>d be delighted to be proven wrong!), and the signature should have been this instead:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">from_reader</span>&lt;R, T&gt;(rdr: R) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;T, io::Error&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">  R: Read,</span>
+<span class="line">  T: DeserializeOwned,</span></code></pre>
+
+</figure>
+<p><span>Then, </span><code>serde_json::Error</code><span> wouldn</span>&rsquo;<span>t have </span><code>Io</code><span> variant and would be stashed into </span><code>io::Error</code><span> with </span><code>InvalidData</code><span> kind.</span></p>
+
+<aside class="block">
+<div class="title">Addendum, 2021-01-25</div>
+<p><span>Re-reading </span><a href="https://sled.rs/errors.html"><span>this article</span></a><span>, I now think that the right return type would be:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">from_reader</span>&lt;R, T&gt;(</span>
+<span class="line">  rdr: R,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;<span class="hl-type">Result</span>&lt;T, serde_json::Error&gt;, io::Error&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">  R: Read,</span>
+<span class="line">  T: DeserializeOwned,</span></code></pre>
+
+</figure>
+<p><span>This forces separate handling of IO and deserialization errors, which makes sense in this case.</span>
+<span>IO error is probably a hardware/environment problem outside of the domain of the program, while serialization error most likely indicates a bug somewhere in the system.</span></p>
+
+</aside>
+  <p><span>I think </span><code>std::io::Error</code><span> is a truly marvelous type, which manages to serve many different use-cases without much compromise.</span>
+<span>But can we perhaps do better?</span></p>
+<p><span>The number one problem with </span><code>std::io::Error</code><span> is that, when a file-system operation fails, you don</span>&rsquo;<span>t know which path it has failed for!</span>
+<span>This is understandable </span>&mdash;<span> Rust is a systems language, so it shouldn</span>&rsquo;<span>t add much fat over what OS provides natively.</span>
+<span>OS returns an integer return code, and coupling that with a heap-allocated </span><code>PathBuf</code><span> could be an unacceptable overhead!</span></p>
+
+<aside class="block">
+
+<p><span>I was surprised to learn that std in fact</span>
+<a href="https://github.com/rust-lang/rust/blob/e160e5cb80652bc2afe74cb3affbe35b74243ea9/library/std/src/sys/unix/fs.rs#L867-L869"><span>does</span></a>
+<span>an allocation for every path-related syscall.</span></p>
+<p><span>It needs to be there in some form: OS API require that unfortunate zero byte at the end of strings.</span>
+<span>But I wonder if using a stack-allocated buffer for short paths would</span>&rsquo;<span>ve made sense.</span>
+<span>Probably not </span>&mdash;<span> paths are not that short usually, and modern allocators handle transient allocations efficiently.</span></p>
+
+</aside>
+  <p><span>I don</span>&rsquo;<span>t know an obviously good solution here.</span>
+<span>One option would be to add compile time (once we get std-aware cargo) or runtime (a-la </span><code>RUST_BACKTRACE</code><span>) switch to heap-allocate all path-related IO errors.</span>
+<span>A similarly-shaped problem is that </span><code>io::Error</code><span> doesn</span>&rsquo;<span>t carry a backtrace.</span></p>
+<p><span>The other problem is that </span><code>std::io::Error</code><span> is not as efficient as it could be:</span></p>
+<ul>
+<li>
+<p><span>Its size is pretty big:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-built_in">assert_eq!</span>(size_of::&lt;io::Error&gt;(), <span class="hl-number">2</span> * size_of::&lt;<span class="hl-type">usize</span>&gt;());</span></code></pre>
+
+</figure>
+</li>
+<li>
+<p><span>For custom case, it incurs double indirection and allocation:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Repr</span> {</span>
+<span class="line">  <span class="hl-title function_ invoke__">Os</span>(<span class="hl-type">i32</span>),</span>
+<span class="line">  <span class="hl-title function_ invoke__">Simple</span>(ErrorKind),</span>
+<span class="line">  <span class="hl-comment">// First Box :|</span></span>
+<span class="line">  <span class="hl-title function_ invoke__">Custom</span>(<span class="hl-type">Box</span>&lt;Custom&gt;),</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Custom</span> {</span>
+<span class="line">  kind: ErrorKind,</span>
+<span class="line">  <span class="hl-comment">// Second Box :(</span></span>
+<span class="line">  error: <span class="hl-type">Box</span>&lt;<span class="hl-keyword">dyn</span> error::Error + <span class="hl-built_in">Send</span> + <span class="hl-built_in">Sync</span>&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</li>
+</ul>
+<p><span>I think we can fix this now!</span></p>
+<p><em><span>First</span></em><span>, we can get rid of double indirection by using a thin trait object, a-la</span>
+<a href="https://github.com/rust-lang-nursery/failure/blob/135e2a3b9af422d9a9dc37ce7c69354c9b36e94b/src/error/error_impl_small.rs#L9-L18"><code>failure</code></a><span> or</span>
+<a href="https://github.com/dtolnay/anyhow/blob/840afd84e9dd91ac5340c05afadeecbe45d0b810/src/error.rs#L671-L679"><code>anyhow</code></a><span>.</span>
+<span>Now that </span><a href="https://doc.rust-lang.org/stable/std/alloc/trait.GlobalAlloc.html"><code>GlobalAlloc</code></a><span> exist, it</span>&rsquo;<span>s a relatively straight-forward implementation.</span></p>
+<p><em><span>Second</span></em><span>, we can make use of the fact that pointers are aligned, and stash both </span><code>Os</code><span> and </span><code>Simple</code><span> variants into </span><code>usize</code><span> with the least significant bit set.</span>
+<span>I think we can even get creative and use the </span><em><span>second</span></em><span> least significant bit, leaving the first one as a niche.</span>
+<span>That way, even something like </span><code>io::Result&lt;i32&gt;</code><span> can be pointer-sized!</span></p>
+<p><span>And this concludes the post.</span>
+<span>Next time you</span>&rsquo;<span>ll be designing an error type for your library, take a moment to peer through</span>
+<a href="https://github.com/rust-lang/rust/blob/5565241f65cf402c3dbcb55dd492f172c473d4ce/library/std/src/io/error.rs"><span>sources</span></a>
+<span>of </span><code>std::io::Error</code><span>, you might find something to steal!</span></p>
+<p><span>Discussion on </span><a href="https://www.reddit.com/r/rust/comments/jbdk5x/blog_post_study_of_stdioerror/"><span>/r/rust</span></a><span>.</span></p>
+
+<aside class="block">
+<div class="title">Bonus puzzler</div>
+<p><span>Take a look at</span>
+<a href="https://github.com/rust-lang/rust/blob/e160e5cb80652bc2afe74cb3affbe35b74243ea9/library/std/src/io/error.rs#L542"><span>this line</span></a><span> from the implementation:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">fmt</span>::Display <span class="hl-keyword">for</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">fmt</span>(&amp;<span class="hl-keyword">self</span>, fmt: &amp;<span class="hl-keyword">mut</span> fmt::Formatter&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> fmt::<span class="hl-type">Result</span> {</span>
+<span class="line">    <span class="hl-keyword">match</span> &amp;<span class="hl-keyword">self</span>.repr {</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Os</span>(code) =&gt; {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">detail</span> = sys::os::<span class="hl-title function_ invoke__">error_string</span>(*code);</span>
+<span class="line">        <span class="hl-built_in">write!</span>(fmt, <span class="hl-string">&quot;{} (os error {})&quot;</span>, detail, code)</span>
+<span class="line">      }</span>
+<span class="line">      Repr::<span class="hl-title function_ invoke__">Simple</span>(kind) =&gt; <span class="hl-built_in">write!</span>(fmt, <span class="hl-string">&quot;{}&quot;</span>, kind.<span class="hl-title function_ invoke__">as_str</span>()),</span>
+<span class="line hl-line">      Repr::<span class="hl-title function_ invoke__">Custom</span>(c) =&gt; c.error.<span class="hl-title function_ invoke__">fmt</span>(fmt),</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol>
+<li>
+<span>Why is it surprising that this line works?</span>
+</li>
+<li>
+<span>Why does it work?</span>
+</li>
+</ol>
+
+</aside>
+  </article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-10-15-study-of-std-io-error.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/11/01/notes-on-paxos.html b/2020/11/01/notes-on-paxos.html
new file mode 100644
index 00000000..0e38a8c6
--- /dev/null
+++ b/2020/11/01/notes-on-paxos.html
@@ -0,0 +1,1050 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Notes on Paxos</title>
+  <meta name="description" content="These are my notes after learning the Paxos algorithm.
+The primary goal here is to sharpen my own understanding of the algorithm, but maybe someone will find this explanation of Paxos useful!
+This post assumes fluency with mathematical notation.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/11/01/notes-on-paxos.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Notes-on-Paxos"><span>Notes on Paxos</span> <time datetime="2020-11-01">Nov 1, 2020</time></a>
+    </h1>
+<p><span>These are my notes after learning the </span><a href="https://en.wikipedia.org/wiki/Paxos_%28computer_science%29"><span>Paxos</span></a><span> algorithm.</span>
+<span>The primary goal here is to sharpen my own understanding of the algorithm, but maybe someone will find this explanation of Paxos useful!</span>
+<span>This post assumes fluency with mathematical notation.</span></p>
+<p><span>I must confess it took me a long time to understand distributed consensus.</span>
+<span>I</span>&rsquo;<span>ve read a whole bunch of papers</span>
+<span>(</span><a href="https://lamport.azurewebsites.net/pubs/pubs.html#lamport-paxos"><span>Part Time Parliament</span></a><span>,</span>
+<a href="https://lamport.azurewebsites.net/pubs/pubs.html#paxos-simple"><span>Paxos Made Simple</span></a><span>,</span>
+<a href="http://pmg.csail.mit.edu/pubs/castro99practical-abstract.html"><span>Practical BFT</span></a><span>,</span>
+<a href="https://raft.github.io/"><span>In Search of an Understandable Consensus Algorithm</span></a><span>,</span>
+<a href="https://arxiv.org/abs/1802.07000"><span>CASPaxos: Replicated State Machines without logs</span></a><span>), but they didn</span>&rsquo;<span>t make sense.</span>
+<span>Or rather, nothing specific was unclear, but, at the same time, I was unable to answer the core question:</span></p>
+
+<aside class="block">
+
+<p><span>What breaks if this particular condition is removed?</span></p>
+
+</aside>
+  <p><span>That means that I didn</span>&rsquo;<span>t actually understand the algorithm.</span></p>
+<p><span>What finally made the whole thing click are</span></p>
+<ul>
+<li>
+<a href="https://lamport.azurewebsites.net/video/videos.html"><span>The TLA+ Video Course</span></a><span>.</span>
+</li>
+<li>
+<a href="https://lamport.azurewebsites.net/tla/paxos-algorithm.html"><span>The Paxos Algorithm or How to Win a Turing Award</span></a><span> lecture (I can</span>&rsquo;<span>t believe I actually was in St. Petersburg at that time and missed this!)</span>
+</li>
+</ul>
+<p><span>I now think that the thing is actually much simpler than it is made to believe :-)</span></p>
+<p><span>Buckle in, we are starting!</span></p>
+<section id="What-is-Paxos">
+
+    <h2>
+    <a href="#What-is-Paxos"><span>What is Paxos?</span> </a>
+    </h2>
+<p><span>Paxos is an algorithm for implementing distributed consensus.</span>
+<span>Suppose you have </span><code>N</code><span> machines which communicate over a faulty network.</span>
+<span>The network may delay, reorder, and lose messages (it can not corrupt them though).</span>
+<span>Some machines might die, and might return later.</span>
+<span>Due to network delays, </span>&ldquo;<span>machine is dead</span>&rdquo;<span> and </span>&ldquo;<span>machine is temporary unreachable</span>&rdquo;<span> are indistinguishable.</span>
+<span>What we want to do is to make machines agree on some value.</span>
+&ldquo;<span>Agree</span>&rdquo;<span> here means that if some machine says </span>&ldquo;<span>value is X</span>&rdquo;<span>, and another machine says </span>&ldquo;<span>value is Y</span>&rdquo;<span>, then X necessary is equal to Y.</span>
+<span>It is OK for machine to answer </span>&ldquo;<span>I don</span>&rsquo;<span>t know yet</span>&rdquo;<span>.</span></p>
+<p><span>The problem with this formulation is that Paxos is an elementary, but subtle algorithm.</span>
+<span>To understand it (at least for me), a precise, mathematical formulation is needed.</span>
+<span>So, let</span>&rsquo;<span>s try again.</span></p>
+<p><span>What is Paxos?</span>
+<span>Paxos is a theorem about sets!</span>
+<span>This is definitely mathematical, and is true (as long as you base math on set theory), but is not that helpful.</span>
+<span>So, let</span>&rsquo;<span>s try again.</span></p>
+<p><span>What is Paxos?</span>
+<span>Paxos is a theorem about nondeterministic state machines!</span></p>
+<p><span>A system is characterized by a state.</span>
+<span>The system evolves in discrete steps: each step takes system from </span><code>state</code><span> to </span><code>state'</code><span>.</span>
+<span>Transitions are non-deterministic: from a single current </span><code>s1</code><span>, you may get to different next states </span><code>s2</code><span> and </span><code>s3</code><span>.</span>
+<span>(non-determinism models a flaky network).</span>
+<span>An infinite sequence of system</span>&rsquo;<span>s states is called a behavior:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">state_0 → state_1 → ... → state_n → ...</span></code></pre>
+
+</figure>
+<p><span>Due to non-determinism, there</span>&rsquo;<span>s a potentially infinite number of possible behaviors.</span>
+<span>Nonetheless, depending on the transition function, we might be able to prove that some condition is true for any state in any behavior.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with a simple example, and also introduce some notation.</span>
+<span>I won</span>&rsquo;<span>t use TLA+, as I don</span>&rsquo;<span>t enjoy its concrete syntax.</span>
+<span>Instead, math will be set in monospaced unicode.</span></p>
+<p><span>The example models an integer counter.</span>
+<span>Each step the counter decrements or increments (non-deterministically), but never gets too big or too small</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Counter</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  ℕ -- Natural numbers with zero</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  counter ∈ ℕ</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">  counter = 0</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">    (counter &lt; 9 ∧ counter' = counter + 1)</span>
+<span class="line">  ∨ (counter &gt; 0 ∧ counter' = counter - 1)</span>
+<span class="line"></span>
+<span class="line">Theorem:</span>
+<span class="line">  ∀ i: 0 ≤ counter_i ≤ 9</span>
+<span class="line"></span>
+<span class="line">-- Notation</span>
+<span class="line">-- ≡: equals by definition</span>
+<span class="line">-- ∧: "and", conjunction</span>
+<span class="line">-- ∨: "or",  disjunction</span></code></pre>
+
+</figure>
+<p><span>The sate of the system is a single variable </span>&mdash;<span> </span><code>counter</code><span>.</span>
+<span>It holds a natural number.</span>
+<span>In general, we will represent a state of any system by a fixed set of variables.</span>
+<span>Even if the system logically consists of several components, we model it using a single unified state.</span></p>
+<p><span>The </span><code>Init</code><span> formula specifies the initial state, the </span><code>counter</code><span> is zero.</span>
+<span>Note that </span><code>=</code><span> is a mathematical equality, and not an assignment.</span>
+<code>Init</code><span> is a </span><em><span>predicate</span></em><span> on states.</span></p>
+<p><code>Init</code><span> is true for </span><code>{counter: 0}</code><span>.</span><br>
+<code>Init</code><span> is false for </span><code>{counter: 92}</code><span>.</span></p>
+<p><code>Next</code><span> defines a non-deterministic transition function.</span>
+<span>It is a predicate on pairs of states, </span><code>s1</code><span> and </span><code>s2</code><span>.</span>
+<code>counter</code><span> is a variable in the </span><code>s1</code><span> state, </span><code>counter'</code><span> is the corresponding variable in the </span><code>s2</code><span> state.</span>
+<span>In plain English, transition from </span><code>s1</code><span> to </span><code>s2</code><span> is valid if one of these is true:</span></p>
+<ul>
+<li>
+<span>Value of </span><code>counter</code><span> in </span><code>s1</code><span> is less than </span><code>9</code><span> and value of </span><code>counter</code><span> in </span><code>s2</code><span> is greater by 1.</span>
+</li>
+<li>
+<span>Value of </span><code>counter</code><span> in </span><code>s1</code><span> is greater than </span><code>0</code><span>, and value of </span><code>counter</code><span> in </span><code>s2</code><span> is smaller by 1.</span>
+</li>
+</ul>
+<p><code>Next</code><span> is true for </span><code>({counter: 5}, {counter: 6})</code><span>.</span><br>
+<code>Next</code><span> is false for </span><code>({counter: 5}, {counter: 5})</code><span>.</span></p>
+<p><span>Here are some behaviors of this system:</span></p>
+<ul>
+<li>
+<code>0 → 1 → 2 → 3 → 4 → 5 → 6 → 7 → 8 → 9</code>
+</li>
+<li>
+<code>0 → 1 → 0 → 1 → 0 → 1</code>
+</li>
+<li>
+<code>0 → 1 → 2 → 3 → 2 → 1 → 0</code>
+</li>
+</ul>
+<p><span>Here are some </span><strong><span>non</span></strong><span> behaviors of this system:</span></p>
+<ul>
+<li>
+<code>1 → 2 → 3 → 4 → 5</code><span>: </span><code>Init</code><span> does not hold for initial state</span>
+</li>
+<li>
+<code>0 → 2</code><span>: </span><code>Next</code><span> does not hold for </span><code>(0, 2)</code><span> pair</span>
+</li>
+<li>
+<code>0 → 1 → 0 → -1</code><span>: </span><code>Next</code><span> does not hold for </span><code>(0, -1)</code><span> pair</span>
+</li>
+</ul>
+<p>&ldquo;<span>behavior</span>&rdquo;<span> means that the initial state satisfies </span><code>Init</code><span>, and each transition satisfies </span><code>Next</code><span>.</span></p>
+<p><span>We can state and prove a theorem about this system: for every state in every behavior, the value of counter is between 0 and 9.</span>
+<span>Proof is by induction:</span></p>
+<ul>
+<li>
+<span>The condition is true in the initial state.</span>
+</li>
+<li>
+<span>If the condition is true for state </span><code>s1</code><span>, and </span><code>Next</code><span> holds for </span><code>(s1, s2)</code><span>, then the condition is true for </span><code>s2</code><span>.</span>
+</li>
+<li>
+<span>QED.</span>
+</li>
+</ul>
+<p><span>As usual with induction, sometimes we would want to prove a </span><em><span>stronger</span></em><span> property, because it gives us more powerful base for an induction step.</span></p>
+<p><span>To sum up, we define a non-deterministic state machine using two predicates </span><code>Init</code><span> and </span><code>Next</code><span>.</span>
+<code>Init</code><span> is a predicate on states which restricts possible initial states.</span>
+<code>Next</code><span> is a predicate on </span><em><span>pairs</span></em><span> of states, which defines a non-deterministic transition function.</span>
+<code>Vars</code><span> section describes the state as a fixed set of typed variables.</span>
+<code>Sets</code><span> defines auxiliary fixed sets, elements of which are values of variables.</span>
+<code>Theorem</code><span> section specifies a predicate on behaviors: </span><em><span>sequences</span></em><span> of steps evolving according to </span><code>Init</code><span> and </span><code>Next</code><span>.</span></p>
+<p><span>The theorem does not automatically follow from </span><code>Init</code><span> and </span><code>Next</code><span>, it needs to be proven.</span>
+<span>Alternatively, we can simulate a range of possible behaviors on a computer and check the theorem for the specific cases.</span>
+<span>If the set of reachable states is small enough (finite would be a good start), we can enumerate </span><em><span>all</span></em><span> behaviors and produce a brute force proof.</span>
+<span>If there are too many reachable states, we can</span>&rsquo;<span>t prove the theorem this way, but we often can prove it to be wrong, by finding a counter example.</span>
+<span>This is the idea behind model checking in general and TLA+ specifically.</span></p>
+</section>
+<section id="What-is-Consensus">
+
+    <h2>
+    <a href="#What-is-Consensus"><span>What is Consensus?</span> </a>
+    </h2>
+<p><span>Having mastered the basic vocabulary, let</span>&rsquo;<span>s start slowly building towards Paxos.</span>
+<span>We begin with defining what consensus is.</span>
+<span>As this is math, we</span>&rsquo;<span>ll do it using sets.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝕍 -- Arbitrary set of values</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  chosen ∈ 2^𝕍 -- Subset of values</span>
+<span class="line"></span>
+<span class="line">Theorem:</span>
+<span class="line">    ∀ i: |chosen_i| ≤ 1</span>
+<span class="line">  ∧ ∀ i, j: i ≤ j ∧ chosen_i ≠ {} ⇒ chosen_i = chosen_j</span>
+<span class="line"></span>
+<span class="line">-- Notation</span>
+<span class="line">-- {}:  empty set</span>
+<span class="line">-- 2^X: set of all subsets of X, powerset</span>
+<span class="line">-- |X|: cardinality (size) of the set</span></code></pre>
+
+</figure>
+<p><span>The state of the system is a set of chosen values.</span>
+<span>For this set to constitute consensus (over time) we need two conditions to hold:</span></p>
+<ul>
+<li>
+<span>at most one value is chosen</span>
+</li>
+<li>
+<span>if we choose a value at one point in time, we stick to it (math friendly: any two chosen values are equal to each other)</span>
+</li>
+</ul>
+<p><span>Here</span>&rsquo;<span>s the simplest possible implementation of consensus:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Consensus</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝕍 -- Arbitrary set of values</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  chosen ∈ 2^𝕍 -- Subset of values</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">  chosen = {}</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">  chosen = {} ∧ ∃ v ∈ 𝕍: chosen' = {v}</span>
+<span class="line"></span>
+<span class="line"></span>
+<span class="line">Theorem:</span>
+<span class="line">    ∀ i: |chosen_i| ≤ 1</span>
+<span class="line">  ∧ ∀ i, j: i ≤ j ∧ (chosen_i ≠ {} ⇒ chosen_i = chosen_j)</span></code></pre>
+
+</figure>
+<p><span>In the initial state, the set of chosen values is empty.</span>
+<span>We can make a step if the current set of chosen values is empty, in which case we select an arbitrary value.</span></p>
+<p><span>This technically breaks our behavior theory: we require behaviors to be infinite, but, for this spec, we can only make a single step.</span>
+<span>The fix is to allow empty steps: a step which does not change the state at all is always valid.</span>
+<span>We call such steps </span>&ldquo;<span>stuttering steps</span>&rdquo;<span>.</span></p>
+<p><span>The proof of the first condition of the consensus theorem is a trivial induction.</span>
+<span>The proof of the second part is actually non-trivial, here</span>&rsquo;<span>s a sketch.</span>
+<span>Assume that </span><code>i</code><span> and </span><code>j</code><span> are indices, which violate the condition.</span>
+<span>They might be far from each other in state-space, so we can</span>&rsquo;<span>t immediately apply </span><code>Next</code><span>.</span>
+<span>So let</span>&rsquo;<span>s choose the </span><em><span>smallest</span></em><span> </span><code>j1 ∈ [i+1;j]</code><span> such that the condition is violated.</span>
+<span>Let </span><code>i1 = j1 - 1</code><span>.</span>
+<span>The condition is still violated for </span><code>(i1, j1)</code><span> pair, but this time they are subsequent steps, and we can show that </span><code>Next</code><span> does not hold for them, concluding the proof.</span></p>
+<p><span>Yay! We have a distributed consensus algorithm which works for 1 (one) machine:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Distributed Consensus For One Machine</figcaption>
+
+
+<pre><code><span class="line">Pick arbitrary value.</span></code></pre>
+
+</figure>
+</section>
+<section id="Simple-Voting">
+
+    <h2>
+    <a href="#Simple-Voting"><span>Simple Voting</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s try to extend this to a truly distributed case, where we have </span><code>N</code><span> machines (</span>&ldquo;<span>acceptors</span>&rdquo;<span>).</span>
+<span>We start with formalizing the naive consensus algorithm: let acceptors vote for values, and select the value which gets a majority of votes.</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Majority Vote</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝕍 -- Arbitrary set of values</span>
+<span class="line">  𝔸 -- Finite set of acceptors</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  votes ∈ 2^(𝔸×𝕍) -- Set of (acceptor, value) pairs</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">  votes = {}</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">  ∃ a ∈ 𝔸:</span>
+<span class="line">      ∃ v ∈ V: votes' = votes ∪ {(a, v)}</span>
+<span class="line">    ∧ ∀ v ∈ V: (a, v) ∉ votes</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: |{a ∈ 𝔸: (a, v) ∈ votes}| &gt; |𝔸| / 2}</span></code></pre>
+
+</figure>
+<p><span>The state of the system is the set of all votes cast by all acceptors.</span>
+<span>We represent a vote as a pair of an acceptor and the value it voted for.</span>
+<span>Initially, the set of votes is empty.</span>
+<span>On each step, some acceptor casts a vote for some value (adds </span><code>(a, v)</code><span> pair to the set of votes), but only if it hasn</span>&rsquo;<span>t voted yet.</span>
+<span>Remember that </span><code>Next</code><span> is a predicate on pairs of states, so we check </span><code>votes</code><span> for existing vote, but add a new one to </span><code>votes'</code><span>.</span>
+<span>The value is chosen if the set of acceptors which voted for the value (</span><code>{a ∈ 𝔸: (a, v) ∈ votes}</code><span>) is at least half as large as the set of all acceptors.</span>
+<span>In other words, if a majority of acceptors has voted for the value.</span></p>
+
+<aside class="block">
+<div class="title">Quiz</div>
+<p><span>What would be the difference between</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">∃ a ∈ 𝔸:</span>
+<span class="line">    ∃ v ∈ V: votes' = votes ∪ {(a, v)}</span>
+<span class="line">  ∧ ∀ v ∈ V: (a, v) ∉ votes</span></code></pre>
+
+</figure>
+<p><span>and</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">∃ a ∈ 𝔸, v ∈ V:</span>
+<span class="line">    votes' = votes ∪ {(a, v)}</span>
+<span class="line">  ∧ ∀ (a1, v1) ∈ votes: a1 = a ⇒ v1 = v</span></code></pre>
+
+</figure>
+<p><span>?</span></p>
+
+<details>
+<summary>Spoiler</summary>
+<p><span>Trick question!</span></p>
+<p><span>They are equivalent.</span>
+<span>The first formula allows </span><code>a</code><span> to vote for </span><code>v</code><span> only if </span><code>a</code><span> hasn</span>&rsquo;<span>t voted before.</span>
+<span>The second formula allows </span><code>a</code><span> to vote for </span><code>v</code><span> only if all previous votes of </span><code>a</code><span> were cast for </span><code>v</code><span>.</span>
+<span>That is, if </span><code>a</code><span> hasn</span>&rsquo;<span>t voted yet, or if it has already voted for </span><code>v</code><span> (in which case this would be a stuttering step).</span></p>
+
+</details>
+  
+</aside>
+  <p><span>Let</span>&rsquo;<span>s prove consensus theorem for Majority Vote protocol.</span>
+<span>TYPE ERROR, DOES NOT COMPUTE.</span>
+<span>The consensus theorem is a predicate on behaviors of states consisting of </span><code>chosen</code><span> variable.</span>
+<span>Here, </span><code>chosen</code><span> isn</span>&rsquo;<span>t a variable, </span><code>votes</code><span> is!</span>
+<code>chosen</code><span> is a function which maps current state to some boolean.</span></p>
+<p><span>While it is intuitively clear what </span>&ldquo;<span>consensus theorem</span>&rdquo;<span> would look like for this case, let</span>&rsquo;<span>s make this precise.</span>
+<span>Let</span>&rsquo;<span>s </span><em><span>map</span></em><span> states with </span><code>votes</code><span> variable to states with </span><code>chosen</code><span> variable using the majority rule, </span><code>f</code><span>.</span>
+<span>This mapping naturally extends to a mapping between corresponding behaviors (sequences of steps):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">  f(votes_0   →   votes_1  → ...)</span>
+<span class="line">= f(votes_0)  → f(votes_1) → ...</span>
+<span class="line">=  chosen_0   →  chosen_1  → ...</span></code></pre>
+
+</figure>
+<p><span>Now we can precisely state that for every behavior </span><code>B</code><span> of majority voting spec, the theorem holds for </span><code>f(B)</code><span>.</span>
+<span>This yields a better way to prove this!</span>
+<span>Instead of proving the theorem directly (which would again require i1, j1 trick), we prove that our mapping </span><code>f</code><span> is a homomorphism.</span>
+<span>That is, we prove that if </span><code>votes_0 → votes_1 → ...</code><span> is a behavior of the majority voting spec, then </span><code>f(votes_0) → f(votes_1) → ...</code><span> is a behavior of the consensus spec.</span>
+<span>This lets us to re-use existing proof.</span></p>
+<p><span>The poof for initial step is trivial, but let</span>&rsquo;<span>s spell it out just to appreciate the amount of details a human mind can glance through</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">  f({votes: {}})</span>
+<span class="line">= {chosen: {v ∈ V: |{a ∈ 𝔸: (a, v) ∈ {}}| &gt; |𝔸| / 2}}</span>
+<span class="line">= {chosen: {v ∈ V: |{}| &gt; |𝔸| / 2}}</span>
+<span class="line">= {chosen: {v ∈ V: 0 &gt; |𝔸| / 2}}</span>
+<span class="line">= {chosen: {v ∈ V: FALSE}}</span>
+<span class="line">= {chosen: {}}</span></code></pre>
+
+</figure>
+<p><span>Let</span>&rsquo;<span>s show that if Majority Vote</span>&rsquo;<span>s </span><code>Next_m</code><span> holds for </span><code>(votes, votes')</code><span>, then Consensus</span>&rsquo;<span>s </span><code>Next_c</code><span> holds for </span><code>(f(votes), f(votes'))</code><span>.</span>
+<span>There</span>&rsquo;<span>s one obstacle on our way: this claim is false!</span>
+<span>Consider a case with three acceptors and two values: </span><code>𝔸 = {a1, a2, a3}</code><span>,  </span><code>𝕍 = {v1, v2}</code><span>.</span>
+<span>Consider these values of </span><code>votes</code><span> and </span><code>votes'</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">votes  = {(a1, v1), (a2, v1), (a1, v2)}</span>
+<span class="line">votes' = {(a1, v1), (a2, v1), (a1, v2), (a3, v2)}</span></code></pre>
+
+</figure>
+<p><span>If you just mechanically check </span><code>Next</code><span>, you see that it works!</span>
+<code>a3</code><span> hasn</span>&rsquo;<span>t cast its vote, so it can do this now.</span>
+<span>The problem is that </span><code>chosen(votes) = {v1}</code><span> and </span><code>chosen(votes') = {v1, v2}</code><span>.</span></p>
+<p><span>We are trying to prove too much!</span>
+<code>f</code><span> works correctly only for states reachable from </span><code>Init</code><span>, and the bad value of </span><code>votes</code><span> where </span><code>a1</code><span> votes twice is not reachable.</span></p>
+<p><span>So, we first should prove a lemma: each acceptor votes at most once.</span>
+<span>After that, we can prove </span><code>Next_m(votes, votes') = Next_c(f(votes), f(votes'))</code><span> under the assumption of at most once voting.</span>
+<span>Specifically, if </span><code>|f(votes')|</code><span> turns out to be larger than </span><code>1</code><span>, then we can pick two majorities which voted for different values, which allows to pin down a single acceptor which voted twice, which is a contradiction.</span>
+<span>The rest is left as an exercise for the reader :)</span></p>
+<p><span>So, majority vote indeed implements consensus.</span>
+<span>Let</span>&rsquo;<span>s look closer at the </span>&ldquo;<span>majority</span>&rdquo;<span> condition.</span>
+<span>It is clearly important.</span>
+<span>If we define </span><code>chosen</code><span> as</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: |{a ∈ 𝔸: (a, v) ∈ votes}| &gt; 0}</span></code></pre>
+
+</figure>
+<p><span>then its easy to construct a behavior with several chosen values.</span>
+<span>The property of majority we use is that any two majorities have at least one acceptor in common.</span>
+<span>But any other condition with this property would work as well as majority.</span>
+<span>For example, we can assign an integer weight to each acceptor, and require the sum of weights to be more than half.</span>
+<span>As a more specific example, consider a set of for acceptors </span><code>{a, b, c, d}</code><span>.</span></p>
+<p><span>Its majorities are:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">{a, b, c, d}</span>
+<span class="line">{a, b, c}</span>
+<span class="line">{a, b, d}</span>
+<span class="line">{a, c, d}</span>
+<span class="line">{b, c, d}</span></code></pre>
+
+</figure>
+<p><span>But the following set of sets would also satisfy non-empty intersection condition:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">{a, b, c, d}</span>
+<span class="line">{a, b, c}</span>
+<span class="line">{a, b, d}</span>
+<span class="line">{a, c}</span>
+<span class="line">{b, c}</span></code></pre>
+
+</figure>
+<p><span>Operationally, it is strictly better, as fewer are acceptors needed to reach a decision.</span></p>
+<p><span>So let</span>&rsquo;<span>s refine the protocol to a more general form.</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Quorum Vote</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝕍       -- Arbitrary set of values</span>
+<span class="line">  𝔸       -- Finite set of acceptors</span>
+<span class="line">  ℚ ∈ 2^𝔸 -- Set of quorums</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ≠ {}</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  votes ∈ 2^(𝔸×𝕍) -- Set of (acceptor, value) pairs</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">  votes = {}</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">  ∃ a ∈ 𝔸:</span>
+<span class="line">      ∃ v ∈ V: votes' = votes ∪ {(a, v)}</span>
+<span class="line">    ∧ ∀ v ∈ V: (a, v) ∉ votes</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: ∃ q ∈ ℚ: AllVotedFor(q, v)}</span>
+<span class="line"></span>
+<span class="line">AllVotedFor(q, v) ≡</span>
+<span class="line">  ∀ a ∈ q: (a, v) ∈ votes</span></code></pre>
+
+</figure>
+<p><span>We require to specify a set of quorums </span>&mdash;<span> set a of subsets of acceptors such that every two quorums have at least one acceptor in common.</span>
+<span>The value is chosen if there exists a quorum such that its every member voted for the value.</span></p>
+<p><span>There</span>&rsquo;<span>s one curious thing worth noting here.</span>
+<span>Consensus is a property of the whole system, there</span>&rsquo;<span>s no single </span>&ldquo;<span>place</span>&rdquo;<span> where we can point to and say </span>&ldquo;<span>hey, this is it, this is consensus</span>&rdquo;<span>.</span>
+<span>Imagine 3 acceptors, sitting on Earth, Venus, and Mars, and choosing between values </span><code>v1</code><span> and </span><code>v2</code><span>.</span>
+<span>They can execute Quorum Vote algorithm without communicating with each other at all.</span>
+<span>They will necessary reach consensus without knowing which specific value they agreed on!</span>
+<span>An external observer can then travel to the three planets, collect the votes and discover the chosen value, but this feature isn</span>&rsquo;<span>t built into the algorithm itself.</span></p>
+<p><span>OK, so we</span>&rsquo;<span>ve just described an algorithm for finding consensus among </span><code>N</code><span> machines, proved the consensus theorem for it, and noted that it has staggering communication efficiency: </span><em><em><span>zero</span></em></em><span> messages.</span>
+<span>Should we collect our Turing Award?</span></p>
+<p><span>Well, no, there</span>&rsquo;<span>s a big problem with Quorum Vote </span>&mdash;<span> it can get stuck.</span>
+<span>Specifically, if there are three values, and the votes are evenly split between them, then no value is chosen, and only stuttering steps are possible.</span>
+<span>If you can vote for different values, it might happen that neither value receives a majority of votes.</span>
+<span>Voting satisfies the safety property, but not the liveness property </span>&mdash;<span> the algorithm can get stuck even if all machines are on-line and communication is perfect.</span></p>
+<p><span>There is a simple fix to the problem, with a rich historical tradition among many </span>&ldquo;<span>democratic</span>&rdquo;<span> governments.</span>
+<span>Let</span>&rsquo;<span>s have a vote, and let</span>&rsquo;<span>s pick the value chosen by the majority, but let</span>&rsquo;<span>s allow to vote only for a single candidate value:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Rigged Quorum Vote</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝕍       -- Arbitrary set of values</span>
+<span class="line">  𝔸       -- Finite set of acceptors</span>
+<span class="line">  ℚ ∈ 2^𝔸 -- Set of quorums</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ≠ {}</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  votes ∈ 2^(𝔸×𝕍) -- Set of (acceptor, value) pairs</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">  votes = {}</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">  ∃ a ∈ 𝔸, v ∈ V:</span>
+<span class="line">      ∀ (a1, v1) ∈ votes: v1 = v</span>
+<span class="line">    ∧ votes' = votes ∪ {(a, v)}</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: ∃ q ∈ ℚ: AllVotedFor(q, v)}</span>
+<span class="line"></span>
+<span class="line">AllVotedFor(q, v) ≡</span>
+<span class="line">  ∀ a ∈ q: (a, v) ∈ votes</span></code></pre>
+
+</figure>
+<p><span>The new condition says that an acceptor is only allowed to cast a vote if all other votes are for the same value.</span>
+<span>As a special case, if the set of votes is empty, the acceptor can vote for any value (but all other acceptors would have to vote for this value afterwards).</span></p>
+<p><span>From a mathematical point of view, this algorithm is perfect.</span>
+<span>From a practical stand point, not so much: an acceptor to cast the first vote somehow needs to make sure that it is indeed the first one.</span>
+<span>The obvious fix to this problem is to assign a unique integer number to each acceptor, call the highest-numbered acceptor </span>&ldquo;<span>leader</span>&rdquo;<span>, and allow only the leader to cast the first decisive vote.</span></p>
+<p><span>So acceptors first communicate with each other to figure out who the leader is, then the leader casts the vote, and the followers follow.</span>
+<span>But this also violates liveness: if the leader dies, then the followers would wait indefinitely.</span>
+<span>A fix for this problem is to let the second highest acceptor to take over the leadership if the leader perishes.</span>
+<span>But under our assumptions, it</span>&rsquo;<span>s impossible to distinguish between a situation when the leader is dead from a situation when it just has a </span><em><span>really</span></em><span> bad internet connection.</span>
+<span>So naively picking successor would lead to a split vote and a standstill again (power transitions are known to be problematic for authoritarian regimes in real life too!).</span>
+<span>If only there were some kind of </span>&hellip;<span> distributed consensus algorithm for picking the leader!</span></p>
+</section>
+<section id="Ballot-Voting">
+
+    <h2>
+    <a href="#Ballot-Voting"><span>Ballot Voting</span> </a>
+    </h2>
+<p><span>This is the place were we start discussing real Paxos :-)</span>
+<span>It starts with a </span>&ldquo;<span>ballot voting</span>&rdquo;<span> algorithm.</span>
+<span>This algorithm, just like the ones we</span>&rsquo;<span>ve already seen, does not define any messages.</span>
+<span>Rather, message passing is an implementation detail, so we</span>&rsquo;<span>ll get to it later.</span></p>
+<p><span>Recall that rigged voting requires all acceptors to vote for a single values.</span>
+<span>It is immune to split voting, but is susceptible to getting stuck when the leader goes offline.</span>
+<span>The idea behind ballot voting is to have many voting rounds, ballots.</span>
+<span>In each ballot, acceptors can vote only for a single value, so each ballot individually can get stuck.</span>
+<span>However, as we are running many ballots, some ballots will make progress.</span>
+<span>The value is chosen in a ballot if it is chosen by some quorum of acceptors.</span>
+<span>The value is chosen in an overall algorithm if it is chosen in some ballot.</span></p>
+<p><span>The Turing award question is: how do we make sure that no two ballots choose different values?</span>
+<span>Note that it is OK if two ballots choose the same value.</span></p>
+<p><span>Let</span>&rsquo;<span>s just brute force this question, really.</span>
+<span>First, assume that the ballots are ordered (for example, by numbering them with natural numbers).</span>
+<span>And let</span>&rsquo;<span>s say we want to pick some value </span><code>v</code><span> to vote for in ballot </span><code>b</code><span>.</span>
+<span>When </span><code>v</code><span> is safe?</span>
+<span>Well, when no other value </span><code>v1</code><span> can be chosen by any other ballot.</span>
+<span>Let</span>&rsquo;<span>s tighten this up a bit.</span></p>
+<p><span>Value </span><code>v</code><span> is safe at ballot </span><code>b</code><span> if any smaller ballot </span><code>b1</code><span> (</span><code>b1 &lt; b</code><span>) did not choose and will not choose any value other than </span><code>v</code><span>.</span></p>
+<p><span>So yeah, easy-peasy, we </span><em><span>just</span></em><span> need to predict which values will be chosen in the future, and we are done!</span>
+<span>We</span>&rsquo;<span>ll deal with it in a moment, but let</span>&rsquo;<span>s first convince ourselves that, if we only select safe values for voting, we won</span>&rsquo;<span>t violate consensus spec.</span></p>
+<p><span>So, when we select a safe value </span><code>v</code><span> to vote for in a particular ballot, it might get chosen in this ballot.</span>
+<span>We need to check that it won</span>&rsquo;<span>t conflict with any other value.</span>
+<span>For smaller ballots that</span>&rsquo;<span>s easy </span>&mdash;<span> it</span>&rsquo;<span>s the definition of safety condition.</span>
+<span>What if we conflict with some value </span><code>v1</code><span> chosen in a future ballot?</span>
+<span>Well, that value is also safe, so whoever chose </span><code>v1</code><span>, was sure that it won</span>&rsquo;<span>t conflict with </span><code>v</code><span>.</span></p>
+<p><span>How do we tackle the precognition problem?</span>
+<span>We</span>&rsquo;<span>ll ask acceptors to commit to </span><em><span>not</span></em><span> voting in certain ballots.</span>
+<span>For example, if you are looking for a safe value for ballot </span><code>b</code><span> and know that there</span>&rsquo;<span>s a quorum </span><code>q</code><span> such that each quorum member never voted in smaller ballots, and promised to never vote in smaller ballots, you can be sure that any value is safe.</span>
+<span>Indeed, any quorum in smaller ballots will have at least one member which would refuse to vote for any value.</span></p>
+<p><span>Ok, but what if there</span>&rsquo;<span>s some quorum member which has already voted for some </span><code>v1</code><span> in some ballot </span><code>b1 &lt; b</code><span>?</span>
+<span>(Take a deep breath, the next sentence is the kernel of the core idea of Paxos).</span>
+<span>Well, that means that </span><code>v1</code><span> was safe at </span><code>b1</code><span>, so, if there will be no votes between </span><code>b1</code><span> and </span><code>b</code><span>, </span><code>v1</code><span> is also safe at </span><code>b</code><span>!</span>
+<span>(Exhale).</span>
+<span>In other words, to pick a safe value at </span><code>b</code><span> we:</span></p>
+<ol>
+<li>
+<span>Take some quorum </span><code>q</code><span>.</span>
+</li>
+<li>
+<span>Make everyone in </span><code>q</code><span> promise to never vote in ballots earlier than </span><code>b</code><span>.</span>
+</li>
+<li>
+<span>Among all of the votes already cast by the quorum members we pick the one with the highest ballot number.</span>
+</li>
+<li>
+<span>If such vote exists, its value is a safe value.</span>
+</li>
+<li>
+<span>Otherwise, any value is safe.</span>
+</li>
+</ol>
+<p><span>To implement the </span>&ldquo;<span>never vote</span>&rdquo;<span> promise, each acceptor will maintain </span><code>maxBal</code><span> value.</span>
+<span>It will never vote in ballots smaller or equal to </span><code>maxBal</code><span>.</span></p>
+<p><span>Let</span>&rsquo;<span>s stop hand-waving and put this algorithm in math.</span>
+<span>Again, we are not thinking about messages yet, and just assume that each acceptor can observe the state of the whole system.</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Ballot Vote</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝔹       -- Numbered set of ballots (for example, ℕ)</span>
+<span class="line">  𝕍       -- Arbitrary set of values</span>
+<span class="line">  𝔸       -- Finite set of acceptors</span>
+<span class="line">  ℚ ∈ 2^𝔸 -- Set of quorums</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ≠ {}</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  -- Set of (acceptor, ballot, value) triples</span>
+<span class="line">  votes ∈ 2^(𝔸×𝔹×𝕍)</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to ballot numbers or -1.</span>
+<span class="line">  -- maxBal :: 𝔸 -&gt; 𝔹 ∪ {-1}</span>
+<span class="line">  maxBal ∈ (𝔹 ∪ {-1})^𝔸</span>
+<span class="line"></span>
+<span class="line">Voted(a, b) ≡</span>
+<span class="line">  ∃ v ∈ 𝕍: (a, b, v) ∈ votes</span>
+<span class="line"></span>
+<span class="line">Safe(v, b) ≡</span>
+<span class="line">  ∃ q ∈ ℚ:</span>
+<span class="line">      ∀ a ∈ q: maxBal(a) ≥ b - 1</span>
+<span class="line">    ∧ ∃ b1 ∈ 𝔹 ∪ {-1}:</span>
+<span class="line">          ∀ b2 ∈ [b1+1; b-1], a ∈ q: ¬Voted(a, b2)</span>
+<span class="line">        ∧ b1 = -1 ∨ ∃ a ∈ q: (a, b1, v) ∈ votes</span>
+<span class="line"></span>
+<span class="line">AdvanceMaxBal(a, b) ≡</span>
+<span class="line">    maxBal(a) &lt; b</span>
+<span class="line">  ∧ votes' = votes</span>
+<span class="line">  ∧ maxBal' = λ a1 ∈ 𝔸: if a1 = a then b else maxBal(a1)</span>
+<span class="line"></span>
+<span class="line">Vote(a, b, v) ≡</span>
+<span class="line">    maxBal(a) &lt; b</span>
+<span class="line">  ∧ ∀ (a1, b1, v1) ∈ votes: b = b1 ⇒ v = v1</span>
+<span class="line">  ∧ Safe(v, b)</span>
+<span class="line">  ∧ votes' = votes ∪ (a, b, v)</span>
+<span class="line">  ∧ maxBal' = λ a1 ∈ 𝔸: if a1 = a then b else maxBal(a1)</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">    votes = {}</span>
+<span class="line">  ∧ maxBal = λ a ∈ 𝔸: -1</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">  ∃ a ∈ 𝔸, b ∈ 𝔹:</span>
+<span class="line">      AdvanceMaxBal(a, b)</span>
+<span class="line">    ∨ ∃ v ∈ 𝕍: Vote(a, b, v)</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: ∃ q ∈ ℚ, b ∈ 𝔹: AllVotedFor(q, b, v)}</span>
+<span class="line"></span>
+<span class="line">AllVotedFor(q, b, v) ≡</span>
+<span class="line">  ∀ a ∈ q: (a, b, v) ∈ votes</span>
+<span class="line"></span>
+<span class="line">-- Notation</span>
+<span class="line">-- [b1;b2]: inclusive interval of ballots</span>
+<span class="line">--- Y^X: set of function from X to Y (f: X -&gt; Y)</span>
+<span class="line">-- λ x ∈ X: y: function that maps x to y</span>
+<span class="line">-- ¬: "not", negation</span>
+<span class="line">--</span>
+<span class="line">-- f' = λ x1 ∈ X: if x1 = x then y else f(x1):</span>
+<span class="line">-- A tedious way to write that f' is the same function as f,</span>
+<span class="line">-- except on x, where it returns y instead.</span>
+<span class="line">--</span>
+<span class="line">-- I am sorry! In my defense, TLA+ notation for this</span>
+<span class="line">--- is also horrible :-)</span></code></pre>
+
+</figure>
+<p><span>Let</span>&rsquo;<span>s unwrap this top-down.</span>
+<span>First, the </span><code>chosen</code><span> condition says that it is enough for some quorum to cast votes in some ballot for a value to be accepted.</span>
+<span>It</span>&rsquo;<span>s trivial to see that, if we fix the ballot, then any two quorums would vote for the same value </span>&mdash;<span> quorums intersect.</span>
+<span>Showing that quorums vote for the same value in different ballots is the tricky bit.</span></p>
+<p><span>The </span><code>Init</code><span> condition is simple </span>&mdash;<span> no votes, any acceptor can vote in any ballot (= any ballot with number larger than -1).</span></p>
+<p><span>The </span><code>Next</code><span> consists of two cases.</span>
+<span>On each step of the protocol, some acceptor either votes for some value in some ballot </span><code>∃ v ∈ 𝕍: Vote(a, b, v)</code><span>, or declares that it won</span>&rsquo;<span>t cast additional vote in small ballots </span><code>AdvanceMaxBal(a, b)</code><span>.</span>
+<span>Advancing ballot just sets </span><code>maxBal</code><span> for this acceptor (but takes care not to rewind older decisions).</span>
+<span>Casting a vote is more complicated and is predicated on three conditions:</span></p>
+<ul>
+<li>
+<span>We haven</span>&rsquo;<span>t forfeited our right to vote in this ballot.</span>
+</li>
+<li>
+<span>If there</span>&rsquo;<span>s some vote in this ballot already, we are voting for the same value.</span>
+</li>
+<li>
+<span>If there are no votes, then the value should be safe.</span>
+</li>
+</ul>
+<p><span>Note that the last two checks overlap a bit: if the set of votes cast in a ballot is not empty, we immediately know that the value is safe: somebody has proven this before.</span>
+<span>But it doesn</span>&rsquo;<span>t harm to check for safety again: a safe value can not become unsafe.</span></p>
+<p><span>Finally, the safety check.</span>
+<span>It is done in relation to some quorum </span>&mdash;<span> if </span><code>q</code><span> proves that </span><code>v</code><span> is safe, than members of this quorum would prevent any other value to be accepted in early ballots.</span>
+<span>To be able to do this, we first need to make sure that </span><code>q</code><span> indeed finalized their votes for ballots less than </span><code>b</code><span> (</span><code>maxBall</code><span> is at least </span><code>b - 1</code><span>).</span>
+<span>Then, we need to find the latest vote of </span><code>q</code><span>.</span>
+<span>There are two cases</span></p>
+<ul>
+<li>
+<span>No one in </span><code>q</code><span> ever voted (</span><code>b1 = -1</code><span>).</span>
+<span>In this case, there are no additional conditions on </span><code>v</code><span>, any value would work.</span>
+</li>
+<li>
+<span>Someone in </span><code>q</code><span> voted, and </span><code>b1</code><span> is the last ballot when someone voted.</span>
+<span>Then </span><code>v</code><span> must be the value voted for in </span><code>b1</code><span>.</span>
+<span>This implies </span><code>Safe(v, b1)</code><span>.</span>
+</li>
+</ul>
+<p><span>If all of these conditions are fulfilled, we cast our vote and advance </span><code>maxBall</code><span>.</span></p>
+<p><span>This is the hardest part of the article.</span>
+<span>Take time to fully understand Ballot Vote.</span></p>
+
+<aside class="block">
+<div class="title">Quiz</div>
+<p><span>What breaks if we don</span>&rsquo;<span>t advance </span><code>maxBall</code><span> in </span><code>Vote</code><span>?</span>
+<span>Ie, if we replace</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">maxBal' = λ a1 ∈ 𝔸: if a1 = a then b else maxBal(a1)</span></code></pre>
+
+</figure>
+<p><span>with just </span><code>maxBal' = maxBal</code><span>?</span></p>
+
+<details>
+<summary>Spoiler</summary>
+<p><span>Trick question!</span></p>
+<p><span>I believe that nothing really changes.</span>
+<span>Safety condition guarantees that no different value will be chosen in any </span><em><span>previous</span></em><span> ballot.</span>
+<span>However, by casting our own vote, we fix the outcome for the current ballot as well!</span>
+<span>If we don</span>&rsquo;<span>t set </span><code>maxBal</code><span>, we can re-enter </span><code>Vote</code><span> and vote the second time, but we</span>&rsquo;<span>ll necessary vote for the same value!</span></p>
+<p><span>Voting the second time for the same value is wasteful, and upping </span><code>maxBall</code><span> here reduces the state space, but it doesn</span>&rsquo;<span>t affect safety.</span></p>
+
+</details>
+  
+</aside>
+  <p><span>Rigorously proving that Ballot Voting satisfies Consensus would be tedious </span>&mdash;<span> the specification is large, and the proof would necessary use every single piece of the spec!</span>
+<span>But let</span>&rsquo;<span>s add some hand-waving.</span>
+<span>Again, we want to provide homomorphism from Ballot Voting to Consensus.</span>
+<span>Cases where the image of a step is a stuttering step (the set of chosen values is the same) are obvious.</span>
+<span>It</span>&rsquo;<span>s also obvious that the set of chosen values never decreases (we never remove votes, so a value can not become unchosen).</span>
+<span>It also increases by at most one value with each step.</span></p>
+<p><span>The complex case is to prove that, if currently only </span><code>v1</code><span> is chosen, no other </span><code>v2</code><span> can be chosen as a result of the current step.</span>
+<span>Suppose the contrary, let </span><code>v2</code><span> be the newly chosen value, and </span><code>v1</code><span> be a different value chosen some time ago.</span>
+<code>v1</code><span> and </span><code>v2</code><span> can</span>&rsquo;<span>t belong to the same ballot, because every ballot contains votes only for a single value (this needs proof!).</span>
+<span>Lets say they belong to </span><code>b1</code><span> and </span><code>b2</code><span>, and that </span><code>b1 &lt; b2</code><span>.</span>
+<span>Note that </span><code>v2</code><span> might belong to </span><code>b1</code><span> </span>&mdash;<span> nothing prevents smaller ballot from finishing later.</span>
+<span>When we chose </span><code>v2</code><span> for </span><code>b2</code><span>, it was safe.</span>
+<span>This means that some quorum either promised not to vote in </span><code>b1</code><span> (but then </span><code>v1</code><span> couldn</span>&rsquo;<span>t have been chosen in </span><code>b1</code><span>), or someone from the quorum voted for </span><code>v2</code><span> in </span><code>b1</code><span> (but then </span><code>v1 = v2</code><span> (proving this might require repeated application of safety condition)).</span></p>
+<p><span>Ok, but is this better than Majority Voting?</span>
+<span>Can Ballot Voting get stuck?</span>
+<span>No </span>&mdash;<span> if at least one quorum of machines is online, they can bump their </span><code>maxBall</code><span> to a ballot bigger than any existing one.</span>
+<span>After they do this, there necessary will be a safe value relative to this quorum, which they can then vote on.</span></p>
+<p><span>However, Ballot Voting is prone to a live lock </span>&mdash;<span> if acceptors continue to bump </span><code>maxBal</code><span> instead of voting, they</span>&rsquo;<span>ll never select any value.</span>
+<span>In fact, in the current formulation one needs to be pretty lucky to not get stuck.</span>
+<span>To finish voting, there needs to be a quorum which can vote in ballot </span><code>b</code><span>, but not in any smaller ballot, and in the above spec this can only happen by luck.</span></p>
+<p><span>It is impossible to completely eliminate live locks without assumptions about real time. However, when we implement Ballot Voting with real message passing, we try to reduce the probability of a live lock.</span></p>
+</section>
+<section id="Paxos-for-Real">
+
+    <h2>
+    <a href="#Paxos-for-Real"><span>Paxos for Real</span> </a>
+    </h2>
+<p><span>One final push left!</span>
+<span>Given the specification of Ballot Voting, how do we implement it using message passing?</span>
+<span>Specifically, how do we implement the logic for selecting the first (safe) value for the ballot?</span></p>
+<p><span>The idea is to have a designated leader for each ballot.</span>
+<span>As there are many ballots, we don</span>&rsquo;<span>t need a leader selection algorithm, and can just statically assign ballot leaders.</span>
+<span>For example, if there are N acceptors, acceptor 0 can lead ballots </span><code>0, N, 2N, …</code><span>, acceptor 1 can lead </span><code>1, N + 1, 2N + 1, …</code><span> etc.</span></p>
+<p><span>To select a value for ballot </span><code>b</code><span>, the ballot</span>&rsquo;<span>s leader broadcasts a message to initiate the ballot.</span>
+<span>Upon receiving this message, each acceptor advances its </span><code>maxBall</code><span> to </span><code>b - 1</code><span>, and sends the leader its latest vote, unless the acceptor has already made a promise to not vote in </span><code>b</code><span>.</span>
+<span>If the leader receives replies from some quorum, it can be sure that this quorum won</span>&rsquo;<span>t vote in smaller ballots.</span>
+<span>Besides, the leader knows quorum</span>&rsquo;<span>s votes, so it can pick a safe value.</span></p>
+<p><span>In other words, the practical trick for picking a safe value is to ask some quorum to abstain from voting in small ballots and to pick a value consistent with votes already cast.</span>
+<span>This is the first phase of Paxos, consisting of two message types, 1a and 1b.</span></p>
+<p><span>The second phase is to ask the quorum to cast the votes.</span>
+<span>The leader picks a safe value and broadcasts it for the quorum.</span>
+<span>Quorum members vote for the value, unless in the meantime they happened to promise to a leader of the bigger ballot to not vote.</span>
+<span>After a member voted, it broadcasts its vote.</span>
+<span>When a quorum of votes is observed, the value is chosen and the consensus is reached.</span>
+<span>This is the second phase of Paxos with messages 2a and 2b.</span></p>
+<p><span>Let</span>&rsquo;<span>s write this in math!</span>
+<span>To model message passing, we will use </span><code>msgs</code><span> variable: a set of messages which have ever been send.</span>
+<span>Sending a message is adding it to this set.</span>
+<span>Receiving a message is asserting that it is contained in the set.</span>
+<span>By not removing messages, we model reorderings and duplications.</span></p>
+<p><span>The messages themselves will be represented by records. For example, phase 1a message which initiates voting in ballot </span><code>b</code><span> will look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">{type: "1a", bal: b}</span></code></pre>
+
+</figure>
+<p><span>Another bit of state we</span>&rsquo;<span>ll need is </span><code>lastVote</code><span> </span>&mdash;<span> for each acceptor, what was the last ballot the acceptor voted in, together with the corresponding vote.</span>
+<span>It will be </span><code>null</code><span> if the acceptor hasn</span>&rsquo;<span>t voted.</span></p>
+<p><span>Without further ado,</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Paxos</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝔹       -- Numbered set of ballots (for example, ℕ)</span>
+<span class="line">  𝕍       -- Arbitrary set of values</span>
+<span class="line">  𝔸       -- Finite set of acceptors</span>
+<span class="line">  ℚ ∈ 2^𝔸 -- Set of quorums</span>
+<span class="line"></span>
+<span class="line">  -- Sets of messages for each of the four subphases</span>
+<span class="line">  Msgs1a ≡ {type: {"1a"}, bal: 𝔹}</span>
+<span class="line"></span>
+<span class="line">  Msgs1b ≡ {type: {"1b"}, bal: 𝔹, acc: 𝔸,</span>
+<span class="line">            vote: {bal: 𝔹, val: 𝕍} ∪ {null}}</span>
+<span class="line"></span>
+<span class="line">  Msgs2a ≡ {type: {"2a"}, bal: 𝔹, val: 𝕍}</span>
+<span class="line"></span>
+<span class="line">  Msgs2b ≡ {type: {"2b"}, bal: 𝔹, val: 𝕍, acc: 𝔸}</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ≠ {}</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  -- Set of all messages sent so far</span>
+<span class="line">  msgs ∈ 2^(Msgs1a ∪ Msgs1b ∪ Msgs2a ∪ Msgs2b)</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to ballot numbers or -1</span>
+<span class="line">  -- maxBal :: 𝔸 -&gt; 𝔹 ∪ {-1}</span>
+<span class="line">  maxBal ∈ (𝔹 ∪ {-1})^𝔸</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to their last vote</span>
+<span class="line">  -- lastVote :: 𝔸 -&gt; {bal: 𝔹, val: 𝕍} ∪ {null}</span>
+<span class="line">  lastVote ∈ ({bal: 𝔹, val: 𝕍} ∪ {null})^𝔸</span>
+<span class="line"></span>
+<span class="line">Send(m) ≡ msgs' = msgs ∪ {m}</span>
+<span class="line"></span>
+<span class="line">Phase1a(b) ≡</span>
+<span class="line">    Send({type: "1a", bal: b})</span>
+<span class="line">  ∧ maxBal' = maxBal</span>
+<span class="line">  ∧ lastVote' = lastVote</span>
+<span class="line"></span>
+<span class="line">Phase1b(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "1a" ∧ maxBal(a) &lt; m.bal</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                            then m.bal - 1</span>
+<span class="line">                            else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = lastVote</span>
+<span class="line">    ∧ Send({type: "1b", bal: m.bal, acc: a, vote: lastVote(a)})</span>
+<span class="line"></span>
+<span class="line">Phase2a(b, v) ≡</span>
+<span class="line">   ¬∃ m ∈ msgs: m.type = "2a" ∧ m.bal = b</span>
+<span class="line">  ∧ ∃ q ∈ ℚ:</span>
+<span class="line">    let</span>
+<span class="line">      qmsgs  ≡ {m ∈ msgs: m.type = "1b" ∧ m.bal = b ∧ m.acc ∈ q}</span>
+<span class="line">      qvotes ≡ {m ∈ qmsgs: m.vote ≠ null}</span>
+<span class="line">    in</span>
+<span class="line">        ∀ a ∈ q: ∃ m ∈ qmsgs: m.acc = a</span>
+<span class="line">      ∧ (  qvotes = {}</span>
+<span class="line">         ∨ ∃ m ∈ qvotes:</span>
+<span class="line">               m.vote.val = v</span>
+<span class="line">             ∧ ∀ m1 ∈ qvotes: m1.vote.bal &lt;= m.vote.bal)</span>
+<span class="line">      ∧ Send({type: "2a", bal: b, val: v})</span>
+<span class="line">      ∧ maxBal' = maxBal</span>
+<span class="line">      ∧ lastVote' = lastVote</span>
+<span class="line"></span>
+<span class="line">Phase2b(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "2a" ∧ maxBal(a) &lt; m.bal</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1 then m.bal else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                              then {bal: m.bal, val: m.val}</span>
+<span class="line">                              else lastVote(a1)</span>
+<span class="line">    ∧ Send({type: "2b", bal: m.bal, val: m.val, acc: a})</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">    msgs = {}</span>
+<span class="line">  ∧ maxBal   = λ a ∈ 𝔸: -1</span>
+<span class="line">  ∧ lastVote = λ a ∈ 𝔸: null</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">    ∃ b ∈ 𝔹:</span>
+<span class="line">        Phase1a(b) ∨ ∃ v ∈ 𝕍: Phase2a(b, v)</span>
+<span class="line">  ∨ ∃ a ∈ 𝔸:</span>
+<span class="line">        Phase1b(a) ∨ Phase2b(a)</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: ∃ q ∈ ℚ, b ∈ 𝔹: AllVotedFor(q, b, v)}</span>
+<span class="line"></span>
+<span class="line">AllVotedFor(q, b, v) ≡</span>
+<span class="line">  ∀ a ∈ q: (a, b, v) ∈ votes</span>
+<span class="line"></span>
+<span class="line">votes ≡</span>
+<span class="line">  let</span>
+<span class="line">    msgs2b ≡ {m ∈ msgs: m.type = "2b"}</span>
+<span class="line">  in</span>
+<span class="line">    {(m.acc, m.bal, m.val): m ∈ msgs2b}</span>
+<span class="line"></span>
+<span class="line">-- Notation</span>
+<span class="line">-- {f1: value1, f2: value}  -- a record with .f1 and .f2 fields</span>
+<span class="line">-- {f1: Set1, f2: Set2}     -- set of records</span>
+<span class="line">--- let name ≡ def in expr   -- local definition of name</span></code></pre>
+
+</figure>
+<p><span>Let</span>&rsquo;<span>s go through each of the phases.</span></p>
+<p><code>Phase1a</code><span> initiates ballot </span><code>b</code><span>.</span>
+<span>It is executed by the ballot</span>&rsquo;<span>s leader, but there</span>&rsquo;<span>s no need to model who exactly the leader is, as long as it is unique.</span>
+<span>This stage simply broadcasts 1a message.</span></p>
+<p><code>Phase1b</code><span> is executed by an acceptor </span><code>a</code><span>.</span>
+<span>If </span><code>a</code><span> receives </span><code>1a</code><span> message for ballot </span><code>b</code><span> and it can vote in </span><code>b</code><span>, then it replies with its </span><code>lastVote</code><span>.</span>
+<span>If it can</span>&rsquo;<span>t vote (it has already started some larger ballot), it simply doesn</span>&rsquo;<span>t respond.</span>
+<span>If enough acceptors don</span>&rsquo;<span>t respond, the ballot will get stuck, but some other ballot might succeed.</span></p>
+<p><code>Phase2a</code><span> is the tricky bit, it checks if the value </span><code>v</code><span> is save for ballot </span><code>b</code><span>.</span></p>
+<p><span>First, we need to make sure that we haven</span>&rsquo;<span>t already initiated </span><code>Phase2a</code><span> for this ballot.</span>
+<span>Otherwise, we might initiate </span><code>Phase2a</code><span> for different values.</span>
+<span>Here is the bit where it is important that the ballot</span>&rsquo;<span>s leader is stable.</span>
+<span>The leader needs to remember if it has already picked a safe value.</span></p>
+<p><span>Then, we collect 1b messages from some quorum (we need to make sure that every quorum member has send 1b message for this ballot).</span>
+<span>Value </span><code>v</code><span> is safe if the whole quorum didn</span>&rsquo;<span>t vote (</span><code>vote</code><span> is null), or if it is the value of the latest vote of some quorum member.</span>
+<span>We know that quorum members won</span>&rsquo;<span>t vote in earlier ballots, because they had increased </span><code>maxBal</code><span> before sending 1b messages.</span></p>
+<p><span>If the value indeed turns out to be safe, we broadcast 2a message for this ballot and value.</span></p>
+<p><span>Finally, in </span><code>Phase2b</code><span> an acceptor </span><code>a</code><span> votes for this value, if its </span><code>maxBall</code><span> is still good.</span>
+<span>The bookkeeping is updating </span><code>maxBal</code><span>, </span><code>lastVote</code><span>, and sending the 2b message.</span></p>
+<p><span>The set of 2b messages corresponds to the </span><code>votes</code><span> variable of the Ballot Voting specification.</span></p>
+</section>
+<section id="Notes-on-Notes">
+
+    <h2>
+    <a href="#Notes-on-Notes"><span>Notes on Notes</span> </a>
+    </h2>
+<p><span>There</span>&rsquo;<span>s a famous result called FLP impossibility: </span><a href="https://groups.csail.mit.edu/tds/papers/Lynch/jacm85.pdf"><span>Impossibility of Distributed Consensus with One Faulty Process</span></a><span>.</span>
+<span>But we</span>&rsquo;<span>ve just presented Paxos algorithm, which works as long as more than half of the processes are alive.</span>
+<span>What gives?</span>
+<span>FLP theorem states that there</span>&rsquo;<span>s no consensus algorithm </span><em><span>with finite behaviors</span></em><span>.</span>
+<span>Stated in a positive way, any asynchronous distributed consensus algorithm is prone to live-lock.</span>
+<span>This is indeed the case for Paxos.</span></p>
+<p><span>Liveness can be improved under partial synchronity assumptions.</span>
+<span>Ie, if we give each process a good enough clock, such that we can say things like </span>&ldquo;<span>if no process fails, Paxos completes in </span><code>t</code><span> seconds</span>&rdquo;<span>.</span>
+<span>If this is the case, we can fix live locking (ballots conflicting each other) by using naive leader selection algorithm to select the single acceptor which can initiate ballots.</span>
+<span>If we don</span>&rsquo;<span>t reach consensus after </span><code>t</code><span> seconds, we can infer that someone has failed and re-run naive leader selection.</span>
+<span>If we are unlucky, naive leader selection will produce two leaders, but this won</span>&rsquo;<span>t be a problem for safety.</span></p>
+<p><span>Paxos requires atomicity and durability to function correctly.</span>
+<span>For example, once the has leader picked safe value and has broadcasted a 2a message, it should persist the selected value.</span>
+<span>Otherwise, if it goes down and then resurrects, it might choose a different value.</span>
+<span>How to make a choice of value atomic and durable?</span>
+<span>Write it to a local database!</span>
+<span>How to make local transaction atomic and durable?</span>
+<span>Write it first into the write ahead log?</span>
+<span>How to write something to WAL?</span>
+<span>Using the </span><code>write</code><span> syscall/DMA.</span>
+<span>What happens if the power goes down exactly in the middle of the write operation?</span>
+<span>Well, we can write a chunk of bytes with a checksum!</span>
+<span>Even if the write itself is not atomic, a checksummed write is!</span>
+<span>If we read the record from disk and checksum matches, then the record is valid.</span></p>
+<p><span>I use slightly different definition of </span><code>maxBal</code><span> (less by one) than the one in the linked lecture, don</span>&rsquo;<span>t get confused about this!</span></p>
+<p><span>See </span><a href="https://github.com/matklad/paxosnotes" class="url">https://github.com/matklad/paxosnotes</a><span> for TLA specs.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-11-01-notes-on-paxos.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/11/11/yde.html b/2020/11/11/yde.html
new file mode 100644
index 00000000..12c1f6e3
--- /dev/null
+++ b/2020/11/11/yde.html
@@ -0,0 +1,224 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Why an IDE?</title>
+  <meta name="description" content="Some time ago I wrote a reddit comment explaining the benefits of IDEs.
+Folks refer to it from time to time, so I decided to edit it into an article form.
+Enjoy!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/11/11/yde.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Why-an-IDE"><span>Why an IDE?</span> <time datetime="2020-11-11">Nov 11, 2020</time></a>
+    </h1>
+<p><span>Some time ago I wrote a reddit comment explaining the benefits of IDEs.</span>
+<span>Folks refer to it from time to time, so I decided to edit it into an article form.</span>
+<span>Enjoy!</span></p>
+<p><span>I think I have a rather balanced perspective on IDEs.</span>
+<span>I used to be a heavy Emacs user (</span><a href="https://github.com/matklad/.emacs.d/tree/475de5db99f8729c57fed7e6fde4cd06f5ccb62f"><span>old config</span></a><span>, </span><a href="https://github.com/matklad/config/blob/d555642a5a9e4e8b0ca0c77f188ffd976f06327c/home/.emacs.d/init.el"><span>current config</span></a><span>).</span>
+<span>I worked at JetBrains on </span><a href="https://github.com/intellij-rust/intellij-rust"><span>IntelliJ Rust</span></a><span> for several years.</span>
+<span>I used evil mode and vim for a bit, and tried tmux and kakoune.</span>
+<span>Nowadays, I primarily use VS Code to develop </span><a href="https://github.com/rust-analyzer/rust-analyzer/"><span>rust-analyzer</span></a><span>: LSP-based editor-independent IDE backend for Rust.</span></p>
+<p><span>I will be focusing on IntelliJ family of IDEs, as I believe these are the most advanced IDEs today.</span></p>
+<p><span>The main distinguishing feature of IntelliJ is semantic understanding of code.</span>
+<span>The core of IntelliJ is a compiler which parses, type checks and otherwise understands your code.</span>
+<a href="https://martinfowler.com/bliki/PostIntelliJ.html"><span>PostIntelliJ</span></a><span> is the canonical post about this.</span>
+<span>That article also refutes the claim that </span>&ldquo;<span>Smalltalk IDE is the best we</span>&rsquo;<span>ve ever had</span>&rdquo;<span>.</span></p>
+<p><span>Note that </span>&ldquo;<span>semantic understanding</span>&rdquo;<span> is mostly unrelated to the traditional interpretation of </span>&ldquo;<span>IDE</span>&rdquo;<span> as </span><em><em><span>Integrated</span></em></em><span> Development Environment.</span>
+<span>I personally don</span>&rsquo;<span>t feel that the </span>&ldquo;<span>Integrated</span>&rdquo;<span> bit is all that important.</span>
+<span>I commit&amp;push from the command line using Julia scripts, rebase in magit, and do code reviews in a browser.</span>
+<span>If anything, there</span>&rsquo;<span>s an ample room for improvement for the integration bits.</span>
+<span>For me, </span><strong><strong><span>I</span></strong></strong><span> in </span>&ldquo;<strong><span>I</span></strong><span>DE</span>&rdquo;<span> stands for </span>&ldquo;<span>intelligent</span>&rdquo;<span>, smart.</span></p>
+<p><span>Keep in mind this terminology difference.</span>
+<span>I feel it is a common source of misunderstanding.</span>
+&ldquo;<span>Unix and command line can do anything an IDE can do</span>&rdquo;<span> is correct about integrated bits, but is wrong about semantical bits.</span></p>
+<p><span>Traditional editors like Vim or Emacs understand programming languages very approximately, mostly via regular expressions.</span>
+<span>For me, this feels very wrong.</span>
+<span>It</span>&rsquo;<span>s </span><a href="https://stackoverflow.com/a/1732454"><span>common knowledge</span></a><span> that HTML shall not be parsed with regex.</span>
+<span>Yet this is exactly what happens every time one does </span><code>vim index.html</code><span> with syntax highlighting on.</span>
+<span>I sincerely think that almost every syntax highlighter out there is wrong and we, as an industry, should do better.</span>
+<span>I also understand that this is a tall order, but I do my best to change the status quo here :-)</span></p>
+<p><span>These are mostly theoretical concerns though.</span>
+<span>The question is, does semantic understanding help in practice?</span>
+<span>I am pretty sure that it is non-essential, especially for smaller code bases.</span>
+<span>My </span><a href="https://github.com/matklad/rustraytracer"><span>first non-trivial Rust program</span></a><span> was written in Emacs, and it was fine.</span>
+<span>Most of rust-analyzer was written using pretty spartan IDE support.</span>
+<span>There are a lot of insanely-productive folks who are like </span>&ldquo;<span>sometimes I type vim, sometimes I type vi, they are sufficiently similar</span>&rdquo;<span>.</span>
+<span>Regex-based syntax highlighting and regex based fuzzy symbol search (</span><a href="https://github.com/universal-ctags/ctags"><span>ctags</span></a><span>) get you a really long way.</span></p>
+<p><span>However, I do believe that features unlocked by deep understanding of the language help.</span>
+<span>The funniest example here is extend/shrink selection.</span>
+<span>This features allows you to extend current selection to the next encompassing syntactic construct.</span>
+<span>It</span>&rsquo;<span>s the simplest feature a PostIntelliJ IDE can have, it only needs the parser.</span>
+<span>But it is sooo helpful when writing code, it just completely blows vim</span>&rsquo;<span>s text objects out of the water, especially when combined with multiple cursors.</span>
+<span>In a sense, this is structural editing which works for text.</span></p>
+
+<figure>
+
+<img alt="" src="https://user-images.githubusercontent.com/1711539/98809232-80e3db00-241d-11eb-883a-5aece9a1dbfc.gif">
+</figure>
+<p><span>If you add further knowledge of the language into a mix, you</span>&rsquo;<span>ll get the </span>&ldquo;<span>assists</span>&rdquo;<span> system: micro-refactoring which available in a particular context.</span>
+<span>For example, if the cursor is on a comma in a list of function arguments, you can </span><kbd><kbd><span>alt</kbd>+<kbd>enter</span></kbd></kbd><span> &gt; </span>&ldquo;<span>swap arguments</span>&rdquo;<span>, and the order of arguments will be changed in the declaration and on various call-sites as well.</span>
+<span>(See </span><a href="https://rust-analyzer.github.io/blog/2020/09/28/how-to-make-a-light-bulb.html"><span>this post</span></a><span> to learn how assists are implemented).</span></p>
+<p><span>These small dwim things add up to a really nice editing experience, where you mostly express the intention, and the IDE deals with boring syntactical aspects of code editing:</span></p>
+
+<figure>
+
+<img alt="" src="https://user-images.githubusercontent.com/1711539/98812121-37e25580-2422-11eb-8541-2c5a32926845.gif">
+</figure>
+<p><span>For larger projects, complex refactors are a huge time-saver.</span>
+<span>Doing project-wide renames and signature changes automatically and without thinking reduces the cost of keeping the code clean.</span></p>
+<p><span>Another transformative experience is navigation.</span>
+<span>In IntelliJ, you generally don</span>&rsquo;<span>t </span>&ldquo;<span>open a file</span>&rdquo;<span>.</span>
+<span>Instead you think directly in terms of functions, types and modules, and navigate to those using file structure, goto symbol, to do definition/implementation/type, etc:</span></p>
+<p><a href="https://www.jetbrains.com/help/idea/navigating-through-the-source-code.html" class="url">https://www.jetbrains.com/help/idea/navigating-through-the-source-code.html</a></p>
+<p><span>When I used Emacs, I really admired its buffer management facilities, because they made opening a file I want a breeze.</span>
+<span>When I later switched to IntelliJ, I stopped thinking in terms of a set of opened files altogether.</span>
+<span>I disabled editor tabs and started using editor splits less often </span>&mdash;<span> you don</span>&rsquo;<span>t need bookmarks if you can just find things.</span></p>
+<p><span>For me, there</span>&rsquo;<span>s one aspect of traditional editors which is typically not matched in IDEs out of the box </span>&mdash;<span> basic cursor motion.</span>
+<span>Using arrow keys for that is slow and flow-breaking, because one needs to move the hand from the home row.</span>
+<span>Even Emacs</span>&rsquo;<span> horrific </span><kbd><kbd><span>C-p</span></kbd></kbd><span>, </span><kbd><kbd><span>C-n</span></kbd></kbd><span> are a big improvement, and vim</span>&rsquo;<span>s </span><kbd><kbd><span>hjkl</span></kbd></kbd><span> go even further.</span>
+<span>One fix here is to configure each tool to use your favorite shortcuts, but this is a whack-a-mole game.</span>
+<span>What I do is remapping </span><kbd><kbd><span>CapsLock</span></kbd></kbd><span> to act as an extra modifier, such that </span><kbd><kbd><span>ijkl</span></kbd></kbd><span> </span><strong><span>are</span></strong><span> arrow keys.</span>
+<span>(There are also keyboards with </span><a href="https://ultimatehackingkeyboard.com"><span>hardware</span></a><span> </span><a href="https://ergodox-ez.com"><span>support</span></a><span> for this).</span>
+<span>This works in all applications the same way.</span>
+<span>Easy motion / ace jump functionality for jumping to any visible character is also handy, and usually is available </span><a href="https://plugins.jetbrains.com/plugin/9803-acejump-lite"><span>via</span></a><span> </span><a href="https://marketplace.visualstudio.com/items?itemName=lucax88x.codeacejumper"><span>a plugin</span></a><span>.</span></p>
+<p><span>Recent advancements with LSP protocol promise to give one the best of both worlds, where semantic-aware backend and light-weight editor frontend are different processes, which can be mixed and matched.</span>
+<span>This is nice in theory, but not as nice in practice as IntelliJ yet, mostly because IntelliJ is way more polished.</span></p>
+<p><span>To give a simple example, in IntelliJ for </span>&ldquo;<span>go to symbol by fuzzy name</span>&rdquo;<span> functionality, I can filter the search scope by:</span></p>
+<ul>
+<li>
+<span>is this my code/code from a dependency?</span>
+</li>
+<li>
+<span>is this test/production code?</span>
+</li>
+<li>
+<span>is a symbol a type-like thing, or a method-like thing?</span>
+</li>
+<li>
+<span>path to the module where the symbol is defined.</span>
+</li>
+</ul>
+<p><span>VS Code and LSP simply do not have capabilities for such filters yet, they have to be bolted on using hacks.</span>
+<span>Support for LSP in other editors is even more hit-and-miss.</span></p>
+<p><span>LSP did achieve a significant breakthrough </span>&mdash;<span> it made people care about implementing IDE backends.</span>
+<span>Experience shows that re-engineering an existing compiler to power an IDE is often impossible, or isomorphic to a rewrite.</span>
+<span>How a compiler talks to an editor is the smaller problem.</span>
+<span>The hard one is building a compiler that can do IDE stuff in the first place.</span>
+<span>Check out </span><a href="https://rust-analyzer.github.io/blog/2020/07/20/three-architectures-for-responsive-ide.html"><span>this post</span></a><span> for some of the technical details.</span>
+<span>Starting with this use-case in mind saves a lot of effort down the road.</span></p>
+<p><span>This I think is a big deal.</span>
+<span>I hypothesize that the reason why IDEs do not completely dominate tooling landscape is the lack of good IDE backends.</span></p>
+<p><span>If we look at the set of languages fairly popular recently, a significant fraction of them is dynamically typed: PHP, JavaScript, Python, Ruby.</span>
+<span>The helpfulness of an IDE for dynamically typed languages is severely limited: while approximations and heuristics can get you a long way, you still need humans in the loop to verify IDE</span>&rsquo;<span>s guesses.</span></p>
+<p><span>There</span>&rsquo;<span>s C++, but its templates are effectively dynamically typed, with exactly the same issues (and a very complex base language to boot).</span>
+<span>Curiously, C looks like a language for which implementing a near-perfect IDE is pretty feasible.</span>
+<span>I don</span>&rsquo;<span>t know why it didn</span>&rsquo;<span>t happen before CLion.</span></p>
+<p><span>This leaves C# and Java.</span>
+<span>Indeed, these languages are dominated by IDEs.</span>
+<span>There</span>&rsquo;<span>s a saying that you can</span>&rsquo;<span>t write Java without an IDE.</span>
+<span>I think it gets the causation direction backwards: Java is one of the few languages for which it is possible to implement a great IDE without great pain.</span>
+<span>Supporting evidence here is Go.</span>
+<span>According to </span><a href="https://go.dev/blog/survey2019-results#developer-tools"><span>survey results</span></a><span>, text editors are stably declining in popularity in favor of IDEs.</span></p>
+<p><span>I think this is because Go actually has good IDEs.</span>
+<span>This is possible because the language is sufficiently statically typed for an IDE to be a marked improvement.</span>
+<span>Additionally, the language is very simple, so the amount of work you need to put in to make a decent IDE is much lower than for other languages.</span>
+<span>If you have something like JavaScript</span>&hellip;
+<span>Well, you first need to build an alternative language for which you can actually implement an IDE (</span><a href="https://www.typescriptlang.org"><span>TypeScript</span></a><span>) and only then you can build the IDE itself (</span><a href="https://github.com/microsoft/vscode"><span>VS Code</span></a><span>).</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-11-11-yde.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/12/12/notes-on-lock-poisoning.html b/2020/12/12/notes-on-lock-poisoning.html
new file mode 100644
index 00000000..87b8e6e5
--- /dev/null
+++ b/2020/12/12/notes-on-lock-poisoning.html
@@ -0,0 +1,231 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Notes On Lock Poisoning</title>
+  <meta name="description" content="Rust's libs teams is considering overhauling std::sync module.
+As a part of this effort, they are launching lock poisoning survey.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/12/12/notes-on-lock-poisoning.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Notes-On-Lock-Poisoning"><span>Notes On Lock Poisoning</span> <time datetime="2020-12-12">Dec 12, 2020</time></a>
+    </h1>
+<p><span>Rust</span>&rsquo;<span>s libs teams is considering overhauling </span><code>std::sync</code><span> module.</span>
+<span>As a part of this effort, they are launching lock poisoning survey.</span></p>
+<p><a href="https://blog.rust-lang.org/2020/12/11/lock-poisoning-survey.html" class="url">https://blog.rust-lang.org/2020/12/11/lock-poisoning-survey.html</a></p>
+<p><span>This is post is a an extended response to that survey.</span>
+<span>It is not be well-edited :-)</span></p>
+<section id="Panics-Should-Propagate">
+
+    <h2>
+    <a href="#Panics-Should-Propagate"><span>Panics Should Propagate</span> </a>
+    </h2>
+<p><span>Midori error model makes sharp distinction between two kinds of errors:</span></p>
+<ul>
+<li>
+<span>bugs in the program, like indexing an array with </span><code>-92</code>
+</li>
+<li>
+<span>error conditions in programs</span>&rsquo;<span> environment (reading a file which doesn</span>&rsquo;<span>t exist)</span>
+</li>
+</ul>
+<p><span>In Rust, those correspond to panics and Results.</span>
+<span>It</span>&rsquo;<span>s important to not mix the two.</span></p>
+<p><span>std I think sadly does mix them in sync API.</span>
+<span>The following APIs convert panics to recoverable results:</span></p>
+<ul>
+<li>
+<code>Mutex::lock</code>
+</li>
+<li>
+<code>thread::JoinHandle::join</code>
+</li>
+<li>
+<code>mpsc::Sender::send</code>
+</li>
+</ul>
+<p><span>All those APIs return a </span><code>Result</code><span> when the other thread panicked.</span>
+<span>These leads to people using </span><code>?</code><span> with these methods, using recoverable error handling for bugs in the program.</span></p>
+<p><span>In my mind, a better design would be to make those API </span><code>panic</code><span> by default.</span>
+<span>Sometimes synchronization point also happen to be failure isolation boundaries.</span>
+<span>More verbose result-returning </span><code>catching_lock</code><span>, </span><code>catching_join</code><span>, </span><code>catching_send</code><span> would work for those special cases.</span></p>
+<p><span>If </span><code>std::Mutex</code><span> did implement lock poisoning, but the </span><code>lock</code><span> method returned a </span><code>LockGuard&lt;T&gt;</code><span>, rather than </span><code>Result&lt;LockGuard&lt;T&gt;, PoisonError&gt;</code><span>, then we wouldn</span>&rsquo;<span>t be discussing poisoning in the rust book, in every mutex example, and wouldn</span>&rsquo;<span>t consider changing the status quo.</span>
+<span>At the same time, we</span>&rsquo;<span>d preserve </span>&ldquo;<span>safer</span>&rdquo;<span> semantics of lock poisoning.</span></p>
+<p><span>There</span>&rsquo;<span>s an additional consideration here.</span>
+<span>In a single-threaded program, panic propagation is linear.</span>
+<span>One panic is unwound past a sequence of frames.</span>
+<span>If we get the second panic in some </span><code>Drop</code><span>, the result is process aborting.</span></p>
+<p><span>In a multi-threaded program, the stack is tree-shaped.</span>
+<span>What should happen if one of the three parallel threads panics?</span>
+<span>I believe the right semantics here is that siblings are cancelled, and then the panic is propagated to the parent.</span>
+<span>How to implement cancellation is an open question.</span>
+<span>If </span><em><span>two</span></em><span> children panic, we should propagate a pair of panics.</span></p>
+</section>
+<section id="Almost-UnwindSafe">
+
+    <h2>
+    <a href="#Almost-UnwindSafe"><span>Almost UnwindSafe</span> </a>
+    </h2>
+<p><span>A topic closely related to lock poisoning is unwinding safety </span>&mdash;<span> </span><code>UnwindSafe</code><span> and </span><code>RefUnwindSafe</code><span> traits.</span>
+<span>I want to share an amusing story how this machinery almost, but not quite, saved my bacon.</span></p>
+<p><span>rust-analyzer implements cancellation via unwinding.</span>
+<span>After a user types something and we have new code to process, we set a global flag.</span>
+<span>Long-running background tasks like syntax highlighting read this flag and, if it is set, panic with a </span><code>struct Cancelled</code><span> payload.</span>
+<span>We use </span><code>resume_unwind</code><span> and not </span><code>panic</code><span> to avoid printing backtrace.</span>
+<span>After the stack is unwound, we can start processing new code.</span></p>
+<p><span>This means that rust-analyzer</span>&rsquo;<span>s data, stored in the </span><code>Db</code><span> type, needs to be unwind safe.</span></p>
+<p><span>One day while I was idly hacking on rust-analyzer during Rust all-hands I</span>&rsquo;<span>ve noticed a weird compilation error, telling me that </span><code>Db</code><span> doesn</span>&rsquo;<span>t implement the corresponding trait.</span>
+<span>What</span>&rsquo;<span>s worse, removing the </span><code>target</code><span> directory fixed the bug.</span>
+<span>This was an instance of incorrect incremental compilation.</span></p>
+<p><span>The problem stemmed from two issues:</span></p>
+<ul>
+<li>
+<code>UnwindSafe</code><span> and </span><code>RefUnwindSafe</code><span> are auto traits, and inference rules for those are complicated</span>
+</li>
+<li>
+<code>Db</code><span> type has a curiously recurring template structure</span>
+</li>
+</ul>
+<p><span>With incremental compilation in the mix, something somewhere went wrong.</span></p>
+<p><span>The compiler bug was fixed after several months, but, to work around it in the meantime, we</span>&rsquo;<span>ve added a manual </span><code>impl UnwindSafe for Db</code><span> which masked the bug.</span></p>
+<p><span>Couple of months more has passed, and we started integrating chalk into rust-analyzer.</span>
+<span>At that time, chalk had it</span>&rsquo;<span>s own layer of caching, in addition to the incremental compilation of rust-analyzer itself.</span>
+<span>So we had something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Db</span> {</span>
+<span class="line">    solver: parking_lot::Mutex&lt;ChalkSolver&gt;,</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>(We used parking_lot for perf, and to share mutex impl between salsa and rust-analyzer).</span></p>
+<p><span>Now, one of the differences between </span><code>std::Mutex</code><span> and </span><code>parking_lot::Mutex</code><span> is lock poisoning.</span>
+<span>And that means that </span><code>std::Mutex</code><span> is unwind safe (as it just becomes poisoned), while </span><code>parking_lot::Mutex</code><span> is not.</span>
+<span>Chalk used some </span><code>RefCell</code>&rsquo;<span>s internally, so it wasn</span>&rsquo;<span>t unwind safe.</span>
+<span>So the whole </span><code>Db</code><span> stopped being </span><code>UnwindSafe</code><span> after addition of chalk.</span>
+<em><span>But</span></em><span> because we had that manual </span><code>impl UnwindSafe for Db</code><span>, we haven</span>&rsquo;<span>t noticed this.</span></p>
+<p><span>And that lead to a heisenbug.</span>
+<span>If cancellation happened during trait solving, we unwound past </span><code>ChalkSolver</code><span>.</span>
+<span>And, as didn</span>&rsquo;<span>t have strict exception safety guarantees, that messed up its internal questions.</span>
+<span>So the </span><em><span>next</span></em><span> trait solving query would observe really weird errors like index out of bounds inside chalk.</span></p>
+<p><span>The solution was to:</span></p>
+<ul>
+<li>
+<span>remove the manual impl (by that time the underlying compiler bug was fixed).</span>
+</li>
+<li>
+<span>get the </span><code>Db: !UnwindSafe</code><span> expected error.</span>
+</li>
+<li>
+<span>replace </span><code>parking_lot::Mutex</code><span> with </span><code>std::Mutex</code><span> to get unwind-safety.</span>
+</li>
+<li>
+<span>change calls to </span><code>.lock</code><span> to propagate cancellation.</span>
+</li>
+</ul>
+<p><span>The last point is interesting, it means that we need support for recoverable poisoning in this case.</span>
+<span>We need to understand that the other thread was cancelled mid-operation (so that chalk</span>&rsquo;<span>s state might be inconsistent).</span>
+<span>And we also need to re-raise the panic with a </span><em><span>specific</span></em><span> payload </span>&mdash;<span> the </span><code>Cancelled</code><span> struct.</span>
+<span>This is because the situation is not a bug.</span></p>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/kbnphb/blog_post_notes_on_lock_poisoning/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-12-12-notes-on-lock-poisoning.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2020/12/28/csdi.html b/2020/12/28/csdi.html
new file mode 100644
index 00000000..8a5a011d
--- /dev/null
+++ b/2020/12/28/csdi.html
@@ -0,0 +1,356 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Call Site Dependency Injection</title>
+  <meta name="description" content="This post documents call site dependency injection pattern.
+It is a rather low level specimen and has little to do with enterprise DI.
+The pattern is somewhat Rust-specific.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2020/12/28/csdi.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Call-Site-Dependency-Injection"><span>Call Site Dependency Injection</span> <time datetime="2020-12-28">Dec 28, 2020</time></a>
+    </h1>
+<p><span>This post documents call site dependency injection pattern.</span>
+<span>It is a rather low level specimen and has little to do with enterprise DI.</span>
+<span>The pattern is somewhat Rust-specific.</span></p>
+<p><span>Usually, when you implement a type which needs some user-provided functionality, the first thought is to supply it in constructor:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Engine</span> {</span>
+<span class="line">    config: Config,</span>
+<span class="line">    ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Engine</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(config: Config) <span class="hl-punctuation">-&gt;</span> Engine { ... }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">go</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) { ... }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In this example, we implement </span><code>Engine</code><span> and the caller supplies </span><code>Config</code><span>.</span></p>
+<p><span>An alternative is to pass the dependency to every method call:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Engine</span> {</span>
+<span class="line">    ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Engine</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>() <span class="hl-punctuation">-&gt;</span> Engine { ... }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">go</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, config: &amp;Config) { ... }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In Rust, the latter (call-site injection) sometimes works with lifetimes better.</span>
+<span>Let</span>&rsquo;<span>s see the examples!</span></p>
+<section id="Lazy-Field">
+
+    <h2>
+    <a href="#Lazy-Field"><span>Lazy Field</span> </a>
+    </h2>
+<p><span>In the first example, we want to lazily compute a field</span>&rsquo;<span>s value based on other fields.</span>
+<span>Something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Widget</span> {</span>
+<span class="line">    name: <span class="hl-type">String</span>,</span>
+<span class="line">    name_hash: Lazy&lt;<span class="hl-type">u64</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Widget</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(name: <span class="hl-type">String</span>) <span class="hl-punctuation">-&gt;</span> Widget {</span>
+<span class="line">        Widget {</span>
+<span class="line">            name,</span>
+<span class="line">            name_hash: Lazy::<span class="hl-title function_ invoke__">new</span>(|| {</span>
+<span class="line">                <span class="hl-title function_ invoke__">compute_hash</span>(&amp;<span class="hl-keyword">self</span>.name)</span>
+<span class="line">            }),</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The problem with this design is that it doesn</span>&rsquo;<span>t work in Rust.</span>
+<span>The closure in </span><code>Lazy</code><span> needs access to </span><code>self</code><span>, and that would create a self-referential data structure!</span></p>
+<p><span>The solution is to supply the closure at the point where the </span><code>Lazy</code><span> is used:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Widget</span> {</span>
+<span class="line">    name: <span class="hl-type">String</span>,</span>
+<span class="line">    name_hash: OnceCell&lt;<span class="hl-type">u64</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Widget</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(name: <span class="hl-type">String</span>) <span class="hl-punctuation">-&gt;</span> Widget {</span>
+<span class="line">        Widget {</span>
+<span class="line">            name,</span>
+<span class="line">            name_hash: OnceCell::<span class="hl-title function_ invoke__">new</span>(),</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">name_hash</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u64</span> {</span>
+<span class="line">        *<span class="hl-keyword">self</span>.name_hash.<span class="hl-title function_ invoke__">get_or_init</span>(|| {</span>
+<span class="line">            <span class="hl-title function_ invoke__">compute_hash</span>(&amp;<span class="hl-keyword">self</span>.name)</span>
+<span class="line">        })</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Indirect-Hash-Table">
+
+    <h2>
+    <a href="#Indirect-Hash-Table"><span>Indirect Hash Table</span> </a>
+    </h2>
+<p><span>The next example is about plugging a custom hash function into a hash table.</span>
+<span>In Rust</span>&rsquo;<span>s standard library, this is only possible on the type level, by implementing the </span><code>Hash</code><span> trait for a type.</span>
+<span>A more general design would be to parameterize the table with a hash function at run-time.</span>
+<span>This is what C++ does.</span>
+<span>However in Rust this won</span>&rsquo;<span>t be general enough.</span></p>
+<p><span>Consider a string interner, which stores strings in a vector and additionally maintains a hash-based index:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    vec: <span class="hl-type">Vec</span>&lt;<span class="hl-type">String</span>&gt;,</span>
+<span class="line">    set: HashSet&lt;<span class="hl-type">usize</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">intern</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, s: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> { ... }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">lookup</span>(&amp;<span class="hl-keyword">self</span>, i: <span class="hl-type">usize</span>) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-type">str</span> { ... }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The </span><code>set</code><span> field stores the strings in a hash table, but it represents them using indices into neighboring </span><code>vec</code><span>.</span></p>
+<p><span>Constructing the </span><code>set</code><span> with a closure wont work for the same reason </span><code>Lazy</code><span> didn</span>&rsquo;<span>t work </span>&mdash;<span> this creates a self-referential structure.</span>
+<span>In C++ there exists a work-around </span>&mdash;<span> it is possible to box the </span><code>vec</code><span> and share a stable pointer between </span><code>Interner</code><span> and the closure.</span>
+<span>In Rust, that would create aliasing, preventing the use of </span><code>&amp;mut Vec</code><span>.</span></p>
+<p><span>Curiously, using a sorted vec instead of a hash works with std APIs:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    vec: <span class="hl-type">Vec</span>&lt;<span class="hl-type">String</span>&gt;,</span>
+<span class="line">    <span class="hl-comment">// Invariant: sorted</span></span>
+<span class="line">    set: <span class="hl-type">Vec</span>&lt;<span class="hl-type">usize</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Interner</span> {</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">intern</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, s: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">idx</span> = <span class="hl-keyword">self</span>.set.<span class="hl-title function_ invoke__">binary_search_by</span>(|&amp;idx| {</span>
+<span class="line">            <span class="hl-keyword">self</span>.vec[idx].<span class="hl-title function_ invoke__">cmp</span>(s)</span>
+<span class="line">        });</span>
+<span class="line">        <span class="hl-keyword">match</span> idx {</span>
+<span class="line">            <span class="hl-title function_ invoke__">Ok</span>(idx) =&gt; <span class="hl-keyword">self</span>.set[idx],</span>
+<span class="line">            <span class="hl-title function_ invoke__">Err</span>(idx) =&gt; {</span>
+<span class="line">                <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">self</span>.vec.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">                <span class="hl-keyword">self</span>.vec.<span class="hl-title function_ invoke__">push</span>(s.<span class="hl-title function_ invoke__">to_string</span>());</span>
+<span class="line">                <span class="hl-keyword">self</span>.set.<span class="hl-title function_ invoke__">insert</span>(idx, res);</span>
+<span class="line">                res</span>
+<span class="line">            }</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">fn</span> <span class="hl-title function_">lookup</span>(&amp;<span class="hl-keyword">self</span>, i: <span class="hl-type">usize</span>) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-type">str</span> { ... }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This is because the closure is supplied at the call site rather than at the construction site.</span></p>
+<p><span>The hashbrown crate provides this style of API for hashes via </span><a href="https://docs.rs/hashbrown/0.9.1/hashbrown/hash_map/struct.HashMap.html#method.raw_entry_mut"><span>RawEntry</span></a><span>.</span></p>
+</section>
+<section id="Per-Container-Allocators">
+
+    <h2>
+    <a href="#Per-Container-Allocators"><span>Per Container Allocators</span> </a>
+    </h2>
+<p><span>The third example is from the Zig programming language.</span>
+<span>Unlike Rust, Zig doesn</span>&rsquo;<span>t have a blessed global allocator.</span>
+<span>Instead, containers in Zig come in two flavors.</span>
+<span>The </span>&ldquo;<span>Managed</span>&rdquo;<span> flavor accepts an allocator as a constructor parameter and stores it as a field</span>
+<span>(</span><a href="https://github.com/ziglang/zig/blob/1590ed9d6aea95e5a21e3455e8edba4cdb374f2c/lib/std/array_list.zig#L36-L43"><span>Source</span></a><span>).</span>
+<span>The </span>&ldquo;<span>Unmanaged</span>&rdquo;<span> flavor adds an </span><code>allocator</code><span> parameter to every method</span>
+<span>(</span><a href="https://github.com/ziglang/zig/blob/1590ed9d6aea95e5a21e3455e8edba4cdb374f2c/lib/std/array_list.zig#L436-L440"><span>Source</span></a><span>).</span></p>
+<p><span>The second approach is more frugal </span>&mdash;<span> it is possible to use a single allocator reference with many containers.</span></p>
+</section>
+<section id="Fat-Pointers">
+
+    <h2>
+    <a href="#Fat-Pointers"><span>Fat Pointers</span> </a>
+    </h2>
+<p><span>The final example comes from the Rust language itself.</span>
+<span>To implement dynamic dispatch, Rust uses fat pointers, which are two words wide.</span>
+<span>The first word points to the object, the second one to the vtable.</span>
+<span>These pointers are manufactured at the point where a concrete type is used generically.</span></p>
+<p><span>This is different from C++, where vtable pointer is embedded into the object itself during construction.</span></p>
+<hr>
+<p><span>Having seen all these examples, I am warming up to Scala-style implicit parameters.</span>
+<span>Consider this hypothetical bit of Rust code with Zig-style vectors:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">{</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">a</span> = <span class="hl-title function_ invoke__">get_allocator</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">xs</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">ys</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    xs.<span class="hl-title function_ invoke__">push</span>(&amp;<span class="hl-keyword">mut</span> a, <span class="hl-number">1</span>);</span>
+<span class="line">    ys.<span class="hl-title function_ invoke__">push</span>(&amp;<span class="hl-keyword">mut</span> a, <span class="hl-number">2</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The problem here is </span><code>Drop</code><span> </span>&mdash;<span> freeing the vectors requires access to the allocator, and it</span>&rsquo;<span>s unclear how to provide one.</span>
+<span>Zig dodges the problem by using defer statement rather than destructors.</span>
+<span>In Rust with implicit parameters, I imagine the following would work:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span>&lt;implicit a: &amp;<span class="hl-keyword">mut</span> Allocator, T&gt; <span class="hl-built_in">Drop</span> <span class="hl-keyword">for</span> <span class="hl-title class_">Vec</span>&lt;T&gt;</span></code></pre>
+
+</figure>
+<hr>
+<p><span>To conclude, I want to share one last example where CSDI thinking helped me to discover a better application-level architecture.</span></p>
+<p><span>A lot of rust-analyzer</span>&rsquo;<span>s behavior is configurable.</span>
+<span>There are toggles for inlay hints, completion can be tweaked, and some features work differently depending on the editor.</span>
+<span>The first implementation was to store a global </span><code>Config</code><span> struct together with the rest of analysis state.</span>
+<span>Various subsystems then read bits of this </span><code>Config</code><span>.</span>
+<span>To avoid coupling distinct features together via this shared struct, config keys were dynamic:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">Config</span> = HashMap&lt;<span class="hl-type">String</span>, <span class="hl-type">String</span>&gt;;</span></code></pre>
+
+</figure>
+<p><span>This system worked, but felt rather awkward.</span></p>
+<p><span>The current implementation is much simpler.</span>
+<span>Rather than storing a single </span><code>Config</code><span> as a part of the state, each method now accepts a specific config parameter:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">get_completions</span>(</span>
+<span class="line">    analysis: &amp;Analysis,</span>
+<span class="line">    config: &amp;CompletionConfig,</span>
+<span class="line">    file: FileId,</span>
+<span class="line">    offset: <span class="hl-type">usize</span>,</span>
+<span class="line">)</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">get_inlay_hints</span>(</span>
+<span class="line">    analysis: &amp;Analysis,</span>
+<span class="line">    config: &amp;HintsConfig,</span>
+<span class="line">    file: FileId,</span>
+<span class="line">)</span></code></pre>
+
+</figure>
+<p><span>Not only the code is simpler, it is more flexible.</span>
+<span>Because configuration is no longer a part of the state, it is possible to use different configs for the same functionality depending on the context.</span>
+<span>For example, explicitly invoked completion might be different from the asynchronous one.</span></p>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/kmd41e/blog_post_call_site_dependency_injection/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2020-12-28-csdi.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/01/03/two-kinds-of-code-review.html b/2021/01/03/two-kinds-of-code-review.html
new file mode 100644
index 00000000..d6e3870a
--- /dev/null
+++ b/2021/01/03/two-kinds-of-code-review.html
@@ -0,0 +1,189 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Two Kinds of Code Review</title>
+  <meta name="description" content="I've read a book about management and it helped me to solve a long-standing personal conundrum about the code review process.
+The book is High Output Management.
+Naturally, I recommend it (and read this review as well: https://apenwarr.ca/log/20190926).">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/01/03/two-kinds-of-code-review.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Two-Kinds-of-Code-Review"><span>Two Kinds of Code Review</span> <time datetime="2021-01-03">Jan 3, 2021</time></a>
+    </h1>
+<p><span>I</span>&rsquo;<span>ve read a book about management and it helped me to solve a long-standing personal conundrum about the code review process.</span>
+<span>The book is </span>&ldquo;<span>High Output Management</span>&rdquo;<span>.</span>
+<span>Naturally, I recommend it (and read this </span>&ldquo;<span>review</span>&rdquo;<span> as well: </span><a href="https://apenwarr.ca/log/20190926" class="url">https://apenwarr.ca/log/20190926</a><span>).</span></p>
+<p><span>One of the smaller ideas of the book is that of the managerial meddling.</span>
+<span>If my manager micro-manages me and always tells me what to do, I</span>&rsquo;<span>ll grow accustomed to that and won</span>&rsquo;<span>t be able to contribute without close supervision.</span>
+<span>This is a facet of a more general </span><strong><strong><span>T</span></strong></strong><span>ask-</span><strong><strong><span>R</span></strong></strong><span>elevant </span><strong><strong><span>M</span></strong></strong><span>aturity framework.</span>
+<span>Irrespective of the overall level of seniority, a person has some expertise level for each </span><em><span>specific</span></em><span> task.</span>
+<span>The optimal quantity and quality of supervisor</span>&rsquo;<span>s involvement depends on this level (TRM).</span>
+<span>When TRM grows, the management style should go from structured control to supervision to nudges and consultations.</span>
+<span>I don</span>&rsquo;<span>t need a ton of support when writing Rust, I can benefit a lot from a thorough review when coding in Julia and I certainly require hand-holding when attempting to write Spanish!</span>
+<span>But the overarching goal is to improve my TRM, as that directly improves my productivity and frees up my supervisor</span>&rsquo;<span>s time.</span>
+<span>The problem with meddling is not excessive control (it might be appropriate in low-TRM situations), it is that meddling removes the motivation to learn to take the wheel yourself.</span></p>
+<p><span>Now, how on earth all this managerial gibberish relates to the pull request review?</span>
+<span>I now believe that there are two largely orthogonal (and even conflicting) goals to a review process.</span></p>
+<p><em><span>One</span></em><span> goal of a review process is good </span><em><span>code</span></em><span>.</span>
+<span>The review ensures that each change improves the overall quality of a code base.</span>
+<span>Without continuous betterment any code under change reverts to the default architecture: a ball of goo.</span></p>
+<p><em><span>Another</span></em><span> goal of a review is good </span><em><span>coders</span></em><span>.</span>
+<span>The review is a perfect mentorship opportunity, it is a way to increase contributor</span>&rsquo;<span>s TRM.</span>
+<span>This is vital for community-driven open-source projects.</span></p>
+<p><span>I personally always felt that the review process I use falls quite short of the proper level of quality.</span>
+<span>Which didn</span>&rsquo;<span>t really square with me bootstrapping a couple of successful open source projects.</span>
+<span>Now I think that I just happen to optimize for the people</span>&rsquo;<span>s aspect of the review process, while most guides</span>
+<span>(with a notable exception of </span><a href="http://hintjens.com/blog:106"><span>Optimistic Merging</span></a><span>) focus on code aspects.</span></p>
+<p><span>Now, (let me stress this point), I do not claim that the second goal is inherently better (though it sounds nicer).</span>
+<span>It</span>&rsquo;<span>s just that in the context of both </span><a href="https://intellij-rust.github.io"><span>IntelliJ Rust</span></a><span> and </span><a href="https://rust-analyzer.github.io"><span>rust-analyzer</span></a><span> (green-field projects with massive scope, big uncertainties and limited payed-for hours) growing the community of contributors and maintainers was more important than maintaining perfectly clean code.</span></p>
+<p><span>Reviews for quality are hard and time consuming.</span>
+<span>I personally can</span>&rsquo;<span>t really review the code looking at the diff, I can give only superficial comments.</span>
+<span>To understand the code, most of the time I need to fetch it locally and to try to implement the change myself in a different way.</span>
+<span>To make a meaningful suggestion, I need to implement and run it on my machine (and the first two attempts won</span>&rsquo;<span>t fly).</span>
+<span>Hence, a proper review for me takes roughly the same time as the implementation itself.</span>
+<span>Taking into account the fact that there are many more contributors than maintainers, this is an instant game over for reviews for quality.</span></p>
+<p><span>Luckily, folks submitting PRs generally have medium/high TRM.</span>
+<span>They were able to introduce themselves to the codebase, find an issue to work on and come up with a working code without me!</span>
+<span>So, instead of scrutinizing away every last bit of diff</span>&rsquo;<span>s imperfection, my goal is to promote the contributor to an autonomous maintainer status.</span>
+<span>This is mostly just a matter of trust.</span>
+<span>I don</span>&rsquo;<span>t read every line of code, as I trust the author of the PR to handle ifs and whiles well enough (this is the major time saver).</span>
+<span>I trust that people address my comments and let them merge their own PRs (</span><a href="https://bors.tech/documentation/"><code>bors d+</code></a><span>).</span>
+<span>I trust that people can review other</span>&rsquo;<span>s code, and share commit access (</span><code>r+</code><span>) liberally.</span></p>
+
+<aside class="block">
+
+<p><span>Note that explicit calls for contribution such as </span>&ldquo;<span>good first issue</span>&rdquo;<span> labels tend to attract less experienced contributors.</span>
+<span>When applying this label, make sure that you are ready to closely mentor the PR.</span></p>
+
+</aside>
+  <p><span>What new contributors don</span>&rsquo;<span>t have and what I do talk about in reviews is the understanding of project-specific architecture and values.</span>
+<span>These are best demonstrated on  specific issues with the diff.</span>
+<span>But the focus isn</span>&rsquo;<span>t the improvement of a specific change, the focus is teaching the author of (hopefully) subsequent changes.</span>
+<span>I liberally digress into discussing general code philosophy issues.</span>
+<span>As disseminating this knowledge 1-1 is not very efficient, I also try to document it.</span>
+<span>Rather than writing a PR comment, I put the text into</span>
+<a href="https://github.com/rust-analyzer/rust-analyzer/blob/41454eb1ebc87c0f35d247bfb600e775abe022f4/docs/dev/architecture.md"><span>architecture.md</span></a><span> or</span>
+<a href="https://github.com/rust-analyzer/rust-analyzer/blob/41454eb1ebc87c0f35d247bfb600e775abe022f4/docs/dev/style.md"><span>style.md</span></a>
+<span>and link that instead.</span>
+<span>I also try to do only a small fixed number of review rounds.</span>
+<span>Roughly, the PR is merged after two round-trips, not when there</span>&rsquo;<span>s nothing left to improve.</span></p>
+<p><span>All this definitely produces warm fuzzy feelings, but what about code quality?</span>
+<span>Gating PRs on quality is one, but not the only one, way to maintain clean code.</span>
+<span>The approach I use instead is continuous reafactoring / asynchronous reviews.</span>
+<span>One of the (documented) values in rust-analyzer is that anyone is allowed and encouraged to refactor all the code, old and new.</span></p>
+<p><span>Instead of blocking the PR, I merge it and then refactor the code in a follow-up (ccing the original author), when I touch this area next time.</span>
+<span>This gives me a much better context than a diff view, as I can edit the code in-place and run the tests.</span>
+<span>I also don</span>&rsquo;<span>t waste time transforming the change I have in mind to a PR comment (the motivation bits go directly into comment/commit message).</span>
+<span>It</span>&rsquo;<span>s also easy to do unrelated drive-by fixes!</span></p>
+<p><span>I wish this asynchronous review workflow was better supported by tools.</span>
+<span>By default, changes are merged by the author, but the PR also goes to a review queue.</span>
+<span>Later, the reviewer looks at the merged code in the main branch.</span>
+<span>Any suggestions are submitted as a new PR, with the original author set as a reviewer.</span>
+<span>(The in-editor reviewing reminds me </span><a href="https://blog.janestreet.com/putting-the-i-back-in-ide-towards-a-github-explorer/"><span>iron workflow</span></a><span>.)</span></p>
+<hr>
+<p><span>For conclusion, let me reference another book.</span>
+<span>I like item 32 from </span>&ldquo;<span>C++ Coding Standards</span>&rdquo;<span>: be clear what kind of class you</span>&rsquo;<span>re writing.</span>
+<span>A value type is not an interface is not a base class.</span>
+<span>All three are classes, but each needs a unique set of rules.</span></p>
+<p><span>When doing/receiving a code review, understand the context and purpose.</span>
+<span>If this is a homework assignment, you want to share knowledge.</span>
+<span>In a critical crypto library, you need perfect code.</span>
+<span>And for a young open source project, you aim to get a co-maintainer!</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-01-03-two-kinds-of-code-review.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/02/06/ARCHITECTURE.md.html b/2021/02/06/ARCHITECTURE.md.html
new file mode 100644
index 00000000..6f977300
--- /dev/null
+++ b/2021/02/06/ARCHITECTURE.md.html
@@ -0,0 +1,159 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>ARCHITECTURE.md</title>
+  <meta name="description" content="If you maintain an open-source project in the range of 10k-200k lines of code, I strongly encourage you to add an ARCHITECTURE document next to README and CONTRIBUTING.
+Before going into the details of why and how, I want to emphasize that this is not another docs are good, write more docs advice.
+I am pretty sloppy about documentation, and, e.g., I often use just simplify as a commit message.
+Nonetheless, I feel strongly about the issue, even to the point of pestering you :-)">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/02/06/ARCHITECTURE.md.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#ARCHITECTURE-md"><span>ARCHITECTURE.md</span> <time datetime="2021-02-06">Feb 6, 2021</time></a>
+    </h1>
+<p><span>If you maintain an open-source project in the range of 10k-200k lines of code, I strongly encourage you to add an </span><code>ARCHITECTURE</code><span> document next to </span><code>README</code><span> and </span><code>CONTRIBUTING</code><span>.</span>
+<span>Before going into the details of why and how, I want to emphasize that this is not another </span>&ldquo;<span>docs are good, write more docs</span>&rdquo;<span> advice.</span>
+<span>I am pretty sloppy about documentation, and, e.g., I often use just </span>&ldquo;<span>simplify</span>&rdquo;<span> as a commit message.</span>
+<span>Nonetheless, I feel strongly about the issue, even to the point of pestering you :-)</span></p>
+<p><span>I have experience with both contributing to and maintaining open-source projects.</span>
+<span>One of the lessons I</span>&rsquo;<span>ve learned is that the biggest difference between an occasional contributor and a core developer lies in the knowledge about the physical architecture of the project.</span>
+<span>Roughly, it takes 2x more time to write a patch if you are unfamiliar with the project, but it takes 10x more time to figure out </span><em><span>where</span></em><span> you should change the code.</span>
+<span>This difference might be hard to perceive if you</span>&rsquo;<span>ve been working with the project for a while.</span>
+<span>If I am new to a code base, I read each file as a sequence of logical chunks specified in some pseudo-random order.</span>
+<span>If I</span>&rsquo;<span>ve made significant contributions before, the perception is quite different.</span>
+<span>I have a mental map of the code in my head, so I no longer read sequentially.</span>
+<span>Instead, I just jump to where the thing should be, and, if it is not there, I move it.</span>
+<span>One</span>&rsquo;<span>s mental map is the source of truth.</span></p>
+<p><span>I find the </span><code>ARCHITECTURE</code><span> file to be a low-effort high-leverage way to bridge this gap.</span>
+<span>As the name suggests, this file should describe the high-level architecture of the project.</span>
+<span>Keep it short: every recurring contributor will have to read it.</span>
+<span>Additionally, the shorter it is, the less likely it will be invalidated by some future change.</span>
+<span>This is the main rule of thumb for </span><code>ARCHITECTURE</code><span> </span>&mdash;<span> only specify things that are unlikely to frequently change.</span>
+<span>Don</span>&rsquo;<span>t try to keep it synchronized with code.</span>
+<span>Instead, revisit it a couple of times a year.</span></p>
+<p><span>Start with a bird</span>&rsquo;<span>s eye overview of the problem being solved.</span>
+<span>Then, specify a more-or-less detailed </span><em><span>codemap</span></em><span>.</span>
+<span>Describe coarse-grained modules and how they relate to each other.</span>
+<span>The codemap should answer </span>&ldquo;<span>where</span>&rsquo;<span>s the thing that does X?</span>&rdquo;<span>.</span>
+<span>It should also answer </span>&ldquo;<span>what does the thing that I am looking at do?</span>&rdquo;<span>.</span>
+<span>Avoid going into details of </span><em><span>how</span></em><span> each module works, pull this into separate documents or (better) inline documentation.</span>
+<span>A codemap is a map of a country, not an atlas of maps of its states.</span>
+<span>Use this as a chance to reflect on the project structure.</span>
+<span>Are the things you want to put near each other in the codemap adjacent when you run </span><code>tree .</code><span>?</span></p>
+<p><em><span>Do</span></em><span> name important files, modules, and types.</span>
+<span>Do </span><em><span>not</span></em><span> directly link them (links go stale).</span>
+<span>Instead, encourage the reader to use symbol search to find the mentioned entities by name.</span>
+<span>This doesn</span>&rsquo;<span>t require maintenance and will help to discover related, similarly named things.</span></p>
+<p><span>Explicitly call-out architectural invariants.</span>
+<span>Often, important invariants are expressed as an </span><em><span>absence</span></em><span> of something, and it</span>&rsquo;<span>s pretty hard to divine that from reading the code.</span>
+<span>Think about a common example from web development: nothing in the model layer specifically doesn</span>&rsquo;<span>t depend on the views.</span></p>
+<p><span>Point out boundaries between layers and systems as well.</span>
+<span>A boundary implicitly contains information about the implementation of the system behind it.</span>
+<span>It even constrains all </span><em><span>possible</span></em><span> implementations.</span>
+<span>But finding a boundary by just randomly looking at the code is hard </span>&mdash;<span> good boundaries have measure zero.</span></p>
+<p><span>After finishing the codemap, add a separate section on cross-cutting concerns.</span></p>
+<p><span>A good example of </span><code>ARCHITECTURE</code><span> document is this one from rust-analyzer:</span>
+<a href="https://github.com/rust-analyzer/rust-analyzer/blob/d7c99931d05e3723d878bea5dc26766791fa4e69/docs/dev/architecture.md"><span>architecture.md</span></a><span>.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>This post is a part of </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html"><span>One Hundred Thousand Lines of Rust</span></a><span> series.</span></p>
+</div>
+</aside></article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-02-06-ARCHITECTURE.md.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/02/10/a-better-profiler.html b/2021/02/10/a-better-profiler.html
new file mode 100644
index 00000000..8ed5c091
--- /dev/null
+++ b/2021/02/10/a-better-profiler.html
@@ -0,0 +1,255 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>A Better Rust Profiler</title>
+  <meta name="description" content="I want a better profiler for Rust.
+Here's what a rust-analyzer benchmark looks like:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/02/10/a-better-profiler.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#A-Better-Rust-Profiler"><span>A Better Rust Profiler</span> <time datetime="2021-02-10">Feb 10, 2021</time></a>
+    </h1>
+<p><span>I want a better profiler for Rust.</span>
+<span>Here</span>&rsquo;<span>s what a rust-analyzer benchmark looks like:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">benchmark_syntax_highlighting_parser</span>() {</span>
+<span class="line">  <span class="hl-keyword">if</span> <span class="hl-title function_ invoke__">skip_slow_tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">return</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">fixture</span> = bench_fixture::<span class="hl-title function_ invoke__">glorious_old_parser</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> (analysis, file_id) = fixture::<span class="hl-title function_ invoke__">file</span>(&amp;fixture);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">hash</span> = {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">_pt</span> = <span class="hl-title function_ invoke__">bench</span>(<span class="hl-string">&quot;syntax highlighting parser&quot;</span>);</span>
+<span class="line">    analysis</span>
+<span class="line">      .<span class="hl-title function_ invoke__">highlight</span>(file_id)</span>
+<span class="line">      .<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">      .<span class="hl-title function_ invoke__">iter</span>()</span>
+<span class="line">      .<span class="hl-title function_ invoke__">filter</span>(|it| {</span>
+<span class="line">        it.highlight.tag == HlTag::<span class="hl-title function_ invoke__">Symbol</span>(SymbolKind::Function)</span>
+<span class="line">      })</span>
+<span class="line">      .<span class="hl-title function_ invoke__">count</span>()</span>
+<span class="line">  };</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(hash, <span class="hl-number">1629</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here</span>&rsquo;<span>s how I want to profile it:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">benchmark_syntax_highlighting_parser</span>() {</span>
+<span class="line">  <span class="hl-keyword">if</span> <span class="hl-title function_ invoke__">skip_slow_tests</span>() {</span>
+<span class="line">    <span class="hl-keyword">return</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">fixture</span> = bench_fixture::<span class="hl-title function_ invoke__">glorious_old_parser</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> (analysis, file_id) = fixture::<span class="hl-title function_ invoke__">file</span>(&amp;fixture);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">hash</span> = {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">_b</span> = <span class="hl-title function_ invoke__">bench</span>(<span class="hl-string">&quot;syntax highlighting parser&quot;</span>);</span>
+<span class="line hl-line">    <span class="hl-keyword">let</span> <span class="hl-variable">_p</span> = better_profiler::<span class="hl-title function_ invoke__">profile</span>();</span>
+<span class="line">    analysis</span>
+<span class="line">      .<span class="hl-title function_ invoke__">highlight</span>(file_id)</span>
+<span class="line">      .<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">      .<span class="hl-title function_ invoke__">iter</span>()</span>
+<span class="line">      .<span class="hl-title function_ invoke__">filter</span>(|it| {</span>
+<span class="line">        it.highlight.tag == HlTag::<span class="hl-title function_ invoke__">Symbol</span>(SymbolKind::Function)</span>
+<span class="line">      })</span>
+<span class="line">      .<span class="hl-title function_ invoke__">count</span>()</span>
+<span class="line">  };</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(hash, <span class="hl-number">1629</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>First, the profiler prints to stderr:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">warning: run with `--release`</span>
+<span class="line">warning: add `debug=true` to Cargo.toml</span>
+<span class="line">warning: set `RUSTFLAGS="-Cforce-frame-pointers=yes"`</span></code></pre>
+
+</figure>
+<p><span>Otherwise, if everything is setup correctly, the output is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Output is saved to:</span>
+<span class="line">   ~/projects/rust-analyzer/profile-results/</span></code></pre>
+
+</figure>
+<p><span>The </span><code>profile-results</code><span> folder contains the following:</span></p>
+<ul>
+<li>
+<code>report.txt</code><span> with</span>
+<ul>
+<li>
+<span>user, cpu, sys time</span>
+</li>
+<li>
+<span>cpu instructions</span>
+</li>
+<li>
+<span>stats for caches &amp; branches a-la </span><code>pref-stat</code>
+</li>
+<li>
+<span>top ten functions by cumulative time</span>
+</li>
+<li>
+<span>top ten functions by self-time</span>
+</li>
+<li>
+<span>top ten hot-spot</span>
+</li>
+</ul>
+</li>
+<li>
+<code>flamegraph.svg</code>
+</li>
+<li>
+<code>data.smth</code><span>, which can be fed into some existing profiler UI (kcachegrind, firefox profiler, etc).</span>
+</li>
+<li>
+<code>report.html</code><span> which contains a basic interactive UI.</span>
+</li>
+</ul>
+<p><span>To tweak settings, the following API is available:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">_p</span> = better_profiler::<span class="hl-title function_ invoke__">profile</span>()</span>
+<span class="line">  .<span class="hl-title function_ invoke__">output</span>(<span class="hl-string">&quot;./other-dir/&quot;</span>)</span>
+<span class="line">  .<span class="hl-title function_ invoke__">samples_per_second</span>(<span class="hl-number">999</span>)</span>
+<span class="line">  .<span class="hl-title function_ invoke__">flamegraph</span>(<span class="hl-literal">false</span>);</span></code></pre>
+
+</figure>
+<p><span>Naturally, the following also works and produces an aggregate profile:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">100</span> {</span>
+<span class="line">  {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">_p</span> = <span class="hl-title function_ invoke__">profile</span>();</span>
+<span class="line">    <span class="hl-title function_ invoke__">interesting_computation</span>();</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-title function_ invoke__">not_interesting_computation</span>();</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I don</span>&rsquo;<span>t know how this should work.</span>
+<span>I think I would be happy with a perf-based Linux-only implementation.</span>
+<span>The </span><a href="https://github.com/jimblandy/perf-event"><span>perf-event</span></a><span> crate by Jim Blandy (co-author of </span>&ldquo;<span>Programming Rust</span>&rdquo;<span>) is good.</span></p>
+<p><span>Have I missed something?</span>
+<span>Does this tool already exist?</span>
+<span>Or is it impossible for some reason?</span></p>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/lgqs35/blog_post_i_want_a_better_rust_profiler/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-02-10-a-better-profiler.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/02/14/for-the-love-of-macros.html b/2021/02/14/for-the-love-of-macros.html
new file mode 100644
index 00000000..6ecd0bd7
--- /dev/null
+++ b/2021/02/14/for-the-love-of-macros.html
@@ -0,0 +1,302 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>For the Love of Macros</title>
+  <meta name="description" content="I've been re-reading Ted Kaminski blog about software design.
+I highly recommend all the posts, especially the earlier ones
+(here's the first).
+He manages to offer design advice which is both non-trivial and sound (a subjective judgment of course), a rare specimen!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/02/14/for-the-love-of-macros.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#For-the-Love-of-Macros"><span>For the Love of Macros</span> <time datetime="2021-02-14">Feb 14, 2021</time></a>
+    </h1>
+<p><span>I</span>&rsquo;<span>ve been re-reading Ted Kaminski blog about software design.</span>
+<span>I highly recommend all the posts, especially the earlier ones</span>
+<span>(here</span>&rsquo;<span>s </span><a href="https://www.tedinski.com/2018/01/16/how-humans-write-programs.html"><span>the first</span></a><span>).</span>
+<span>He manages to offer design advice which is both non-trivial and sound (a subjective judgment of course), a rare specimen!</span></p>
+<p><span>Anyway, one of the insights of the series is that, when designing an abstraction, we always face the inherent tradeoff between power and properties.</span>
+<span>The more we can express using a particular abstraction, the less we can say about the code using it.</span>
+<span>Our human bias for more expressive power is not inherent however.</span>
+<span>This is evident in programming language communities, where users unceasingly ask for new features and language designers say no.</span></p>
+<p><span>Macros are a language feature which is very far in the </span>&ldquo;<span>more power</span>&rdquo;<span> side of the chart.</span>
+<span>Macros give you an ability to abstract over the source code.</span>
+<span>In exchange, you give up the ability to (automatically) reason about the surface syntax.</span>
+<span>As a specific </span><a href="https://rust-analyzer.github.io/blog/2020/03/30/macros-vs-rename.html"><span>example</span></a><span>, rename refactoring doesn</span>&rsquo;<span>t work 100% reliably in languages with powerful macro systems.</span></p>
+<p><span>I do think that, in the ideal world, this is a wrong trade for a language which wants to scale to gigantic projects.</span>
+<span>The ability to automatically reason about and transform source code gains in importance when you add more programmers, more years, and more millions of lines of code.</span>
+<span>But take this with a huuuge grain of salt </span>&mdash;<span> I am obviously biased, having spent several years developing Rust IDEs.</span></p>
+<p><span>That said, macros have a tremendous appeal </span>&mdash;<span> they are a language designer</span>&rsquo;<span>s duct tape.</span>
+<span>Macros are rarely the best tool for the job, but they can do almost any job.</span>
+<span>The language design is incremental.</span>
+<span>A macro system relieves the design pressure by providing a ready poor man</span>&rsquo;<span>s substitute for many features.</span></p>
+<p><span>In this post, I want to explore what macros are used for in Rust.</span>
+<span>The intention is to find solutions which do not give up the </span>&ldquo;<span>reasoning about source code</span>&rdquo;<span> property.</span></p>
+<section id="String-Interpolation">
+
+    <h2>
+    <a href="#String-Interpolation"><span>String Interpolation</span> </a>
+    </h2>
+<p><span>By far, the most common use-case is the </span><code>format!</code><span> family of macros.</span>
+<span>The macro-less solution here is straightforward </span>&mdash;<span> a string interpolation syntax:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">key</span> = <span class="hl-string">&quot;number&quot;</span>;</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">value</span> = || <span class="hl-number">92</span>;</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">t</span> = f<span class="hl-string">&quot;$key: ${value()}&quot;</span>;</span>
+<span class="line"><span class="hl-built_in">assert_eq!</span>(t.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;number: 92&quot;</span>);</span></code></pre>
+
+</figure>
+<p><span>In Rust, interpolation probably shouldn</span>&rsquo;<span>t construct a string directly.</span>
+<span>Instead, it can produce a value implementing </span><code>Display</code><span> (just like </span><code>format_args!</code><span>), which can avoid allocations.</span>
+<span>An interesting extension would be to allow iterating over format string pieces.</span>
+<span>That way, the interpolation syntax could be used for things like SQL statements or command line arguments, without the fear of introducing injection vulnerabilities:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">arg</span> = <span class="hl-string">&quot;my dir&quot;</span>;</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">cmd</span> = f<span class="hl-string">&quot;ls $arg&quot;</span>.<span class="hl-title function_ invoke__">to_cmd</span>();</span>
+<span class="line"><span class="hl-built_in">assert_eq!</span>(cmd.<span class="hl-title function_ invoke__">to_string</span>(), <span class="hl-string">&quot;ls &#x27;my dir&#x27;&quot;</span>);</span></code></pre>
+
+</figure>
+<p><a href="https://julialang.org/blog/2012/03/shelling-out-sucks/"><span>This post</span></a><span> about Julia programming language explains the issue.</span>
+<a href="https://github.com/matklad/xshell"><code>xshell</code></a><span> crate implements this idea for Rust.</span></p>
+</section>
+<section id="Derives">
+
+    <h2>
+    <a href="#Derives"><span>Derives</span> </a>
+    </h2>
+<p><span>I think the second most common, and probably the most important use of macros in Rust are derives.</span>
+<span>Rust is one of the few languages which gets equality right (and forbids comparing apples and oranges), but this crucially depends on the ability to </span><code>derive(Eq)</code><span>.</span>
+<span>Common solutions in this space are special casing in the compiler (Haskell</span>&rsquo;<span>s </span><code>deriving</code><span>) or runtime reflection.</span></p>
+<p><span>But the solution I am most excited about are C# </span><a href="https://devblogs.microsoft.com/dotnet/introducing-c-source-generators/"><span>source generators</span></a><span>.</span>
+<span>Which are nothing new </span>&mdash;<span> this is just the old (source) code generation, just with a nice quality of implementation.</span>
+<span>You can supply custom code which gets run during the build and which can read existing sources and generate additional files, which are then added back to the compilation.</span></p>
+<p><span>The beauty of this solution is that it moves all the complexity out of the language and into the build system.</span>
+<span>This means that you get baseline tooling support for free.</span>
+<span>Goto definition for generated code? Just works.</span>
+<span>Want to step into some serialization code while debugging? There</span>&rsquo;<span>s actual source code on disk, so feel free to!</span>
+<span>You are more of a </span><code>printf</code><span> person? Well, you</span>&rsquo;<span>d need to convince the build system to not stomp over your changes, but, otherwise, why not?</span></p>
+<p><span>Additionally, source generators turn out to be significantly </span><em><span>more</span></em><span> expressive.</span>
+<span>They can call into the Roslyn compiler to analyzer the source code, so they are capable of type-directed code generation.</span></p>
+<p><span>To be useful, source generators require some language level support for splitting a single entity across several files.</span>
+<span>In C#, partial classes play this role.</span></p>
+</section>
+<section id="Domain-Specific-Languages">
+
+    <h2>
+    <a href="#Domain-Specific-Languages"><span>Domain Specific Languages</span> </a>
+    </h2>
+<p><span>The raison d</span>&rsquo;<span>être of macros is implementation of embedded DSLs.</span>
+<span>We want to introduce custom syntax within the language for succinctly modeling the program</span>&rsquo;<span>s domain.</span>
+<span>For example, a macro can be used to embed HTML fragments in Rust code.</span></p>
+<p><span>To me personally, eDSL is not problem to be solved, but just a problem.</span>
+<span>Introducing a new sublanguage (even if small) spends a lot of cognitive complexity budget.</span>
+<span>If you need it once in a while, better stick to just chaining together somewhat verbose function calls.</span>
+<span>If you need it a lot, it makes sense to introduce external DSL, with a compiler, a language server, and all the tooling that makes programming productive.</span>
+<span>To me, macro-based DSLs just don</span>&rsquo;<span>t fell like an interesting point on the cost-benefit curve.</span></p>
+<p><span>That being said, the Kotlin programming language solves the problem of strongly-typed, tooling-friendly DSL nicely (</span><a href="https://kotlinlang.org/docs/type-safe-builders.html#how-it-works"><span>example</span></a><span>).</span>
+<span>Infuriatingly, it</span>&rsquo;<span>s hard to point what </span><em><span>specifically</span></em><span> is the solution.</span>
+<span>It</span>&rsquo;<span>s </span>&hellip;<span> just the concrete syntax mostly.</span>
+<span>Here are some ingredients:</span></p>
+<ul>
+<li>
+<span>The syntax for closures is </span><code>{ arg -&gt; body }</code><span>, or just </span><code>{ body }</code><span>, so closures syntactically resemble blocks.</span>
+</li>
+<li>
+<span>Extension methods (which are just sugar for static methods).</span>
+</li>
+<li>
+<span>Java style implicit </span><code>this</code><span>, which introduces names into scope without an explicit declaration.</span>
+</li>
+<li>
+<a href="https://boats.gitlab.io/blog/post/the-problem-of-effects/"><span>TCP-preserving</span></a><span> inline closures (this the single non-syntactical feature)</span>
+</li>
+</ul>
+<p><span>Nonetheless, this was not enough to implement Jetpack Compose UI DSL, it also needs a compiler plugin.</span></p>
+</section>
+<section id="sqlx">
+
+    <h2>
+    <a href="#sqlx"><span>sqlx</span> </a>
+    </h2>
+<p><span>An interesting case of a DSL I want to call out is </span><a href="https://docs.rs/sqlx/0.5.1/sqlx/macro.query.html"><code>sqlx::query</code></a><span>.</span>
+<span>It allows one to write code like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">account</span> =</span>
+<span class="line">  sqlx::query!(<span class="hl-string">&quot;select (1) as id, &#x27;Herp Derpinson&#x27; as name&quot;</span>)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">fetch_one</span>(&amp;<span class="hl-keyword">mut</span> conn)</span>
+<span class="line">    .<span class="hl-keyword">await</span>?;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// anonymous struct has `#[derive(Debug)]` for convenience</span></span>
+<span class="line"><span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{:?}&quot;</span>, account);</span>
+<span class="line"><span class="hl-built_in">println!</span>(<span class="hl-string">&quot;{}: {}&quot;</span>, account.id, account.name);</span></code></pre>
+
+</figure>
+<p><span>This I think is one of the few cases where eDSL does really pull its weight.</span>
+<span>I don</span>&rsquo;<span>t know how to do this without macros.</span>
+<span>Using string interpolation (the advanced version to protect from injection), it is possible to specify the query.</span>
+<span>Using a source generator, it is possible to check the syntax of the query and verity the types, to, eg, raise a type error in this case:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> (id, name): (<span class="hl-type">i32</span>, <span class="hl-type">f32</span>) =</span>
+<span class="line">  <span class="hl-title function_ invoke__">query</span>(<span class="hl-string">&quot;select (1) as id, &#x27;Herp Derpinson&#x27; as name&quot;</span>)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">fetch_one</span>(&amp;<span class="hl-keyword">mut</span> conn)</span>
+<span class="line">    .<span class="hl-keyword">await</span>?;</span></code></pre>
+
+</figure>
+<p><span>But this won</span>&rsquo;<span>t be enough to generate an anonymous struct, or to get rid of dynamic casts.</span></p>
+</section>
+<section id="Conditional-Compilation">
+
+    <h2>
+    <a href="#Conditional-Compilation"><span>Conditional Compilation</span> </a>
+    </h2>
+<p><span>Rust also uses macros for conditional compilation.</span>
+<span>This use case convincingly demonstrates </span>&ldquo;<span>lack of properties</span>&rdquo;<span> aspect of power.</span>
+<span>Dealing with feature combinations is a perpetual headache for Cargo.</span>
+<span>Users have to repeatedly recompile large chunks of the crate graph when feature flags change.</span>
+<span>Catching a type error on CI with </span><code>cargo test --no-default-features</code><span> is pretty annoying, especially if you did run </span><code>cargo test</code><span> before submitting a PR.</span>
+&ldquo;<span>Additive Features</span>&rdquo;<span> is an uncheckable wishful thinking.</span></p>
+<p><span>In this case, I don</span>&rsquo;<span>t know a good macro-less alternative.</span>
+<span>But, in principle, this seems doable, if conditional compilation is pushed further down the compiler pipeline, to the code generation and linking stage.</span>
+<span>Rather than discarding some code early during parsing, the compiler can select the platform-specific version just before producing machine code for a function.</span>
+<span>Before that, it checks that all conditionally-compiled versions of the function have the same interface.</span>
+<span>That way, platform-specific type errors are impossible.</span></p>
+</section>
+<section id="Placeholder-Syntax">
+
+    <h2>
+    <a href="#Placeholder-Syntax"><span>Placeholder Syntax</span> </a>
+    </h2>
+<p><span>The final use-case I want to cover is that of a placeholder syntax.</span>
+<span>Rust</span>&rsquo;<span>s </span><code>macro_call!(...)</code><span> syntax carves a well-isolated region where anything goes, syntax wise, as long as the parenthesis are balanced.</span>
+<span>In theory, this allow language designers to experiment with provisional syntax before setting something in stone.</span>
+<span>In practice, it looks like this is not at all that beneficial?</span>
+<span>There was some opposition to stabilizing postfix </span><code>.await</code><span> without going via intermediate period with </span><code>await!</code><span> macro.</span>
+<span>And, after stabilization, all </span><em><span>syntax</span></em><span> discussions were immediately forgotten?</span>
+<span>On the other hand, we did have </span><code>try! -&gt; ?</code><span> transition, and I don</span>&rsquo;<span>t think it helped to uncover any design pitfalls?</span>
+<span>At least, we managed to stabilize the </span><a href="https://internals.rust-lang.org/t/can-try-and-use-the-into-trait-instead-of-from/6714"><span>unnecessary restrictive</span></a><span> desugaring on that one.</span></p>
+<hr>
+<p><span>For conclusion, I want to circle back to source generators.</span>
+<span>What </span><em><span>exactly</span></em><span> makes them easier for tooling than macros?</span>
+<span>I think the following three properties do.</span>
+<em><span>First</span></em><span>, both input and output is, fundamentally, text.</span>
+<span>There</span>&rsquo;<span>s no intermediate representation (like token trees), which is used by this meta-programming facility.</span>
+<span>This means that it doesn</span>&rsquo;<span>t need to be integrated deeply with the compiler.</span>
+<span>Of course, internally the tool is free to parse, typecheck and transform the code however it likes.</span>
+<em><span>Second</span></em><span>, there is a phase distinction.</span>
+<span>Source generators are executed once, in unordered fashion.</span>
+<span>There</span>&rsquo;<span>s no back and forth between meta programming and name resolution, which, again, allows to keep </span>&ldquo;<span>meta</span>&rdquo;<span> part outside.</span>
+<em><span>Third</span></em><span>, source generators can only add code, they can not change the meaning of the existing code.</span>
+<span>This means that semantically sound source code transformations remains so in the presence of a code generator.</span></p>
+<p><span>That</span>&rsquo;<span>s all!</span>
+<span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/ljnkwg/blog_post_for_the_love_of_macros/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-02-14-for-the-love-of-macros.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/02/15/NEAR.html b/2021/02/15/NEAR.html
new file mode 100644
index 00000000..41fb2262
--- /dev/null
+++ b/2021/02/15/NEAR.html
@@ -0,0 +1,142 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>matklad @ NEAR</title>
+  <meta name="description" content="Hey, I have a short announcement to make: I am joining NEAR (sharded proof of stake public blockchain)!
+TL;DR: I'll be spending 60% of my time on WASM runtime for smart contracts and 40% on rust-analyzer.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/02/15/NEAR.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#matklad-NEAR"><span>matklad @ NEAR</span> <time datetime="2021-02-15">Feb 15, 2021</time></a>
+    </h1>
+<p><span>Hey, I have a short announcement to make: I am joining </span><a href="https://near.org"><span>NEAR</span></a><span> (sharded proof of stake public blockchain)!</span>
+<span>TL;DR: I</span>&rsquo;<span>ll be spending 60% of my time on WASM runtime for smart contracts and 40% on rust-analyzer.</span></p>
+<p><span>Why NEAR?</span>
+<span>One of the problems I have with current popular blockchain technologies is that they are not scalable.</span>
+<span>Every node needs to process every transaction in the network.</span>
+<span>For a network of with </span><code>N</code><span> nodes that is roughly </span><code>O(N^2)</code><span> total work.</span>
+<span>NEAR aims to solve exactly this problem using the classic big data trick </span>&mdash;<span> sharding the data across several partitions.</span></p>
+<p><span>Another aspect of NEAR I am particularly excited about is the strategic focus on the smart contract</span>&rsquo;<span>s developer experience.</span>
+<span>That</span>&rsquo;<span>s why NEAR is particularly interested in supporting rust-analyzer.</span>
+<span>Rust, with its top-notch WASM ecosystem and focus on correctness is a natural choice for writing contracts.</span>
+<span>At the same time, it is not the most approachable language there is.</span>
+<span>Good tooling can help a lot with surmounting the language</span>&rsquo;<span>s inherent complexity, making writing smart contracts in Rust easy.</span></p>
+<p><span>What does it mean for rust-analyzer?</span>
+<span>We</span>&rsquo;<span>ll see: I am still be putting significant hours into it, although a bit less than previously.</span>
+<span>I</span>&rsquo;<span>ll also help to manage rust-analyzer </span><a href="https://opencollective.com/rust-analyzer"><span>Open Collective</span></a><span>.</span>
+<span>And, naturally, my know-how about building IDEs isn</span>&rsquo;<span>t going anywhere :)</span>
+<span>At the same time, I am excited about lowering the bus factor and distributing rust-analyzer maintainership.</span>
+<span>I do want to take credit for initiating the effort, but it</span>&rsquo;<span>s high time for some structured leadership rotation.</span>
+<span>It</span>&rsquo;<span>s exciting to see </span><a href="https://github.com/jonas-schievink"><span>@jonas-schievink</span></a><span> from </span><a href="https://ferrous-systems.com"><span>Ferrous System</span></a><span> taking on more team leadership tasks.</span>
+<span>(I am hyped about support for inner items, kudos Jonas!)</span>
+<span>I am also delighted with the open source community that formed around rust-analyzer.</span>
+<a href="https://github.com/edwin0cheng"><span>@edwin0cheng</span></a><span>,</span>
+<a href="https://github.com/flodiebold"><span>@flodiebold</span></a><span>,</span>
+<a href="https://github.com/kjeremy"><span>@kjeremy</span></a><span>,</span>
+<a href="https://github.com/lnicola"><span>@lnicola</span></a><span>,</span>
+<a href="https://github.com/SomeoneToIgnore"><span>@SomeoneToIgnore</span></a><span>,</span>
+<a href="https://github.com/Veetaha"><span>@Veetaha</span></a><span>,</span>
+<a href="https://github.com/Veykril"><span>@Veykril</span></a>
+<span>you are awesome, and rust-analyzer wouldn</span>&rsquo;<span>t be possible without you ❤️</span></p>
+<p><span>Finally, I can</span>&rsquo;<span>t help but notice that IntelliJ Rust which I left completely a while ago is doing better than ever.</span>
+<span>Overall, I must say I </span><em><span>am</span></em><span> quite happy with today</span>&rsquo;<span>s state of Rust IDE tooling.</span>
+<span>The basics are firmly in place.</span>
+<span>Let</span>&rsquo;<span>s just finish the remaining 90%!</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-02-15-NEAR.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/02/24/another-generic-dilemma.html b/2021/02/24/another-generic-dilemma.html
new file mode 100644
index 00000000..671bbfc2
--- /dev/null
+++ b/2021/02/24/another-generic-dilemma.html
@@ -0,0 +1,158 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Another Generic Dilemma</title>
+  <meta name="description" content="In The Generic Dilemma, Russ Cox observes that you can have only two of">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/02/24/another-generic-dilemma.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Another-Generic-Dilemma"><span>Another Generic Dilemma</span> <time datetime="2021-02-24">Feb 24, 2021</time></a>
+    </h1>
+<p><span>In </span><a href="https://research.swtch.com/generic">&ldquo;<span>The Generic Dilemma</span>&rdquo;</a><span>, Russ Cox observes that you can have only two of</span></p>
+<ul>
+<li>
+<span>separate compilation</span>
+</li>
+<li>
+<span>unboxed values</span>
+</li>
+<li>
+<span>parametric polymorphism</span>
+</li>
+</ul>
+<p><span>(but see </span><a href="https://www.youtube.com/watch?v=ctS8FzqcRug"><span>1</span></a><span> and </span><a href="https://gankra.github.io/blah/swift-abi/"><span>2</span></a><span> for how you can achieve a middle ground with enough compiler wizardry)</span></p>
+<p><span>Now that </span><a href="https://blog.golang.org/generics-proposal"><span>Go is getting generics</span></a><span>, I want to point out another dilemma:</span></p>
+<p><strong><strong><span>Any language has parametric polymorphism, eventually</span></strong></strong></p>
+<p><span>If you start with just dynamic dispatch, you</span>&rsquo;<span>ll end up adding generics down the road.</span>
+<span>This happened with C++ and Java, and is now happening with Go.</span>
+<span>The last one is interesting </span>&mdash;<span> even if you don</span>&rsquo;<span>t carry accidental OOP baggage (inheritance), interfaces alone are not enough.</span></p>
+<p><span>Why does it happen?</span>
+<span>Well, because generics are useful for simple things.</span>
+<span>Even if the language special-cases several parametric data structures, like go does with slices, maps and channels, it is impossible to abstract over them.</span>
+<span>In particular, it</span>&rsquo;<span>s impossible to write </span><code>list_reverse</code><span> or </span><code>list_sort</code><span> functions without some awkward workarounds.</span></p>
+<p><span>Ok, but where</span>&rsquo;<span>s the dilemma?</span>
+<span>The dilemma is that adding parametric polymorphism to the language opens floodgates of complexity.</span>
+<span>At least in my experience, Rust traits, Haskell type classes, and Java generics are the main reason why some libraries in those languages are hard to use.</span></p>
+<p><span>It</span>&rsquo;<span>s not that generics are inherently hard, </span><code>fn reverse&lt;T&gt;(xs: [T]) -&gt; [T]</code><span> is simple.</span>
+<span>It</span>&rsquo;<span>s that they allow creating complicated solutions, and this doesn</span>&rsquo;<span>t play well with our human bias for complexity.</span></p>
+<p><span>One thing I am wondering is whether a polymorphic language without bounded quantification would be practical?</span>
+<span>Again, in my anecdotal experience, cognitive complexity soars when there are bounds on type parameters: </span><code>T: This&lt;S&gt; + That</code><span>.</span>
+<span>But parametric polymorphism can be useful without them:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sort</span>&lt;T: <span class="hl-built_in">Ord</span>&gt;(xs: &amp;<span class="hl-keyword">mut</span> [T]) { ... }</span></code></pre>
+
+</figure>
+<p><span>is equivalent to</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Ord</span>&lt;T&gt; {</span>
+<span class="line">  cmp: <span class="hl-title function_ invoke__">fn</span>(&amp;T, &amp;T) <span class="hl-punctuation">-&gt;</span> Ordering</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sort</span>&lt;T&gt;(ord: <span class="hl-built_in">Ord</span>&lt;T&gt;, xs: &amp;<span class="hl-keyword">mut</span> [T]) { ... }</span></code></pre>
+
+</figure>
+<p><span>Can we build an entire language out of this pattern?</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-02-24-another-generic-dilemma.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/02/27/delete-cargo-integration-tests.html b/2021/02/27/delete-cargo-integration-tests.html
new file mode 100644
index 00000000..31d43ce3
--- /dev/null
+++ b/2021/02/27/delete-cargo-integration-tests.html
@@ -0,0 +1,327 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Delete Cargo Integration Tests</title>
+  <meta name="description" content="Click bait title!
+We'll actually look into how integration and unit tests are implemented in Cargo.
+A few guidelines for organizing test suites in large Cargo projects naturally arise out of these implementation differences.
+And, yes, one of those guidelines will turn out to be: delete all integration tests but one.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/02/27/delete-cargo-integration-tests.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Delete-Cargo-Integration-Tests"><span>Delete Cargo Integration Tests</span> <time datetime="2021-02-27">Feb 27, 2021</time></a>
+    </h1>
+<p><span>Click bait title!</span>
+<span>We</span>&rsquo;<span>ll actually look into how integration and unit tests are implemented in Cargo.</span>
+<span>A few guidelines for organizing test suites in large Cargo projects naturally arise out of these implementation differences.</span>
+<span>And, yes, one of those guidelines will turn out to be: </span>&ldquo;<span>delete all integration tests but one</span>&rdquo;<span>.</span></p>
+<p><span>Keep in mind that this post is explicitly only about Cargo concepts.</span>
+<span>It doesn</span>&rsquo;<span>t discuss relative merits of integration or unit styles of testing.</span>
+<span>I</span>&rsquo;<span>d love to, but that</span>&rsquo;<span>s going to be a loooong article some other day!</span></p>
+<section id="Loomings">
+
+    <h2>
+    <a href="#Loomings"><span>Loomings 🐳</span> </a>
+    </h2>
+<p><span>When you use Cargo, you can put </span><code>#[test]</code><span> functions directly next to code, in files inside </span><code>src/</code><span> directory.</span>
+<span>Alternatively, you can put them into dedicated files inside </span><code>tests/</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">awesomeness-rs/</span>
+<span class="line">  Cargo.toml</span>
+<span class="line hl-line">  src/          # unit tests go here</span>
+<span class="line">    lib.rs</span>
+<span class="line">    submodule.rs</span>
+<span class="line">    submodule/</span>
+<span class="line">      tests.rs</span>
+<span class="line"></span>
+<span class="line hl-line">  tests/        # integration tests go here</span>
+<span class="line">    is_awesome.rs</span></code></pre>
+
+</figure>
+<p><span>I stress that unit/integration terminology is based purely on the location of the </span><code>#[test]</code><span> functions, and not on what those functions actually do.</span></p>
+<p><span>To build unit tests, Cargo runs</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">rustc --test src/lib.rs</span></code></pre>
+
+</figure>
+<p><span>Rustc then compiles the library with </span><code>--cfg test</code><span>.</span>
+<span>It also injects a generated </span><code>fn main()</code><span>, which invokes all functions annotated with </span><code>#[test]</code><span>.</span>
+<span>The result is an executable file which, when run subsequently by Cargo, executes the tests.</span></p>
+<p><span>Integration tests are build differently.</span>
+<span>First, Cargo uses </span><code>rustc</code><span> to compile the library as usual, </span><em><span>without</span></em><span> </span><code>--cfg test</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">rustc --crate-type=rlib src/lib.rs</span></code></pre>
+
+</figure>
+<p><span>This produces an </span><code>.rlib</code><span> file </span>&mdash;<span> a compiled library.</span></p>
+<p><span>Then, for </span><em><span>each</span></em><span> file in the tests directory, Cargo runs the equivalent of</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">rustc --test --extern awesomeness=path/to/awesomeness.rlib \</span>
+<span class="line">    ./tests/is_awesome.rs</span></code></pre>
+
+</figure>
+<p><span>That is, each integration test is compiled into a separate binary.</span>
+<span>Running those binaries executes the test functions.</span></p>
+</section>
+<section id="Implications">
+
+    <h2>
+    <a href="#Implications"><span>Implications</span> </a>
+    </h2>
+<p><span>Note that </span><code>rustc</code><span> needs to repeatedly re-link the library crate with each of the integration tests.</span>
+<span>This can add up to a significant compilation time blow up for tests.</span>
+<span>That is why I recommend that large projects should have only one integration test crate with several modules.</span>
+<span>That is, don</span>&rsquo;<span>t do this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">tests/</span>
+<span class="line">  foo.rs</span>
+<span class="line">  bar.rs</span></code></pre>
+
+</figure>
+<p><span>Do this instead:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">tests/</span>
+<span class="line">  integration/</span>
+<span class="line">    main.rs</span>
+<span class="line">    foo.rs</span>
+<span class="line">    bar.rs</span></code></pre>
+
+</figure>
+<p><span>When a refactoring along these lines was applied to Cargo itself, the effects were substantial (</span><a href="https://github.com/rust-lang/cargo/pull/5022#issuecomment-364691154"><span>numbers</span></a><span>).</span>
+<span>The time to compile the test suite decreased 3x.</span>
+<span>The size of on-disk artifacts decreased 5x.</span></p>
+<p><span>It can</span>&rsquo;<span>t get better than this, right?</span>
+<span>Wrong!</span>
+<span>Rust tests by default are run in parallel.</span>
+<span>The </span><code>main</code><span> that is generated by </span><code>rustc</code><span> spawns several threads to saturate all of the CPU cores.</span>
+<span>However, Cargo itself runs test binaries sequentially.</span>
+<span>This makes sense </span>&mdash;<span> otherwise, concurrently executing test binaries oversubscribe the CPU.</span>
+<span>But this means that multiple integration tests leave performance on the table.</span>
+<span>The critical path is the sum of longest tests in each binary.</span>
+<span>The more binaries, the longer the path.</span>
+<span>For one of my projects, consolidating several integration tests into one reduced the time to run the test suite from 20 seconds to just 13.</span></p>
+<p><span>A nice side-effect of a single modularized integration test is that sharing the code between separate tests becomes trivial, you just pull it into a submodule.</span>
+<span>There</span>&rsquo;<span>s no need to awkwardly repeat </span><code>mod common;</code><span> for each integration test.</span></p>
+</section>
+<section id="Rules-of-Thumb">
+
+    <h2>
+    <a href="#Rules-of-Thumb"><span>Rules of Thumb</span> </a>
+    </h2>
+<p><span>If the project I am working with is small, I don</span>&rsquo;<span>t worry about test organization.</span>
+<span>There</span>&rsquo;<span>s no need to make tests twice as fast if they are already nearly instant.</span></p>
+<p><span>Conversely, if the project is large (a workspace with many crates) I worry about test organization a lot.</span>
+<span>Slow tests are a boiling frog kind of problem.</span>
+<span>If you do not proactively fix it, everything is fine up until the moment you realize you need to sink a week to untangle the mess.</span></p>
+<p><span>For a library with a public API which is published to crates.io, I avoid unit tests.</span>
+<span>Instead, I use a single integration tests, called </span><strong><code>it</code></strong><span> (</span><strong><span>i</span></strong><span>ntegration </span><strong><span>t</span></strong><span>est):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">tests/</span>
+<span class="line">  it.rs</span>
+<span class="line"></span>
+<span class="line"># Or, for larger crates</span>
+<span class="line"></span>
+<span class="line">tests/</span>
+<span class="line">  it/</span>
+<span class="line">    main.rs</span>
+<span class="line">    foo.rs</span>
+<span class="line">    bar.rs</span></code></pre>
+
+</figure>
+<p><span>Integration tests use the library as an external crate.</span>
+<span>This forces the usage of the same public API that consumers use, resulting in a better design feedback.</span></p>
+<p><span>For an internal library, I avoid integration tests all together.</span>
+<span>Instead, I use Cargo unit tests for </span>&ldquo;<span>integration</span>&rdquo;<span> bits:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">src/</span>
+<span class="line">  lib.rs</span>
+<span class="line">  tests.rs</span>
+<span class="line">  tests/</span>
+<span class="line">    foo.rs</span>
+<span class="line">    bar.rs</span></code></pre>
+
+</figure>
+<p><span>That way, I avoid linking the separate integration tests binary altogether.</span>
+<span>I also have access to non-</span><code>pub</code><span> API of the crate, which is often useful.</span></p>
+</section>
+<section id="Assorted-Tricks">
+
+    <h2>
+    <a href="#Assorted-Tricks"><span>Assorted Tricks</span> </a>
+    </h2>
+<p><em><span>First</span></em><span>, documentation tests are extremely slow.</span>
+<span>Each doc test is linked as a separate binary.</span>
+<span>For this reason, avoid doc tests in internal libraries for big projects and add this to </span><code>Cargo.toml</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-section">[lib]</span></span>
+<span class="line"><span class="hl-attr">doctest</span> = <span class="hl-literal">false</span></span></code></pre>
+
+</figure>
+<p><em><span>Second</span></em><span>, prefer</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[cfg(test)]</span></span>
+<span class="line"><span class="hl-keyword">mod</span> tests; <span class="hl-comment">// tests in `tests.rs` file</span></span></code></pre>
+
+</figure>
+<p><span>to</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[cfg(test)]</span></span>
+<span class="line"><span class="hl-keyword">mod</span> tests {</span>
+<span class="line">    <span class="hl-comment">// tests here</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This way, when you modify just the tests, the cargo is smart to not recompile the library crate.</span>
+<span>It knows that the contents of </span><code>tests.rs</code><span> only affects compilation when </span><code>--test</code><span> is passed to rustc.</span>
+<span>Learned this one from </span><a href="https://github.com/petrochenkov"><span>@petrochenkov</span></a><span>, thanks!</span></p>
+<p><em><span>Third</span></em><span>, even if you stick to unit tests, the library is recompiled twice: once with, and once without </span><code>--test</code><span>.</span>
+<span>For this reason, folks from </span><a href="https://pernos.co"><span>pernosco</span></a><span> go even further.</span>
+<span>They add</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-section">[lib]</span></span>
+<span class="line"><span class="hl-attr">test</span> = <span class="hl-literal">false</span></span></code></pre>
+
+</figure>
+<p><span>to </span><code>Cargo.toml</code><span>, make all APIs they want to unit test public and have a single test crate for the whole workspace.</span>
+<span>This crates links everything and contains all the unit tests.</span></p>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/lto0qa/blog_post_delete_cargo_integration_tests/"><span>/r/rust</span></a><span>.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>This post is a part of </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html"><span>One Hundred Thousand Lines of Rust</span></a><span> series.</span></p>
+</div>
+</aside></section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-02-27-delete-cargo-integration-tests.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/03/12/goroutines-are-not-significantly-smaller-than-threads.html b/2021/03/12/goroutines-are-not-significantly-smaller-than-threads.html
new file mode 100644
index 00000000..0207f6f7
--- /dev/null
+++ b/2021/03/12/goroutines-are-not-significantly-smaller-than-threads.html
@@ -0,0 +1,232 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Goroutines Are Not Significantly Smaller Than Threads</title>
+  <meta name="description" content="The most commonly cited drawback of OS-level threads is that they use a lot of RAM.
+This is not true on Linux.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/03/12/goroutines-are-not-significantly-smaller-than-threads.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Goroutines-Are-Not-Significantly-Smaller-Than-Threads"><span>Goroutines Are Not Significantly Smaller Than Threads</span> <time datetime="2021-03-12">Mar 12, 2021</time></a>
+    </h1>
+<p><span>The most commonly cited drawback of OS-level threads is that they use a lot of RAM.</span>
+<span>This is not true on Linux.</span></p>
+<p><span>Let</span>&rsquo;<span>s compare memory footprint of 10</span><span>_000 Linux threads with 10</span><span>_000 goroutines.</span>
+<span>We spawn 10k workers, which sleep for about 10 seconds, waking up every 10 milliseconds.</span>
+<span>Each worker is staggered by a pseudorandom delay up to 200 milliseconds to avoid the thundering herd problem.</span></p>
+
+<figure class="code-block">
+<figcaption class="title">main.rs</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::{thread, time::Duration};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">threads</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0u32</span>..<span class="hl-number">10_000</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">bad_hash</span> = i.<span class="hl-title function_ invoke__">wrapping_mul</span>(<span class="hl-number">2654435761</span>) % <span class="hl-number">200_000</span>;</span>
+<span class="line">            thread::<span class="hl-title function_ invoke__">sleep</span>(Duration::<span class="hl-title function_ invoke__">from_micros</span>(bad_hash <span class="hl-keyword">as</span> <span class="hl-type">u64</span>));</span>
+<span class="line">            <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">1000</span> {</span>
+<span class="line">                thread::<span class="hl-title function_ invoke__">sleep</span>(Duration::<span class="hl-title function_ invoke__">from_millis</span>(<span class="hl-number">10</span>));</span>
+<span class="line">            }</span>
+<span class="line">        });</span>
+<span class="line">        threads.<span class="hl-title function_ invoke__">push</span>(t);</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">t</span> <span class="hl-keyword">in</span> threads {</span>
+<span class="line">        t.<span class="hl-title function_ invoke__">join</span>().<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+
+<figure class="code-block">
+<figcaption class="title">main.go</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">package</span> main</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">import</span> (</span>
+<span class="line">    <span class="hl-string">&quot;sync&quot;</span></span>
+<span class="line">    <span class="hl-string">&quot;time&quot;</span></span>
+<span class="line">)</span>
+<span class="line"></span>
+<span class="line"><span class="hl-function"><span class="hl-keyword">func</span> <span class="hl-title">main</span><span class="hl-params">()</span></span> {</span>
+<span class="line">    <span class="hl-keyword">var</span> wg sync.WaitGroup</span>
+<span class="line">    <span class="hl-keyword">for</span> i := <span class="hl-type">uint32</span>(<span class="hl-number">0</span>); i &lt; <span class="hl-number">10</span>_000; i++ {</span>
+<span class="line">        i := i</span>
+<span class="line">        wg.Add(<span class="hl-number">1</span>)</span>
+<span class="line">        <span class="hl-keyword">go</span> <span class="hl-function"><span class="hl-keyword">func</span><span class="hl-params">()</span></span> {</span>
+<span class="line">            <span class="hl-keyword">defer</span> wg.Done()</span>
+<span class="line">            bad_hash := (i * <span class="hl-number">2654435761</span>) % <span class="hl-number">200</span>_000</span>
+<span class="line">            time.Sleep(time.Duration(bad_hash) * time.Microsecond)</span>
+<span class="line">            <span class="hl-keyword">for</span> j := <span class="hl-number">0</span>; j &lt; <span class="hl-number">1000</span>; j++ {</span>
+<span class="line">                time.Sleep(<span class="hl-number">10</span> * time.Millisecond)</span>
+<span class="line">            }</span>
+<span class="line">        }()</span>
+<span class="line">    }</span>
+<span class="line">    wg.Wait()</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>We use </span><code>time</code><span> utility to measure memory usage:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">t</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-meta">#!/bin/sh</span></span>
+<span class="line"><span class="hl-built_in">command</span> time --format <span class="hl-string">&#x27;real %es\nuser %Us\nsys  %Ss\nrss  %Mk&#x27;</span> <span class="hl-string">&quot;<span class="hl-variable">$@</span>&quot;</span></span></code></pre>
+
+</figure>
+<p><span>The results:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">λ rustc main.rs -C opt-level=3 &amp;&amp; ./t ./main</span>
+<span class="line">real 10.35s</span>
+<span class="line">user 4.96s</span>
+<span class="line">sys  16.06s</span>
+<span class="line">rss  94472k</span>
+<span class="line"></span>
+<span class="line">λ go build main.go &amp;&amp; ./t ./main</span>
+<span class="line">real 10.92s</span>
+<span class="line">user 13.30s</span>
+<span class="line">sys  0.55s</span>
+<span class="line">rss  34924k</span></code></pre>
+
+</figure>
+<p><span>A thread is only </span><strong><strong><span>3</span></strong></strong><span> times as large as a goroutine.</span>
+<span>Absolute numbers are also significant: 10k threads require only 100 megabytes of overhead.</span>
+<span>If the application does 10k concurrent things, 100mb might be negligible.</span></p>
+
+<aside class="block">
+<div class="title">Correction</div>
+<p><span>As pointed out in comments, using solely RSS to compare memory usage of goroutines and threads is wrong.</span>
+<span>Thread bookkeeping is managed by the kernel, using kernel</span>&rsquo;<span>s own data structures, so not all overhead is accounted for by RSS.</span>
+<span>In contrast, goroutines are managed by the userspace, and RSS does account for this.</span></p>
+<p><span>In particular, 10k threads with default stack sizes need about 40mb of page tables to map virtual memory.</span></p>
+
+</aside>
+  <hr>
+<p><span>Note that it is wrong to use this benchmark to compare performance of threads and goroutines.</span>
+<span>The workload is representative for measuring absolute memory overhead, but is not representative for time overhead.</span></p>
+<p><span>That being said, it is possible to explain why threads need 21 seconds of CPU time while goroutines need only 14.</span>
+<span>Go runtime spawns a thread per CPU-core, and tries hard to keep each goroutine tied to specific thread (and, by extension, CPU).</span>
+<span>Threads by default migrate between CPUs, which incurs synchronization overhead.</span>
+<span>Pinning threads to cores in a round-robin fashion removes this overhead:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo build --release &amp;&amp; ./t ./target/release/main --pin-to-core</span>
+<span class="line"><span class="hl-output">    Finished release [optimized] target(s) in 0.00s</span></span>
+<span class="line"><span class="hl-output">real 10.36s</span></span>
+<span class="line"><span class="hl-output">user 3.01s</span></span>
+<span class="line"><span class="hl-output">sys  9.08s</span></span>
+<span class="line"><span class="hl-output">rss  94856k</span></span></code></pre>
+
+</figure>
+<p><span>The total CPU time now is approximately the same, but the distribution is different.</span>
+<span>On this workload, goroutine scheduler spends roughly the same amount of cycles in the userspace that the thread scheduler spends in the kernel.</span></p>
+<p><span>Code for the benchmarks is available here: </span><a href="https://github.com/matklad/10k_linux_threads"><span>matklad/10k</span><span>_linux</span><span>_threads</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-03-12-goroutines-are-not-significantly-smaller-than-threads.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/03/22/async-benchmarks-index.html b/2021/03/22/async-benchmarks-index.html
new file mode 100644
index 00000000..1a6af5eb
--- /dev/null
+++ b/2021/03/22/async-benchmarks-index.html
@@ -0,0 +1,230 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Async Benchmarks Index</title>
+  <meta name="description" content="I don't understand performance characteristics of async programming when applied to typical HTTP based web applications.
+Let's say we have a CRUD app with a relational database, where a typical request results in N queries to the database and transfers M bytes over the network.
+How much (orders of magnitude?) faster/slower would an async solution be in comparison to a threaded solution?">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/03/22/async-benchmarks-index.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Async-Benchmarks-Index"><span>Async Benchmarks Index</span> <time datetime="2021-03-22">Mar 22, 2021</time></a>
+    </h1>
+<p><span>I don</span>&rsquo;<span>t understand performance characteristics of </span>&ldquo;<span>async</span>&rdquo;<span> programming when applied to typical HTTP based web applications.</span>
+<span>Let</span>&rsquo;<span>s say we have a CRUD app with a relational database, where a typical request results in N queries to the database and transfers M bytes over the network.</span>
+<span>How much (orders of magnitude?) faster/slower would an </span>&ldquo;<span>async</span>&rdquo;<span> solution be in comparison to a </span>&ldquo;<span>threaded</span>&rdquo;<span> solution?</span></p>
+<p><span>In this </span><em><span>live</span></em><span> post, I am collecting the benchmarks that help to shed the light on this and related questions.</span>
+<span>Note that I am definitely not the right person to do this work, so, if there is a better resource, I</span>&rsquo;<span>ll gladly just use that instead.</span>
+<span>Feel free to </span><a href="https://github.com/matklad/matklad.github.io/edit/master/_posts/2021-03-22-async-benchmarks-index.adoc"><span>send pull requests</span></a><span> with benchmarks!</span>
+<span>Every benchmark will be added, but some might go to the rejected section.</span></p>
+<p><span>I am interested in understanding differences between several execution models, regardless of programming language:</span></p>
+<dl>
+<dt><span>Threads:</span></dt>
+<dd>
+<p><span>Good old POSIX threads, as implemented on modern Linux.</span></p>
+</dd>
+<dt><span>Stackful Coroutines</span></dt>
+<dd>
+<p><span>M:N threading, which expose the same programming model as threads, but are implemented by multiplexing several user-space coroutines over a single OS-level thread.</span>
+<span>The most prominent example here is Go</span></p>
+</dd>
+<dt><span>Stackless Coroutines</span></dt>
+<dd>
+<p><span>In this model, each concurrent computation is represented by a fixed-size state machine which reacts to events.</span>
+<span>This model often uses </span><code>async / await</code><span> syntax for describing and composing state machines using standard control flow constructs.</span></p>
+</dd>
+<dt><span>Threads With Cooperative Scheduling</span></dt>
+<dd>
+<p><span>This is a mostly hypothetical model of OS threads with an additional primitive for directly switching between two threads of the same process.</span>
+<span>It is not implemented on Linux (see </span><a href="https://www.youtube.com/watch?v=KXuZi9aeGTw"><span>this presentation</span></a><span> for some old work towards that).</span>
+<span>It is implemented on Windows under the </span>&ldquo;<span>fiber</span>&rdquo;<span> branding.</span></p>
+</dd>
+</dl>
+<p><span>I am also interested in Rust</span>&rsquo;<span>s specific implementation of stackless coroutines</span></p>
+<section id="Benchmarks">
+
+    <h2>
+    <a href="#Benchmarks"><span>Benchmarks</span> </a>
+    </h2>
+<dl>
+<dt><a href="https://github.com/jimblandy/context-switch" class="url">https://github.com/jimblandy/context-switch</a></dt>
+<dd>
+<p><span>This is a micro benchmark comparing the cost of primitive operations of threads and stackless as implemented in Rust coroutines.</span>
+<span>Findings:</span></p>
+<ul>
+<li>
+<span>Thread creation is order of magnitude slower</span>
+</li>
+<li>
+<span>Threads use order of magnitude more RAM.</span>
+</li>
+<li>
+<span>IO-related context switches take the same time</span>
+</li>
+<li>
+<span>Thread-to-thread context switches (channel sends) take the same time, </span><em><span>if</span></em><span> threads are pinned to one core.</span>
+<span>This is surprising to me.</span>
+<span>I</span>&rsquo;<span>d expect channel send to be significantly more efficient for either stackful or stackless coroutines.</span>
+</li>
+<li>
+<span>Thread-to-thread context switches are order of magnitude slower if there</span>&rsquo;<span>s no pinning</span>
+</li>
+<li>
+<span>Threads hit non-memory resource limitations quickly (it</span>&rsquo;<span>s hard to spawn &gt; 50k threads).</span>
+</li>
+</ul>
+</dd>
+<dt><a href="https://github.com/jkarneges/rust-async-bench" class="url">https://github.com/jkarneges/rust-async-bench</a></dt>
+<dd>
+<p><span>Micro benchmark which compares Rust</span>&rsquo;<span>s implementation of stackless coroutines with a manually coded state machine.</span>
+<span>Rust</span>&rsquo;<span>s async/await turns out to not be zero-cost, pure overhead is about 4x.</span>
+<span>The absolute numbers are still low though, and adding even a single syscall of work reduces the difference to only 10%</span></p>
+</dd>
+<dt><a href="https://matklad.github.io/2021/03/12/goroutines-are-not-significantly-smaller-than-threads.html" class="url">https://matklad.github.io/2021/03/12/goroutines-are-not-significantly-smaller-than-threads.html</a></dt>
+<dd>
+<p><span>This is a micro benchmark comparing just the memory overhead of threads and stackful coroutines as implemented in Go.</span>
+<span>Threads are </span>&ldquo;<span>times</span>&rdquo;<span>, but not </span>&ldquo;<span>orders of magnitude</span>&rdquo;<span> larger.</span></p>
+</dd>
+<dt><a href="https://calpaterson.com/async-python-is-not-faster.html" class="url">https://calpaterson.com/async-python-is-not-faster.html</a></dt>
+<dd>
+<p><span>Macro benchmark which compares many different Python web frameworks.</span>
+<span>The conclusion is that </span><code>async</code><span> is worse for both latency and throughput.</span>
+<span>Note two important things.</span>
+<em><span>First</span></em><span>, the servers are run behind a reverse proxy (nginx), which drastically changes IO patterns that are observed by the server.</span>
+<em><span>Second</span></em><span>, Python is not the fastest language, so throughput is roughly correlated with the amount of C code in the stack.</span></p>
+<p><span>There is also </span><a href="https://blog.miguelgrinberg.com/post/ignore-all-web-performance-benchmarks-including-this-one"><span>a rebuttal post</span></a><span>.</span></p>
+</dd>
+</dl>
+</section>
+<section id="Rejected-Benchmarks">
+
+    <h2>
+    <a href="#Rejected-Benchmarks"><span>Rejected Benchmarks</span> </a>
+    </h2>
+<dl>
+<dt><a href="https://matej.laitl.cz/bench-actix-rocket/" class="url">https://matej.laitl.cz/bench-actix-rocket/</a></dt>
+<dd>
+<p><span>This is a macro benchmark comparing performance of sync and async Rust web servers.</span>
+<span>This is the kind of benchmark I want to see, and the analysis is exceptionally good.</span>
+<span>Sadly, a big part of the analysis is fighting with unreleased version of software and working around bugs, so I don</span>&rsquo;<span>t trust that the results are representative.</span></p>
+</dd>
+<dt><a href="https://www.techempower.com/benchmarks/" class="url">https://www.techempower.com/benchmarks/</a></dt>
+<dd>
+<p><span>This is a micro benchmark that pretends to be a macro benchmark.</span>
+<span>The code is overly optimized to fit a very specific task.</span>
+<span>I don</span>&rsquo;<span>t think the results are easily transferable to real-world applications.</span>
+<span>At the same time, lack of the analysis and the </span>&ldquo;<span>macro</span>&rdquo;<span> scale of the task itself doesn</span>&rsquo;<span>t help with building a mental model for explaining the observed performance.</span></p>
+</dd>
+<dt><a href="https://inside.java/2020/08/07/loom-performance" class="url">https://inside.java/2020/08/07/loom-performance</a></dt>
+<dd>
+<p><span>The opposite of a benchmark actually.</span>
+<span>This post gives a good theoretical overview of why async might lead to performance improvements.</span>
+<span>Sadly, it drops the ball when it comes to practice:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>millions of user-mode threads instead of the meager thousands the OS can support.</span></p>
+</blockquote>
+
+</figure>
+<p><span>What is the limiting factor for OS threads?</span></p>
+</dd>
+</dl>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-03-22-async-benchmarks-index.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/04/26/concurrent-expression-problem.html b/2021/04/26/concurrent-expression-problem.html
new file mode 100644
index 00000000..5fc7a908
--- /dev/null
+++ b/2021/04/26/concurrent-expression-problem.html
@@ -0,0 +1,218 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Concurrent Expression Problem</title>
+  <meta name="description" content="I am struggling with designing concurrent code.
+In this post, I want to share a model problem which exemplifies some of the issues.
+It is reminiscent of the famous expression problem in that there's a two dimensional design grid, and a win along one dimension translates to a loss along the other.
+If you want a refresher on the expression problem (not required to understand this article), take a look at this post.
+It's not canonical, but I like it.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/04/26/concurrent-expression-problem.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Concurrent-Expression-Problem"><span>Concurrent Expression Problem</span> <time datetime="2021-04-26">Apr 26, 2021</time></a>
+    </h1>
+<p><span>I am struggling with designing concurrent code.</span>
+<span>In this post, I want to share a model problem which exemplifies some of the issues.</span>
+<span>It is reminiscent of the famous expression problem in that there</span>&rsquo;<span>s a two dimensional design grid, and a win along one dimension translates to a loss along the other.</span>
+<span>If you want a refresher on the expression problem (not required to understand this article), take a look at </span><a href="https://www.tedinski.com/2018/02/27/the-expression-problem.html"><span>this post</span></a><span>.</span>
+<span>It</span>&rsquo;<span>s not canonical, but I like it.</span></p>
+<p><span>Without further ado, concurrent expression problem:</span></p>
+
+<aside class="block">
+
+<p><span>Define a set of concurrent activities which interact with shared mutable state with invariants, such that it is easy to introduce new activities and additional invariants.</span></p>
+
+</aside>
+  <p><span>I am not sure that</span>&rsquo;<span>s exactly the right formulation, I feel like I am straining it a bit to fit the expression problem shape.</span>
+<span>The explanation that follows matters more.</span></p>
+<p><span>I think there are two ways to code the system described.</span>
+<span>The first approach is to us a separate thread / goroutine / async task for each concurrent activity, with some synchronization around the access to the shared state.</span>
+<span>The alternative approach is to write an explicit state machine / actor loop to receive the next event and process it.</span></p>
+<p><span>In the first scheme, adding new activities is easy, as you just write straight-line code with maybe some </span><code>.await</code><span>s here and there.</span>
+<span>In the second scheme, it</span>&rsquo;<span>s easy to check and act on invariants, as there is only a single place where the state is modified.</span></p>
+<p><span>Let</span>&rsquo;<span>s take a look at a concrete example.</span>
+<span>We</span>&rsquo;<span>ll be using a pseudo code for a language with cooperative concurrency and explicit yield points (think Python with async/await).</span></p>
+<p><span>The state consists of two counters.</span>
+<span>One activity decrements the first counter every second.</span>
+<span>The other activity does the same to the other counter.</span>
+<span>When </span><em><span>both</span></em><span> counters reach zero, we want to print something.</span></p>
+<p><span>The first approach would look roughly like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">State</span> { c1: <span class="hl-type">u32</span>, c2: <span class="hl-type">u32</span> }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">async</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">act1</span>(state: State) {</span>
+<span class="line">  <span class="hl-keyword">while</span> state.c1 &gt; <span class="hl-number">0</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">sleep</span>(<span class="hl-number">1</span>).<span class="hl-keyword">await</span>;</span>
+<span class="line">    state.c1 -= <span class="hl-number">1</span>;</span>
+<span class="line">    <span class="hl-keyword">if</span> state.c1 == <span class="hl-number">0</span> &amp;&amp; state.c2 == <span class="hl-number">0</span> {</span>
+<span class="line">      <span class="hl-title function_ invoke__">print</span>(<span class="hl-string">&quot;both are zero&quot;</span>)</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">async</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">act2</span>(state: State) {</span>
+<span class="line">  <span class="hl-keyword">while</span> state.c2 &gt; <span class="hl-number">0</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">sleep</span>(<span class="hl-number">1</span>).<span class="hl-keyword">await</span>;</span>
+<span class="line">    state.c2 -= <span class="hl-number">1</span>;</span>
+<span class="line">    <span class="hl-keyword">if</span> state.c1 == <span class="hl-number">0</span> &amp;&amp; state.c2 == <span class="hl-number">0</span> {</span>
+<span class="line">      <span class="hl-title function_ invoke__">print</span>(<span class="hl-string">&quot;both are zero&quot;</span>)</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And the second one like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">async</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">run</span>(state: State) {</span>
+<span class="line">  <span class="hl-keyword">loop</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">event</span> = <span class="hl-title function_ invoke__">next_event</span>().<span class="hl-keyword">await</span>;</span>
+<span class="line">    <span class="hl-keyword">match</span> event {</span>
+<span class="line">      Event::Dec1 =&gt; {</span>
+<span class="line">        state.c1 -= <span class="hl-number">1</span>;</span>
+<span class="line">        <span class="hl-keyword">if</span> state.c1 &gt; <span class="hl-number">0</span> {</span>
+<span class="line">          <span class="hl-title function_ invoke__">send_event_with_delay</span>(Event::Dec1, <span class="hl-number">1</span>)</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">      Event::Dec2 =&gt; {</span>
+<span class="line">        state.c2 -= <span class="hl-number">1</span>;</span>
+<span class="line">        <span class="hl-keyword">if</span> state.c2 &gt; <span class="hl-number">0</span> {</span>
+<span class="line">          <span class="hl-title function_ invoke__">send_event_with_delay</span>(Event::Dec2, <span class="hl-number">1</span>)</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">if</span> state.c1 == <span class="hl-number">0</span> &amp;&amp; state.c2 == <span class="hl-number">0</span> {</span>
+<span class="line">      <span class="hl-title function_ invoke__">print</span>(<span class="hl-string">&quot;both are zero&quot;</span>)</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It</span>&rsquo;<span>s much easier to see what the concurrent activities are in the first case.</span>
+<span>It</span>&rsquo;<span>s more clear how the overall state evolves in the second case.</span></p>
+<p><span>The second approach also gives you more control </span>&mdash;<span> if several events are ready, you can process them in the order of priority (usually it makes sense to prioritize writes over reads).</span>
+<span>You can trivially add some logging at the start and end of the loop to collect data about slow events and overall latency.</span>
+<span>But the hit to the programming model is big.</span>
+<span>If you are new to the code and don</span>&rsquo;<span>t know which conceptual activities are there, it</span>&rsquo;<span>s hard to figure out that just from the code.</span>
+<span>The core issue is that causal links between asynchronous events are not reified in the code:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">match</span> {</span>
+<span class="line">  Event::X =&gt; { <span class="hl-title function_ invoke__">do_x</span>() },</span>
+<span class="line">  Event::Y =&gt; { <span class="hl-title function_ invoke__">do_y</span>() },</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// vs</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">async</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">do_xy</span>() {</span>
+<span class="line">  <span class="hl-title function_ invoke__">do_x</span>().<span class="hl-keyword">await</span>;</span>
+<span class="line">  <span class="hl-title function_ invoke__">do_y</span>().<span class="hl-keyword">await</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-04-26-concurrent-expression-problem.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/05/12/design-pattern-dumping-ground.html b/2021/05/12/design-pattern-dumping-ground.html
new file mode 100644
index 00000000..60c95b6e
--- /dev/null
+++ b/2021/05/12/design-pattern-dumping-ground.html
@@ -0,0 +1,149 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Design Pattern: Kitchen Sink</title>
+  <meta name="description" content="These are the notes on a design pattern I noticed in several contexts.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/05/12/design-pattern-dumping-ground.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Design-Pattern-Kitchen-Sink"><span>Design Pattern: Kitchen Sink</span> <time datetime="2021-05-12">May 12, 2021</time></a>
+    </h1>
+<p><span>These are the notes on a design pattern I noticed in several contexts.</span></p>
+<p><span>Suppose, metaphorically, you have a neatly organized bookcase which categorizes the books by their topics.</span>
+<span>And now, suppose you</span>&rsquo;<span>ve got a new book, which doesn</span>&rsquo;<span>t fit clearly into any existing category.</span>
+<span>What would you do?</span></p>
+<p><span>Here are some common solutions I</span>&rsquo;<span>ve seen:</span></p>
+<ul>
+<li>
+<span>Put the book somewhere in the bookcase.</span>
+</li>
+<li>
+<span>Start rearranging the shelves until you have a proper topic for this new book.</span>
+<span>Maybe introduce a dedicated topic just for this single book.</span>
+</li>
+<li>
+<span>Don</span>&rsquo;<span>t store the book in the bookcase, keep it on the bedside table.</span>
+</li>
+</ul>
+<p><span>Here</span>&rsquo;<span>s the </span>&ldquo;<span>kitchen sink pattern</span>&rdquo;<span> solution for this problem: have the </span>&ldquo;<span>Uncategorized</span>&rdquo;<span> shelf for books which don</span>&rsquo;<span>t clearly fit into the existing hierarchy.</span></p>
+<p><span>The idea here is that the overall organization becomes better, if you explicitly designate some place as </span>&ldquo;<span>stuff that doesn</span>&rsquo;<span>t fit goes here by default</span>&rdquo;<span>.</span>
+<span>Let</span>&rsquo;<span>s see the examples.</span></p>
+<p><em><span>First</span></em><span>, the Django web framework has a </span><code>shortcuts</code><span> module with contains conveniences functions, not fitting model/view separation.</span>
+<span>The </span><code>get_object_or_404</code><span> function lookups an object in the database and returns HTTP404 if it is not found.</span>
+<span>Models (SQL) and views (HTTP) don</span>&rsquo;<span>t know about each other, so the function doesn</span>&rsquo;<span>t belong to either of these modules.</span>
+<span>Placing it in </span><code>shortcuts</code><span> allows this separation to be more crisp.</span></p>
+<p><em><span>Second</span></em><span>, I have two tricks to keep my home folder organized.</span>
+<span>I have a script that clears </span><code>~/downloads</code><span> on every reboot, and I have a </span><code>~/tmp</code><span> as my dumping ground.</span>
+<span>Before </span><code>~/tmp</code><span>, various semi-transient things polluted my otherwise perfectly organized workspace.</span></p>
+<p><em><span>Third</span></em><span>, I asked my colleague recently about some testing infrastructure.</span>
+<span>They replied that they have an extensive document for it in their fork, because it</span>&rsquo;<span>s unclear what</span>&rsquo;<span>s the proper place for it in the main repo.</span>
+<span>In this case the absence of a </span>&ldquo;<span>dumping ground</span>&rdquo;<span> prevented useful work for no good reason.</span></p>
+<p><em><span>Fourth</span></em><span>, in </span><code>rust-analyzer</code><span> we have a </span><code>ast::make</code><span> module which is intended to contain the minimal orthogonal set of constructors for AST nodes.</span>
+<span>Historically, people kept adding non-minimal, non-orthogonal constructors there as well.</span>
+<span>Useful work was done, but it muddied the design.</span>
+<span>This was fixed by adding a dedicated </span><code>ast::make::ext</code><span> submodule for convenient shortcuts.</span></p>
+<p><em><span>Fifth</span></em><span>, for big projects I like having </span><code>stdext</code><span> modules, which fill-in missing batteries for the standard library.</span>
+<span>Without it, various modules tend to accumulate unrelated, and often slightly duplicated, functionality.</span></p>
+<p><em><span>Sixth</span></em><span>, to avoid overthinking and setup costs to start a new hobby project (of which I have a tonne), I have a single monorepo for all incomplete things.</span>
+<span>Adding a folder there is much easier than creating a GitHub repo.</span></p>
+<p><span>To sum up, many classifications work best if there is an explicit </span>&ldquo;<span>can</span>&rsquo;<span>t classify this</span>&rdquo;<span> category.</span>
+<span>If there</span>&rsquo;<span>s no obvious place to put things which don</span>&rsquo;<span>t fit, a solid design might erode with time.</span>
+<span>Note that for this pattern to be useful, an existence of a good solid design is prerequisite, lest all the code ends up in an </span><code>utils</code><span> module.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-05-12-design-pattern-dumping-ground.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/05/31/how-to-test.html b/2021/05/31/how-to-test.html
new file mode 100644
index 00000000..54258d36
--- /dev/null
+++ b/2021/05/31/how-to-test.html
@@ -0,0 +1,883 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>How to Test</title>
+  <meta name="description" content="Alternative titles:
+ Unit Tests are a Scam
+ Test Features, Not Code
+ Data Driven Integrated Tests
+">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/05/31/how-to-test.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#How-to-Test"><span>How to Test</span> <time datetime="2021-05-31">May 31, 2021</time></a>
+    </h1>
+<p><span>Alternative titles:</span><br>
+&nbsp;&nbsp;&nbsp;&nbsp;<span> </span><strong><span>Unit Tests are a Scam</span></strong><br>
+&nbsp;&nbsp;&nbsp;&nbsp;<span> </span><strong><span>Test Features, Not Code</span></strong><br>
+&nbsp;&nbsp;&nbsp;&nbsp;<span> </span><strong><span>Data Driven Integrated Tests</span></strong><br>
+</p>
+<p><span>This post describes my current approach to testing.</span>
+<span>When I started programming professionally, I knew how to write good code, but good tests remained a mystery for a long time.</span>
+<span>This is not due to the lack of advice </span>&mdash;<span> on the contrary, there</span>&rsquo;<span>s abundance of information &amp; terminology about testing.</span>
+<span>This celestial emporium of benevolent knowledge includes TDD, BDD, unit tests, integrated tests, integration tests, end-to-end tests, functional tests, non-functional tests, blackbox tests, glassbox tests, </span>&hellip;</p>
+<p><span>Knowing all this didn</span>&rsquo;<span>t help me to create better software.</span>
+<span>What did help was trying out different testing approaches myself, and looking at how other people write tests.</span>
+<span>Keep in mind that my background is mostly in writing </span><a href="https://github.com/intellij-rust/intellij-rust"><span>compiler</span></a><span> </span><a href="https://github.com/rust-analyzer/rust-analyzer/"><span>front-ends</span></a><span> for IDEs.</span>
+<span>This is a rather niche area, which is especially amendable to testing.</span>
+<span>Compilers are pure self-contained functions.</span>
+<span>I don</span>&rsquo;<span>t know how to best test modern HTTP applications built around inter-process communication.</span></p>
+<p><span>Without further ado, let</span>&rsquo;<span>s see what I have learned.</span></p>
+<section id="Test-Driven-Design-Ossification">
+
+    <h2>
+    <a href="#Test-Driven-Design-Ossification"><span>Test Driven Design Ossification</span> </a>
+    </h2>
+<p><span>This is something I inflicted upon myself early in my career, and something I routinely observe.</span>
+<span>You want to refactor some code, say add a new function parameter.</span>
+<span>Turns out, there are a dozen of tests calling this function, so now a simple refactor also involves fixing all the tests.</span></p>
+<p><span>There is a simple, mechanical fix to this problem: introduce the </span><code>check</code><span> function which encapsulates API under test.</span>
+<span>It</span>&rsquo;<span>s easier to explain using a toy example.</span>
+<span>Let</span>&rsquo;<span>s look at testing something simple, like a binary search, just to illustrate the technique.</span></p>
+<p><span>We start with direct testing:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">/// Given a *sorted* `haystack`, returns `true`</span></span>
+<span class="line"><span class="hl-comment">/// if it contains the `needle`.</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binary_search</span>(haystack: &amp;[T], needle: &amp;T) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">bool</span> {</span>
+<span class="line">    ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binary_search_empty</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-title function_ invoke__">binary_search</span>(&amp;[], &amp;<span class="hl-number">0</span>);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(res, <span class="hl-literal">false</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binary_search_singleton</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-title function_ invoke__">binary_search</span>(&amp;[<span class="hl-number">92</span>], &amp;<span class="hl-number">0</span>);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(res, <span class="hl-literal">false</span>);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-title function_ invoke__">binary_search</span>(&amp;[<span class="hl-number">92</span>], &amp;<span class="hl-number">92</span>);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(res, <span class="hl-literal">true</span>);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-title function_ invoke__">binary_search</span>(&amp;[<span class="hl-number">92</span>], &amp;<span class="hl-number">100</span>);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(res, <span class="hl-literal">false</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// And a dozen more of other similar tests...</span></span></code></pre>
+
+</figure>
+<p><span>Some time passes, and we realize that </span><code>-&gt; bool</code><span> is not the best signature for binary search.</span>
+<span>It</span>&rsquo;<span>s better if it returned an insertion point (an index where element should be inserted to maintain sortedness).</span>
+<span>That is, we want to change the signature to</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binary_search</span>(haystack: &amp;[T], needle: &amp;T) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;<span class="hl-type">usize</span>, <span class="hl-type">usize</span>&gt;;</span></code></pre>
+
+</figure>
+<p><span>Now we have to change every test, because the tests are tightly coupled to the specific API.</span></p>
+<p><span>My solution to this problem is making the tests data driven.</span>
+<span>Instead of every test interacting with the API directly, I like to define a single </span><code>check</code><span> function which calls the API.</span>
+<span>This function takes a pair of input and expected result.</span>
+<span>For binary search example, it will look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[track_caller]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">check</span>(</span>
+<span class="line">  input_haystack: &amp;[<span class="hl-type">i32</span>],</span>
+<span class="line">  input_needle: <span class="hl-type">i32</span>,</span>
+<span class="line">  expected_result: <span class="hl-type">bool</span>,</span>
+<span class="line">) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">actual_result</span> =</span>
+<span class="line">    <span class="hl-title function_ invoke__">binary_search</span>(input_haystack, &amp;input_needle);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_result, actual_result);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binary_search_empty</span>() {</span>
+<span class="line">  <span class="hl-title function_ invoke__">check</span>(&amp;[], <span class="hl-number">0</span>, <span class="hl-literal">false</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">binary_search_singleton</span>() {</span>
+<span class="line">  <span class="hl-title function_ invoke__">check</span>(&amp;[<span class="hl-number">92</span>], <span class="hl-number">0</span>, <span class="hl-literal">false</span>);</span>
+<span class="line">  <span class="hl-title function_ invoke__">check</span>(&amp;[<span class="hl-number">92</span>], <span class="hl-number">92</span>, <span class="hl-literal">true</span>);</span>
+<span class="line">  <span class="hl-title function_ invoke__">check</span>(&amp;[<span class="hl-number">92</span>], <span class="hl-number">100</span>, <span class="hl-literal">false</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, when the API of the </span><code>binary_search</code><span> function changes, we only need to adjust the single place </span>&mdash;<span> </span><code>check</code><span> function:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[track_caller]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">check</span>(</span>
+<span class="line">  input_haystack: &amp;[<span class="hl-type">i32</span>],</span>
+<span class="line">  input_needle: <span class="hl-type">i32</span>,</span>
+<span class="line">  expected_result: <span class="hl-type">bool</span>,</span>
+<span class="line">) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">actual_result</span> =</span>
+<span class="line hl-line">    <span class="hl-title function_ invoke__">binary_search</span>(input_haystack, &amp;input_needle).<span class="hl-title function_ invoke__">is_ok</span>();</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_result, actual_result);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To be clear, after you</span>&rsquo;<span>ve done the refactor, you</span>&rsquo;<span>ll need to adjust the tests to check the index as well, but this can be done separately.</span>
+<span>Existing test suite does not impede changes.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key point:</span></strong></strong><span> keep an eye on tests standing in a way of refactors.</span>
+<span>Use the </span><code>check</code><span> idiom to make tests resilient to changes.</span></p>
+</div>
+</aside><p><span>Keep in mind that the binary search example is artificially simple.</span>
+<span>The main danger here is that this is a </span><a href="https://en.wikipedia.org/wiki/Boiling_frog"><span>boiling frog</span></a><span> type of situation.</span>
+<span>While the project is small and the tests are few, you don</span>&rsquo;<span>t notice that refactors are ever so slightly longer than necessary.</span>
+<span>Then, several tens of thousands lines of code later, you realize that to make a simple change you need to fix a hundred tests.</span></p>
+</section>
+<section id="Test-Friction">
+
+    <h2>
+    <a href="#Test-Friction"><span>Test Friction</span> </a>
+    </h2>
+<p><span>Almost no one likes to write tests.</span>
+<span>I</span>&rsquo;<span>ve noticed many times how, upon fixing a trivial bug, I am prone to skipping the testing work.</span>
+<span>Specifically, if writing a test is more effort than the fix itself, testing tends to go out of the window.</span>
+<span>Hence,</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key point:</span></strong></strong><span> work hard on making adding new tests trivial.</span></p>
+</div>
+</aside><p><span>Coming back to the binary search example, note how </span><code>check</code><span> function reduces the amount of typing to add a new test.</span>
+<span>For tests, this is a significant saving, not because typing is hard, but because it lowers the cognitive barrier to actually do the work.</span></p>
+</section>
+<section id="Test-Features-Not-Code">
+
+    <h2>
+    <a href="#Test-Features-Not-Code"><span>Test Features, Not Code</span> </a>
+    </h2>
+<p><span>The over-simplified binary search example can be stretched further.</span>
+<span>What if you replace the sorted array with a hash map inside your application?</span>
+<span>Or what if the calling code no longer needs to search at all, and wants to process all of the elements instead?</span></p>
+<p><span>Good code </span><a href="https://programmingisterrible.com/post/139222674273/how-to-write-disposable-code-in-large-systems"><span>is easy to delete</span></a><span>.</span>
+<span>Tests represent an investment into existing code, and make it costlier to delete (or change).</span></p>
+<p><span>The solution is to write tests for features in such a way that they are independent of the code.</span>
+<span>I like to use the neural network test for this:</span></p>
+<dl>
+<dt><span>Neural Network Test</span></dt>
+<dd>
+<p><span>Can you re-use the test suite if your entire software is replaced with an opaque neural network?</span></p>
+</dd>
+</dl>
+<p><span>To give a real-life example this time, suppose that you are writing that part of code-completion engine which sorts potential completions according to relevance.</span>
+<span>(something I should probably be doing right now, instead of writing this article :-) )</span></p>
+<p><span>Internally, you have a bunch of functions that compute relevance facts, like:</span></p>
+<ul>
+<li>
+<span>Is there direct type match (</span><code>.foo</code><span> has the desired type)?</span>
+</li>
+<li>
+<span>Is there there indirect type match (</span><code>.foo.bar</code><span> has the right type)?</span>
+</li>
+<li>
+<span>How frequently is this completion used in the current module?</span>
+</li>
+</ul>
+<p><span>Then, there</span>&rsquo;<span>s the final ranking function that takes these facts and comes up with an overall rank.</span></p>
+<p><span>The classical unit-test approach here would be to write a bunch of isolated tests for each of the relevance functions,</span>
+<span>and a separate bunch of tests which feeds the ranking function a list of relevance facts and checks the final score.</span></p>
+<p><span>This approach obviously fails the neural network test.</span></p>
+<p><span>An alternative approach is to write a test to check that at a given position a specific ordered list of entries is returned.</span>
+<span>That suite could work as a cross-validation for an ML-based implementation.</span></p>
+<p><span>In practice, it</span>&rsquo;<span>s unlikely (but not impossible), that we use actual ML here.</span>
+<span>But it</span>&rsquo;<span>s highly probably that the naive independent weights model isn</span>&rsquo;<span>t the end of the story.</span>
+<span>At some point there will be special cases which would necessitate change of the interface.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key point:</span></strong></strong><span> duh, test features, not code!</span>
+<a href="https://www.tedinski.com/2019/03/19/testing-at-the-boundaries.html"><span>Test at the boundaries</span></a><span>.</span></p>
+<p><span>If you build a library, the boundary is the public API.</span>
+<span>If you are building an application, you are not building the library.</span>
+<span>The boundary is what a human in front of a display sees.</span></p>
+</div>
+</aside><p><span>Note that this advice goes directly against one common understanding of unit-testing.</span>
+<span>I am fairly confident that it results in better software over the long run.</span></p>
+</section>
+<section id="Make-Tests-Fast">
+
+    <h2>
+    <a href="#Make-Tests-Fast"><span>Make Tests Fast</span> </a>
+    </h2>
+<p><span>There</span>&rsquo;<span>s one talk about software engineering, which stands out for me, and which is my favorite.</span>
+<span>It is </span><a href="https://www.destroyallsoftware.com/talks/boundaries"><span>Boundaries</span></a><span> by Gary Bernhardt.</span>
+<span>There</span>&rsquo;<span>s a point there though, which I strongly disagree with:</span></p>
+<dl>
+<dt><span>Integration Tests are Superlinear?</span></dt>
+<dd>
+<p><span>When you use integration tests, any new feature is accompanied by a bit of new code and a new test.</span>
+<span>However, new code slows down all other tests, so the the overall test suite becomes slow, as the total time grows super-linearly.</span></p>
+</dd>
+</dl>
+<p><span>I don</span>&rsquo;<span>t think more code under test translates to slower test suite.</span>
+<span>Merge sort spends more lines of code than bubble sort, but it is way faster.</span></p>
+<p><span>In the abstract, yes, more code generally means more execution time, but I doubt this is the defining factor in tests execution time.</span>
+<span>What actually happens is usually:</span></p>
+<ul>
+<li>
+<span>Input/Output </span>&mdash;<span> reading just a bit from a disk, network or another process slows down the tests significantly.</span>
+</li>
+<li>
+<span>Outliers </span>&mdash;<span> very often, testing time is dominated by only a couple of slow tests.</span>
+</li>
+<li>
+<span>Overly large input </span>&mdash;<span> throwing enough data at any software makes it slow.</span>
+</li>
+</ul>
+<p><span>The problem with integrated tests is not code volume per se, but the fact that they </span><em><span>typically</span></em><span> mean doing a lot of IO.</span>
+<span>But this doesn</span>&rsquo;<span>t need to be the case</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key point:</span></strong></strong><span> architecture the software to keep as much as possible </span><a href="https://sans-io.readthedocs.io"><span>sans io</span></a><span>.</span>
+<span>Let the caller do input and output, and let the callee do compute.</span>
+<span>It doesn</span>&rsquo;<span>t matter if the callee is large and complex.</span>
+<span>Even if it is the whole compiler, testing is fast and easy as long as no IO is involved.</span></p>
+</div>
+</aside><p><span>Nonetheless, some tests are going to be slow.</span>
+<span>It pays off to introduce the concept of slow tests early on, arrange the skipping of such tests by default and only exercise them on CI.</span>
+<span>You don</span>&rsquo;<span>t need to be fancy, just checking an environment variable at the start of the test is perfectly fine:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">completion_works_with_real_standard_library</span>() {</span>
+<span class="line">  <span class="hl-keyword">if</span> std::env::<span class="hl-title function_ invoke__">var</span>(<span class="hl-string">&quot;RUN_SLOW_TESTS&quot;</span>).<span class="hl-title function_ invoke__">is_err</span>() {</span>
+<span class="line">    <span class="hl-keyword">return</span>;</span>
+<span class="line">  }</span>
+<span class="line">  ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Definitely do </span><em><span>not</span></em><span> use conditional compilation to hide slow tests </span>&mdash;<span> it</span>&rsquo;<span>s an obvious solution which makes your life harder</span>
+<span>(</span><a href="https://peter.bourgon.org/blog/2021/04/02/dont-use-build-tags-for-integration-tests.html"><span>similar observation</span></a><span> from the Go ecosystem).</span></p>
+<p><span>To deal with outliers, print each test</span>&rsquo;<span>s execution time by default.</span>
+<span>Having the numbers fly by gives you immediate feedback and incentive to improve.</span></p>
+</section>
+<section id="Data-Driven-Testing">
+
+    <h2>
+    <a href="#Data-Driven-Testing"><span>Data Driven Testing</span> </a>
+    </h2>
+<p><span>All these together lead to a particular style of architecture and tests, which I call data driven testing.</span>
+<span>The bulk of the software is a pure function, where the state is passed in explicitly.</span>
+<span>Removing IO from the picture necessitates that the interface of software is specified in terms of data.</span>
+<span>Value in, value out.</span></p>
+<p><span>One property of data is that it can be serialized and deserialized.</span>
+<span>That means that the </span><code>check</code><span> style tests can easily accept arbitrary complex input, which is specified in a structured format (JSON), ad-hoc plain text format, or via embedded DSL (builder-style interface for data objects).</span></p>
+<p><span>Similarly, The </span>&ldquo;<span>expected</span>&rdquo;<span> argument of </span><code>check</code><span> is data.</span>
+<span>It is a result which is more-or-less directly displayed to the user.</span></p>
+<p><span>A convincing example of a data driven test would be a </span>&ldquo;<span>Goto Definition</span>&rdquo;<span> tests  from rust-analyzer (</span><a href="https://github.com/rust-analyzer/rust-analyzer/blob/92b9e5ef3c03d51713ff5fa32cd58bdf97701b5e/crates/ide/src/goto_definition.rs#L168-L185"><span>source</span></a><span>):</span></p>
+
+<figure>
+
+<img alt="" src="/assets/goto-definition-test.png">
+</figure>
+<p><span>In this case, the </span><code>check</code><span> function has only a single argument </span>&mdash;<span> a string which specifies both the input and the expected result.</span>
+<span>The input is a rust project with three files (</span><code>//- /file.rs</code><span> syntax shows the boundary between the files).</span>
+<span>The current cursor position is also a part of the input and is specified with the </span><code>$0</code><span> syntax.</span>
+<span>The result is the </span><code>//^^^</code><span> comment which marks the target of the </span>&ldquo;<span>Goto Definition</span>&rdquo;<span> call.</span>
+<span>The </span><code>check</code><span> function creates an in-memory Rust project, invokes </span>&ldquo;<span>Goto Definition</span>&rdquo;<span> at the position signified by </span><code>$0</code><span>, and checks that the result is the position marked with </span><code>^^^</code><span>.</span></p>
+<p><span>Note that this is decidedly not a unit test.</span>
+<span>Nothing is stubbed or mocked.</span>
+<span>This test invokes the whole compilation pipeline: virtual file system, parser, macro expander, name resolution.</span>
+<span>It runs on top of our incremental computation engine.</span>
+<span>It touches a significant fraction of the IDE APIs.</span>
+<span>Yet, it takes 4ms in debug mode (and 500µs in release mode).</span>
+<span>And note that it absolutely does not depend on any internal API </span>&mdash;<span> if we replace our dumb compiler with sufficiently smart neural net, nothing needs to be adjusted in the tests.</span></p>
+<p><span>There</span>&rsquo;<span>s one question though: why on earth am I using a png image to display a bit of code?</span>
+<span>Only to show that the raw string literal (</span><code>r#""#</code><span>) which contains Rust code is highlighted as such.</span>
+<span>This is possible because we re-use the same input format (with </span><code>//-</code><span>, </span><code>$0</code><span> and couple of other markup elements) for almost every test in rust-analyzer.</span>
+<span>As such, we can invest effort into building cool things on top of this format, which subsequently benefit all our tests.</span></p>
+</section>
+<section id="Expect-Tests">
+
+    <h2>
+    <a href="#Expect-Tests"><span>Expect Tests</span> </a>
+    </h2>
+<p><span>Previous example had a complex data input, but a relatively simple data output </span>&mdash;<span> a position in the file.</span>
+<span>Often, the output is messy and has a complicated structure as well (a symptom of </span><a href="https://buttondown.email/hillelwayne/archive/cross-branch-testing/"><span>rho problem</span></a><span>).</span>
+<span>Worse, sometimes the output is a part that is changed frequently.</span>
+<span>This often necessitates updating a lot of tests.</span>
+<span>Going back to the binary search example, the change from </span><code>-&gt; bool</code><span> to </span><code>-&gt; Result&lt;usize, usize&gt;</code><span> was an example of this effect.</span></p>
+<p><span>There is a technique to make such simultaneous changes to all gold outputs easy </span>&mdash;<span> testing with expectations.</span>
+<span>You specify the expected result as a bit of data inline with the test.</span>
+<span>There</span>&rsquo;<span>s a special mode of running the test suite for updating this data.</span>
+<span>Instead of failing the test, a mismatch between expected and actual causes the gold value to be updated in-place.</span>
+<span>That is, the test framework edits the code of the test itself.</span></p>
+<p><span>Here</span>&rsquo;<span>s an example of this workflow in rust-analyzer, used for testing code completion:</span></p>
+
+<figure>
+
+<video src="https://user-images.githubusercontent.com/1711539/120119633-73b3f100-c1a1-11eb-91be-4c61a23e7060.mp4" controls=""></video>
+</figure>
+<p><span>Often, just </span><code>Debug</code><span> representation of the type works well for expect tests, but you can do something more fun.</span>
+<span>See this post from Jane Street for a great example:</span>
+<a href="https://blog.janestreet.com/using-ascii-waveforms-to-test-hardware-designs/"><span>Using ASCII waveforms to test hardware designs</span></a><span>.</span></p>
+<p><span>There are several libraries for this in Rust: </span><a href="https://github.com/mitsuhiko/insta"><span>insta</span></a><span>, </span><a href="https://github.com/aaronabramov/k9"><span>k9</span></a><span>, </span><a href="https://github.com/rust-analyzer/expect-test"><span>expect-test</span></a><span>.</span></p>
+</section>
+<section id="Fluent-Assertions">
+
+    <h2>
+    <a href="#Fluent-Assertions"><span>Fluent Assertions</span> </a>
+    </h2>
+<p><span>An extremely popular genre for a testing library is a collection of fluent assertions:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Built-in assertion:</span></span>
+<span class="line"><span class="hl-built_in">assert!</span>(x &gt; y);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Fluent assertion:</span></span>
+<span class="line"><span class="hl-title function_ invoke__">assert_that</span>(x).<span class="hl-title function_ invoke__">is_greater_than</span>(y);</span></code></pre>
+
+</figure>
+<p><span>The benefit of this style are better error messages.</span>
+<span>Instead of just </span>&ldquo;<span>false is not true</span>&rdquo;<span>, the testing framework can print values for </span><code>x</code><span> and </span><code>y</code><span>.</span></p>
+<p><span>I don</span>&rsquo;<span>t find this useful.</span>
+<span>Using the </span><code>check</code><span> style testing, there are very few assertions actually written in code.</span>
+<span>Usually, I start with plain asserts without messages.</span>
+<span>The first time I debug an actual test failure for a particular function, I spend some time to write a detailed assertion message.</span>
+<span>To me, fluent assertions are not an attractive point on the curve that includes plain asserts and hand-written, context aware explanations of failures.</span>
+<span>A notable exception here is pytest approach </span>&mdash;<span> this testing framework overrides the standard </span><code>assert</code><span> to provide a rich diff without ceremony.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key Point:</span></strong></strong><span> invest into testing infrastructure in a scalable way.</span>
+<span>Write a single </span><code>check</code><span> function with artisanally crafted error message, define a universal fixture format for the input, use expectation testing for output.</span></p>
+</div>
+</aside></section>
+<section id="Peeking-Inside">
+
+    <h2>
+    <a href="#Peeking-Inside"><span>Peeking Inside</span> </a>
+    </h2>
+<p><span>One apparent limitation of the style of integrated testing I am describing is checking for properties which are </span><em><span>not</span></em><span> part of the output.</span>
+<span>For example, if some kind of caching is involved, you might want to check that the cache is actually being hit, and is not just sitting there.</span>
+<span>But, by definition, cache is not something that an outside client can observe.</span></p>
+<p><span>The solution to this problem is to make this extra data a part of the system</span>&rsquo;<span>s output by adding extra observability points.</span>
+<span>A good example here is Cargo</span>&rsquo;<span>s test suite.</span>
+<span>It is set-up in an integrated, data-driven fashion.</span>
+<span>Each tests starts with a succinct DSL for setting up a tree of files on disk.</span>
+<span>Then, a full cargo command is invoked.</span>
+<span>Finally, the test looks at the command</span>&rsquo;<span>s output and the resulting state of the file system, and asserts the relevant facts.</span></p>
+<p><span>Tests for caching additionally enable verbose internal logging.</span>
+<span>In this mode, Cargo prints information about cache hits and misses.</span>
+<span>These messages are then used </span><a href="https://github.com/rust-lang/cargo/blob/57b75970e022e8519fe82cc38a7aed4862f67089/tests/testsuite/rustc_info_cache.rs#L68-L70"><span>in assertions</span></a><span>.</span></p>
+<p><span>A close idea is </span><a href="https://ferrous-systems.com/blog/coverage-marks/"><span>coverage marks</span></a><span>.</span>
+<span>Some times, you want to check that something </span><em><em><span>does not</span></em></em><span> happen.</span>
+<span>Tests for this tend to be fragile </span>&mdash;<span> often the thing does not happen, but for the wrong reason.</span>
+<span>You can add a side channel which explains the reasoning behind particular behavior, and additionally assert this as well.</span></p>
+</section>
+<section id="Externalized-Tests">
+
+    <h2>
+    <a href="#Externalized-Tests"><span>Externalized Tests</span> </a>
+    </h2>
+<p><span>In the ultimate stage of data driven tests the definitions of test cases are moved out of test functions and into external files.</span>
+<span>That is, you don</span>&rsquo;<span>t do this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test_foo</span>() {</span>
+<span class="line">  <span class="hl-title function_ invoke__">check</span>(<span class="hl-string">&quot;foo&quot;</span>, <span class="hl-string">&quot;oof&quot;</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test_bar</span>() {</span>
+<span class="line">  <span class="hl-title function_ invoke__">check</span>(<span class="hl-string">&quot;bar&quot;</span>, <span class="hl-string">&quot;rab&quot;</span>)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Rather, there is a </span><em><span>single</span></em><span> test that looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test_all</span>() {</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">file</span> <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">read_dir</span>(<span class="hl-string">&quot;./test_data/in&quot;</span>) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">input</span> = <span class="hl-title function_ invoke__">read_to_string</span>(</span>
+<span class="line">      &amp;<span class="hl-built_in">format!</span>(<span class="hl-string">&quot;./test_data/in/{}&quot;</span>, file),</span>
+<span class="line">    );</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">output</span> = <span class="hl-title function_ invoke__">read_to_string</span>(</span>
+<span class="line">      &amp;<span class="hl-built_in">format!</span>(<span class="hl-string">&quot;./test_data/out/{}&quot;</span>, file),</span>
+<span class="line">    );</span>
+<span class="line">    <span class="hl-title function_ invoke__">check</span>(input, output)</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I have a love-hate relationship with this approach.</span>
+<span>It has at least two attractive properties.</span>
+<em><span>First,</span></em><span> it forces data driven approach without any cheating.</span>
+<em><span>Second,</span></em><span> it makes the test suite more re-usable.</span>
+<span>An alternative implementation in a different programming language can use the same tests.</span></p>
+<p><span>But there</span>&rsquo;<span>s a drawback as well </span>&mdash;<span> without literal </span><code>#[test]</code><span> attributes, integration with tooling suffers.</span>
+<span>For example, you don</span>&rsquo;<span>t automatically get </span>&ldquo;<span>X out of Y tests passed</span>&rdquo;<span> at the end of test run.</span>
+<span>You can</span>&rsquo;<span>t conveniently debug just a single test, there isn</span>&rsquo;<span>t a helpful </span>&ldquo;<span>Run</span>&rdquo;<span> icon/shortcut you can use in an IDE.</span></p>
+<p><span>When I do externalized test cases, I like to leave a trivial smoke test behind:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">smoke</span>() {</span>
+<span class="line">  <span class="hl-title function_ invoke__">check</span>(<span class="hl-string">&quot;&quot;</span>, <span class="hl-string">&quot;&quot;</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If I need to debug a failing external test, I first paste the input into this smoke test, and then get my IDE tooling back.</span></p>
+</section>
+<section id="Beyond-Example-Based-Testing">
+
+    <h2>
+    <a href="#Beyond-Example-Based-Testing"><span>Beyond Example Based Testing</span> </a>
+    </h2>
+<p><span>Reading from a file is not the most fun way to come up with a data input for a </span><code>check</code><span> function.</span></p>
+<p><span>Here are a few other popular ones:</span></p>
+<dl>
+<dt><span>Property Based Testing</span></dt>
+<dd>
+<p><span>Generate the input at random and verify that the output makes sense.</span>
+<span>For a binary search, check that the </span><code>needle</code><span> indeed lies between the two elements at the insertion point.</span></p>
+</dd>
+<dt><span>Full Coverage</span></dt>
+<dd>
+<p><span>Better still, instead of generating some random inputs, just check that the answer is correct for </span><em><span>all</span></em><span> inputs.</span>
+<span>This is how you should be testing binary search </span>&mdash;<span> generate every sorted list of length at most </span><code>7</code><span> with numbers in the </span><code>0..=6</code><span> range.</span>
+<span>Then, for each list and for each number, check that the binary search gives the same result as a naive linear search.</span></p>
+</dd>
+<dt><span>Coverage Guided Fuzzing</span></dt>
+<dd>
+<p><span>Just throw random bytes at the check function.</span>
+<span>Random bytes probably don</span>&rsquo;<span>t make much sense, but it</span>&rsquo;<span>s good to verify that the program returns an error instead of summoning nasal demons.</span>
+<span>Instead of piling bytes completely at random, observe which branches are taken, and try to invent byte sequences which cover more branches.</span>
+<span>Note that this test is polymorphic in the system under test.</span></p>
+</dd>
+<dt><span>Structured Fuzzing / Coverage Guided Property Testing</span></dt>
+<dd>
+<p><span>Use random bytes as a seed to generate </span>&ldquo;<span>syntactically valid</span>&rdquo;<span> inputs, then see you software crash and burn when the most hideous edge cases are uncovered.</span>
+<span>If you use Rust, check out </span><a href="https://github.com/bytecodealliance/wasm-tools/tree/f632261627a0ea758762e431d8be32740111e33c/crates/wasm-smith"><span>wasm-smith</span></a><span> and </span><a href="https://lib.rs/crates/arbitrary"><span>arbitrary</span></a><span> crates.</span></p>
+</dd>
+</dl>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key Point:</span></strong></strong><span> once you formulated the tests in terms of data, you no longer need to write code to add your tests.</span>
+<span>If code is not required, you can generate test cases easily.</span></p>
+</div>
+</aside></section>
+<section id="The-External-World">
+
+    <h2>
+    <a href="#The-External-World"><span>The External World</span> </a>
+    </h2>
+<p><span>What if isolating IO is not possible, and the application is fundamentally build around interacting with external systems?</span>
+<span>In this case, my advice is to just accept that the tests are going to be slow, and might need extra effort to avoid flakiness.</span></p>
+<p><span>Cargo is the perfect case study here.</span>
+<span>Its raison d</span>&rsquo;<span>être is orchestrating a herd of external processes.</span>
+<span>Let</span>&rsquo;<span>s look at the basic test:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">cargo_compile_simple</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">p</span> = <span class="hl-title function_ invoke__">project</span>()</span>
+<span class="line">    .<span class="hl-title function_ invoke__">file</span>(<span class="hl-string">&quot;Cargo.toml&quot;</span>, &amp;<span class="hl-title function_ invoke__">basic_bin_manifest</span>(<span class="hl-string">&quot;foo&quot;</span>))</span>
+<span class="line">    .<span class="hl-title function_ invoke__">file</span>(<span class="hl-string">&quot;src/foo.rs&quot;</span>, &amp;<span class="hl-title function_ invoke__">main_file</span>(<span class="hl-string">r#&quot;&quot;i am foo&quot;&quot;#</span>, &amp;[]))</span>
+<span class="line">    .<span class="hl-title function_ invoke__">build</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">cargo</span>(<span class="hl-string">&quot;build&quot;</span>).<span class="hl-title function_ invoke__">run</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">bin</span>(<span class="hl-string">&quot;foo&quot;</span>).<span class="hl-title function_ invoke__">is_file</span>());</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">process</span>(&amp;p.<span class="hl-title function_ invoke__">bin</span>(<span class="hl-string">&quot;foo&quot;</span>)).<span class="hl-title function_ invoke__">with_stdout</span>(<span class="hl-string">&quot;i am foo\n&quot;</span>).<span class="hl-title function_ invoke__">run</span>();</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The </span><code>project()</code><span> part is a builder, which describes the state of the a system.</span>
+<em><span>First,</span></em><span> </span><code>.build()</code><span> writes the specified files to a disk in a temporary directory.</span>
+<em><span>Then,</span></em><span> </span><code>p.cargo("build").run()</code><span> executes the real </span><code>cargo build</code><span> command.</span>
+<em><span>Finally,</span></em><span> a bunch of assertions are made about the end state of the file system.</span></p>
+<p><span>Neural network test: this is completely independent of internal Cargo APIs, by virtue of interacting with a </span><code>cargo</code><span> process via IPC.</span></p>
+<p><span>To give an order-of-magnitude feeling for the cost of IO, Cargo</span>&rsquo;<span>s test suite takes around seven minutes (</span><code>-j 1</code><span>), while rust-analyzer finishes in less than half a minute.</span></p>
+<p><span>An interesting case is the middle ground, when the IO-ing part is just big and important enough to be annoying.</span>
+<span>That is the case for rust-analyzer </span>&mdash;<span> although almost all code is pure, there</span>&rsquo;<span>s a part which interacts with a specific editor.</span>
+<span>What makes this especially finicky is that, in the case of Cargo, it</span>&rsquo;<span>s Cargo who calls external processes.</span>
+<span>With rust-analyzer, it</span>&rsquo;<span>s something which we don</span>&rsquo;<span>t control, the editor, which schedules the IO.</span>
+<span>This often results in hard-to-imagine bugs which are caused by particularly weird environments.</span></p>
+<p><span>I don</span>&rsquo;<span>t have good answers here, but here are the tricks I use:</span></p>
+<ol>
+<li>
+<span>Accept that something </span><em><span>will</span></em><span> break during integration.</span>
+<span>Even if </span><em><span>you</span></em><span> always create perfect code and never make bugs, your upstream integration point will be buggy sometimes.</span>
+</li>
+<li>
+<span>Make integration bugs less costly:</span>
+<ul>
+<li>
+<span>use release trains,</span>
+</li>
+<li>
+<span>make patch release process non-exceptional and easy,</span>
+</li>
+<li>
+<span>have a checklist for manual QA before the release.</span>
+</li>
+</ul>
+</li>
+<li>
+<span>Separate the tricky to test bits into a separate project.</span>
+<span>This allows you to write slow and not 100% reliable tests for integration parts, while keeping the core test suite fast and dependable.</span>
+</li>
+</ol>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key Point:</span></strong></strong><span> if you can</span>&rsquo;<span>t avoid IO, embrace it.</span>
+<span>Even if a data driven test suite is slow, it gives you a lot of confidence that features work, without intervening with refactors.</span></p>
+</div>
+</aside></section>
+<section id="The-Concurrent-World">
+
+    <h2>
+    <a href="#The-Concurrent-World"><span>The Concurrent World</span> </a>
+    </h2>
+<p><span>Consider the following API:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">do_stuff_in_background</span>(p: Param) {</span>
+<span class="line">  std::thread::<span class="hl-title function_ invoke__">spawn</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">    <span class="hl-comment">// Stuff</span></span>
+<span class="line">  })</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This API is fundamentally untestable.</span>
+<span>Can you see why?</span>
+<span>It spawns a concurrent computation, but it doesn</span>&rsquo;<span>t allow waiting for this computation to be finished.</span>
+<span>So, any test that calls </span><code>do_stuff_in_background</code><span> can</span>&rsquo;<span>t check that the </span>&ldquo;<span>Stuff</span>&rdquo;<span> is done.</span>
+<span>Worse, even tests which do not call this function might start to fail </span>&mdash;<span> they now can get interference from other tests.</span>
+<span>The concurrent computation can outlive the test that originally spawned it.</span></p>
+<p><span>This problem plagues almost every concurrent application I see.</span>
+<span>A common symptom is adding timeouts and sleeps to test, to increase the probability of stuff getting done.</span>
+<span>Such timeouts are another common cause of slow test suites.</span></p>
+<p><span>What makes this problem truly insidious is that there</span>&rsquo;<span>s no work-around.</span>
+<span>Broken once, causality link is not reforgable by a layer above.</span></p>
+<p><span>The solution is simple: don</span>&rsquo;<span>t do this.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key Point:</span></strong></strong><span> grab a (large) cup of coffee and go read </span><a href="https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful/"><span>Go statement considered harmful</span></a><span>.</span>
+<span>I will wait until you are done, and then proceed with my article.</span></p>
+</div>
+</aside></section>
+<section id="Layers">
+
+    <h2>
+    <a href="#Layers"><span>Layers</span> </a>
+    </h2>
+<p><span>Another common problem I see in complex projects is a beautifully layered architecture, which is </span>&ldquo;<span>inverted</span>&rdquo;<span> in tests.</span></p>
+<p><span>Let</span>&rsquo;<span>s say you have something fabulous, like </span><code>L1 &lt;- L2 &lt;- L3 &lt;- L4</code><span>.</span>
+<span>To test </span><code>L1</code><span>, the path of least resistance is often to write tests which exercise </span><code>L4</code><span>.</span>
+<span>You might even think that this is the setup I am advocating for.</span>
+<span>Not exactly.</span></p>
+<p><span>The problem with </span><code>L1 &lt;- L2 &lt;- L3 &lt;- L4 &lt;- Tests</code><span> is that working on </span><code>L1</code><span> becomes slower, especially in compiled languages.</span>
+<span>If you make a change to </span><code>L1</code><span>, then, before you get to the tests, you need to recompile the whole chain of reverse dependencies.</span>
+<span>My </span>&ldquo;<span>favorite</span>&rdquo;<span> example here is </span><code>rustc</code><span> </span>&mdash;<span> when I worked on the lexer (</span><code>T1</code><span>), I spent a lot of time waiting for the rest of the compiler to be rebuild to check my small change.</span></p>
+<p><span>The right setup here is to write integrated tests for each layer:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">L1 &lt;- Tests</span>
+<span class="line">L1 &lt;- L2 &lt;- Tests</span>
+<span class="line">L1 &lt;- L2 &lt;- L3 &lt;- Tests</span>
+<span class="line">L1 &lt;- L2 &lt;- L3 &lt;- L4 &lt;- Tests</span></code></pre>
+
+</figure>
+<p><span>Note that testing </span><code>L4</code><span> involves testing </span><code>L1</code><span>, </span><code>L2</code><span> an </span><code>L3</code><span>.</span>
+<span>This is not a problem.</span>
+<span>Due to layering, only </span><code>L4</code><span> needs to be </span><em><span>recompiled</span></em><span>.</span>
+<span>Other layers don</span>&rsquo;<span>t affect </span><em><span>run</span></em><span> time meaningfully.</span>
+<span>Remember </span>&mdash;<span> it</span>&rsquo;<span>s IO (and sleep-based synchronization) that kills performance, not just code volume.</span></p>
+</section>
+<section id="Test-Everything">
+
+    <h2>
+    <a href="#Test-Everything"><span>Test Everything</span> </a>
+    </h2>
+<p><span>In a nutshell, a </span><code>#[test]</code><span> is just a bit of code which is plugged into the build system to be executed automatically.</span>
+<span>Use this to your advantage, simplify the automation by moving as much as possible into tests.</span></p>
+<p><span>Here</span>&rsquo;<span>s some things in </span><code>rust-analyzer</code><span> which are just tests:</span></p>
+<ul>
+<li>
+<span>Code formatting (most common one </span>&mdash;<span> you don</span>&rsquo;<span>t need an extra pile of YAML in CI, you can shell out to the formatter from the test).</span>
+</li>
+<li>
+<span>Checking that the history does not contain merge commits and teaching new contributors git survival skills.</span>
+</li>
+<li>
+<span>Collecting the manual from specially-formatted doc comments across the code base.</span>
+</li>
+<li>
+<span>Checking that the code base is, in fact, reasonably well-documented.</span>
+</li>
+<li>
+<span>Ensuring that the licenses of dependencies are compatible.</span>
+</li>
+<li>
+<span>Ensuring that high-level operations are linear in the size of the input.</span>
+<span>Syntax-highlight a synthetic file of 1, 2, 4, 8, 16 kilobytes, run linear regression, check that result looks like a line rather than a parabola.</span>
+</li>
+</ul>
+</section>
+<section id="Use-Bors">
+
+    <h2>
+    <a href="#Use-Bors"><span>Use Bors</span> </a>
+    </h2>
+<p><span>This essay already mentioned a couple of cognitive tricks for better testing: reducing the fixed costs for adding new tests, and plotting/printing test times.</span>
+<span>The best trick in a similar vein is the </span><a href="https://graydon2.dreamwidth.org/1597.html">&ldquo;<span>not rocket science</span>&rdquo;</a><span> rule of software engineering.</span></p>
+<p><span>The idea is to have a robot which checks that </span><em><em><span>the merge commit</span></em></em><span> passes all the tests, before advancing the state of the main branch.</span></p>
+<p><span>Besides the evergreen master, such bot adds pressure to keep the test suite fast and non-flaky.</span>
+<span>This is another boiling frog, something you need to constantly keep an eye on.</span>
+<span>If you have any a single flaky test, it</span>&rsquo;<span>s very easy to miss when the second one is added.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><strong><strong><span>Key point:</span></strong></strong><span> use </span><a href="https://bors.tech" class="url">https://bors.tech</a><span>, a no-nonsense implementation of </span>&ldquo;<span>not rocket science</span>&rdquo;<span> rule.</span></p>
+</div>
+</aside></section>
+<section id="Recap">
+
+    <h2>
+    <a href="#Recap"><span>Recap</span> </a>
+    </h2>
+<p><span>This was a long essay.</span>
+<span>Let</span>&rsquo;<span>s look back at some of the key points:</span></p>
+<ol>
+<li>
+<span>There is a lot of information about testing, but it is not always helpful.</span>
+<span>At least, it was not helpful for me.</span>
+</li>
+<li>
+<span>The core characteristic of the test suite is how easy it makes changing the software under test.</span>
+</li>
+<li>
+<span>To that end, a good strategy is to focus on testing the features of the application, rather than on testing the code used to implement those features.</span>
+</li>
+<li>
+<span>A good test suite passes the neural network test </span>&mdash;<span> it is still useful if the entire application is replaced by an ML model which just comes up with the right answer.</span>
+</li>
+<li>
+<span>Corollary: good tests are not helpful for design in the small </span>&mdash;<span> a good test won</span>&rsquo;<span>t tell you the best signatures for functions.</span>
+</li>
+<li>
+<span>Testing time is something worth optimizing for.</span>
+<span>Tests are sensitive to IO and IPC.</span>
+<span>Tests are relatively insensitive to the amount of code under tests.</span>
+</li>
+<li>
+<span>There are useful techniques which are underused </span>&mdash;<span> expectation tests, coverage marks, externalized tests.</span>
+</li>
+<li>
+<span>There are not so useful techniques which are over-represented in the discourse: fluent assertions, mocks, BDD.</span>
+</li>
+<li>
+<span>The key for unlocking many of the above techniques is thinking in terms of data, rather than interfaces or objects.</span>
+</li>
+<li>
+<span>Corollary: good tests are helpful for design in the large.</span>
+<span>They help to crystalize the data model your application is built around.</span>
+</li>
+</ol>
+</section>
+<section id="Links">
+
+    <h2>
+    <a href="#Links"><span>Links</span> </a>
+    </h2>
+<ol>
+<li>
+<a href="https://www.destroyallsoftware.com/talks/boundaries" class="url">https://www.destroyallsoftware.com/talks/boundaries</a>
+</li>
+<li>
+<a href="https://www.tedinski.com/2019/03/19/testing-at-the-boundaries.html" class="url">https://www.tedinski.com/2019/03/19/testing-at-the-boundaries.html</a>
+</li>
+<li>
+<a href="https://programmingisterrible.com/post/139222674273/how-to-write-disposable-code-in-large-systems" class="url">https://programmingisterrible.com/post/139222674273/how-to-write-disposable-code-in-large-systems</a>
+</li>
+<li>
+<a href="https://sans-io.readthedocs.io" class="url">https://sans-io.readthedocs.io</a>
+</li>
+<li>
+<a href="https://peter.bourgon.org/blog/2021/04/02/dont-use-build-tags-for-integration-tests.html" class="url">https://peter.bourgon.org/blog/2021/04/02/dont-use-build-tags-for-integration-tests.html</a>
+</li>
+<li>
+<a href="https://buttondown.email/hillelwayne/archive/cross-branch-testing/" class="url">https://buttondown.email/hillelwayne/archive/cross-branch-testing/</a>
+</li>
+<li>
+<a href="https://blog.janestreet.com/testing-with-expectations/" class="url">https://blog.janestreet.com/testing-with-expectations/</a>
+</li>
+<li>
+<a href="https://blog.janestreet.com/using-ascii-waveforms-to-test-hardware-designs/" class="url">https://blog.janestreet.com/using-ascii-waveforms-to-test-hardware-designs/</a>
+</li>
+<li>
+<a href="https://ferrous-systems.com/blog/coverage-marks/" class="url">https://ferrous-systems.com/blog/coverage-marks/</a>
+</li>
+<li>
+<a href="https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful/" class="url">https://vorpus.org/blog/notes-on-structured-concurrency-or-go-statement-considered-harmful/</a>
+</li>
+<li>
+<a href="https://graydon2.dreamwidth.org/1597.html" class="url">https://graydon2.dreamwidth.org/1597.html</a>
+</li>
+<li>
+<a href="https://bors.tech" class="url">https://bors.tech</a>
+</li>
+<li>
+<a href="https://fsharpforfunandprofit.com/posts/property-based-testing/" class="url">https://fsharpforfunandprofit.com/posts/property-based-testing/</a>
+</li>
+<li>
+<a href="https://fsharpforfunandprofit.com/posts/property-based-testing-1/" class="url">https://fsharpforfunandprofit.com/posts/property-based-testing-1/</a>
+</li>
+<li>
+<a href="https://fsharpforfunandprofit.com/posts/property-based-testing-2/" class="url">https://fsharpforfunandprofit.com/posts/property-based-testing-2/</a>
+</li>
+<li>
+<a href="https://www.sqlite.org/testing.html" class="url">https://www.sqlite.org/testing.html</a>
+</li>
+</ol>
+<p><span>Somewhat amusingly, after writing this article I</span>&rsquo;<span>ve learned about an excellent post by Tim Bray which argues for the opposite point:</span></p>
+<p><a href="https://www.tbray.org/ongoing/When/202x/2021/05/15/Testing-in-2021" class="url">https://www.tbray.org/ongoing/When/202x/2021/05/15/Testing-in-2021</a></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>This post is a part of </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html"><span>One Hundred Thousand Lines of Rust</span></a><span> series.</span></p>
+</div>
+</aside></section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-05-31-how-to-test.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/07/09/inline-in-rust.html b/2021/07/09/inline-in-rust.html
new file mode 100644
index 00000000..984b7100
--- /dev/null
+++ b/2021/07/09/inline-in-rust.html
@@ -0,0 +1,269 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Inline In Rust</title>
+  <meta name="description" content="There's a lot of tribal knowledge surrounding #[inline] attribute in Rust.
+I often find myself teaching how it works, so I finally decided to write this down.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/07/09/inline-in-rust.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Inline-In-Rust"><span>Inline In Rust</span> <time datetime="2021-07-09">Jul 9, 2021</time></a>
+    </h1>
+<p><span>There</span>&rsquo;<span>s a lot of tribal knowledge surrounding </span><a href="https://doc.rust-lang.org/reference/attributes/codegen.html#the-inline-attribute"><code>#[inline]</code></a><span> attribute in Rust.</span>
+<span>I often find myself teaching how it works, so I finally decided to write this down.</span></p>
+<p><span>Caveat Emptor: this is what I know, not necessarily what is true.</span>
+<span>Additionally, exact semantics of </span><code>#[inline]</code><span> is not set in stone and may change in future Rust versions.</span></p>
+<section id="Why-Inlining-Matters">
+
+    <h2>
+    <a href="#Why-Inlining-Matters"><span>Why Inlining Matters?</span> </a>
+    </h2>
+<p><span>Inlining is an optimizing transformation which replaces a call to a function with its body.</span></p>
+<p><span>To give a trivial example, during compilation the compiler can transform this code:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f</span>(w: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-title function_ invoke__">inline_me</span>(w, <span class="hl-number">2</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">inline_me</span>(x: <span class="hl-type">u32</span>, y: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    x * y</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Into this code:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f</span>(w: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    w * <span class="hl-number">2</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To paraphrase </span><a href="https://www.clear.rice.edu/comp512/Lectures/Papers/1971-allen-catalog.pdf"><span>A Catalogue of Optimizing Transformations</span></a><span> by </span><a href="https://en.wikipedia.org/wiki/Frances_Allen"><span>Frances Allen</span></a><span> and </span><a href="https://en.wikipedia.org/wiki/John_Cocke"><span>John Cocke</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">There are many obvious advantages to inlining; two are:</span>
+<span class="line"></span>
+<span class="line">a. There is no function call overhead whatsoever.</span>
+<span class="line">b. Caller and callee are optimized together. Advantage can be taken</span>
+<span class="line">   of particular argument values and relationships: constant arguments</span>
+<span class="line">   can be folded into the code, invariant instructions in the callee</span>
+<span class="line">   can be moved to infrequently executed areas of the caller, etc.</span></code></pre>
+
+</figure>
+<p><span>In other words, for an ahead of time compiled language inlining is the mother of all other optimizations.</span>
+<span>It gives the compiler the necessary context to apply further transformations.</span></p>
+</section>
+<section id="Inlining-and-Separate-Compilation">
+
+    <h2>
+    <a href="#Inlining-and-Separate-Compilation"><span>Inlining and Separate Compilation</span> </a>
+    </h2>
+<p><span>Inlining is at odds with another important idea in compilers </span>&mdash;<span> that of separate compilation.</span>
+<span>When compiling big programs, it is desirable to separate them into modules which can be compiled independently to:</span></p>
+<ul>
+<li>
+<span>Process everything in parallel.</span>
+</li>
+<li>
+<span>Scope incremental recompilations to individual changed modules.</span>
+</li>
+</ul>
+<p><span>To achieve separate compilation, compilers expose signatures of functions, but keep function bodies invisible to other modules, preventing inlining.</span>
+<span>This fundamental tension is what makes </span><code>#[inline]</code><span> in Rust trickier than just a hint for the compiler to inline the function.</span></p>
+</section>
+<section id="Inlining-in-Rust">
+
+    <h2>
+    <a href="#Inlining-in-Rust"><span>Inlining in Rust</span> </a>
+    </h2>
+<p><span>In Rust, a unit of (separate) compilation is a crate.</span>
+<span>If a function </span><code>f</code><span> is defined in a crate </span><code>A</code><span>, then all calls to </span><code>f</code><span> from within </span><code>A</code><span> can be inlined, as the compiler has full access to </span><code>f</code><span>.</span>
+<span>If, however, </span><code>f</code><span> is called from some downstream crate </span><code>B</code><span>, such calls can</span>&rsquo;<span>t be inlined.</span>
+<code>B</code><span> has access only to the signature of </span><code>f</code><span>, not its body.</span></p>
+<p><span>That</span>&rsquo;<span>s where the main usage of </span><code>#[inline]</code><span> comes from </span>&mdash;<span> it enables cross-crate inlining.</span>
+<span>Without </span><code>#[inline]</code><span>, even the most trivial of functions can</span>&rsquo;<span>t be inlined across the crate boundary.</span>
+<span>The benefit is not without a cost </span>&mdash;<span> the compiler implements this by compiling a separate copy of the </span><code>#[inline]</code><span> function with every crate it is used in, significantly increasing compile times.</span></p>
+<p><span>Besides </span><code>#[inline]</code><span>, there are two more exceptions to this.</span>
+<span>Generic functions are implicitly inlinable.</span>
+<span>Indeed, the compiler can only compile a generic function when it knows the specific type arguments it is instantiated with.</span>
+<span>As that is known only in the calling crate, bodies of generic functions have to be always available.</span></p>
+<p><span>The other exception is link-time optimization.</span>
+<span>LTO opts out of separate compilation </span>&mdash;<span> it makes bodies of all functions available, at the cost of making compilation much slower.</span></p>
+</section>
+<section id="Inlining-in-Practice">
+
+    <h2>
+    <a href="#Inlining-in-Practice"><span>Inlining in Practice</span> </a>
+    </h2>
+<p><span>Now that the underlying semantics is explained, it</span>&rsquo;<span>s possible to infer some rule-of-thumbs for using </span><code>#[inline]</code><span>.</span></p>
+<p><em><span>First</span></em><span>, it</span>&rsquo;<span>s not a good idea to apply </span><code>#[inline]</code><span> indiscriminately, as that makes compile time worse.</span>
+<span>If you don</span>&rsquo;<span>t care about compile times, a much better solution is to set </span><code>lto = true</code><span> in Cargo profile (</span><a href="https://doc.rust-lang.org/cargo/reference/profiles.html#lto"><span>docs</span></a><span>).</span></p>
+<p><em><span>Second</span></em><span>, it usually isn</span>&rsquo;<span>t necessary to apply </span><code>#[inline]</code><span> to private functions </span>&mdash;<span> within a crate, the compiler generally makes good inline decisions.</span>
+<span>There</span>&rsquo;<span>s </span><a href="https://twitter.com/ManishEarth/status/936084757212946432"><span>a joke</span></a><span> that LLVM</span>&rsquo;<span>s heuristic for when the function should be inlined is </span>&ldquo;<span>yes</span>&rdquo;<span>.</span></p>
+<p><em><span>Third</span></em><span>, when building an application, apply </span><code>#[inline]</code><span> reactively when profiling shows that a particular small function is a bottleneck.</span>
+<span>Consider using lto for releases.</span>
+<span>It might make sense to proactively </span><code>#[inline]</code><span> trivial public functions.</span></p>
+<p><em><span>Fourth</span></em><span>, when building libraries, proactively add </span><code>#[inline]</code><span> to small non-generic functions.</span>
+<span>Pay special attention to impls: </span><code>Deref</code><span>, </span><code>AsRef</code><span> and the like often benefit from inlining.</span>
+<span>A library can</span>&rsquo;<span>t anticipate all usages upfront, it makes sense to not prematurely pessimize future users.</span>
+<span>Note that </span><code>#[inline]</code><span> is not transitive: if a trivial public function calls a trivial private function, you need to </span><code>#[inline]</code><span> both.</span>
+<span>See </span><a href="https://github.com/matklad/benchmarks/tree/91171269f0a6e260a27111d07661021a89d20085/rust-inline"><span>this benchmark</span></a><span> for details.</span></p>
+<p><em><span>Fifth</span></em><span>, mind generic functions.</span>
+<span>It</span>&rsquo;<span>s not too wrong to say that generic functions are implicitly inline.</span>
+<span>As a result, they often are a cause for code bloat.</span>
+<span>Generic functions, especially in libraries, should be written to minimize unwanted inlining.</span>
+<span>To give an example from </span><a href="https://github.com/bytecodealliance/wasm-tools/blob/0486fb4de505b8116a0034bdde4918cd783325b9/crates/wat/src/lib.rs#L214-L222"><span>wat</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Public, generic function.</span></span>
+<span class="line"><span class="hl-comment">// Will cause code bloat if not handled carefully!</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">parse_str</span>(wat: <span class="hl-keyword">impl</span> <span class="hl-title class_">AsRef</span>&lt;<span class="hl-type">str</span>&gt;) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-comment">// Immediately delegate to a non-generic function.</span></span>
+<span class="line">  _parse_str(wat.<span class="hl-title function_ invoke__">as_ref</span>())</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Separate-compilation friendly private implementation.</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">_parse_str</span>(wat: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="References">
+
+    <h2>
+    <a href="#References"><span>References</span> </a>
+    </h2>
+<ol>
+<li>
+<a href="https://doc.rust-lang.org/reference/attributes/codegen.html#the-inline-attribute"><span>Language reference</span></a><span>.</span>
+</li>
+<li>
+<a href="https://nnethercote.github.io/perf-book/inlining.html"><span>Rust performance book</span></a><span>.</span>
+</li>
+<li>
+<span>@alexcrichton </span><a href="https://github.com/rust-lang/hashbrown/pull/119#issuecomment-537539046"><span>explains inline</span></a><span>.</span>
+<span>Note that, in reality, the compile time costs are worse than what I described </span>&mdash;<span> inline functions are compiled per codegen-unit, not per crate.</span>
+</li>
+<li>
+<a href="https://users.rust-lang.org/t/enable-cross-crate-inlining-without-suggesting-inlining/55004/9?u=matklad"><span>More @alexcrichton</span></a><span>.</span>
+</li>
+<li>
+<a href="https://internals.rust-lang.org/t/inlining-policy-for-functions-in-std/14189/10?u=matklad"><span>Even more @alexcrichton</span></a><span>.</span>
+</li>
+</ol>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/oh4s2j/blog_post_inline_in_rust/"><span>/r/rust</span></a><span>.</span></p>
+<p><span>There is now a follow up post: </span><a href="https://matklad.github.io/2021/07/10/its-not-always-icache.html"><span>It</span>&rsquo;<span>s Not Always iCache</span></a><span>.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>This post is a part of </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html"><span>One Hundred Thousand Lines of Rust</span></a><span> series.</span></p>
+</div>
+</aside></section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-07-09-inline-in-rust.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/07/10/its-not-always-icache.html b/2021/07/10/its-not-always-icache.html
new file mode 100644
index 00000000..6adaf424
--- /dev/null
+++ b/2021/07/10/its-not-always-icache.html
@@ -0,0 +1,496 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>It's Not Always ICache</title>
+  <meta name="description" content="This is a follow up to the previous post about #[inline] in Rust specifically.
+This post is a bit more general, and a bit more ranty.
+Reader, beware!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/07/10/its-not-always-icache.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#It-s-Not-Always-ICache"><span>It</span>&rsquo;<span>s Not Always ICache</span> <time datetime="2021-07-10">Jul 10, 2021</time></a>
+    </h1>
+<p><span>This is a follow up to the </span><a href="https://matklad.github.io/2021/07/09/inline-in-rust.html"><span>previous post</span></a><span> about </span><code>#[inline]</code><span> in Rust specifically.</span>
+<span>This post is a bit more general, and a bit more ranty.</span>
+<span>Reader, beware!</span></p>
+<p><span>When inlining optimization is discussed, the following is almost always mentioned: </span>&ldquo;<span>inlining can also make code slower, </span><em><span>because</span></em><span> inlining increases the code size, blowing the instruction cache size and causing cache misses</span>&rdquo;<span>.</span></p>
+<p><span>I myself have seen this repeated on various forms many times.</span>
+<span>I have also seen a lot of benchmarks where judicious removal of inlining annotations did increase performance.</span>
+<span>However, not once have I seen the performance improvement being traced to ICache specifically.</span>
+<span>To me at least, this explanation doesn</span>&rsquo;<span>t seem to be grounded </span>&mdash;<span> people know that ICache is to blame because other people say this, not because there</span>&rsquo;<span>s a benchmark everyone points to.</span>
+<span>It doesn</span>&rsquo;<span>t mean that the ICache explanation is wrong </span>&mdash;<span> just that I personally don</span>&rsquo;<span>t have evidence to believe it is better than any other explanation.</span></p>
+<p><span>Anyway, I</span>&rsquo;<span>ve decided to look at a specific case where I know </span><code>#[inline]</code><span> to cause an observable slow down, and understand why it happens.</span>
+<span>Note that the goal here is not to explain real-world impact of </span><code>#[inline]</code><span>, the benchmark is artificial.</span>
+<span>The goal is, first and foremost, to learn more about the tools to use for explaining results.</span>
+<span>The secondary goal is to either observe ICache effects in practice, or else to provide an alternative hypothesis for why removing inlining can speed the things up.</span></p>
+<p><span>The benchmark is based on my </span><a href="https://github.com/matklad/once_cell"><span>once_cell</span></a><span> Rust library.</span>
+<span>The library provides a primitive, similar to </span><a href="https://en.wikipedia.org/wiki/Double-checked_locking"><span>double-checked locking</span></a><span>.</span>
+<span>There</span>&rsquo;<span>s a function that looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">get_or_try_init</span>&lt;F, E&gt;(&amp;<span class="hl-keyword">self</span>, f: F) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;&amp;T, E&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line"> F: <span class="hl-title function_ invoke__">FnOnce</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;T, E&gt;,</span>
+<span class="line">{</span>
+<span class="line">  <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(value) = <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">get</span>() {</span>
+<span class="line">    <span class="hl-comment">// Fast path.</span></span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(value);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Slow path.</span></span>
+<span class="line">  <span class="hl-keyword">self</span>.<span class="hl-number">0</span>.<span class="hl-title function_ invoke__">initialize</span>(f)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-keyword">unsafe</span> { <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">get_unchecked</span>() })</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I know that performance improves significantly when the </span><code>initialize</code><span> function is not inlined.</span>
+<span>It</span>&rsquo;<span>s somewhat obvious that this is the case (that</span>&rsquo;<span>s why the benchmark is synthetic </span>&mdash;<span> real world examples are about cases where we don</span>&rsquo;<span>t know if </span><code>inline</code><span> is needed).</span>
+<span>But it is unclear why, </span><em><span>exactly</span></em><span>, inlining </span><code>initialize</code><span> leads to slower code.</span></p>
+<p><span>For the experiment, I wrote a simple high-level benchmark calling </span><code>get_or_try_init</code><span> in a loop:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> N_LOOPS: <span class="hl-type">usize</span> = <span class="hl-number">8</span>;</span>
+<span class="line"><span class="hl-keyword">static</span> CELL: OnceCell&lt;<span class="hl-type">usize</span>&gt; = OnceCell::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..N_LOOPS {</span>
+<span class="line">    <span class="hl-title function_ invoke__">go</span>(i)</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">go</span>(i: <span class="hl-type">usize</span>) {</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">100_000_000</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> &amp;value = CELL.<span class="hl-title function_ invoke__">get_or_init</span>(|| i);</span>
+<span class="line">    <span class="hl-built_in">assert!</span>(value &lt; N_LOOPS);</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I also added compile-time toggle to force or forbid inlining:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[cfg_attr(feature = <span class="hl-string">&quot;inline_always&quot;</span>, inline(always))]</span></span>
+<span class="line"><span class="hl-meta">#[cfg_attr(feature = <span class="hl-string">&quot;inline_never&quot;</span>, inline(never))]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">initialize</span>() { ... }</span></code></pre>
+
+</figure>
+<p><span>You can see the full benchmark in this commit: </span><a href="https://github.com/matklad/once_cell/commit/a741d5f2ca7cd89125ef1c70ee2e5fe660271050"><span>matklad/once_cell@a741d5f</span></a><span>.</span></p>
+<p><span>Running both versions shows that </span><code>#[inline(never)]</code><span> is indeed measurably faster:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run -q --example bench  --release --features inline_always</span>
+<span class="line"><span class="hl-output">330ms</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-title function_">$</span> cargo run -q --example bench  --release --features inline_never</span>
+<span class="line"><span class="hl-output">259ms</span></span></code></pre>
+
+</figure>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Note that we don</span>&rsquo;<span>t use fancy statistics here.</span>
+<code>/usr/bin/time</code><span> is enough to see the difference with a naked eye despite the fact that the effect we are looking for is very low-level.</span>
+<span>Hence, a general tip: if you are benchmarking relative difference (and not the absolute performance), don</span>&rsquo;<span>t bother with measuring nanosecond-precision time.</span>
+<span>Instead, loop the benchmark enough to make the change human-perceptible.</span></p>
+</div>
+</aside><p><span>How do we explain the difference?</span>
+<span>The first step is to remove cargo from the equation and make two binaries for comparison:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo build --example bench --release --features inline_never</span>
+<span class="line"><span class="hl-title function_">$</span> cp ./target/release/examples/bench never</span>
+<span class="line"><span class="hl-title function_">$</span> cargo build --example bench --release --features inline_always</span>
+<span class="line"><span class="hl-title function_">$</span> cp ./target/release/examples/bench always</span></code></pre>
+
+</figure>
+<p><span>On Linux, the best tool to quickly access the performance of any program is </span><code>perf stat</code><span>.</span>
+<span>It runs the program and shows a bunch of CPU-level performance counters, which might explain what</span>&rsquo;<span>s going on.</span>
+<span>As we suspect that ICache might be to blame, let</span>&rsquo;<span>s include the counters for caches:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> perf stat -e instructions,cycles,\</span>
+<span class="line">  L1-dcache-loads,L1-dcache-load-misses,L1-dcache-prefetches,\</span>
+<span class="line">  L1-icache-loads,L1-icache-load-misses,cache-misses \</span>
+<span class="line">  ./always</span>
+<span class="line"><span class="hl-output">348ms</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output"> 6,396,754,995      instructions:u</span></span>
+<span class="line"><span class="hl-output"> 1,601,314,994      cycles:u</span></span>
+<span class="line"><span class="hl-output"> 1,600,621,170      L1-dcache-loads:u</span></span>
+<span class="line"><span class="hl-output">         4,806      L1-dcache-load-misses:u</span></span>
+<span class="line"><span class="hl-output">         4,402      L1-dcache-prefetches:u</span></span>
+<span class="line"><span class="hl-output">        69,594      L1-icache-loads:u</span></span>
+<span class="line"><span class="hl-output">           461      L1-icache-load-misses:u</span></span>
+<span class="line"><span class="hl-output">         1,928      cache-misses:u</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-title function_">$</span> perf stat -e instructions,cycles,\</span>
+<span class="line">  L1-dcache-loads,L1-dcache-load-misses,L1-dcache-prefetches,\</span>
+<span class="line">  L1-icache-loads,L1-icache-load-misses,cache-misses \</span>
+<span class="line">  ./never</span>
+<span class="line"><span class="hl-output">261ms</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output"> Performance counter stats for './never':</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output"> 5,597,215,493      instructions:u</span></span>
+<span class="line"><span class="hl-output"> 1,199,960,402      cycles:u</span></span>
+<span class="line"><span class="hl-output"> 1,599,404,303      L1-dcache-loads:u</span></span>
+<span class="line"><span class="hl-output">         4,612      L1-dcache-load-misses:u</span></span>
+<span class="line"><span class="hl-output">         4,290      L1-dcache-prefetches:u</span></span>
+<span class="line"><span class="hl-output">        62,268      L1-icache-loads:u</span></span>
+<span class="line"><span class="hl-output">           603      L1-icache-load-misses:u</span></span>
+<span class="line"><span class="hl-output">         1,675      cache-misses:u</span></span></code></pre>
+
+</figure>
+<p><span>There is some difference in </span><code>L1-icache-load-misses</code><span>, but there</span>&rsquo;<span>s also a surprising difference in </span><code>instructions</code><span>.</span>
+<span>What</span>&rsquo;<span>s more, the </span><code>L1-icache-load-misses</code><span> difference is hard to estimate, because it</span>&rsquo;<span>s unclear what </span><code>L1-icache-loads</code><span> are.</span>
+<span>As a sanity check, statistics for </span><code>dcache</code><span> are the same, just as we expect.</span></p>
+<p><span>While </span><code>perf</code><span> takes the real data from the CPU, an alternative approach is to run the program in a simulated environment.</span>
+<span>That</span>&rsquo;<span>s what </span><code>cachegrind</code><span> tool does.</span>
+<span>Fun fact: the primary author of cachegrind is </span><a href="https://github.com/nnethercote"><span>@nnethercote</span></a><span>, whose </span><a href="https://nnethercote.github.io/perf-book/"><span>Rust Performance Book</span></a><span> we saw in the last post.</span>
+<span>Let</span>&rsquo;<span>s see what </span><code>cachegrind</code><span> thinks about the benchmark.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> valgrind --tool=cachegrind ./always</span>
+<span class="line"><span class="hl-output">10s</span></span>
+<span class="line"><span class="hl-output"> I   refs:      6,400,577,147</span></span>
+<span class="line"><span class="hl-output"> I1  misses:            1,560</span></span>
+<span class="line"><span class="hl-output"> LLi misses:            1,524</span></span>
+<span class="line"><span class="hl-output"> I1  miss rate:          0.00%</span></span>
+<span class="line"><span class="hl-output"> LLi miss rate:          0.00%</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output"> D   refs:      1,600,196,336</span></span>
+<span class="line"><span class="hl-output"> D1  misses:            5,549</span></span>
+<span class="line"><span class="hl-output"> LLd misses:            4,024</span></span>
+<span class="line"><span class="hl-output"> D1  miss rate:           0.0%</span></span>
+<span class="line"><span class="hl-output"> LLd miss rate:           0.0%</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output"> LL refs:               7,109</span></span>
+<span class="line"><span class="hl-output"> LL misses:             5,548</span></span>
+<span class="line"><span class="hl-output"> LL miss rate:            0.0%</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-title function_">$</span> valgrind --tool=cachegrind ./never</span>
+<span class="line"><span class="hl-output">9s</span></span>
+<span class="line"><span class="hl-output"> I   refs:      5,600,577,226</span></span>
+<span class="line"><span class="hl-output"> I1  misses:            1,572</span></span>
+<span class="line"><span class="hl-output"> LLi misses:            1,529</span></span>
+<span class="line"><span class="hl-output"> I1  miss rate:          0.00%</span></span>
+<span class="line"><span class="hl-output"> LLi miss rate:          0.00%</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output"> D   refs:      1,600,196,330</span></span>
+<span class="line"><span class="hl-output"> D1  misses:            5,553</span></span>
+<span class="line"><span class="hl-output"> LLd misses:            4,024</span></span>
+<span class="line"><span class="hl-output"> D1  miss rate:           0.0%</span></span>
+<span class="line"><span class="hl-output"> LLd miss rate:           0.0%</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output"> LL refs:               7,125</span></span>
+<span class="line"><span class="hl-output"> LL misses:             5,553</span></span>
+<span class="line"><span class="hl-output"> LL miss rate:            0.0%</span></span></code></pre>
+
+</figure>
+<p><span>Note that, because </span><code>cachegrind</code><span> simulates the program, it runs much slower.</span>
+<span>Here, we don</span>&rsquo;<span>t see a big difference in ICache misses (I1 </span>&mdash;<span> first level instruction cache, LLi </span>&mdash;<span> last level instruction cache).</span>
+<span>We do see a difference in ICache references.</span>
+<span>Note that the number of times CPU refers to ICache should correspond to the number of instructions it executes.</span>
+<span>Cross-checking the number with </span><code>perf</code><span>, we see that both </span><code>perf</code><span> and </span><code>cachegrind</code><span> agree on the number of instructions executed.</span>
+<span>They also agree that </span><code>inline_always</code><span> version executes more instructions.</span>
+<span>It</span>&rsquo;<span>s still hard to say what perf</span>&rsquo;<span>s </span><code>sL1-icache-loads</code><span> means.</span>
+<span>Judging by the name, it should correspond to </span><code>cachegrind</code>&rsquo;<span>s </span><code>I refs</code><span>, but it doesn</span>&rsquo;<span>t.</span></p>
+<p><span>Anyway, it seems there</span>&rsquo;<span>s one thing that bears further investigation </span>&mdash;<span> why inlining changes the number of instructions executed?</span>
+<span>Inlining doesn</span>&rsquo;<span>t actually change the code the CPU runs, so the number of instructions should stay the same.</span>
+<span>Let</span>&rsquo;<span>s look at the asm then!</span>
+<span>The right tool here is </span><a href="https://github.com/gnzlbg/cargo-asm"><span>cargo-asm</span></a><span>.</span></p>
+<p><span>Again, here</span>&rsquo;<span>s the function we will be looking at:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">go</span>(tid: <span class="hl-type">usize</span>) {</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">100_000_000</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> &amp;value = CELL.<span class="hl-title function_ invoke__">get_or_init</span>(|| tid);</span>
+<span class="line">    <span class="hl-built_in">assert!</span>(value &lt; N_THREADS);</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The call to </span><code>get_or_init</code><span> will be inlined, and the nested call to </span><code>initialize</code><span> will be inlined depending on the flag.</span></p>
+<p><span>Let</span>&rsquo;<span>s first look at the </span><code>inline_never</code><span> version:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">r14</span> <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">rbx</span> <span class="hl-comment">; prologue</span></span>
+<span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">rax</span> <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rsp</span>], <span class="hl-built_in">rdi</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">ebx</span>, <span class="hl-number">100000001</span> <span class="hl-comment">; loop counter</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">r14</span>, <span class="hl-built_in">rsp</span></span>
+<span class="line">  <span class="hl-keyword">jmp</span>     .LBB14_1</span>
+<span class="line"><span class="hl-symbol"> .loop:</span></span>
+<span class="line hl-line">  <span class="hl-keyword">cmp</span>     <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL+<span class="hl-number">16</span>], <span class="hl-number">8</span></span>
+<span class="line hl-line">  <span class="hl-keyword">jae</span>     .assert_failure</span>
+<span class="line hl-line"><span class="hl-symbol"> .LBB14_1:</span></span>
+<span class="line hl-line">  <span class="hl-keyword">add</span>     <span class="hl-built_in">rbx</span>, -<span class="hl-number">1</span></span>
+<span class="line hl-line">  <span class="hl-keyword">je</span>      .normal_exit</span>
+<span class="line hl-line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">rax</span>, <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL]</span>
+<span class="line hl-line">  <span class="hl-keyword">cmp</span>     <span class="hl-built_in">rax</span>, <span class="hl-number">2</span></span>
+<span class="line hl-line">  <span class="hl-keyword">je</span>      .loop</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">rdi</span>, <span class="hl-built_in">r14</span></span>
+<span class="line">  <span class="hl-keyword">call</span>    once_cell::imp::OnceCell&lt;T&gt;::initialize</span>
+<span class="line">  <span class="hl-keyword">jmp</span>     .loop</span>
+<span class="line"><span class="hl-symbol"> .normal_exit:</span></span>
+<span class="line">  <span class="hl-keyword">add</span>     <span class="hl-built_in">rsp</span>, <span class="hl-number">8</span> <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     <span class="hl-built_in">rbx</span>    <span class="hl-comment">; epilogue</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     r14a   <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">ret</span>            <span class="hl-comment">;</span></span>
+<span class="line"><span class="hl-symbol"> .assert_failure:</span></span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">rdi</span>, [<span class="hl-built_in">rip</span>, +, .L__unnamed_12]</span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">rdx</span>, [<span class="hl-built_in">rip</span>, +, .L__unnamed_13]</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">esi</span>, <span class="hl-number">35</span></span>
+<span class="line">  <span class="hl-keyword">call</span>    <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, core::panicking::panic@GOTPCREL]</span>
+<span class="line">  <span class="hl-keyword">ud2</span></span></code></pre>
+
+</figure>
+<p><span>And then at the </span><code>inline_always</code><span> version:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">rbp</span>  <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">r15</span>  <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">r14</span>  <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">r13</span>  <span class="hl-comment">; prologue</span></span>
+<span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">r12</span>  <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">push</span>    <span class="hl-built_in">rbx</span>  <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">sub</span>     <span class="hl-built_in">rsp</span>, <span class="hl-number">24</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">r12</span>, <span class="hl-built_in">rdi</span></span>
+<span class="line">  <span class="hl-keyword">xor</span>     <span class="hl-built_in">ebx</span>, <span class="hl-built_in">ebx</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">r13d</span>, <span class="hl-number">1</span></span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">r14</span>, [<span class="hl-built_in">rip</span>, +, CELL]</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">rbp</span>, <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, WaiterQueue::drop@GOTPCREL]</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">r15</span>, <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, once_cell::imp::wait@GOTPCREL]</span>
+<span class="line">  <span class="hl-keyword">jmp</span>     .LBB10_1</span>
+<span class="line"><span class="hl-symbol"> .LBB10_10:</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rsp</span>, +, <span class="hl-number">8</span>], <span class="hl-built_in">r14</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL+<span class="hl-number">8</span>], <span class="hl-number">1</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL+<span class="hl-number">16</span>], <span class="hl-built_in">r12</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rsp</span>, +, <span class="hl-number">16</span>], <span class="hl-number">2</span></span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">rdi</span>, [<span class="hl-built_in">rsp</span>, +, <span class="hl-number">8</span>]</span>
+<span class="line">  <span class="hl-keyword">call</span>    <span class="hl-built_in">rbp</span></span>
+<span class="line"><span class="hl-symbol"> .loop:</span></span>
+<span class="line hl-line">  <span class="hl-keyword">add</span>     <span class="hl-built_in">rbx</span>, <span class="hl-number">1</span></span>
+<span class="line hl-line">  <span class="hl-keyword">cmp</span>     <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL+<span class="hl-number">16</span>], <span class="hl-number">8</span></span>
+<span class="line hl-line">  <span class="hl-keyword">jae</span>     .assert_failure</span>
+<span class="line hl-line"><span class="hl-symbol"> .LBB10_1:</span></span>
+<span class="line hl-line">  <span class="hl-keyword">cmp</span>     <span class="hl-built_in">rbx</span>, <span class="hl-number">100000000</span></span>
+<span class="line hl-line">  <span class="hl-keyword">je</span>      .normal_exit</span>
+<span class="line hl-line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">rax</span>, <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL]</span>
+<span class="line hl-line">  <span class="hl-keyword">cmp</span>     <span class="hl-built_in">rax</span>, <span class="hl-number">2</span></span>
+<span class="line hl-line">  <span class="hl-keyword">je</span>      .loop</span>
+<span class="line"><span class="hl-symbol"> .LBB10_3:</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">rax</span>, <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL]</span>
+<span class="line"><span class="hl-symbol"> .LBB10_4:</span></span>
+<span class="line">  <span class="hl-keyword">test</span>    <span class="hl-built_in">rax</span>, <span class="hl-built_in">rax</span></span>
+<span class="line">  <span class="hl-keyword">jne</span>     .LBB10_5</span>
+<span class="line">  <span class="hl-keyword">xor</span>     <span class="hl-built_in">eax</span>, <span class="hl-built_in">eax</span></span>
+<span class="line">  <span class="hl-keyword">lock</span>    <span class="hl-keyword">cmpxchg</span>, <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, CELL], <span class="hl-built_in">r13</span></span>
+<span class="line">  <span class="hl-keyword">jne</span>     .LBB10_4</span>
+<span class="line">  <span class="hl-keyword">jmp</span>     .LBB10_10</span>
+<span class="line"><span class="hl-symbol"> .LBB10_5:</span></span>
+<span class="line">  <span class="hl-keyword">cmp</span>     <span class="hl-built_in">rax</span>, <span class="hl-number">2</span></span>
+<span class="line">  <span class="hl-keyword">je</span>      .loop</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">ecx</span>, <span class="hl-built_in">eax</span></span>
+<span class="line">  <span class="hl-keyword">and</span>     <span class="hl-built_in">ecx</span>, <span class="hl-number">3</span></span>
+<span class="line">  <span class="hl-keyword">cmp</span>     <span class="hl-built_in">ecx</span>, <span class="hl-number">1</span></span>
+<span class="line">  <span class="hl-keyword">jne</span>     .LBB10_8</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">rdi</span>, <span class="hl-built_in">r14</span></span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">rsi</span>, <span class="hl-built_in">rax</span></span>
+<span class="line">  <span class="hl-keyword">call</span>    <span class="hl-built_in">r15</span></span>
+<span class="line">  <span class="hl-keyword">jmp</span>     .LBB10_3</span>
+<span class="line"><span class="hl-symbol"> .normal_exit:</span></span>
+<span class="line">  <span class="hl-keyword">add</span>     <span class="hl-built_in">rsp</span>, <span class="hl-number">24</span> <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     <span class="hl-built_in">rbx</span>     <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     <span class="hl-built_in">r12</span>     <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     <span class="hl-built_in">r13</span>     <span class="hl-comment">; epilogue</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     <span class="hl-built_in">r14</span>     <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     <span class="hl-built_in">r15</span>     <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">pop</span>     <span class="hl-built_in">rbp</span>     <span class="hl-comment">;</span></span>
+<span class="line">  <span class="hl-keyword">ret</span></span>
+<span class="line"><span class="hl-symbol"> .assert_failure:</span></span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">rdi</span>, [<span class="hl-built_in">rip</span>, +, .L__unnamed_9]</span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">rdx</span>, [<span class="hl-built_in">rip</span>, +, .L__unnamed_10]</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">esi</span>, <span class="hl-number">35</span></span>
+<span class="line">  <span class="hl-keyword">call</span>    <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, core::panicking::panic@GOTPCREL]</span>
+<span class="line">  <span class="hl-keyword">ud2</span></span>
+<span class="line"><span class="hl-symbol"> .LBB10_8:</span></span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">rdi</span>, [<span class="hl-built_in">rip</span>, +, .L__unnamed_11]</span>
+<span class="line">  <span class="hl-keyword">lea</span>     <span class="hl-built_in">rdx</span>, [<span class="hl-built_in">rip</span>, +, .L__unnamed_12]</span>
+<span class="line">  <span class="hl-keyword">mov</span>     <span class="hl-built_in">esi</span>, <span class="hl-number">57</span></span>
+<span class="line">  <span class="hl-keyword">call</span>    <span class="hl-built_in">qword</span>, <span class="hl-built_in">ptr</span>, [<span class="hl-built_in">rip</span>, +, core::panicking::panic@GOTPCREL]</span>
+<span class="line">  <span class="hl-keyword">ud2</span></span></code></pre>
+
+</figure>
+<p><span>I</span>&rsquo;<span>ve slightly edited the code and also highlighted the hot loop which constitutes the bulk of the benchmark.</span></p>
+<p><span>Looking at the assembly, we can see the following:</span></p>
+<ul>
+<li>
+<span>code is much larger </span>&mdash;<span> inlining happened!</span>
+</li>
+<li>
+<span>function prologue is bigger, compiler pushes more callee-saved registers to the stack</span>
+</li>
+<li>
+<span>function epilogue is bigger, compiler needs to restore more registers</span>
+</li>
+<li>
+<span>stack frame is larger</span>
+</li>
+<li>
+<span>compiler hoisted some of the </span><code>initialize</code><span> code to before the loop</span>
+</li>
+<li>
+<span>the core loop is very tight in both cases, just a handful of instructions</span>
+</li>
+<li>
+<span>the core loop counts upwards rather than downwards, adding an extra </span><code>cmp</code><span> instruction</span>
+</li>
+</ul>
+<p><span>Note that it</span>&rsquo;<span>s highly unlikely that ICache affects the running code, as it</span>&rsquo;<span>s a small bunch of instructions next to each other in memory.</span>
+<span>On the other hand, an extra </span><code>cmp</code><span> with a large immediate precisely accounts for the amount of extra instructions we observe (the loop is run 800</span><span>_000</span><span>_000 times).</span></p>
+<section id="Conclusions">
+
+    <h2>
+    <a href="#Conclusions"><span>Conclusions</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s hard enough to come up with a benchmark which demonstrate the difference between two alternatives.</span>
+<span>It</span>&rsquo;<span>s even harder to explain the difference </span>&mdash;<span> there might be many </span><a href="https://en.wikipedia.org/wiki/Availability_heuristic"><span>readily available</span></a><span> explanations, but they are not necessary true.</span>
+<span>Nonetheless, today we have a wealth of helpful tooling.</span>
+<span>Two notable examples are </span><a href="https://perf.wiki.kernel.org/index.php/Tutorial"><span>perf</span></a><span> and </span><a href="https://valgrind.org/docs/manual/quick-start.html"><span>valgrind</span></a><span>.</span>
+<span>Tools are not always correct </span>&mdash;<span> it</span>&rsquo;<span>s a good idea to sanity check different tools against each other and against common-sense understanding of the problem.</span></p>
+<p><span>For inlining in particular, we found the following reasons why inlining </span><code>S</code><span> into </span><code>C</code><span> might cause a slow down:</span></p>
+<ol>
+<li>
+<span>Inlining might cause </span><code>C</code><span> to use more registers.</span>
+<span>This means that prologue and epilogue grow additional push/pop instructions, which also use stack memory.</span>
+<span>Without inlining, these instructions are hidden in </span><code>S</code><span> and are only paid for when </span><code>C</code><span> actually calls into </span><code>S</code><span>, as opposed to every time </span><code>C</code><span> itself is called.</span>
+</li>
+<li>
+<span>Generalizing from the first point, if </span><code>S</code><span> is called in a loop or in an </span><code>if</code><span>, the compiler might hoist some instructions of </span><code>S</code><span> to before the branch, moving them from the cold path to the hot path.</span>
+</li>
+<li>
+<span>With more local variables and control flow in the stack frame to juggle, compiler might accidentally pessimize the hot loop.</span>
+</li>
+</ol>
+<p><span>If you are curious under which conditions ICache does become an issue, there</span>&rsquo;<span>s </span><a href="https://www.scylladb.com/2017/07/06/scyllas-approach-improve-performance-cpu-bound-workloads/"><span>this excellent article</span></a><span> about one such case.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-07-10-its-not-always-icache.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/07/30/shell-injection.html b/2021/07/30/shell-injection.html
new file mode 100644
index 00000000..644b3a46
--- /dev/null
+++ b/2021/07/30/shell-injection.html
@@ -0,0 +1,432 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>; echo Shell Injection</title>
+  <meta name="description" content="This is an introductory article about shell injection, a security vulnerability allowing an attacker to execute arbitrary code on the user's machine.
+This is a well-studied problem, and there are simple and efficient solutions to it.
+It's relatively easy to design library API in such a way as to shield the application developer from the risk of shell injections.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/07/30/shell-injection.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#echo-Shell-Injection"><span>; echo </span>&ldquo;<span>Shell Injection</span>&rdquo; <time datetime="2021-07-30">Jul 30, 2021</time></a>
+    </h1>
+<p><span>This is an introductory article about </span><a href="https://en.wikipedia.org/wiki/Code_injection#Shell_injection"><span>shell injection</span></a><span>, a security vulnerability allowing an attacker to execute arbitrary code on the user</span>&rsquo;<span>s machine.</span>
+<span>This is a well-studied problem, and there are simple and efficient solutions to it.</span>
+<span>It</span>&rsquo;<span>s relatively easy to design library API in such a way as to shield the application developer from the risk of shell injections.</span></p>
+<p><span>There are two reasons why I am writing this post.</span>
+<span>First, this year I</span>&rsquo;<span>ve pointed out this issue in </span><a href="https://old.reddit.com/r/rust/comments/ls096k/rust_cmd_lib_v010_to_write_shellscript_like_tasks/goqlv3m/"><span>three</span></a><span> </span><a href="https://lobste.rs/s/9yu5sl/after_discussion_here_i_created_lib_for#c_ckkova"><span>different</span></a><span> </span><a href="https://lobste.rs/s/p1hict/zxpy_tool_for_shell_scripting_python#c_zuaapx"><span>libraries</span></a><span>.</span>
+<span>It seems that, although the problem is well-studied, its not well known, so just repeating some things might help.</span>
+<span>Second, I</span>&rsquo;<span>ve recently reported a related problem about one of the VS Code APIs, and I want to use this piece as an extended GitHub comment :-)</span></p>
+<section id="A-Curious-Case-Of-Pwnd-Script">
+
+    <h2>
+    <a href="#A-Curious-Case-Of-Pwnd-Script"><span>A Curious Case Of Pwnd Script</span> </a>
+    </h2>
+<p><span>Shell injection can happen when a program needs to execute another program, and one of the arguments is controlled by the user/attacker.</span>
+<span>As a model example, let</span>&rsquo;<span>s write a quick script to read a list of URLs from stdin, and run </span><code>curl</code><span> for each one of those.</span></p>
+<p><span>That</span>&rsquo;<span>s not realistic, but small and illustrative.</span>
+<span>This is what the script could look like in NodeJS:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">curl-all.js</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> readline = <span class="hl-built_in">require</span>(<span class="hl-string">&#x27;readline&#x27;</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">const</span> util = <span class="hl-built_in">require</span>(<span class="hl-string">&#x27;util&#x27;</span>);</span>
+<span class="line"><span class="hl-keyword">const</span> exec = util.<span class="hl-title function_">promisify</span>(<span class="hl-built_in">require</span>(<span class="hl-string">&#x27;child_process&#x27;</span>).<span class="hl-property">exec</span>);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">async</span> <span class="hl-keyword">function</span> <span class="hl-title function_">main</span>(<span class="hl-params"></span>) {</span>
+<span class="line">  <span class="hl-keyword">const</span> input = readline.<span class="hl-title function_">createInterface</span>({</span>
+<span class="line">    <span class="hl-attr">input</span>: process.<span class="hl-property">stdin</span>,</span>
+<span class="line">    <span class="hl-attr">output</span>: process.<span class="hl-property">stdout</span>,</span>
+<span class="line">    <span class="hl-attr">terminal</span>: <span class="hl-literal">false</span>,</span>
+<span class="line">  });</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-keyword">await</span> (<span class="hl-keyword">const</span> line <span class="hl-keyword">of</span> input) {</span>
+<span class="line">    <span class="hl-keyword">if</span> (line.<span class="hl-title function_">trim</span>().<span class="hl-property">length</span> &gt; <span class="hl-number">0</span>) {</span>
+<span class="line">      <span class="hl-keyword">const</span> { stdout, stderr } = <span class="hl-keyword">await</span> <span class="hl-title function_">exec</span>(<span class="hl-string">`curl <span class="hl-subst">${line}</span>`</span>);</span>
+<span class="line">      <span class="hl-variable language_">console</span>.<span class="hl-title function_">log</span>({ stdout, stderr });</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-title function_">main</span>()</span></code></pre>
+
+</figure>
+<p><span>I would have written this in Rust, but, alas, it</span>&rsquo;<span>s not vulnerable to this particular attack :)</span></p>
+<p><span>The interesting line is this one:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> { stdout, stderr } = <span class="hl-keyword">await</span> <span class="hl-title function_">exec</span>(<span class="hl-string">`curl <span class="hl-subst">${line}</span>`</span>);</span></code></pre>
+
+</figure>
+<p><span>Here, we use are using </span><a href="https://nodejs.org/api/child_process.html#child_process_child_process_exec_command_options_callback"><code>exec</code></a><span> API from node to spawn a child </span><code>curl</code><span> process, passing a line of input as an argument.</span></p>
+<p><span>Seems to work for simple cases?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cat urls.txt</span>
+<span class="line"><span class="hl-output">&lt;https://example.com&gt;</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-title function_">$</span> node curl-all.js &lt; urls.txt</span>
+<span class="line"><span class="hl-output">{</span></span>
+<span class="line"><span class="hl-output">  stdout: '&lt;!doctype html&gt;...&lt;/html&gt;\n',</span></span>
+<span class="line"><span class="hl-output">  stderr: '% Total    % Received ...'</span></span>
+<span class="line"><span class="hl-output">}</span></span></code></pre>
+
+</figure>
+<p><span>But what if we use a slightly more imaginative input?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> node main.js &lt; malice_in_the_wonderland.txt</span>
+<span class="line"><span class="hl-output">{</span></span>
+<span class="line"><span class="hl-output">  stdout: 'PWNED, reading your secrets from /etc/passwd\n' +</span></span>
+<span class="line"><span class="hl-output">    'root:x:0:0:System administrator:/root:/bin/fish\n' +</span></span>
+<span class="line"><span class="hl-output">    '...' +</span></span>
+<span class="line"><span class="hl-output">    'matklad:x:1000:100::/home/matklad:/bin/fish\n',</span></span>
+<span class="line"><span class="hl-output">  stderr: "curl: try 'curl --help' for more information\n"</span></span>
+<span class="line"><span class="hl-output">}</span></span></code></pre>
+
+</figure>
+<p><span>That feels bad </span>&mdash;<span> seems that the script somehow reads the contents of my </span><code>/etc/passwd</code><span>.</span>
+<span>How did this happen, we</span>&rsquo;<span>ve only invoked </span><code>curl</code><span>?</span></p>
+</section>
+<section id="Spawning-a-Process">
+
+    <h2>
+    <a href="#Spawning-a-Process"><span>Spawning a Process</span> </a>
+    </h2>
+<p><span>To understand what have just happened, we need to learn a bit about how spawning a process works in general.</span>
+<span>This section is somewhat UNIX-specific </span>&mdash;<span> things are implemented a bit differently on Windows.</span>
+<span>Nonetheless, the big picture conclusions hold there as well.</span></p>
+<p><span>The main API to run a program with command line arguments is the </span><code>exec</code><span> family of functions.</span>
+<span>For example, here</span>&rsquo;<span>s </span><code>execve</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-type">int</span> <span class="hl-title function_">execve</span><span class="hl-params">(<span class="hl-type">const</span> <span class="hl-type">char</span> *pathname, <span class="hl-type">char</span> *<span class="hl-type">const</span> argv[],</span></span>
+<span class="line"><span class="hl-params">           <span class="hl-type">char</span> *<span class="hl-type">const</span> envp[])</span>;</span></code></pre>
+
+</figure>
+<p><span>It takes the name of the program (</span><code>pathname</code><span>), a list of command line arguments (</span><code>argv</code><span>), and a list of environment variable for the new process (</span><code>envp</code><span>), and uses those to run the specified binary.</span>
+<span>How exactly this happens is a fascinating story with many forks in the plot, but it is beyond the scope of the article.</span></p>
+<p><span>What is curious though, is that while the underlying system API wants an array of arguments, the </span><code>child_process.exec</code><span> function from node takes only a single string: </span><code>exec("curl http://example.com")</code><span>.</span></p>
+<p><span>Let</span>&rsquo;<span>s find out!</span>
+<span>To do that, we</span>&rsquo;<span>ll use the </span><a href="https://strace.io"><span>strace</span></a><span> tool.</span>
+<span>This tool inspects (traces) all the system calls invoked by the program.</span>
+<span>We</span>&rsquo;<span>ll ask </span><code>strace</code><span> to look for </span><code>execve</code><span> in particular, to understand how node</span>&rsquo;<span>s </span><code>exec</code><span> maps to the underlying system</span>&rsquo;<span>s API.</span>
+<span>We</span>&rsquo;<span>ll need the </span><code>--follow</code><span> argument to trace all processes, and not just the top-level one.</span>
+<span>To reduce the amount of output and only print </span><code>execve</code><span>, we</span>&rsquo;<span>ll use the </span><code>--trace</code><span> flag:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> strace --follow --trace execve node main.js &lt; urls.txt</span>
+<span class="line"><span class="hl-output">execve("/bin/node", ["node", "curl-all.js"], 0x7fff97776be0)</span></span>
+<span class="line"><span class="hl-output">...</span></span>
+<span class="line"><span class="hl-output">execve("/bin/sh", ["/bin/sh", "-c", "curl https://example.com"], 0x3fcacc0)</span></span>
+<span class="line"><span class="hl-output">...</span></span>
+<span class="line"><span class="hl-output">execve("/bin/curl", ["curl", "https://example.com"], 0xec4008)</span></span></code></pre>
+
+</figure>
+<p><span>The first </span><code>execve</code><span> we see here is our original invocation of the </span><code>node</code><span> binary itself.</span>
+<span>The last one is what we want to do </span>&mdash;<span> spawn </span><code>curl</code><span> with a single argument, an url.</span>
+<span>And the middle one is what node</span>&rsquo;<span>s </span><code>exec</code><span> actually does.</span></p>
+<p><span>Let</span>&rsquo;<span>s take a closer look:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">/bin/sh -c "curl https://example.com"</span></code></pre>
+
+</figure>
+<p><span>Here, node invokes the </span><code>sh</code><span> binary (system</span>&rsquo;<span>s shell) with two arguments: </span><code>-c</code><span> and the string we originally passed to </span><code>child_process.exec</code><span>.</span>
+<code>-c</code><span> stands for command, and instructs the shell to interpret the value as a shell command, parse, it and then run it.</span></p>
+<p><span>In other words, rather then running the command directly, node asks the shell to do the heavy lifting.</span>
+<span>But the shell is an interpreter of the shell language, and, by carefully crafting the input to </span><code>exec</code><span>, we can ask it to run arbitrary code.</span>
+<span>In particular, that</span>&rsquo;<span>s what we used as a payload in the bad example above:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">malice_in_the_wonderland.txt</figcaption>
+
+
+<pre><code><span class="line">; echo 'PWNED, reading your secrets from /etc/passwd' &amp;&amp; cat /etc/passwd</span></code></pre>
+
+</figure>
+<p><span>After the string interpolation, the resulting command was</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">/bin/sh -c "curl; echo '...' &amp;&amp; cat /etc/passwd"</span></code></pre>
+
+</figure>
+<p><span>That is, first run </span><code>curl</code><span>, then </span><code>echo</code><span>, then read the </span><code>/etc/passwd</code><span>.</span></p>
+</section>
+<section id="Those-Who-Study-History-Are-Doomed-to-Repeat-It">
+
+    <h2>
+    <a href="#Those-Who-Study-History-Are-Doomed-to-Repeat-It"><span>Those Who Study History Are Doomed to Repeat It</span> </a>
+    </h2>
+<p><span>There</span>&rsquo;<span>s an equivalent safe API in node: </span><a href="https://nodejs.org/api/child_process.html#child_process_child_process_spawn_command_args_options"><code>spawn</code></a><span>.</span>
+<span>unlike </span><code>exec</code><span>, it uses an array of arguments rather then a single string.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">-  <span class="hl-title function_">exec</span>(<span class="hl-string">`curl <span class="hl-subst">${line}</span>`</span>)</span>
+<span class="line">+ <span class="hl-title function_">spawn</span>(<span class="hl-string">&quot;curl&quot;</span>, line)</span></code></pre>
+
+</figure>
+<p><span>Internally, the API bypasses the shell and uses </span><code>execve</code><span> directly.</span>
+<span>Thus, this API is not vulnerable to shell injection </span>&mdash;<span> attacker can run </span><code>curl</code><span> with bad arguments, but it can</span>&rsquo;<span>t run something else than </span><code>curl</code><span>.</span></p>
+<p><span>Note that it</span>&rsquo;<span>s easy to implement </span><code>exec</code><span> in terms of </span><code>spawn</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">myExec</span>(<span class="hl-params">cmd</span>) {</span>
+<span class="line">  <span class="hl-keyword">return</span> <span class="hl-title function_">spawn</span>(<span class="hl-string">&quot;/bin/sh&quot;</span>, <span class="hl-string">&quot;-c&quot;</span>, cmd)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It</span>&rsquo;<span>s a common pattern among many languages:</span></p>
+<ul>
+<li>
+<span>there</span>&rsquo;<span>s an </span><code>exec</code><span>-style function that takes a string and spawns </span><code>/bin/sh -c</code><span> under the hood,</span>
+</li>
+<li>
+<span>the docs for this function include a giant disclaimer, saying that using it with user input is a bad idea,</span>
+</li>
+<li>
+<span>there</span>&rsquo;<span>s a safe alternative which takes arguments as an array and spawns the process directly.</span>
+</li>
+</ul>
+<p><span>Why provide an exploitable API, while a safe version is possible and is more direct?</span>
+<span>I don</span>&rsquo;<span>t know, but my guess is that it</span>&rsquo;<span>s mostly just history.</span>
+<span>C has </span><a href="https://en.cppreference.com/w/c/program/system"><code>system</code></a><span>, Perl</span>&rsquo;<span>s backticks correspond directly to that, Ruby got backticks from Perl, Python just has </span><code>system</code><span>, node was probably influenced by all these scripting languages.</span></p>
+<p><span>Note that security isn</span>&rsquo;<span>t the only issue with </span><code>/bin/sh -c</code><span> based API.</span>
+<span>Read </span><a href="https://julialang.org/blog/2012/03/shelling-out-sucks/"><span>this other post</span></a><span> to learn about the rest of the problems.</span></p>
+</section>
+<section id="Take-Aways">
+
+    <h2>
+    <a href="#Take-Aways"><span>Take Aways</span> </a>
+    </h2>
+<p><span>If you are an </span><em><em><span>application developer</span></em></em><span>, be aware that this issue exists.</span>
+<span>Read the language documentation carefully </span>&mdash;<span> most likely, there are two flavors of process spawning functions.</span>
+<span>Note how shell injection is similar to </span><a href="https://en.wikipedia.org/wiki/SQL_injection"><span>SQL injection</span></a><span> and </span><a href="https://en.wikipedia.org/wiki/Cross-site_scripting"><span>XSS</span></a><span>.</span></p>
+<p><span>If you </span><em><em><span>develop a library</span></em></em><span> for conveniently working with external processes, use and expose only the shell-less API from the underlying platform.</span></p>
+<p><span>If you </span><em><em><span>build a new platform</span></em></em><span>, don</span>&rsquo;<span>t provide </span><code>bin/sh -c</code><span> API in the first place.</span>
+<span>Be like </span><a href="https://deno.land/manual@v1.12.2/examples/subprocess"><span>deno</span></a><span> (and also Go, Rust, Julia), don</span>&rsquo;<span>t be like </span><a href="https://nodejs.org/api/child_process.html#child_process_child_process_exec_command_options_callback"><span>node</span></a><span> (and also Python, Ruby, Perl, C).</span>
+<span>If you </span><em><span>have</span></em><span> to maintain such API for legacy reasons, clearly document the issue about shell injection.</span>
+<span>Documenting how to do </span><code>/bin/sh -c</code><span> by hand might also be a good idea.</span></p>
+<p><span>If you are </span><em><em><span>designing a programming language</span></em></em><span>, be careful with string interpolation syntax.</span>
+<span>It</span>&rsquo;<span>s important that string interpolation can be used to spawn a command in a safe way.</span>
+<span>That mostly means that library authors should be able to deconstruct a </span><code>"cmd -j $arg1 -f $arg2"</code><span> literal into two (compile-time) arrays: </span><code>["cmd -j ", " -f "]</code><span> and </span><code>[arg1, arg2]</code><span>.</span>
+<span>If you don</span>&rsquo;<span>t provide this feature in the language, library authors will split the interpolated string, which would be unsafe (not only for shelling out </span>&mdash;<span> for SQLing or HTMLing as well).</span>
+<span>Good examples to learn from are JavaScript</span>&rsquo;<span>s</span>
+<a href="https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Template_literals#tagged_templates"><span>tagged templates</span></a>
+<span>and Julia</span>&rsquo;<span>s</span>
+<a href="https://julialang.org/blog/2013/04/put-this-in-your-pipe/#do-nothing_backticks"><span>backticks</span></a><span>.</span></p>
+</section>
+<section id="What-s-About-VS-Code">
+
+    <h2>
+    <a href="#What-s-About-VS-Code"><span>What</span>&rsquo;<span>s About VS Code?</span> </a>
+    </h2>
+<p><span>Oh, right, the actual reason why I am writing this thing.</span>
+<span>The TL;DR for this section is that I want to complain about a specific API design a bit.</span></p>
+<p><span>This story begins in </span><a href="https://github.com/rust-analyzer/rust-analyzer/issues/9058"><span>#9058</span></a><span>.</span></p>
+<p><span>I was happily hacking on some Rust library.</span>
+<span>At some point I pressed the </span>&ldquo;<span>run tests</span>&rdquo;<span> button in </span><a href="https://rust-analyzer.github.io"><span>rust-analyzer</span></a><span>.</span>
+<span>And, surprised, accidentally pwned myself!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Executing task: cargo test --doc --- Plotter&lt;D&gt;::line_fill --nocapture</span>
+<span class="line"></span>
+<span class="line">warning: An error occurred while redirecting file 'D'</span>
+<span class="line">open: No such file or directory</span>
+<span class="line"></span>
+<span class="line">The terminal process</span>
+<span class="line">/bin/fish '-c', 'cargo test --doc --- Plotter&lt;D&gt;::line_fill --nocapture'</span>
+<span class="line">failed to launch (exit code: 1).</span>
+<span class="line"></span>
+<span class="line">Terminal will be reused by tasks, press any key to close it.</span></code></pre>
+
+</figure>
+<p><span>That was disappointing.</span>
+<span>C</span>&rsquo;<span>mon, how come there</span>&rsquo;<span>s a shell injection in the code I help to maintain?</span>
+<span>While this is not a big problem for rust-analyzer (our security model assumes trusted code, as each of </span><code>rustup</code><span>, </span><code>cargo</code><span>, and </span><code>rustc</code><span> can execute arbitrary code by design), it definitely was big blow to my aesthetics sensibilities!</span></p>
+<p><span>Looking at the git history, it was me who had missed </span>&ldquo;<span>concatenate arguments into a single string</span>&rdquo;<span> during review.</span>
+<span>So I was definitely a part of the problem here.</span>
+<span>But the other part is that the API that takes a single string exists at all.</span></p>
+<p><span>Let</span>&rsquo;<span>s look at the API:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">class</span> <span class="hl-title class_">ShellExecution</span> {</span>
+<span class="line">  <span class="hl-comment">/**</span></span>
+<span class="line"><span class="hl-comment">    * Creates a shell execution with a full command line.</span></span>
+<span class="line"><span class="hl-comment">    *</span></span>
+<span class="line"><span class="hl-comment">    * <span class="hl-doctag">@param</span> commandLine The command line to execute.</span></span>
+<span class="line"><span class="hl-comment">    * <span class="hl-doctag">@param</span> options Optional options for the started the shell.</span></span>
+<span class="line"><span class="hl-comment">    */</span></span>
+<span class="line">  <span class="hl-title function_">constructor</span>(<span class="hl-params"></span></span>
+<span class="line"><span class="hl-params">    commandLine: <span class="hl-built_in">string</span>,</span></span>
+<span class="line"><span class="hl-params">    options?: ShellExecutionOptions</span></span>
+<span class="line"><span class="hl-params">  </span>);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">/* ... */</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>So, this is exactly what I am describing </span>&mdash;<span> a process-spawning API that takes a single string.</span>
+<span>I guess, in this case this </span><em><span>might</span></em><span> even be justified </span>&mdash;<span> the API opens a literal shell in the GUI, and the user can interact with it after the command finishes.</span></p>
+<p><span>Anyway, after looking around I quickly found another API, which </span><em><span>seemed</span></em><span> (ominous music in the background) like what I was looking for:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">class</span> <span class="hl-title class_">ShellExecution</span> {</span>
+<span class="line">  <span class="hl-comment">/**</span></span>
+<span class="line"><span class="hl-comment">    * Creates a shell execution with a command and arguments.</span></span>
+<span class="line"><span class="hl-comment">    * For the real execution the editor will construct a</span></span>
+<span class="line"><span class="hl-comment">    * command line from the command and the arguments. This</span></span>
+<span class="line"><span class="hl-comment">    * is subject to interpretation especially when it comes to</span></span>
+<span class="line"><span class="hl-comment">    * quoting. If full control over the command line is needed</span></span>
+<span class="line"><span class="hl-comment">    * please use the constructor that creates a `ShellExecution`</span></span>
+<span class="line"><span class="hl-comment">    * with the full command line.</span></span>
+<span class="line"><span class="hl-comment">    *</span></span>
+<span class="line"><span class="hl-comment">    * <span class="hl-doctag">@param</span> command The command to execute.</span></span>
+<span class="line"><span class="hl-comment">    * <span class="hl-doctag">@param</span> args The command arguments.</span></span>
+<span class="line"><span class="hl-comment">    * <span class="hl-doctag">@param</span> options Optional options for the started the shell.</span></span>
+<span class="line"><span class="hl-comment">    */</span></span>
+<span class="line">  <span class="hl-title function_">constructor</span>(<span class="hl-params"></span></span>
+<span class="line"><span class="hl-params">    command: <span class="hl-built_in">string</span> | ShellQuotedString,</span></span>
+<span class="line"><span class="hl-params">    args: (<span class="hl-built_in">string</span> | ShellQuotedString)[],</span></span>
+<span class="line"><span class="hl-params">    options?: ShellExecutionOptions</span></span>
+<span class="line"><span class="hl-params">  </span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The API takes a array of strings.</span>
+<span>It also tries to say something about quoting, which is a good sign!</span>
+<span>The wording is perplexing, but seems that it struggles to explain to me that passing </span><code>["ls", "&gt;", "out.txt"]</code><span> won</span>&rsquo;<span>t actually redirect, because </span><code>&gt;</code><span> will get quoted.</span>
+<span>This is exactly what I want!</span>
+<span>The absence of any kind of a security note on both APIs is concerning, but oh well.</span></p>
+<p><span>So, I refactored the code to use this second constructor, and, 🥁 🥁 🥁, it still had the exact same behavior!</span>
+<span>Turns out that this API takes an array of arguments, and just concatenates them, unless I explicitly say that each argument needs to be escaped.</span></p>
+<p><span>And </span><em><span>this</span></em><span> is what I am complaining about </span>&mdash;<span> that the API looks like it is safe for an untrusted user input, while it is not.</span>
+<span>This is misuse resistance resistance.</span></p>
+<p><span>That</span>&rsquo;<span>s all, thanks for reading!</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-07-30-shell-injection.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/08/22/large-rust-workspaces.html b/2021/08/22/large-rust-workspaces.html
new file mode 100644
index 00000000..20c0fd87
--- /dev/null
+++ b/2021/08/22/large-rust-workspaces.html
@@ -0,0 +1,273 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Large Rust Workspaces</title>
+  <meta name="description" content="In this article, I'll share my experience with organizing large Rust projects.
+This is in no way authoritative --- just some tips I've discovered through trial and error.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/08/22/large-rust-workspaces.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Large-Rust-Workspaces"><span>Large Rust Workspaces</span> <time datetime="2021-08-22">Aug 22, 2021</time></a>
+    </h1>
+<p><span>In this article, I</span>&rsquo;<span>ll share my experience with organizing large Rust projects.</span>
+<span>This is in no way authoritative </span>&mdash;<span> just some tips I</span>&rsquo;<span>ve discovered through trial and error.</span></p>
+<p><span>Cargo, Rust</span>&rsquo;<span>s build system, follows convention over configuration principle.</span>
+<span>It provides a set of good defaults for small projects, and it is especially well-tailored for public crates.io libraries.</span>
+<span>The defaults are not perfect, but they are good enough.</span>
+<span>The resulting ecosystem-wide consistency is also welcome.</span></p>
+<p><span>However, Cargo is less opinionated when it comes to large, multi-crate projects, organized as a Cargo workspace.</span>
+<span>Workspaces are flexible </span>&mdash;<span> Cargo doesn</span>&rsquo;<span>t have a preferred layout for them.</span>
+<span>As a result, people try different things, with varying degrees of success.</span></p>
+<p><span>To cut to the chase, I think for projects in between ten thousand and one million lines of code, the flat layout makes the most sense.</span>
+<span>rust-analyzer (200k lines) is good example here.</span>
+<span>The </span><a href="https://github.com/rust-analyzer/rust-analyzer"><span>repository</span></a><span> is laid out this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">rust-analyzer/</span>
+<span class="line">  Cargo.toml</span>
+<span class="line">  Cargo.lock</span>
+<span class="line">  crates/</span>
+<span class="line">    rust-analyzer/</span>
+<span class="line">    hir/</span>
+<span class="line">    hir_def/</span>
+<span class="line">    hir_ty/</span>
+<span class="line">    ...</span></code></pre>
+
+</figure>
+<p><span>In the root of the repo, Cargo.toml defines a virtual manifest:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Cargo.toml</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-section">[workspace]</span></span>
+<span class="line"><span class="hl-attr">members</span> = [<span class="hl-string">&quot;crates/*&quot;</span>]</span></code></pre>
+
+</figure>
+<p><span>Everything else (including rust-analyzer </span>&ldquo;<span>main</span>&rdquo;<span> crate) is nested one-level deep under </span><code>crates/</code><span>.</span>
+<span>The name of each directory is equal to the name of the crate:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">crates/hir_def/Cargo.toml</figcaption>
+
+
+<pre><code><span class="line">[package]</span>
+<span class="line">name = "hir_def"</span>
+<span class="line">version = "0.0.0"</span>
+<span class="line">edition = "2018"</span></code></pre>
+
+</figure>
+<p><span>At the time of writing, there are 32 different subfolders in </span><code>crates/</code><span>.</span></p>
+<section id="Flat-Is-Better-Than-Nested">
+
+    <h2>
+    <a href="#Flat-Is-Better-Than-Nested"><span>Flat Is Better Than Nested</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s interesting that this advice goes against the natural tendency to just organize everything hierarchically:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">rust-analyzer/</span>
+<span class="line">  Cargo.toml</span>
+<span class="line">  src/</span>
+<span class="line">  hir/</span>
+<span class="line">    Cargo.toml</span>
+<span class="line">    src/</span>
+<span class="line">    def/</span>
+<span class="line">    ty/</span></code></pre>
+
+</figure>
+<p><span>There are several reasons why trees are inferior in this case.</span></p>
+<p><em><span>First</span></em><span>, the Cargo-level namespace of crates is flat.</span>
+<span>It</span>&rsquo;<span>s not possible to write </span><code>hir::def</code><span> in Cargo.toml, so crates typically have prefixes in their names.</span>
+<span>Tree layout creates an alternative hierarchy, which adds a possibility for inconsistencies.</span></p>
+<p><em><span>Second</span></em><span>, even comparatively large lists are easier to understand at a glance than even small trees.</span>
+<code>ls ./crates</code><span> gives immediate bird</span>&rsquo;<span>s eye view of the project, and this view is small enough:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">16:22:57|~/projects/rust-analyzer|master✓</span>
+<span class="line">λ ls ./crates</span>
+<span class="line">base_db</span>
+<span class="line">cfg</span>
+<span class="line">flycheck</span>
+<span class="line">hir</span>
+<span class="line">hir_def</span>
+<span class="line">hir_expand</span>
+<span class="line">hir_ty</span>
+<span class="line">ide</span>
+<span class="line">ide_assists</span>
+<span class="line">ide_completion</span>
+<span class="line">ide_db</span>
+<span class="line">ide_diagnostics</span>
+<span class="line">ide_ssr</span>
+<span class="line">limit</span>
+<span class="line">mbe</span>
+<span class="line">parser</span>
+<span class="line">paths</span>
+<span class="line">proc_macro_api</span>
+<span class="line">proc_macro_srv</span>
+<span class="line">proc_macro_test</span>
+<span class="line">profile</span>
+<span class="line">project_model</span>
+<span class="line">rust-analyzer</span>
+<span class="line">sourcegen</span>
+<span class="line">stdx</span>
+<span class="line">syntax</span>
+<span class="line">test_utils</span>
+<span class="line">text_edit</span>
+<span class="line">toolchain</span>
+<span class="line">tt</span>
+<span class="line">vfs</span></code></pre>
+
+</figure>
+<p><span>Doing the same for a tree-based layout is harder.</span>
+<span>Looking at a single level doesn</span>&rsquo;<span>t tell you which folders contains nested crates.</span>
+<span>Looking at </span><em><span>all</span></em><span> level lists too many folders.</span>
+<span>Looking only at folder that contain Cargo.toml gives the right result, but is not as trivial as just </span><code>ls</code><span>.</span></p>
+<p><span>It is true that nested structure scales better than a flat one.</span>
+<span>But the constant matters </span>&mdash;<span> until you hit a million lines of code, the number of crates in the project will probably fit on one screen.</span></p>
+<p><em><span>Finally</span></em><span>, the last problem with hierarchical layout is that there are no perfect hierarchies.</span>
+<span>With a flat structure, adding or splitting the crates is trivial.</span>
+<span>With a tree, you need to figure out where to put the new crate, and, if there isn</span>&rsquo;<span>t a perfect match for it already, you</span>&rsquo;<span>ll have to either:</span></p>
+<ul>
+<li>
+<span>add a stupid mostly empty folder near the top</span>
+</li>
+<li>
+<span>add a catch-all utils folder</span>
+</li>
+<li>
+<span>place the code in a known suboptimal directory.</span>
+</li>
+</ul>
+<p><span>This is a significant issue for long-lived multi-person projects </span>&mdash;<span> tree structure tends to deteriorate over time, while flat structure doesn</span>&rsquo;<span>t need maintenance.</span></p>
+</section>
+<section id="Smaller-Tips">
+
+    <h2>
+    <a href="#Smaller-Tips"><span>Smaller Tips</span> </a>
+    </h2>
+<p><span>Make the root of the workspace a virtual manifest.</span>
+<span>It might be tempting to put the main crate into the root, but that pollutes the root with </span><code>src/</code><span>, requires passing </span><code>--workspace</code><span> to every Cargo command, and adds an exception to an otherwise consistent structure.</span></p>
+<p><span>Don</span>&rsquo;<span>t succumb to the temptation to strip common prefix from folder names.</span>
+<span>If each crate is named exactly as the folder it lives in, navigation and renames become easier.</span>
+<span>Cargo.tomls of reverse dependencies mention both the folder and the crate name, it</span>&rsquo;<span>s useful when they are exactly the same.</span></p>
+<p><span>For large projects a lot of repository bloat often comes from ad-hoc automation </span>&mdash;<span> Makefiles and various prepare.sh scripts here and there.</span>
+<span>To avoid both the bloat and proliferation of ad-hoc workflows, write all automation in Rust in a dedicated crate.</span>
+<span>One pattern useful for this is </span><a href="https://github.com/matklad/cargo-xtask"><code>cargo xtask</code></a><span>.</span></p>
+<p><span>Use </span><code>version = "0.0.0"</code><span> for internal crates you don</span>&rsquo;<span>t intend to publish.</span>
+<span>If you do want to publish a subset of crates with proper semver API, be very deliberate about them.</span>
+<span>It probably makes sense to extract all such crates into a separate top-level folder, </span><code>libs/</code><span>.</span>
+<span>It makes it easier to check that things in </span><code>libs/</code><span> don</span>&rsquo;<span>t use things from </span><code>crates/</code><span>.</span></p>
+<p><span>Some crates consist only of a single-file.</span>
+<span>For those, it is tempting to flatten out the </span><code>src</code><span> directory and keep </span><code>lib.rs</code><span> and </span><code>Cargo.toml</code><span> in the same directory.</span>
+<span>I suggest not doing that </span>&mdash;<span> even if crate is single file now, it might get expanded later.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>This post is a part of </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html"><span>One Hundred Thousand Lines of Rust</span></a><span> series.</span></p>
+</div>
+</aside></section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-08-22-large-rust-workspaces.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/09/04/fast-rust-builds.html b/2021/09/04/fast-rust-builds.html
new file mode 100644
index 00000000..4d714613
--- /dev/null
+++ b/2021/09/04/fast-rust-builds.html
@@ -0,0 +1,601 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Fast Rust Builds</title>
+  <meta name="description" content="It's common knowledge that Rust code is slow to compile.
+But I have a strong gut feeling that most Rust code out there compiles much slower than it could.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/09/04/fast-rust-builds.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Fast-Rust-Builds"><span>Fast Rust Builds</span> <time datetime="2021-09-04">Sep 4, 2021</time></a>
+    </h1>
+<p><span>It</span>&rsquo;<span>s common knowledge that Rust code is slow to compile.</span>
+<span>But I have a strong gut feeling that most Rust code out there compiles much slower than it could.</span></p>
+<p><span>As an example, one fairly recent </span><a href="https://kerkour.com/blog/rust-development-workflow/"><span>post</span></a><span> says:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>With Rust, on the other hand, it takes between </span><strong><strong><span>15 and 45 minutes</span></strong></strong><span> to run a CI pipeline, depending on your project and the power of your CI servers.</span></p>
+</blockquote>
+
+</figure>
+<p><span>This doesn</span>&rsquo;<span>t make sense to me.</span>
+<span>rust-analyzer CI takes </span><strong><strong><span>8</span></strong></strong><span> minutes on GitHub actions.</span>
+<span>It is a fairly large and complex project with 200k lines of own code and 1 million lines of dependencies on top.</span></p>
+<p><span>It is true that Rust is slow to compile in a rather fundamental way.</span>
+<span>It picked </span>&ldquo;<span>slow compiler</span>&rdquo;<span> in the </span><a href="https://research.swtch.com/generic"><span>generic dilemma</span></a><span>, and its overall philosophy prioritizes runtime over compile time (an excellent series of posts about that:</span>
+<a href="https://pingcap.com/blog/rust-compilation-model-calamity"><span>1</span></a><span>,</span>
+<a href="https://pingcap.com/blog/generics-and-compile-time-in-rust"><span>2</span></a><span>,</span>
+<a href="https://pingcap.com/blog/rust-huge-compilation-units"><span>3</span></a><span>,</span>
+<a href="https://pingcap.com/blog/reasons-rust-compiles-slowly"><span>4</span></a><span>).</span>
+<span>But </span><code>rustc</code><span> is not a slow compiler </span>&mdash;<span> it implements the most </span><a href="https://blog.jetbrains.com/kotlin/2020/09/the-dark-secrets-of-fast-compilation-for-kotlin/#:~:text=I%20think%20Rust%20qualifies%20as%20a%20counter%20example%20here]"><span>advanced incremental compilation</span></a><span> in industrial compilers, it takes advantage of compilation model based on proper modules (crates), and it has been </span><a href="https://blog.mozilla.org/nnethercote/2020/09/08/how-to-speed-up-the-rust-compiler-one-last-time/"><span>meticulously optimized</span></a><span>.</span>
+<span>Fast to compile Rust projects are a reality, even if they are not common.</span>
+<span>Admittedly, some care and domain knowledge is required to do that.</span></p>
+<p><span>So let</span>&rsquo;<span>s take a closer look at what did it take for us to keep the compilation time within reasonable bounds for rust-analyzer!</span></p>
+<section id="Why-Care-About-Build-Times">
+
+    <h2>
+    <a href="#Why-Care-About-Build-Times"><span>Why Care About Build Times</span> </a>
+    </h2>
+<p><span>One thing I want to make clear is that optimizing project</span>&rsquo;<span>s build time is in some sense busy-work.</span>
+<span>Reducing compilation time provides very small </span><em><span>direct</span></em><span> benefits to the users, and is pure accidental complexity.</span></p>
+<p><span>That being said, compilation time is a </span><em><em><span>multiplier</span></em></em><span> for basically everything.</span>
+<span>Whether you want to ship more features, to make code faster, to adapt to a change of requirements, or to attract new contributors, build time is a factor in that.</span></p>
+<p><span>It also is a non-linear factor.</span>
+<span>Just waiting for the compiler is the smaller problem.</span>
+<span>The big one is losing the state of the flow or (worse) mental context switch to do something else while the code is compiling.</span>
+<span>One minute of work for the compiler wastes more than one minute of work for the human.</span></p>
+<p><span>It</span>&rsquo;<span>s hard for me to quantify the impact, but my intuitive understanding is that, as soon as the project grows beyond several thousands lines written by a single person, build times become pretty darn important!</span></p>
+<p><span>The most devilish property of build times is that they creep up on you.</span>
+<span>While the project is small, build times are going to be acceptable.</span>
+<span>As projects grow incrementally, build times start to slowly increase as well.</span>
+<span>And if you let them grow, it might be rather hard to get them back in check later!</span></p>
+<p><span>If project is already too slow to compile, then:</span></p>
+<ul>
+<li>
+<span>Improving build times will be time consuming, because each iteration of </span>&ldquo;<span>try a change, trigger the build, measure improvement</span>&rdquo;<span> will take long time (yes, build times are a multiplier for everything, </span><em><span>including</span></em><span> build times themselves!)</span>
+</li>
+<li>
+<span>There won</span>&rsquo;<span>t be easy wins: in contrast to runtime performance, pareto principle doesn</span>&rsquo;<span>t work!</span>
+<span>If you write a thousand lines of code, maybe one hundred of them will be performance-sensitive, but each line will add to compile times!</span>
+</li>
+<li>
+<span>Small wins will seem too small until they add up: shaving off five seconds is a much bigger deal for a five minute build than for an hour-long build.</span>
+</li>
+<li>
+<span>Dually, small regressions will go unnoticed.</span>
+</li>
+</ul>
+<p><span>There</span>&rsquo;<span>s also a culture aspect to it: if you join a project and its CI takes one hour, then an hour-long CI is normal, right?</span></p>
+<p><span>Luckily, there</span>&rsquo;<span>s one simple trick to solve the problem of build times </span>&hellip;</p>
+</section>
+<section id="The-Silver-Bullet">
+
+    <h2>
+    <a href="#The-Silver-Bullet"><span>The Silver Bullet</span> </a>
+    </h2>
+<p><span>You need to care about build times, keep an eye on them, and fix them </span><em><span>before</span></em><span> they become a problem.</span>
+<span>Build times are a fairly easy optimization problem: it</span>&rsquo;<span>s trivial to get direct feedback (just time the build), there are a bunch of tools for profiling, and you don</span>&rsquo;<span>t even need to come up with a representative benchmark.</span>
+<span>The task is to optimize a particular project</span>&rsquo;<span>s build time, not performance of the compiler in general.</span>
+<span>That</span>&rsquo;<span>s a nice property of most instances of accidental complexity </span>&mdash;<span> they tend to be well defined engineering problems with well understood solutions.</span></p>
+<p><span>The only hard bit about compilation time is that you don</span>&rsquo;<span>t know that it is a problem until it actually is one!</span>
+<span>So, the most valuable thing you can get from this post is this:</span>
+<span>if you are working on a Rust project, take some time to optimize its build today, and try to repeat the exercise once in a while.</span></p>
+<p><span>Now, with the software engineering bits cleared, let</span>&rsquo;<span>s finally get to some actionable programming advice!</span></p>
+</section>
+<section id="bors">
+
+    <h2>
+    <a href="#bors"><span>bors</span> </a>
+    </h2>
+<p><span>I like to use CI time as one of the main metrics to keep an eye on.</span></p>
+<p><span>Part of that is that CI time is important in itself.</span>
+<span>While you are not bound by CI when developing features, CI time directly affects how annoying it is to context switch when finishing one piece of work and starting the next one.</span>
+<span>Juggling five outstanding PRs waiting for CI to complete is not productive.</span>
+<span>Longer CI also creates a pressure to </span><em><span>not</span></em><span> split the work into independent chunks.</span>
+<span>If correcting a typo requires keeping a PR tab open for half a hour, it</span>&rsquo;<span>s better to just make a drive by fix in the next feature branch, right?</span></p>
+<p><span>But a bigger part is that CI gives you a standardized benchmark.</span>
+<span>Locally, you compile incrementally, and the time of build varies greatly with the kinds of changes you are doing.</span>
+<span>Often, you compile just a subset of the project.</span>
+<span>Due to this inherent variability, local builds give poor continuous feedback about build times.</span>
+<span>Standardized CI though runs for every change and gives you a time series where numbers are directly comparable.</span></p>
+<p><span>To increase this standardization pressure of CI, I recommend following </span><a href="https://graydon2.dreamwidth.org/1597.html"><span>not rocket science rule</span></a><span> and setting up a merge robot which guarantees that every state of the main branch passes CI.</span>
+<a href="https://bors.tech"><span>bors</span></a><span> is a particular implementation I use, but there are others.</span></p>
+<p><span>While it</span>&rsquo;<span>s by far not the biggest reason to use something like bors, it gives two benefits for healthy compile times:</span></p>
+<ul>
+<li>
+<span>It ensures that every change goes via CI, and creates pressure to keep CI healthy overall</span>
+</li>
+<li>
+<span>The time between leaving </span><code>r+</code><span> comment on the PR and receiving the </span>&ldquo;<span>PR merged</span>&rdquo;<span> notification gives you an always on feedback loop.</span>
+<span>You don</span>&rsquo;<span>t need to specifically time the build, every PR is a build benchmark.</span>
+</li>
+</ul>
+</section>
+<section id="CI-Caching">
+
+    <h2>
+    <a href="#CI-Caching"><span>CI Caching</span> </a>
+    </h2>
+<p><span>If you think about it, it</span>&rsquo;<span>s pretty obvious how a good caching strategy for CI should work.</span>
+<span>It makes sense to cache stuff that changes rarely, but it</span>&rsquo;<span>s useless to cache frequently changing things.</span>
+<span>That is, cache all the dependencies, but don</span>&rsquo;<span>t cache project</span>&rsquo;<span>s own crates.</span></p>
+<p><span>Unfortunately, almost nobody does this.</span>
+<span>A </span><a href="https://github.com/actions/cache/blob/main/examples.md#rust---cargo"><span>typical example</span></a><span> would just cache the whole of </span><code>./target</code><span> directory.</span>
+<span>That</span>&rsquo;<span>s wrong </span>&mdash;<span> the </span><code>./target</code><span> is huge, and most of it is useless on CI.</span></p>
+<p><span>It</span>&rsquo;<span>s not super trivial to fix though </span>&mdash;<span> sadly, Cargo doesn</span>&rsquo;<span>t make it too easy to figure out which part of </span><code>./target</code><span> are durable dependencies, and which parts are volatile local crates.</span>
+<span>So, you</span>&rsquo;<span>ll need to write </span><a href="https://github.com/rust-analyzer/rust-analyzer/blob/94d9fc2a28ea5d97e3a9293b9dac05bdb00304cc/xtask/src/pre_cache.rs#L30-L53"><span>some code</span></a><span> to clean the </span><code>./target</code><span> before storing the cache.</span>
+<span>For GitHub actions in particular you can also use </span><a href="https://github.com/Swatinem/rust-cache"><span>Swatinem/rust-cache</span></a><span>.</span></p>
+</section>
+<section id="CI-Workflow">
+
+    <h2>
+    <a href="#CI-Workflow"><span>CI Workflow</span> </a>
+    </h2>
+<p><span>Caching is usually the low-hanging watermelon, but there are several more things to tweak.</span></p>
+<p><a href="https://github.com/rust-analyzer/rust-analyzer/blob/48f84a7b60bcbd7ec5fa6434d92d9e7a8eb9731b/.github/workflows/ci.yaml#L56-L61"><span>Split</span></a><span> CI into separate </span><code>cargo test --no-run</code><span> and </span><code>cargo test</code><span>.</span>
+<span>It is vital to know which part of your CI is the build, and which are the tests.</span></p>
+<p><a href="https://github.com/rust-analyzer/rust-analyzer/blob/25368d24308d6a94ffe8b99f0122bcf5a2175322/.github/workflows/ci.yaml#L11"><span>Disable</span></a><span> incremental compilation.</span>
+<span>CI builds often are closer to from-scratch builds, as changes are typically much bigger than from a local edit-compile cycle.</span>
+<span>For from-scratch builds, incremental adds an extra dependency-tracking overhead.</span>
+<span>It also significantly increases the amount of IO and the size of </span><code>./target</code><span>, which make caching less effective.</span></p>
+<p><a href="https://github.com/rust-analyzer/rust-analyzer/blob/48f84a7b60bcbd7ec5fa6434d92d9e7a8eb9731b/Cargo.toml#L6-L10"><span>Disable</span></a><span> debuginfo </span>&mdash;<span> it makes </span><code>./target</code><span> much bigger, which again harms caching.</span>
+<span>Depending on your preferred workflow, you might consider disabling debuginfo unconditionally, this brings some benefits for local builds as well.</span></p>
+<p><span>While we are at it, </span><a href="https://github.com/rust-analyzer/rust-analyzer/blob/3dae94bf2b3e496adb049da589c7efef272a39b8/.github/workflows/ci.yaml#L15"><span>add</span></a><span> </span><code>-D warnings</code><span> to the </span><code>RUSTFLAGS</code><span> environmental variable to deny warning for all crates at the same time.</span>
+<span>It</span>&rsquo;<span>s a bad idea to </span><code>#![deny(warnings)]</code><span> in code: you need to repeat it for every crate, it needlessly makes local development harder, and it might break your users when they upgrade their compiler.</span>
+<span>It might also make sense to bump cargo network retry limits.</span></p>
+</section>
+<section id="Read-The-Lockfile">
+
+    <h2>
+    <a href="#Read-The-Lockfile"><span>Read The Lockfile</span> </a>
+    </h2>
+<p><span>Another obvious advice is to use fewer, smaller dependencies.</span></p>
+<p><span>This is nuanced: libraries do solve actual problems, and it would be stupid to roll your own solution to something already solved by crates.io.</span>
+<span>And it</span>&rsquo;<span>s not like it</span>&rsquo;<span>s guaranteed that your solution will be smaller.</span></p>
+<p><span>But it</span>&rsquo;<span>s important to realise what problems your application is and is not solving.</span>
+<span>If you are building a CLI utility for thousands of people of to use, you absolutely need </span><a href="http://clap.rs"><span>clap</span></a><span> with all of its features.</span>
+<span>If you are writing a quick script to run during CI, which only the team will be using, it</span>&rsquo;<span>s probably fine to start with simplistic command line parsing, but faster builds.</span></p>
+<p><span>One </span><em><span>tremendously</span></em><span> useful exercise here is to read </span><code>Cargo.lock</code><span> (not </span><code>Cargo.toml</code><span>) and for each dependency think about the actual problem this dependency solves for the person in front of your application.</span>
+<span>It</span>&rsquo;<span>s very frequent that you</span>&rsquo;<span>ll find dependencies that just don</span>&rsquo;<span>t make sense at all, </span><em><span>in your context</span></em><span>.</span></p>
+<p><span>As an illustrative example, rust-analyzer depends on </span><code>regex</code><span>.</span>
+<span>This doesn</span>&rsquo;<span>t make sense </span>&mdash;<span> we have exact parsers and lexers for Rust and Markdown, we don</span>&rsquo;<span>t need to interpret regular expressions at runtime.</span>
+<code>regex</code><span> is also one of the heavier dependencies </span>&mdash;<span> it</span>&rsquo;<span>s a full implementation of a small language!</span>
+<span>The reason why this dependency is there is because the logging library we use allows to say something like:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">RUST_LOG=rust_analyzer=very complex filtering expression</span></code></pre>
+
+</figure>
+<p><span>where parsing of the filtering expression is done by regular expressions.</span></p>
+<p><span>This is undoubtedly a very useful feature to have for some applications, but in the context of rust-analyzer we don</span>&rsquo;<span>t need it.</span>
+<span>Simple </span><code>env_logger</code><span>-style filtering would be enough.</span></p>
+<p><span>Once you identify a similar redundant dependency, it</span>&rsquo;<span>s usually enough to tweak </span><code>features</code><span> field somewhere, or to send a PR upstream to make non-essential bits configurable.</span></p>
+<p><span>Sometimes it is a bigger yak to shave :)</span>
+<span>For example, rust-analyzer optionally use </span><code>jemalloc</code><span> crate, and its build script pulls in </span><a href="https://docs.rs/fs_extra"><code>fs_extra</code></a><span> and (of all the things!) </span><a href="https://docs.rs/paste"><code>paste</code></a><span>.</span>
+<span>The ideal solution here would be of course to have a production grade, stable, pure rust memory allocator.</span></p>
+</section>
+<section id="Profile-Before-Optimize">
+
+    <h2>
+    <a href="#Profile-Before-Optimize"><span>Profile Before Optimize</span> </a>
+    </h2>
+<p><span>Now that we</span>&rsquo;<span>ve dealt with things which are just sensible to do, it</span>&rsquo;<span>s time to start measuring before cutting.</span>
+<span>A tool to use here is </span><code>timings</code><span> flag for Cargo (</span><a href="https://doc.rust-lang.org/nightly/cargo/reference/unstable.html#timings"><span>documentation</span></a><span>).</span>
+<span>Sadly, I lack the eloquence to adequately express the level of quality and polish of this feature, so let me just say ❤️ and continue with my dry prose.</span></p>
+<p><code>cargo build -Z timings</code><span> records profiling data during the build, and then renders it as a very legible and information-dense HTML file.</span>
+<span>This is a nightly feature, so you</span>&rsquo;<span>ll need the </span><code>+nightly</code><span> toggle.</span>
+<span>This isn</span>&rsquo;<span>t a problem in practice, as you only need to run this manually once in a while.</span></p>
+<p><span>Here</span>&rsquo;<span>s an example from rust-analyzer:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo +nightly build -p rust-analyzer --bin rust-analyzer \</span>
+<span class="line">  -Z timings --release</span></code></pre>
+
+</figure>
+
+<figure>
+
+<img alt="" src="/assets/cargo-timings.png">
+</figure>
+<p><span>Not only can you see how long each crate took to compile, but you</span>&rsquo;<span>ll also see how individual compilations where scheduled, </span><em><span>when</span></em><span> each crate started to compile, and its critical dependency.</span></p>
+</section>
+<section id="Compilation-Model-Crates">
+
+    <h2>
+    <a href="#Compilation-Model-Crates"><span>Compilation Model: Crates</span> </a>
+    </h2>
+<p><span>This last point is important </span>&mdash;<span> crates form a directed acyclic graph of dependencies and, on a multicore CPU, the shape of this graph affects the compilation time a lot.</span></p>
+<p><span>This is slow to compile, as all the crates need to be compiled sequentially:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">A -&gt; B -&gt; C -&gt; D -&gt; E</span></code></pre>
+
+</figure>
+<p><span>This version is much faster, as it enables significantly more parallelism:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">   +-  B  -+</span>
+<span class="line">  /         \</span>
+<span class="line">A  -&gt;  C  -&gt;  E</span>
+<span class="line">  \         /</span>
+<span class="line">   +-  D  -+</span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s also connection between parallelism and incrementality.</span>
+<span>In the wide graph, changing </span><code>B</code><span> doesn</span>&rsquo;<span>t entail recompiling </span><code>C</code><span> and </span><code>D</code><span>.</span></p>
+<p><span>The first advice you get when complaining about compile times in Rust is: </span>&ldquo;<span>split the code into crates</span>&rdquo;<span>.</span>
+<span>It is not </span><em><span>that</span></em><span> easy </span>&mdash;<span> if you ended up with a graph like the first one, you are not winning much.</span>
+<span>It is important to architect the applications to look like the second picture </span>&mdash;<span> a common vocabulary crate, a number of independent features, and a leaf crate to tie everything together.</span>
+<span>The most important property of a crate is which crates it doesn</span>&rsquo;<span>t (transitively) depend on.</span></p>
+<p><span>Another important consideration is the number of final artifacts (most typically binaries).</span>
+<span>Rust is statically linked, so, if two different binaries use the same library, each binary contains a separately linked copy of the library.</span>
+<span>If you have </span><code>n</code><span> binaries and </span><code>m</code><span> libraries, and each binary uses each library, then the amount of work to do during the linking is </span><code>m * n</code><span>.</span>
+<span>For this reason, it</span>&rsquo;<span>s better to minimize the number of artifacts.</span>
+<span>One common technique here is </span><a href="https://www.busybox.net/FAQ.html#design"><span>BusyBox</span></a><span>-style Swiss Army knife executables.</span>
+<span>The idea is that you can hardlink the same executable as several files with different names.</span>
+<span>The program then can look at the zeroth command line argument to learn the name it was invoked with, and use it effectively as a name of a subcommand.</span>
+<span>One cargo-specific gotcha here is that, by default, each file in </span><code>./examples</code><span> or </span><code>./tests</code><span> folder creates a new executable.</span></p>
+</section>
+<section id="Compilation-Model-Macros-And-Pipelining">
+
+    <h2>
+    <a href="#Compilation-Model-Macros-And-Pipelining"><span>Compilation Model: Macros And Pipelining</span> </a>
+    </h2>
+<p><span>But Cargo is even smarter than that!</span>
+<span>It does pipelined compilation </span>&mdash;<span> splitting the compilation of a crate into metadata and codegen phases, and starting compilation of dependent crates as soon as the metadata phase is over.</span></p>
+<p><span>This has interesting interactions with procedural macros (and build scripts).</span>
+<code>rustc</code><span> needs to run procedural macros to compute crate</span>&rsquo;<span>s metadata.</span>
+<span>That means that procedural macros can</span>&rsquo;<span>t be pipelined, and crates using procedural macros are blocked until the proc macro is fully compiled to the binary code.</span></p>
+<p><span>Separately from that, procedural macros need to parse Rust code, and that is a relatively complex task.</span>
+<span>The de-facto crate for this, </span><code>syn</code><span>, takes quite some time to compile (not because it is bloated </span>&mdash;<span> just because parsing Rust is hard).</span></p>
+<p><span>This generally means that projects tend to have </span><code>syn</code><span> / </span><code>serde</code><span> shaped hole in the CPU utilization profile during compilation.</span>
+<span>It</span>&rsquo;<span>s relatively important to use procedural macros only where they pull their weight, and try to push crates before </span><code>syn</code><span> in the </span><code>cargo -Z timings</code><span> graph.</span></p>
+<p><span>The latter can be tricky, as proc macro dependencies can sneak up on you.</span>
+<span>The problem here is that they are often hidden behind feature flags, and those feature flags might be enabled by downstream crates.</span>
+<span>Consider this example:</span></p>
+<p><span>You have a convenient utility type </span>&mdash;<span> for example, an SSO string, in a </span><code>small_string</code><span> crate.</span>
+<span>To implement serialization, you don</span>&rsquo;<span>t actually need derive (just delegating to </span><code>String</code><span> works), so you add an (optional) dependency on </span><code>serde</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-section">[package]</span></span>
+<span class="line"><span class="hl-attr">name</span> = <span class="hl-string">&quot;small-string&quot;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-section">[dependencies]</span></span>
+<span class="line"><span class="hl-attr">serde</span> = { version = <span class="hl-string">&quot;1&quot;</span> }</span></code></pre>
+
+</figure>
+<p><span>SSO string is a rather useful abstraction, so it gets used throughout the codebase.</span>
+<span>Then in some leaf crate which, eg, needs to expose a JSON API, you add dependency on </span><code>small_string</code><span> with the </span><code>serde</code><span> feature, as well as </span><code>serde</code><span> with derive itself:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-section">[package]</span></span>
+<span class="line"><span class="hl-attr">name</span> = <span class="hl-string">&quot;json-api&quot;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-section">[dependencies]</span></span>
+<span class="line"><span class="hl-attr">small-string</span> = { version = <span class="hl-string">&quot;1&quot;</span>, features = [ <span class="hl-string">&quot;serde&quot;</span> ] }</span>
+<span class="line"><span class="hl-attr">serde</span> = { version = <span class="hl-string">&quot;1&quot;</span>, features = [ <span class="hl-string">&quot;derive&quot;</span> ] }</span></code></pre>
+
+</figure>
+<p><span>The problem here is that </span><code>json-api</code><span> enables the </span><code>derive</code><span> feature of </span><code>serde</code><span>, and that means that </span><code>small-string</code><span> and all of its reverse-dependencies now need to wait for </span><code>syn</code><span> to compile!</span>
+<span>Similarly, if a crate depends on a subset of </span><code>syn</code>&rsquo;<span>s features, but something else in the crate graph enables all features, the original crate gets them as a bonus as well!</span></p>
+<p><span>It</span>&rsquo;<span>s not necessarily the end of the world, but it shows that dependency graph can get tricky with the presence of features.</span>
+<span>Luckily, </span><code>cargo -Z timings</code><span> makes it easy to notice that something strange is happening, even if it might not be always obvious what </span><em><span>exactly</span></em><span> went wrong.</span></p>
+<p><span>There</span>&rsquo;<span>s also a much more direct way for procedural macros to slow down compilation </span>&mdash;<span> if the macro generates a lot of code, the result would take some time to compile.</span>
+<span>That is, some macros allow you to write just a bit of source code, which feels innocuous enough, but expands to substantial amount of logic.</span>
+<span>The prime example is serialization </span>&mdash;<span> I</span>&rsquo;<span>ve noticed that converting values to/from JSON accounts for surprisingly big amount of compiling.</span>
+<span>Thinking in terms of overall crate graph helps here </span>&mdash;<span> you want to keep serialization at the boundary of the system, in the leaf crates.</span>
+<span>If you put serialization near the foundation, then all intermediate crates would have to pay its build-time costs.</span></p>
+<p><span>All that being said, an interesting side-note here is that procedural macros are not </span><em><span>inherently</span></em><span> slow to compile.</span>
+<span>Rather, it</span>&rsquo;<span>s the fact that most proc macros need to parse Rust or to generate a lot of code that makes them slow.</span>
+<span>Sometimes, a macro can accept a simplified syntax which can be parsed without </span><code>syn</code><span>, and emit a tiny bit of Rust code based on that.</span>
+<span>Producing valid Rust is not nearly as complicated as parsing it!</span></p>
+</section>
+<section id="Compilation-Model-Monomorphization">
+
+    <h2>
+    <a href="#Compilation-Model-Monomorphization"><span>Compilation Model: Monomorphization</span> </a>
+    </h2>
+<p><span>Now that we</span>&rsquo;<span>ve covered macro issues at the level of crates, it</span>&rsquo;<span>s time to look closer, at the code-level concerns.</span>
+<span>The main thing to look here are generics.</span>
+<span>It</span>&rsquo;<span>s vital to understand how they are compiled, which, in case of Rust, is achieved by monomorphization.</span>
+<span>Consider a run of the mill generic function:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">frobnicate</span>&lt;T: SomeTrait&gt;(x: &amp;T) {</span>
+<span class="line">   ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>When Rust compiles this function, it doesn</span>&rsquo;<span>t actually emit machine code.</span>
+<span>Instead, it stores an abstract representation of function body in the library.</span>
+<span>The actual compilation happens when you </span><em><span>instantiate</span></em><span> the function with a particular type parameter.</span>
+<span>The C++ terminology gives the right intuition here </span>&mdash;<span> </span><code>frobnicate</code><span> is a </span>&ldquo;<span>template</span>&rdquo;<span>, it produces an actual function when a concrete type is substituted for the parameter </span><code>T</code><span>.</span></p>
+<p><span>In other words, in the following case</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">frobnicate_both</span>(x: <span class="hl-type">String</span>, y: Widget) {</span>
+<span class="line">  <span class="hl-title function_ invoke__">frobnicate</span>(&amp;x);</span>
+<span class="line">  <span class="hl-title function_ invoke__">frobnicate</span>(&amp;y);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>on the level of machine code there will be two separate copies of </span><code>frobnicate</code><span>, which would differ in details of how they deal with parameter, but would be otherwise identical.</span></p>
+<p><span>Sounds pretty bad, right?</span>
+<span>Seems like that you can write a gigantic generic function, and then write just a small bit of code to instantiate it with a bunch of types, to create a lot of load for the compiler.</span></p>
+<p><span>Well, I have bad news for you </span>&mdash;<span> the reality is much, much worse.</span>
+<span>You don</span>&rsquo;<span>t even need different types to create duplication.</span>
+<span>Let</span>&rsquo;<span>s say we have four crates which form a diamond</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">   +- B -+</span>
+<span class="line">  /       \</span>
+<span class="line">A           D</span>
+<span class="line">  \       /</span>
+<span class="line">   +- C -+</span></code></pre>
+
+</figure>
+<p><span>The </span><code>frobnicate</code><span> is defined in </span><code>A</code><span>, and is used by </span><code>B</code><span> and </span><code>C</code></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// A</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">frobnicate</span>&lt;T: SomeTrait&gt;(x: &amp;T) { ... }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// B</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">do_b</span>(s: <span class="hl-type">String</span>) { a::<span class="hl-title function_ invoke__">frobnicate</span>(&amp;s) }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// C</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">do_c</span>(s: <span class="hl-type">String</span>) { a::<span class="hl-title function_ invoke__">frobnicate</span>(&amp;s) }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// D</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">hello</span> = <span class="hl-string">&quot;hello&quot;</span>.<span class="hl-title function_ invoke__">to_owned</span>();</span>
+<span class="line">  b::<span class="hl-title function_ invoke__">do_b</span>(&amp;hello);</span>
+<span class="line">  c::<span class="hl-title function_ invoke__">do_c</span>(&amp;hello);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In this case, we only ever instantiate </span><code>frobincate</code><span> with </span><code>String</code><span>, but it will get compiled twice, because monomorphization happens </span><em><span>per crate</span></em><span>.</span>
+<code>B</code><span> and </span><code>C</code><span> are compiled separately, and each includes machine code for </span><code>do_*</code><span> functions, so they need </span><code>frobnicate&lt;String&gt;</code><span>.</span>
+<span>If optimizations are disabled, rustc can share template instantiations with dependencies, but that doesn</span>&rsquo;<span>t work for sibling dependencies.</span>
+<span>With optimizations, rustc doesn</span>&rsquo;<span>t share monomorphizations even with direct dependencies.</span></p>
+<p><span>In other words, generics in Rust can lead to accidentally-quadratic compilation times across many crates!</span></p>
+<p><span>If you are wondering whether it gets worse than that, the answer is yes.</span>
+<span>I </span><em><span>think</span></em><span> the actual unit of monomorphization is codegen unit, so duplicates are possible even within one crate.</span></p>
+</section>
+<section id="Keeping-an-Eye-on-Instantiations">
+
+    <h2>
+    <a href="#Keeping-an-Eye-on-Instantiations"><span>Keeping an Eye on Instantiations</span> </a>
+    </h2>
+<p><span>Besides just duplication, generics add one more problem </span>&mdash;<span> they shift the blame for compile times to consumers.</span>
+<span>Most of the compile time cost of generic functions is borne out by the crates that use the functionality, while the defining crate just typechecks the code without doing any code generation.</span>
+<span>Coupled with the fact that at times it is not at all obvious what gets instantiated where and why (</span><a href="https://github.com/rust-analyzer/rust-analyzer/issues/10065"><span>example</span></a><span>), this make it hard to directly see the footprint of generic APIs</span></p>
+<p><span>Luckily, this is not needed </span>&mdash;<span> there</span>&rsquo;<span>s a tool for that!</span>
+<a href="https://github.com/dtolnay/cargo-llvm-lines"><code>cargo llvm-lines</code></a><span> tells you which monomorphizations are happening in a specific crate.</span></p>
+<p><span>Here</span>&rsquo;<span>s an example from a </span><a href="https://github.com/rust-analyzer/rust-analyzer/issues/10065"><span>recent investigation</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo llvm-lines --lib --release -p ide_ssr | head -n 12</span>
+<span class="line"><span class="hl-output"> Lines          Copies        Function name</span></span>
+<span class="line"><span class="hl-output">  -----          ------        -------------</span></span>
+<span class="line"><span class="hl-output">  533069 (100%)  28309 (100%)  (TOTAL)</span></span>
+<span class="line"><span class="hl-output">   20349 (3.8%)    357 (1.3%)  RawVec&lt;T,A&gt;::current_memory</span></span>
+<span class="line"><span class="hl-output">   18324 (3.4%)    332 (1.2%)  &lt;Weak&lt;T&gt; as Drop&gt;::drop</span></span>
+<span class="line"><span class="hl-output">   14024 (2.6%)    332 (1.2%)  Weak&lt;T&gt;::inner</span></span>
+<span class="line"><span class="hl-output">   11718 (2.2%)    378 (1.3%)  core::ptr::metadata::from_raw_parts_mut</span></span>
+<span class="line"><span class="hl-output">   10710 (2.0%)    357 (1.3%)  &lt;RawVec&lt;T,A&gt; as Drop&gt;::drop</span></span>
+<span class="line"><span class="hl-output">    7984 (1.5%)    332 (1.2%)  &lt;Arc&lt;T&gt; as Drop&gt;::drop</span></span>
+<span class="line"><span class="hl-output">    7968 (1.5%)    332 (1.2%)  Layout::for_value_raw</span></span>
+<span class="line"><span class="hl-output">    6790 (1.3%)     97 (0.3%)  hashbrown::raw::RawTable&lt;T,A&gt;::drop_elements</span></span>
+<span class="line"><span class="hl-output">    6596 (1.2%)     97 (0.3%)  &lt;hashbrown::raw::RawIterRange&lt;T&gt; as Iterator&gt;::next</span></span></code></pre>
+
+</figure>
+<p><span>It shows, for each generic function, how many copies of it were generated, and what</span>&rsquo;<span>s their total size.</span>
+<span>The size is measured very coarsely, in the number of llvm ir lines it takes to encode the function.</span>
+<span>A useful fact: llvm doesn</span>&rsquo;<span>t have generic functions, its the job of </span><code>rustc</code><span> to turn a function template and a set of instantiations into a set of actual functions.</span></p>
+</section>
+<section id="Keeping-Instantiations-In-Check">
+
+    <h2>
+    <a href="#Keeping-Instantiations-In-Check"><span>Keeping Instantiations In Check</span> </a>
+    </h2>
+<p><span>Now that we understand the pitfalls of monomorphization, a rule of thumb becomes obvious: do not put generic code at the boundaries between the crates.</span>
+<span>When designing a large system, architect it as a set of components where each of the components does something concrete and has non-generic interface.</span></p>
+<p><span>If you do need generic interface for better type-safety and ergonomics, make sure that the interface layer is thin, and that it immediately delegates to a non-generic implementation.</span>
+<span>The classical example to internalize here are various functions from </span><code>str::fs</code><span> module which operate on paths:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>&lt;P: <span class="hl-built_in">AsRef</span>&lt;Path&gt;&gt;(path: P) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">inner</span>(path: &amp;Path) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path)?;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">bytes</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    file.<span class="hl-title function_ invoke__">read_to_end</span>(&amp;<span class="hl-keyword">mut</span> bytes)?;</span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(bytes)</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-title function_ invoke__">inner</span>(path.<span class="hl-title function_ invoke__">as_ref</span>())</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The outer function is parameterized </span>&mdash;<span> it is ergonomic to use, but is compiled afresh for every downstream crate.</span>
+<span>That</span>&rsquo;<span>s not a problem though, because it is very small, and immediately delegates to a non-generic function that gets compiled in the std.</span></p>
+<p><span>If you are writing a function which takes a path as an argument, either use </span><code>&amp;Path</code><span>, or use </span><code>impl AsRef&lt;Path&gt;</code><span> and delegate to a non-generic implementation.</span>
+<span>If you care about API ergonomics enough to use impl trait, you should use </span><code>inner</code><span> trick </span>&mdash;<span> compile times are as big part of ergonomics, as the syntax used to call the function.</span></p>
+<p><span>A second common case here are closures: by default, prefer </span><code>&amp;dyn Fn()</code><span> over </span><code>impl Fn()</code><span>.</span>
+<span>Similarly to paths, an </span><code>impl</code><span>-based nice API might be a thin wrapper around </span><code>dyn</code><span>-based implementation which does the bulk of the work.</span></p>
+<p><span>Another idea along these lines is </span>&ldquo;<span>generic, inline hotpath; concrete, outline coldpath</span>&rdquo;<span>.</span>
+<span>In the </span><a href="https://lib.rs/crates/once_cell"><span>once_cell</span></a><span> crate, there</span>&rsquo;<span>s this curious pattern (simplified, here</span>&rsquo;<span>s the </span><a href="https://github.com/matklad/once_cell/blob/f92720a4cac370c117e9d565aebbae2b8de51852/src/imp_std.rs#L86"><span>actual source</span></a><span>):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">OnceCell</span>&lt;T&gt; {</span>
+<span class="line">  state: AtomicUsize,</span>
+<span class="line">  inner: <span class="hl-type">Option</span>&lt;T&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; OnceCell&lt;T&gt; {</span>
+<span class="line">  <span class="hl-meta">#[cold]</span></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">initialize</span>&lt;F: <span class="hl-title function_ invoke__">FnOnce</span>() <span class="hl-punctuation">-&gt;</span> T&gt;(&amp;<span class="hl-keyword">self</span>, f: F) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">f</span> = <span class="hl-title function_ invoke__">Some</span>(f);</span>
+<span class="line">    <span class="hl-title function_ invoke__">synchronize_access</span>(<span class="hl-keyword">self</span>.state, &amp;<span class="hl-keyword">mut</span> || {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">f</span> = f.<span class="hl-title function_ invoke__">take</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">      <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.inner {</span>
+<span class="line">        <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">self</span>.inner = <span class="hl-title function_ invoke__">Some</span>(<span class="hl-title function_ invoke__">f</span>()),</span>
+<span class="line">        <span class="hl-title function_ invoke__">Some</span>(_value) =&gt; (),</span>
+<span class="line">      }</span>
+<span class="line">    });</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">synchronize_access</span>(state: &amp;AtomicUsize, init: &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">dyn</span> <span class="hl-title function_ invoke__">FnMut</span>()) {</span>
+<span class="line">  <span class="hl-comment">// One hundred lines of tricky synchronization code on atomics.</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, the </span><code>initialize</code><span> function is generic twice: first, the </span><code>OnceCell</code><span> is parametrized with the type of value being stored, and then </span><code>initialize</code><span> takes a generic closure parameter.</span>
+<span>The job of </span><code>initialize</code><span> is to make sure (even if it is called concurrently from many threads) that at most one </span><code>f</code><span> is run.</span>
+<span>This mutual exclusion task doesn</span>&rsquo;<span>t actually depend on specific </span><code>T</code><span> and </span><code>F</code><span> and is implemented as non-generic </span><code>synchronize_access</code><span>, to improve compile time.</span>
+<span>One wrinkle here is that, ideally, we</span>&rsquo;<span>d want an </span><code>init: dyn FnOnce()</code><span> argument, but that</span>&rsquo;<span>s not expressible in today</span>&rsquo;<span>s Rust.</span>
+<span>The </span><code>let mut f = Some(f) / let f = f.take().unwrap()</code><span> is a standard work-around for this case.</span></p>
+</section>
+<section id="Conclusions">
+
+    <h2>
+    <a href="#Conclusions"><span>Conclusions</span> </a>
+    </h2>
+<p><span>I guess that</span>&rsquo;<span>s it!</span>
+<span>To repeat the main ideas:</span></p>
+<p><span>Build times are a big factor in the overall productivity of the humans working on the project.</span>
+<span>Optimizing this is a straightforward engineering task </span>&mdash;<span> the tools are there.</span>
+<span>What might be hard is not letting them slowly regress.</span>
+<span>I hope this post provides enough motivation and inspiration for that!</span>
+<span>As a rough baseline, 200k line Rust project somewhat optimized for reasonable build times should take about 10 minutes of CI on GitHub actions.</span></p>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/pid70f/blog_post_fast_rust_builds"><span>/r/rust</span></a><span>.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>This post is a part of </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html"><span>One Hundred Thousand Lines of Rust</span></a><span> series.</span></p>
+</div>
+</aside></section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-09-04-fast-rust-builds.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/09/05/Rust100k.html b/2021/09/05/Rust100k.html
new file mode 100644
index 00000000..85db1d1e
--- /dev/null
+++ b/2021/09/05/Rust100k.html
@@ -0,0 +1,131 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>One Hundred Thousand Lines of Rust</title>
+  <meta name="description" content="In 2021, I wrote a series of posts about lessons learned maintaining medium-sized Rust projects.
+Here's the list, in chronological order:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/09/05/Rust100k.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#One-Hundred-Thousand-Lines-of-Rust"><span>One Hundred Thousand Lines of Rust</span> <time datetime="2021-09-05">Sep 5, 2021</time></a>
+    </h1>
+<p><span>In 2021, I wrote a series of posts about lessons learned maintaining medium-sized Rust projects.</span>
+<span>Here</span>&rsquo;<span>s the list, in chronological order:</span></p>
+<ul>
+<li>
+<a href="https://matklad.github.io/2021/02/06/ARCHITECTURE.md.html"><span>ARCHITECTURE.md</span></a>
+</li>
+<li>
+<a href="https://matklad.github.io/2021/02/27/delete-cargo-integration-tests.html"><span>Delete Cargo Integration Tests</span></a>
+</li>
+<li>
+<a href="https://matklad.github.io/2021/05/31/how-to-test.html"><span>How to Test</span></a>
+</li>
+<li>
+<a href="https://matklad.github.io/2021/07/09/inline-in-rust.html"><span>Inline In Rust</span></a>
+</li>
+<li>
+<a href="https://matklad.github.io/2021/08/22/large-rust-workspaces.html"><span>Large Rust Workspaces</span></a>
+</li>
+<li>
+<a href="https://matklad.github.io/2021/09/04/fast-rust-builds.html"><span>Fast Rust Builds</span></a>
+</li>
+</ul>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-09-05-Rust100k.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/11/07/generate-all-the-things.html b/2021/11/07/generate-all-the-things.html
new file mode 100644
index 00000000..c7a61275
--- /dev/null
+++ b/2021/11/07/generate-all-the-things.html
@@ -0,0 +1,669 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Generate All the Things</title>
+  <meta name="description" content="In this post, we'll look at one technique from property-based testing repertoire: full coverage / exhaustive testing.
+Specifically, we will learn how to conveniently enumerate any kind of combinatorial object without using recursion.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/11/07/generate-all-the-things.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Generate-All-the-Things"><span>Generate All the Things</span> <time datetime="2021-11-07">Nov 7, 2021</time></a>
+    </h1>
+<p><span>In this post, we</span>&rsquo;<span>ll look at one technique from property-based testing repertoire: full coverage / exhaustive testing.</span>
+<span>Specifically, we will learn how to conveniently enumerate any kind of combinatorial object without using recursion.</span></p>
+<p><span>To start, let</span>&rsquo;<span>s assume we have some algorithmic problem to solve.</span>
+<span>For example, we want to sort an array of numbers:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sort</span>(xs: &amp;<span class="hl-keyword">mut</span> [<span class="hl-type">u32</span>]) {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To test that the </span><code>sort</code><span> function works, we can write a bunch of example-based test cases.</span>
+<span>This approach has two flaws:</span></p>
+<ul>
+<li>
+<span>Generating examples by hand is time consuming.</span>
+</li>
+<li>
+<span>It might be hard to come up with interesting examples </span>&mdash;<span> any edge cases we</span>&rsquo;<span>ve thought about is probably already handled in the code.</span>
+<span>We want to find cases which we didn</span>&rsquo;<span>t think of before.</span>
+</li>
+</ul>
+<p><span>A better approach is randomized testing: just generate a random array and check that it is sorted:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">naive_randomized_testing</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">rng</span> = rand::<span class="hl-title function_ invoke__">thread_rng</span>();</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">100_000</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">n</span>: <span class="hl-type">usize</span> = rng.<span class="hl-title function_ invoke__">gen_range</span>(<span class="hl-number">0</span>..<span class="hl-number">1_000</span>);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">xs</span>: <span class="hl-type">Vec</span>&lt;<span class="hl-type">u32</span>&gt; =</span>
+<span class="line">      std::iter::<span class="hl-title function_ invoke__">repeat_with</span>(|| rng.<span class="hl-title function_ invoke__">gen</span>()).<span class="hl-title function_ invoke__">take</span>(n).<span class="hl-title function_ invoke__">collect</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">sort</span>(&amp;<span class="hl-keyword">mut</span> xs);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..xs.<span class="hl-title function_ invoke__">len</span>() {</span>
+<span class="line">      <span class="hl-built_in">assert!</span>(xs[i - <span class="hl-number">1</span>] &lt;= xs[i]);</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, we generated one hundred thousand completely random test cases!</span></p>
+<p><span>Sadly, the result might actually be </span><em><span>worse</span></em><span> than a small set of hand-picked examples.</span>
+<span>The problem here is that, if you pick an array completely at random (sample uniformly), it will be a rather ordinary array.</span>
+<span>In particular, given that the elements are arbitrary </span><code>u32</code><span> numbers, it</span>&rsquo;<span>s highly unlikely that we generate an array with at least some equal elements.</span>
+<span>And when I write quick sort, I always have that nasty bug that it just loops infinitely when </span><em><span>all</span></em><span> elements are equal.</span></p>
+<p><span>There are several fixes for the problem.</span>
+<span>The simplest one is to just make the sampling space smaller:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">std::iter::<span class="hl-title function_ invoke__">repeat_with</span>(|| rng.<span class="hl-title function_ invoke__">gen_range</span>(<span class="hl-number">0</span>..<span class="hl-number">10</span>)).<span class="hl-title function_ invoke__">take</span>(n).<span class="hl-title function_ invoke__">collect</span>();</span></code></pre>
+
+</figure>
+<p><span>If we generate not an arbitrary </span><code>u32</code><span>, but a number between 0 and 10, we</span>&rsquo;<span>ll get some short arrays where all elements are equal.</span>
+<span>Another trick is to use a property-based testing library, which comes with some strategies for generating interesting sequences predefined.</span>
+<span>Yet another approach is to combine property-based testing and coverage guided fuzzing.</span>
+<span>When checking a particular example, we will collect coverage information for this specific input.</span>
+<span>Given a set of inputs with coverage info, we can apply targeted genetic algorithms to try to cover more of the code.</span>
+<span>A particularly fruitful insight here is that we don</span>&rsquo;<span>t have to invent a novel structure-aware fuzzer for this.</span>
+<span>We can take an existing fuzzer which emits a sequence of bytes, and use those bytes as a sequence of random numbers to generate structured input.</span>
+<span>Essentially, we say that the fuzzer </span><em><span>is</span></em><span> a random number generator.</span>
+<span>That way, when the fuzzer flips bits in the raw bytes array, it applies local semantically valid transformations to the random data structure.</span></p>
+<p><span>But this post isn</span>&rsquo;<span>t about those techniques :)</span>
+<span>Instead, it is about the idea of full coverage.</span>
+<em><span>Most</span></em><span> of the bugs involve small, tricky examples.</span>
+<span>If a sorting routine breaks on some array with ten thousand elements it</span>&rsquo;<span>s highly likely that there</span>&rsquo;<span>s a much smaller array (a handful of elements), which exposes the same bug.</span>
+<span>So what we can do is to just generate </span><em><span>every</span></em><span> array of length at most </span><code>n</code><span> with numbers up to </span><code>m</code><span> and exhaustively check them all:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">exhaustive_testing</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">xs</span> <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">every_array</span>(n, m) {</span>
+<span class="line">    <span class="hl-title function_ invoke__">sort</span>(&amp;<span class="hl-keyword">mut</span> xs);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..xs.<span class="hl-title function_ invoke__">len</span>() {</span>
+<span class="line">      <span class="hl-built_in">assert!</span>(xs[i - <span class="hl-number">1</span>] &lt;= xs[i]);</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The problem here is that implementing </span><code>every_array</code><span> is tricky.</span>
+<span>It is one of those puzzlers you know how to solve, but which are excruciatingly annoying  to implement for the umpteenth time:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">every_array</span>(n: <span class="hl-type">usize</span>, m: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Vec</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u32</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-keyword">if</span> n == <span class="hl-number">0</span> {</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-built_in">vec!</span>[<span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>()];</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">res</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">xs</span> <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">every_array</span>(n - <span class="hl-number">1</span>, m) {</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">x</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..=m {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">ys</span> = xs.<span class="hl-title function_ invoke__">clone</span>();</span>
+<span class="line">      ys.<span class="hl-title function_ invoke__">push</span>(x);</span>
+<span class="line">      res.<span class="hl-title function_ invoke__">push</span>(ys)</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  res</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>What</span>&rsquo;<span>s more, for algorithms you often need to generate permutations, combinations and subsets, and they all have similar simple but tricky recursive solutions.</span></p>
+<p><span>Yesterday I needed to generate a sequence of up to </span><code>n</code><span> segments with integer coordinates up to </span><code>m</code><span>, which finally pushed me to realize that there</span>&rsquo;<span>s a relatively simple way to exhaustively enumerate arbitrary combinatorial objects.</span>
+<span>I don</span>&rsquo;<span>t recall seeing it anywhere else, which is surprising, as the technique seems rather elegant.</span></p>
+<hr>
+<p><span>Let</span>&rsquo;<span>s look again at how we generate a random array:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">l</span>: <span class="hl-type">usize</span> = rng.<span class="hl-title function_ invoke__">gen_range</span>(<span class="hl-number">0</span>..l);</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">xs</span>: <span class="hl-type">Vec</span>&lt;<span class="hl-type">u32</span>&gt; =</span>
+<span class="line">  std::iter::<span class="hl-title function_ invoke__">repeat_with</span>(|| rng.<span class="hl-title function_ invoke__">gen</span>(..m)).<span class="hl-title function_ invoke__">take</span>(m).<span class="hl-title function_ invoke__">collect</span>();</span></code></pre>
+
+</figure>
+<p><span>This is definitely much more straightforward than the </span><code>every_array</code><span> function above, although it does sort-of the same thing.</span>
+<span>The trick is to take this </span>&ldquo;<span>generate </span><em><span>a random</span></em><span> thing</span>&rdquo;<span> code and just make it generate </span><em><span>every</span></em><span> thing instead.</span>
+<span>In the above code, we base decisions on random numbers.</span>
+<span>Specifically, an input sequence of random numbers generates one element in the search space.</span>
+<span>If we enumerate all sequences of random numbers, we then explore the whole space.</span></p>
+<p><span>Essentially, we</span>&rsquo;<span>ll rig the </span><code>rng</code><span> to not be random, but instead to emit all finite sequences of numbers.</span>
+<span>By writing a single generator of such sequences, we gain an ability to enumerate arbitrary objects.</span>
+<span>As we are interested in generating all </span>&ldquo;<span>small</span>&rdquo;<span> objects, we always pass an upper bound when asking for a </span>&ldquo;<span>random</span>&rdquo;<span> number.</span>
+<span>We can use the bounds to enumerate only the sequences which fit under them.</span></p>
+<p><span>So, the end result will look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">for_every_array</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = <span class="hl-number">4</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">l</span> = g.<span class="hl-title function_ invoke__">gen</span>(n) <span class="hl-keyword">as</span> <span class="hl-type">usize</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">xs</span>: <span class="hl-type">Vec</span>&lt;_&gt; =</span>
+<span class="line">      std::iter::<span class="hl-title function_ invoke__">repeat_with</span>(|| g.<span class="hl-title function_ invoke__">gen</span>(m)).<span class="hl-title function_ invoke__">take</span>(l).collect::&lt;_&gt;();</span>
+<span class="line">    <span class="hl-comment">// `xs` enumerates all arrays</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The implementation of </span><code>Gen</code><span> is relatively straightforward.</span>
+<span>On each iteration, we will remember the sequence of numbers we generated together with bounds the user requested, something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">value:  3 1 4 4</span>
+<span class="line">bound:  5 4 4 4</span></code></pre>
+
+</figure>
+<p><span>To advance to the next iteration, we will find the smallest sequence of values which is larger than the current one, but still satisfies all the bounds.</span>
+&ldquo;<span>Smallest</span>&rdquo;<span> means that we</span>&rsquo;<span>ll try to increment the rightmost number.</span>
+<span>In the above example, the last two fours already match the bound, so we can</span>&rsquo;<span>t increment them.</span>
+<span>However, we </span><em><span>can</span></em><span> increment one to get </span><code>3 2 4 4</code><span>.</span>
+<span>This isn</span>&rsquo;<span>t the smallest sequence though, </span><code>3 2 0 0</code><span> would be smaller.</span>
+<span>So, after incrementing the rightmost number we can increment, we zero the rest.</span></p>
+<p><span>Here</span>&rsquo;<span>s the full implementation:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Gen</span> {</span>
+<span class="line">  started: <span class="hl-type">bool</span>,</span>
+<span class="line">  v: <span class="hl-type">Vec</span>&lt;(<span class="hl-type">u32</span>, <span class="hl-type">u32</span>)&gt;,</span>
+<span class="line">  p: <span class="hl-type">usize</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Gen</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>() <span class="hl-punctuation">-&gt;</span> Gen {</span>
+<span class="line">    Gen { started: <span class="hl-literal">false</span>, v: <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>(), p: <span class="hl-number">0</span> }</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">done</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">bool</span> {</span>
+<span class="line">    <span class="hl-keyword">if</span> !<span class="hl-keyword">self</span>.started {</span>
+<span class="line">      <span class="hl-keyword">self</span>.started = <span class="hl-literal">true</span>;</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-literal">false</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> (<span class="hl-number">0</span>..<span class="hl-keyword">self</span>.v.<span class="hl-title function_ invoke__">len</span>()).<span class="hl-title function_ invoke__">rev</span>() {</span>
+<span class="line">      <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.v[i].<span class="hl-number">0</span> &lt; <span class="hl-keyword">self</span>.v[i].<span class="hl-number">1</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.v[i].<span class="hl-number">0</span> += <span class="hl-number">1</span>;</span>
+<span class="line">        <span class="hl-keyword">self</span>.v.<span class="hl-title function_ invoke__">truncate</span>(i + <span class="hl-number">1</span>);</span>
+<span class="line">        <span class="hl-keyword">self</span>.p = <span class="hl-number">0</span>;</span>
+<span class="line">        <span class="hl-keyword">return</span> <span class="hl-literal">false</span>;</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-literal">true</span></span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">gen</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, bound: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.p == <span class="hl-keyword">self</span>.v.<span class="hl-title function_ invoke__">len</span>() {</span>
+<span class="line">      <span class="hl-keyword">self</span>.v.<span class="hl-title function_ invoke__">push</span>((<span class="hl-number">0</span>, <span class="hl-number">0</span>));</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">self</span>.p += <span class="hl-number">1</span>;</span>
+<span class="line">    <span class="hl-keyword">self</span>.v[<span class="hl-keyword">self</span>.p - <span class="hl-number">1</span>].<span class="hl-number">1</span> = bound;</span>
+<span class="line">    <span class="hl-keyword">self</span>.v[<span class="hl-keyword">self</span>.p - <span class="hl-number">1</span>].<span class="hl-number">0</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Some notes:</span></p>
+<ul>
+<li>
+<span>We need </span><code>start</code><span> field to track the first iteration, and to make </span><code>while !g.done()</code><span> syntax work.</span>
+<span>It</span>&rsquo;<span>s a bit more natural to remove </span><code>start</code><span> and use a </span><code>do { } while !g.done()</code><span> loop, but it</span>&rsquo;<span>s not available in Rust.</span>
+</li>
+<li>
+<code>v</code><span> stores </span><code>(value, bound)</code><span> pairs.</span>
+</li>
+<li>
+<code>p</code><span> tracks the current position in the middle of the iteration.</span>
+</li>
+<li>
+<code>v</code><span> is conceptually an infinite vector with finite number of non-zero elements.</span>
+<span>So, when </span><code>p</code><span> gets past then end of </span><code>v</code><span>, we just materialize the implicit zero by pushing it onto </span><code>v</code><span>.</span>
+</li>
+<li>
+<span>As we store zeros implicitly anyway, we can just truncate the vector in </span><code>done</code><span> instead of zeroing-out the elements after the incremented one.</span>
+</li>
+<li>
+<span>Somewhat unusually, the bounds are treated inclusively.</span>
+<span>This removes the panic when </span><code>bound</code><span> is zero, and allows to generate a full set of numbers via </span><code>gen(u32::MAX)</code><span>.</span>
+</li>
+</ul>
+<p><span>Let</span>&rsquo;<span>s see how our </span><code>gen</code><span> fairs for generating random arrays of length at most </span><code>n</code><span>.</span>
+<span>We</span>&rsquo;<span>ll count how many distinct cases were covered:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_arrays</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = <span class="hl-number">4</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> =</span>
+<span class="line">    (<span class="hl-number">0</span>..=n).<span class="hl-title function_ invoke__">map</span>(|l| (m + <span class="hl-number">1</span>).<span class="hl-title function_ invoke__">pow</span>(l)).sum::&lt;<span class="hl-type">u32</span>&gt;();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">l</span> = g.<span class="hl-title function_ invoke__">gen</span>(n) <span class="hl-keyword">as</span> <span class="hl-type">usize</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">xs</span>: <span class="hl-type">Vec</span>&lt;_&gt; =</span>
+<span class="line">      std::iter::<span class="hl-title function_ invoke__">repeat_with</span>(|| g.<span class="hl-title function_ invoke__">gen</span>(m)).<span class="hl-title function_ invoke__">take</span>(l).collect::&lt;_&gt;();</span>
+<span class="line"></span>
+<span class="line">    all.<span class="hl-title function_ invoke__">insert</span>(xs);</span>
+<span class="line">    total += <span class="hl-number">1</span></span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(all.<span class="hl-title function_ invoke__">len</span>(), total);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, total <span class="hl-keyword">as</span> <span class="hl-type">u32</span>)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This test passes.</span>
+<span>That is, the </span><code>gen</code><span> approach for this case is both exhaustive (it generates all arrays) and efficient (each array is generated once).</span></p>
+<p><span>As promised in the post</span>&rsquo;<span>s title, let</span>&rsquo;<span>s now generate </span><em><span>all</span></em><span> the things.</span></p>
+<p><span>First case: there should be only one nothing (that</span>&rsquo;<span>s the reason why we need </span><code>start</code><span>):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_nothing</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = <span class="hl-number">1</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    total += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, total)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Second case: we expect to see </span><code>n</code><span> numbers and </span><code>n*2</code><span> ordered pairs of numbers.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_number</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = n + <span class="hl-number">1</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">a</span> = g.<span class="hl-title function_ invoke__">gen</span>(n);</span>
+<span class="line"></span>
+<span class="line">    all.<span class="hl-title function_ invoke__">insert</span>(a);</span>
+<span class="line">    total += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, total);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, all.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_number_pair</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = (n + <span class="hl-number">1</span>) * (n + <span class="hl-number">1</span>);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">a</span> = g.<span class="hl-title function_ invoke__">gen</span>(n);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">b</span> = g.<span class="hl-title function_ invoke__">gen</span>(n);</span>
+<span class="line"></span>
+<span class="line">    all.<span class="hl-title function_ invoke__">insert</span>((a, b));</span>
+<span class="line">    total += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, total);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, all.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Third case: we expect to see </span><code>n * (n - 1) / 2</code><span> unordered pairs of numbers.</span>
+<span>This one is interesting </span>&mdash;<span> here, our second decision is based on the first one, but we still enumerate all the cases efficiently (without duplicates).</span>
+<span>(Aside: did you ever realise that the number of ways to pick two objects out of </span><code>n</code><span> is equal to the sum of first </span><code>n</code><span> natural numbers?)</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_number_combination</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = n * (n + <span class="hl-number">1</span>) / <span class="hl-number">2</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">a</span> = g.<span class="hl-title function_ invoke__">gen</span>(n - <span class="hl-number">1</span>);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">b</span> = a + <span class="hl-number">1</span> + g.<span class="hl-title function_ invoke__">gen</span>(n - a - <span class="hl-number">1</span>);</span>
+<span class="line">    all.<span class="hl-title function_ invoke__">insert</span>((a, b));</span>
+<span class="line">    total += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, total);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, all.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>We</span>&rsquo;<span>ve already generated all arrays, so let</span>&rsquo;<span>s try to create all permutations.</span>
+<span>Still efficient:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_permutations</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = (<span class="hl-number">1</span>..=n).product::&lt;<span class="hl-type">u32</span>&gt;();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">candidates</span>: <span class="hl-type">Vec</span>&lt;<span class="hl-type">i32</span>&gt; = (<span class="hl-number">1</span>..=n).<span class="hl-title function_ invoke__">collect</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">permutation</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..n {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">idx</span> = g.<span class="hl-title function_ invoke__">gen</span>(candidates.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span> - <span class="hl-number">1</span>);</span>
+<span class="line">      permutation.<span class="hl-title function_ invoke__">push</span>(candidates.<span class="hl-title function_ invoke__">remove</span>(idx <span class="hl-keyword">as</span> <span class="hl-type">usize</span>));</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    all.<span class="hl-title function_ invoke__">insert</span>(permutation);</span>
+<span class="line">    total += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, total);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, all.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Subsets:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_subset</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = <span class="hl-number">1</span> &lt;&lt; n;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">s</span>: <span class="hl-type">Vec</span>&lt;_&gt; = (<span class="hl-number">0</span>..n).<span class="hl-title function_ invoke__">map</span>(|_| g.<span class="hl-title function_ invoke__">gen</span>(<span class="hl-number">1</span>) == <span class="hl-number">1</span>).<span class="hl-title function_ invoke__">collect</span>();</span>
+<span class="line"></span>
+<span class="line">        all.<span class="hl-title function_ invoke__">insert</span>(s);</span>
+<span class="line">        total += <span class="hl-number">1</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(expected_total, total);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(expected_total, all.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Combinations:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_combinations</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = <span class="hl-number">3</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">fact</span> = |n: <span class="hl-type">u32</span>| <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> { (<span class="hl-number">1</span>..=n).<span class="hl-title function_ invoke__">product</span>() };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = <span class="hl-title function_ invoke__">fact</span>(n) / (<span class="hl-title function_ invoke__">fact</span>(m) * <span class="hl-title function_ invoke__">fact</span>(n - m));</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">candidates</span>: <span class="hl-type">Vec</span>&lt;<span class="hl-type">u32</span>&gt; = (<span class="hl-number">1</span>..=n).<span class="hl-title function_ invoke__">collect</span>();</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">combination</span> = BTreeSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">        <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..m {</span>
+<span class="line">            <span class="hl-keyword">let</span> <span class="hl-variable">idx</span> = g.<span class="hl-title function_ invoke__">gen</span>(candidates.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span> - <span class="hl-number">1</span>);</span>
+<span class="line">            combination.<span class="hl-title function_ invoke__">insert</span>(candidates.<span class="hl-title function_ invoke__">remove</span>(idx <span class="hl-keyword">as</span> <span class="hl-type">usize</span>));</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        all.<span class="hl-title function_ invoke__">insert</span>(combination);</span>
+<span class="line">        total += <span class="hl-number">1</span>;</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(expected_total, total);</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>(expected_total, all.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, this one actually fails </span>&mdash;<span> while this code generates all combinations, some combinations are generated more than once.</span>
+<span>Specifically, what we are generating here are k-permutations (combinations with significant order of elements).</span>
+<span>While this is not efficient, this is OK for the purposes of exhaustive testing (as we still generate any combination).</span>
+<span>Nonetheless, there</span>&rsquo;<span>s an efficient version as well:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">combination</span> = BTreeSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">c</span> <span class="hl-keyword">in</span> <span class="hl-number">1</span>..=n {</span>
+<span class="line">  <span class="hl-keyword">if</span> combination.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span> == m {</span>
+<span class="line">    <span class="hl-keyword">break</span>;</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">if</span> combination.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span> + (n - c + <span class="hl-number">1</span>) == m {</span>
+<span class="line">    combination.<span class="hl-title function_ invoke__">extend</span>(c..=n);</span>
+<span class="line">    <span class="hl-keyword">break</span>;</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">if</span> g.<span class="hl-title function_ invoke__">gen</span>(<span class="hl-number">1</span>) == <span class="hl-number">1</span> {</span>
+<span class="line">    combination.<span class="hl-title function_ invoke__">insert</span>(c);</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I think this covers all standard combinatorial structures.</span>
+<span>What</span>&rsquo;<span>s interesting, this approach works for non-standard structures as well.</span>
+<span>For example, for </span><a href="https://cses.fi/problemset/task/2168" class="url">https://cses.fi/problemset/task/2168</a><span>, the problem which started all this, I need to generate sequences of segments:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_segments</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = <span class="hl-number">6</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">l</span> = g.<span class="hl-title function_ invoke__">gen</span>(n);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">xs</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..l {</span>
+<span class="line">      <span class="hl-keyword">if</span> m &gt; <span class="hl-number">0</span> {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">l</span> = g.<span class="hl-title function_ invoke__">gen</span>(m - <span class="hl-number">1</span>);</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">r</span> = l + <span class="hl-number">1</span> + g.<span class="hl-title function_ invoke__">gen</span>(m - l - <span class="hl-number">1</span>);</span>
+<span class="line">        <span class="hl-keyword">if</span> !xs.<span class="hl-title function_ invoke__">contains</span>(&amp;(l, r)) {</span>
+<span class="line">          xs.<span class="hl-title function_ invoke__">push</span>((l, r))</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    all.<span class="hl-title function_ invoke__">insert</span>(xs);</span>
+<span class="line">    total += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(all.<span class="hl-title function_ invoke__">len</span>(), <span class="hl-number">2_593_942</span>);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(total, <span class="hl-number">4_288_306</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Due to the </span><code>.contains</code><span> check there are some duplicates, but that</span>&rsquo;<span>s not a problem as long as all sequences of segments are generated.</span>
+<span>Additionally, examples are strictly ordered by their complexity </span>&mdash;<span> earlier examples have fewer segments with smaller coordinates.</span>
+<span>That means that the first example which fails a property test is actually guaranteed to be the smallest counterexample! Nifty!</span></p>
+<p><span>That</span>&rsquo;<span>s all!</span>
+<span>Next time when you need to test something, consider if you can just exhaustively enumerate all </span>&ldquo;<span>sufficiently small</span>&rdquo;<span> inputs.</span>
+<span>If that</span>&rsquo;<span>s feasible, you can either write the classical recursive enumerator, or use this imperative </span><code>Gen</code><span> thing.</span></p>
+<p><strong><strong><span>Update(2021-11-28):</span></strong></strong></p>
+<p><span>There are now Rust (</span><a href="https://crates.io/crates/exhaustigen"><span>crates.io link</span></a><span>) and C++ (</span><a href="https://github.com/graydon/exhaustigen"><span>GitHub link</span></a><span>) implementations.</span>
+<a href="https://arxiv.org/abs/1710.10385">&ldquo;<span>Capturing the Future by Replaying the Past</span>&rdquo;</a><span> is a related paper which includes the above technique as a special case of </span>&ldquo;<span>simulate any monad by simulating delimited continuations via exceptions and replay</span>&rdquo;<span> trick.</span></p>
+<p><span>Balanced parentheses sequences:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">gen_parenthesis</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">n</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">expected_total</span> = <span class="hl-number">1</span> + <span class="hl-number">1</span> + <span class="hl-number">2</span> + <span class="hl-number">5</span> + <span class="hl-number">14</span> + <span class="hl-number">42</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">all</span> = HashSet::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">l</span> = g.<span class="hl-title function_ invoke__">gen</span>(n);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">s</span> = <span class="hl-type">String</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">bra</span> = <span class="hl-number">0</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">ket</span> = <span class="hl-number">0</span>;</span>
+<span class="line">    <span class="hl-keyword">while</span> ket &lt; l {</span>
+<span class="line">      <span class="hl-keyword">if</span> bra &lt; l &amp;&amp; (bra == ket || g.<span class="hl-title function_ invoke__">gen</span>(<span class="hl-number">1</span>) == <span class="hl-number">1</span>) {</span>
+<span class="line">        s.<span class="hl-title function_ invoke__">push</span>(<span class="hl-string">&#x27;(&#x27;</span>);</span>
+<span class="line">        bra += <span class="hl-number">1</span>;</span>
+<span class="line">      } <span class="hl-keyword">else</span> {</span>
+<span class="line">        s.<span class="hl-title function_ invoke__">push</span>(<span class="hl-string">&#x27;)&#x27;</span>);</span>
+<span class="line">        ket += <span class="hl-number">1</span>;</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    all.<span class="hl-title function_ invoke__">insert</span>(s);</span>
+<span class="line">    total += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, total);</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(expected_total, all.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-11-07-generate-all-the-things.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2021/11/27/notes-on-module-system.html b/2021/11/27/notes-on-module-system.html
new file mode 100644
index 00000000..19d4ddfa
--- /dev/null
+++ b/2021/11/27/notes-on-module-system.html
@@ -0,0 +1,299 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Notes On Module System</title>
+  <meta name="description" content="Unedited summary of what I think a better module system for a Rust-like
+language would look like.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2021/11/27/notes-on-module-system.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Notes-On-Module-System"><span>Notes On Module System</span> <time datetime="2021-11-27">Nov 27, 2021</time></a>
+    </h1>
+<p><span>Unedited summary of what I think a better module system for a Rust-like</span>
+<span>language would look like.</span></p>
+<p><span>Today</span>&rsquo;<span>s Rust module system is it</span>&rsquo;<span>s most exciting feature, after borrow checker.</span>
+<span>Explicit separation between crates (which form a DAG) and modules (which might</span>
+<span>be mutually dependent) and the absence of a single global namespace (crates</span>
+<span>don</span>&rsquo;<span>t have innate names; instead, the name is written on a dependency edge</span>
+<span>between two crates, and the same crate might be known under different names in</span>
+<span>two of its dependents) makes decentralized ecosystems of libraries a-la</span>
+<span>crates.io robust. Specifically, Rust allows linking-in several versions of the</span>
+<span>same crate without the fear of naming conflicts.</span></p>
+<p><span>However, the specific surface syntax we use to express the model I feel is</span>
+<span>suboptimal. Module system is pretty confusing (in the pre-2018 surveys, it was</span>
+<span>by far the most confusing aspect of the language after lifetimes. Post-2018</span>
+<span>system is better, but there are still regular questions about module system).</span>
+<span>What can we do better?</span></p>
+<p><em><span>First</span></em><span>, be more precise about visibilities. The most single most important</span>
+<span>question about an item is </span>&ldquo;<span>can it be visible outside of CU?</span>&rdquo;<span>. Depending on the</span>
+<span>answer to that, you have either closed world (all usages are known) or open</span>
+<span>world (usages are not-knowable) assumption. This should be reflected in the</span>
+<span>modules system. </span><code>pub</code><span> is for </span>&ldquo;<span>visible inside the whole CU, but not further</span>&rdquo;<span>.</span>
+<code>export</code><span> or (my favorite) </span><code>pub*</code><span> is for </span>&ldquo;<span>visible to the outer world</span>&rdquo;<span>. You sorta</span>
+<span>can have these in today</span>&rsquo;<span>s rust with </span><code>pub(crate)</code><span>, </span><code>-Dunreachable_pub</code><span> and some</span>
+<span>tolerance for compiler false-positive.</span></p>
+<p><span>I am not sure if the rest of Rust visibility systems pulls its weight. It is OK,</span>
+<span>but it is pretty complex </span><code>pub(in some::path)</code><span> and doesn</span>&rsquo;<span>t </span><em><span>really</span></em><span> help </span>&mdash;
+<span>making visibilities more precise within a single CU doesn</span>&rsquo;<span>t meaningfully make</span>
+<span>the code better, as you can control and rewrite all the code anyway. CU doesn</span>&rsquo;<span>t</span>
+<span>have internal boundaries which can be reflected in visibilities. If we go this</span>
+<span>way, we get a nice, simple system: </span><code>fn foo()</code><span> is visible in the current module</span>
+<span>only (not its children), </span><code>pub fn foo()</code><span> is visible anywhere inside the current</span>
+<span>crate, and </span><code>pub* fn foo()</code><span> is visible to other crates using ours. But then,</span>
+<span>again, the current tree-based visibility is OK, can leave it in as long as</span>
+<code>pub/pub*</code><span> is more explicit and </span><code>-Dunreachable_pub</code><span> is an error by default.</span></p>
+<p><span>In a similar way, the fact that </span><code>use</code><span> is an item (ie, </span><code>a::b</code><span> can </span><code>use</code><span> items</span>
+<span>imported in </span><code>a</code><span>) is an unnecessary cuteness. Imports should only introduce the</span>
+<span>name into module</span>&rsquo;<span>s namespace, and should be separate from intentional</span>
+<span>re-exports. It </span><em><span>might</span></em><span> make sense to ban glob re-export </span>&mdash;<span> this</span>&rsquo;<span>ll give you a</span>
+<span>nice property that all the names existing in the module are spelled out</span>
+<span>explicitly, which is useful for tooling. Though, as Rust has namespaces, looking</span>
+<span>at </span><code>pub use submod::thing</code><span> doesn</span>&rsquo;<span>t tell you whether the thing is a type or a</span>
+<span>value, so this might not be a meaningful property after all.</span></p>
+<p><span>The </span><em><span>second</span></em><span> thing to change would be module tree/directory structure mapping.</span>
+<span>The current system creates quite some visible problems:</span></p>
+<ul>
+<li>
+<p><span>library/binary confusion. It</span>&rsquo;<span>s common for new users to have </span><code>mod foo;</code><span> in both</span>
+<code>src/main.rs</code><span> and </span><code>src/lib.rs</code><span>.</span></p>
+</li>
+<li>
+<p><code>mod {}</code><span> file confusion </span>&mdash;<span> it</span>&rsquo;<span>s common (even for some production code I</span>&rsquo;<span>ve</span>
+<span>seen) to have </span><code>mod foo { stuff }</code><span> </span><em><span>inside</span></em><span> </span><code>foo.rs</code><span>.</span></p>
+</li>
+<li>
+<p><span>duplicate inclusion </span>&mdash;<span> again, it</span>&rsquo;<span>s common to start every file in </span><code>tests/</code><span> with</span>
+<code>mod common;</code><span>. Rust book even recommends some awful work-around to put common</span>
+<span>into </span><code>common/mod.rs</code><span>, just so it itself isn</span>&rsquo;<span>t treated as a test.</span></p>
+</li>
+<li>
+<p><span>inconsistency </span>&mdash;<span> large projects which don</span>&rsquo;<span>t have super-strict code style</span>
+<span>process end up using both the older </span><code>foo/mod.rs</code><span> and the newer </span><code>foo.rs, foo/*</code>
+<span>conventions.</span></p>
+</li>
+<li>
+<p><span>forgotten files </span>&mdash;<span> it is again pretty common to have some file somewhere in</span>
+<code>src/</code><span> which isn</span>&rsquo;<span>t actually linked into the module tree at all by mistake.</span></p>
+</li>
+</ul>
+<p><span>A bunch of less-objective issues:</span></p>
+<ul>
+<li>
+<code>mod.rs</code><span>-less system is self-inconsistent. </span><code>lib.rs</code><span> and </span><code>main.rs</code><span> </span><em><span>still</span></em>
+<span>behave like </span><code>mod.rs</code><span>, in a sense that nested modules are their direct</span>
+<span>siblings, and not in the </span><code>lib</code><span> directory.</span>
+</li>
+<li>
+<span>naming for crates roots (</span><code>lib.rs</code><span> and </span><code>main.rs</code><span>) is ad-hoc</span>
+</li>
+<li>
+<span>current system doesn</span>&rsquo;<span>t work well for tools, which have to iteratively</span>
+<span>discover the module tree. You can</span>&rsquo;<span>t process all of the crate</span>&rsquo;<span>s files in</span>
+<span>parallel, because you don</span>&rsquo;<span>t know what those files are until you process them.</span>
+</li>
+</ul>
+<p><span>I think a better system would say that a compilation unit is equivalent to a</span>
+<span>directory with Rust source files, and that (relative) file paths correspond to</span>
+<span>module paths. There</span>&rsquo;<span>s neither </span><code>mod foo;</code><span> nor </span><code>mod foo {}</code><span> (yes, sometimes those</span>
+<span>are genuinely useful. No, the fact that something </span><em><span>can</span></em><span> be useful doesn</span>&rsquo;<span>t mean</span>
+<span>it should be part of the language </span>&mdash;<span> it</span>&rsquo;<span>s very hard to come up with a language</span>
+<span>features which would be completely useless (though </span><code>mod foo {}</code><span> I think can be</span>
+<span>added back relatively painless)). We use </span><code>mod.rs</code><span>, but we name it</span>
+<code>_$name_of_the_module$.rs</code><span> instead, to solve two issues: sort it first</span>
+<span>alphabetically, and generate a unique fuzzy-findable name. So, something like</span>
+<span>this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">/home/matklad/projects/regex</span>
+<span class="line">  Cargo.toml</span>
+<span class="line">  src/</span>
+<span class="line">    _regex.rs</span>
+<span class="line">    parsing/</span>
+<span class="line">      _parsing.rs</span>
+<span class="line">      ast.rs</span>
+<span class="line">    rt/</span>
+<span class="line">     _rt.rs</span>
+<span class="line">     dfa.rs</span>
+<span class="line">     nfa.rs</span>
+<span class="line">  bins/</span>
+<span class="line">    grep/</span>
+<span class="line">      _grep.rs</span>
+<span class="line">      cli.rs</span>
+<span class="line">  tests/</span>
+<span class="line">    _tests.rs   # just a single integration tests binary by default!</span>
+<span class="line">    lookahead.rs</span>
+<span class="line">    fuzz.rs</span></code></pre>
+
+</figure>
+<p><span>The library there would give the following module tree:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">crate::{</span>
+<span class="line">    parsing::{ast}</span>
+<span class="line">    rt::{nfa, dfa}</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To do conditional compilation, you</span>&rsquo;<span>d do:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">mutex/</span>
+<span class="line">  _mutex.rs</span>
+<span class="line">  linux_mutex.rs</span>
+<span class="line">  windows_mutex.rs</span></code></pre>
+
+</figure>
+<p><span>where </span><code>_mutex.rs</code><span> is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[cfg(linux)]</span></span>
+<span class="line"><span class="hl-keyword">use</span> linux_mutex <span class="hl-keyword">as</span> os_mutex;</span>
+<span class="line"><span class="hl-meta">#[cfg(windows)]</span></span>
+<span class="line"><span class="hl-keyword">use</span> windows_mutex <span class="hl-keyword">as</span> os_mutex;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Mutex</span> {</span>
+<span class="line">   inner: os_mutex::Mutex</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>and </span><code>linux_mutex.rs</code><span> starts with </span><code>#![cfg(linux)]</code><span>. But of course we shouldn</span>&rsquo;<span>t</span>
+<span>implement conditional compilation by barbarically cutting the AST, and instead</span>
+<span>should push conditional compilation to after the type checking, so that you at</span>
+<span>least can check, on Linux, that the windows version of your code wouldn</span>&rsquo;<span>t fail</span>
+<span>due to some stupid typos in the name of </span><code>#[cfg(windows)]</code><span> functions. Alas, I</span>
+<span>don</span>&rsquo;<span>t know how to design such conditional compilation system.</span></p>
+<p><span>The same re-export idiom would be used for specifying non-default visibility:</span>
+<code>pub* use rt;</code><span> would make </span><code>regex::rt</code><span> a public module (yeah, this</span>
+<span>particular bit is sketchy :-) ).</span></p>
+<p><span>I think this approach would make most of pitfalls impossible. E.g, it wouldn</span>&rsquo;<span>t</span>
+<span>be possible to mix several different crates in one source tree. Additionally,</span>
+<span>it</span>&rsquo;<span>d be a great help for IDEs, as each file can be processed independently, and</span>
+<span>it would be clear just from the file contents and path where in the crate</span>
+<span>namespace the items are mounted, unlocking</span>
+<a href="https://rust-analyzer.github.io/blog/2020/07/20/three-architectures-for-responsive-ide.html"><span>map-reduce</span>
+<span>style IDE</span></a><span>.</span></p>
+<p><span>While we are at it, </span><code>use</code><span> definitely should use exactly the same path resolution</span>
+<span>rules as the rest of the language, without any kind of </span>&ldquo;<span>implicit leading </span><code>::</code>&rdquo;
+<span>special cases. Oh, and we shouldn</span>&rsquo;<span>t have nested use groups:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> collections::{</span>
+<span class="line">    hash::{HashMap, HashSet},</span>
+<span class="line">    BTreeMap,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Some projects use them, some projects don</span>&rsquo;<span>t use them, sufficiently large</span>
+<span>projects inconsistently both use and don</span>&rsquo;<span>t use them.</span></p>
+<p><span>Afterword: as I</span>&rsquo;<span>ve said in the beginning, this is unedited and not generally</span>
+<span>something I</span>&rsquo;<span>ve thought very hard and long about. Please don</span>&rsquo;<span>t take this as one</span>
+<span>true way to do things, my level of confidence about these ideas is about </span><code>0.5</code><span> I</span>
+<span>guess.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2021-11-27-notes-on-module-system.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/03/14/rpath-or-why-lld-doesnt-work-on-nixos.html b/2022/03/14/rpath-or-why-lld-doesnt-work-on-nixos.html
new file mode 100644
index 00000000..4b158aca
--- /dev/null
+++ b/2022/03/14/rpath-or-why-lld-doesnt-work-on-nixos.html
@@ -0,0 +1,439 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>RPATH, or why lld doesn't work on NixOS</title>
+  <meta name="description" content="I've learned a thing I wish I didn't know.
+As a revenge, I am going to write it down so that you, my dear reader, also learn about this.
+You probably want to skip this post unless you are interested and somewhat experienced in all of Rust, NixOS, and dynamic linking.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/03/14/rpath-or-why-lld-doesnt-work-on-nixos.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#RPATH-or-why-lld-doesn-t-work-on-NixOS"><span>RPATH, or why lld doesn</span>&rsquo;<span>t work on NixOS</span> <time datetime="2022-03-14">Mar 14, 2022</time></a>
+    </h1>
+<p><span>I</span>&rsquo;<span>ve learned a thing I wish I didn</span>&rsquo;<span>t know.</span>
+<span>As a revenge, I am going to write it down so that you, my dear reader, also learn about this.</span>
+<span>You probably want to skip this post unless you are interested and somewhat experienced in all of Rust, NixOS, and dynamic linking.</span></p>
+<section id="Problem">
+
+    <h2>
+    <a href="#Problem"><span>Problem</span> </a>
+    </h2>
+<p><span>I use NixOS and Rust.</span>
+<span>For linking my Rust code, I would love to use lld, the LLVM linker, as it is significantly faster.</span>
+<span>Unfortunately, this often leads to errors when trying to run the resulting binary:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">error while loading shared libraries: libbla.so.92:</span>
+<span class="line">cannot open shared object file: No such file or directory</span></code></pre>
+
+</figure>
+<p><span>Let</span>&rsquo;<span>s see what</span>&rsquo;<span>s going on here!</span></p>
+</section>
+<section id="Baseline">
+
+    <h2>
+    <a href="#Baseline"><span>Baseline</span> </a>
+    </h2>
+<p><span>We</span>&rsquo;<span>ll be using </span><code>evdev-rs</code><span> as a running example.</span>
+<span>It is binding to the evdev shared library on Linux.</span>
+<span>First, we</span>&rsquo;<span>ll build it with the default linker, which just works (haha, nope, this is NixOS).</span></p>
+<p><span>Let</span>&rsquo;<span>s get the crate:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> git clone git@github.com:ndesh26/evdev-rs.git</span>
+<span class="line"><span class="hl-title function_">$</span> cd evdev-rs</span></code></pre>
+
+</figure>
+<p><span>And run the example</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run --example evtest</span>
+<span class="line"><span class="hl-output">    Updating crates.io index</span></span>
+<span class="line"><span class="hl-output">  Downloaded libc v0.2.120</span></span>
+<span class="line"><span class="hl-output">  Downloaded 1 crate (574.7 KB) in 1.10s</span></span>
+<span class="line"><span class="hl-output">   Compiling cc v1.0.73</span></span>
+<span class="line"><span class="hl-output">   Compiling pkg-config v0.3.24</span></span>
+<span class="line"><span class="hl-output">   Compiling libc v0.2.120</span></span>
+<span class="line"><span class="hl-output">   Compiling log v0.4.14</span></span>
+<span class="line"><span class="hl-output">   Compiling cfg-if v1.0.0</span></span>
+<span class="line"><span class="hl-output">   Compiling bitflags v1.3.2</span></span>
+<span class="line"><span class="hl-output">   Compiling evdev-sys v0.2.4</span></span>
+<span class="line"><span class="hl-output">error: failed to run custom build command for `evdev-sys`</span></span>
+<span class="line"><span class="hl-output">&lt;---SNIP---&gt;</span></span>
+<span class="line"><span class="hl-output">  Couldn't find libevdev from pkgconfig</span></span>
+<span class="line"><span class="hl-output">&lt;---SNIP---&gt;</span></span></code></pre>
+
+</figure>
+<p><span>This of course doesn</span>&rsquo;<span>t just work and spits out humongous error message, which contains one line of important information: we are missing </span><code>libevdev</code><span> library.</span>
+<span>As this is NixOS, we are not going to barbarically install it globally.</span>
+<span>Let</span>&rsquo;<span>s create an isolated environment instead, using </span><code>nix-shell</code><span>:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">shell.nix</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">with</span> <span class="hl-built_in">import</span> &lt;nixpkgs&gt; {};</span>
+<span class="line">mkShell {</span>
+<span class="line">    <span class="hl-attr">buildInputs</span> = [</span>
+<span class="line">        pkgconfig</span>
+<span class="line">        libevdev</span>
+<span class="line">    ];</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And activate it:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> nix-shell</span></code></pre>
+
+</figure>
+<p><span>This environment gives us two things </span>&mdash;<span> the </span><code>pkg-config</code><span> binary and the </span><code>evdev</code><span> library.</span>
+<code>pkg-config</code><span> is a sort of half of a C package manager for UNIX: it can</span>&rsquo;<span>t install libraries, but it helps to locate them.</span>
+<span>Let</span>&rsquo;<span>s ask it about </span><code>libevdev</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> pkg-config --libs libevdev</span>
+<span class="line"><span class="hl-output">-L/nix/store/62gwpvp0c1i97lr84az2p0qg8nliwzgh-libevdev-1.11.0/lib -levdev</span></span></code></pre>
+
+</figure>
+<p><span>Essentially, it resolved library</span>&rsquo;<span>s short name (</span><code>libevdev</code><span>) to the full path to the directory were the library resides:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> exa -l /nix/store/62gwpvp0c1i97lr84az2p0qg8nliwzgh-libevdev-1.11.0/lib</span>
+<span class="line"><span class="hl-output">libevdev.la</span></span>
+<span class="line"><span class="hl-output">libevdev.so -&gt; libevdev.so.2.3.0</span></span>
+<span class="line"><span class="hl-output">libevdev.so.2 -&gt; libevdev.so.2.3.0</span></span>
+<span class="line"><span class="hl-output">libevdev.so.2.3.0</span></span>
+<span class="line"><span class="hl-output">pkgconfig</span></span></code></pre>
+
+</figure>
+<p><span>The </span><code>libevdev.so.2.3.0</code><span> file is the actual dynamic library.</span>
+<span>The symlinks stuff is another bit of a C package manager which implements somewhat-semver: </span><code>libevdev.so.2</code><span> version requirement gets resolved to </span><code>libevdev.so.2.3.0</code><span> version.</span></p>
+<p><span>Anyway, this works well enough to allow us to finally run the example</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run --example evtest</span>
+<span class="line"><span class="hl-output">    Finished dev [unoptimized + debuginfo] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-output">     Running `target/debug/examples/evtest`</span></span>
+<span class="line"><span class="hl-output">Usage: evtest /path/to/device</span></span></code></pre>
+
+</figure>
+<p><span>Success!</span></p>
+<p><span>Ooook, so let</span>&rsquo;<span>s now do what we wanted to from the beginning and configure cargo to use </span><code>lld</code><span>, for blazingly fast linking.</span></p>
+</section>
+<section id="lld">
+
+    <h2>
+    <a href="#lld"><span>lld</span> </a>
+    </h2>
+<p><span>The magic spell you need need to put into </span><code>.cargo/config</code><span> is (courtesy of @lnicola):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-section">[build]</span></span>
+<span class="line"><span class="hl-attr">rustflags</span> = [<span class="hl-string">&quot;-Clink-arg=-fuse-ld=lld&quot;</span>]</span></code></pre>
+
+</figure>
+<p><span>To unpack this:</span></p>
+<ul>
+<li>
+<code>-C</code><span> set codegen option </span><code>link-arg=-fuse-ld=lld</code><span>.</span>
+</li>
+<li>
+<code>link-arg</code><span> means that </span><code>rustc</code><span> will pass </span>&ldquo;<span>-fuse-ld=lld</span>&rdquo;<span> to the linker.</span>
+</li>
+<li>
+<span>Because linkers are not in the least confusing, the </span>&ldquo;<span>linker</span>&rdquo;<span> here is actually the whole gcc/clang.</span>
+<span>That is, rather than invoking the linker, </span><code>rustc</code><span> will call </span><code>cc</code><span> and </span><em><span>that</span></em><span> will then call the linker.</span>
+</li>
+<li>
+<span>So </span><code>-fuse-ld</code><span> (unlike </span><code>-C</code><span>, I </span><em><span>think</span></em><span> this is an atomic option, not </span><code>-f use-ld</code><span>) is an argument to gcc/clang,</span>
+<span>which asks it to use </span><code>lld</code><span> linker.</span>
+</li>
+<li>
+<span>And note that it</span>&rsquo;<span>s </span><code>lld</code><span> rather than </span><code>ldd</code><span> which confusingly exists and does something completely different.</span>
+</li>
+</ul>
+<p><span>Anyhow, the end result is that we switch the linker from </span><code>ld</code><span> (default slow GNU linker) to </span><code>lld</code><span> (fast LLVM linker).</span></p>
+<p><span>And that breaks!</span></p>
+<p><em><span>Building</span></em><span> the code still works fine:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo build --example evtest</span>
+<span class="line"><span class="hl-output">   Compiling libc v0.2.120</span></span>
+<span class="line"><span class="hl-output">   Compiling pkg-config v0.3.24</span></span>
+<span class="line"><span class="hl-output">   Compiling cc v1.0.73</span></span>
+<span class="line"><span class="hl-output">   Compiling log v0.4.14</span></span>
+<span class="line"><span class="hl-output">   Compiling cfg-if v1.0.0</span></span>
+<span class="line"><span class="hl-output">   Compiling bitflags v1.3.2</span></span>
+<span class="line"><span class="hl-output">   Compiling evdev-sys v0.2.4 (/home/matklad/tmp/evdev-rs/evdev-sys)</span></span>
+<span class="line"><span class="hl-output">   Compiling evdev-rs v0.5.0 (/home/matklad/tmp/evdev-rs)</span></span>
+<span class="line"><span class="hl-output">    Finished dev [unoptimized + debuginfo] target(s) in 2.87s</span></span></code></pre>
+
+</figure>
+<p><span>But </span><em><span>running</span></em><span> the binary fails:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cargo run --example evtest</span>
+<span class="line"><span class="hl-output">    Finished dev [unoptimized + debuginfo] target(s) in 0.01s</span></span>
+<span class="line"><span class="hl-output">     Running `target/debug/examples/evtest`</span></span>
+<span class="line"><span class="hl-output">target/debug/examples/evtest: error while loading shared libraries:</span></span>
+<span class="line"><span class="hl-output">libevdev.so.2: cannot open shared object file: No such file or directory</span></span></code></pre>
+
+</figure>
+</section>
+<section id="rpath">
+
+    <h2>
+    <a href="#rpath"><span>rpath</span> </a>
+    </h2>
+<p><span>Ok, what</span>&rsquo;<span>s now?</span>
+<span>Now, let</span>&rsquo;<span>s understand why the first example, with </span><code>ld</code><span> rather than </span><code>lld</code><span>, can</span>&rsquo;<span>t work :-)</span></p>
+<p><span>As a reminder, we use NixOS, so there</span>&rsquo;<span>s no global folder a-la </span><code>/usr/lib</code><span> where all shared libraries are stored.</span>
+<span>Coming back to our </span><code>pkgconfig</code><span> example,</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> pkg-config --libs libevdev</span>
+<span class="line"><span class="hl-output">-L/nix/store/62gwpvp0c1i97lr84az2p0qg8nliwzgh-libevdev-1.11.0/lib -levdev</span></span></code></pre>
+
+</figure>
+<p><span>the </span><code>libevdev.so</code><span> is well-hidden behind the hash.</span>
+<span>So we need a </span><code>pkg-config</code><span> binary at compile time to get from </span><code>libevdev</code><span> name to actual location.</span></p>
+<p><span>However, as this is a dynamic library, we need it not only during compilation, but during runtime as well.</span>
+<span>And at runtime loader (also known as dynamic linker (its binary name is something like </span><code>ld-linux-x86-64.so</code><span>, but despite the </span><code>.so</code><span> suffix, it</span>&rsquo;<span>s an executable (I kid you not, this stuff is indeed this confusing))) loads the executable together with shared libraries required by it.</span>
+<span>Normally, the loader looks for libraries in well-known locations, like the aforementioned </span><code>/usr/lib</code><span> or </span><code>LD_LIBRARY_PATH</code><span>.</span>
+<span>So we need </span><em><span>something</span></em><span> which would tell the loader that </span><code>libevdev</code><span> lives at </span><code>/nix/store/$HASH/lib</code><span>.</span></p>
+<p><span>That something is rpath (also known as RUNPATH) </span>&mdash;<span> this is more or less </span><code>LD_LIBRARY_PATH</code><span>, just hard-coded into the executable.</span>
+<span>We can use </span><code>readelf</code><span> to inspect program</span>&rsquo;<span>s rpath.</span></p>
+<p><span>When the binary is linked with the default linker, the result is as follows (lightly edited for clarity):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> readelf -d target/debug/examples/evtest | rg PATH</span>
+<span class="line"><span class="hl-output"> 0x000000000000001d (RUNPATH)            Library runpath: [</span></span>
+<span class="line"><span class="hl-output">    /nix/store/a9m53x4b3jf6mp1ll9acnh55lnx48hcj-nix-shell/lib64</span></span>
+<span class="line"><span class="hl-output">    /nix/store/a9m53x4b3jf6mp1ll9acnh55lnx48hcj-nix-shell/lib</span></span>
+<span class="line"><span class="hl-output">    /nix/store/62gwpvp0c1i97lr84az2p0qg8nliwzgh-libevdev-1.11.0/lib</span></span>
+<span class="line"><span class="hl-output">    /nix/store/z56jcx3j1gfyk4sv7g8iaan0ssbdkhz1-glibc-2.33-56/lib</span></span>
+<span class="line"><span class="hl-output">    /nix/store/c9f15p1kwm0mw5p13wsnvd1ixrhbhb12-gcc-10.3.0-lib/lib</span></span>
+<span class="line"><span class="hl-output">]</span></span></code></pre>
+
+</figure>
+<p><span>And sure, we see path to </span><code>libevdev</code><span> right there!</span></p>
+<p><span>With </span><code>rustflags = ["-Clink-arg=-fuse-ld=lld"]</code><span>, the result is different, the library is missing from rpath:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">0x000000000000001d (RUNPATH)            Library runpath: [</span>
+<span class="line">    /nix/store/a9m53x4b3jf6mp1ll9acnh55lnx48hcj-nix-shell/lib64</span>
+<span class="line">    /nix/store/a9m53x4b3jf6mp1ll9acnh55lnx48hcj-nix-shell/lib</span>
+<span class="line">]</span></code></pre>
+
+</figure>
+<p><span>At this point, I think we know what</span>&rsquo;<span>s going on.</span>
+<span>To recap:</span></p>
+<ul>
+<li>
+<span>With both </span><code>ld</code><span> and </span><code>lld</code><span>, we don</span>&rsquo;<span>t have problems at compile time, because </span><code>pkg-config</code><span> helps the compiler to find the library.</span>
+</li>
+<li>
+<span>At runtime, the library linked with </span><code>lld</code><span> fails to find the shared library, while the one linked with </span><code>ld</code><span> works.</span>
+</li>
+<li>
+<span>The difference between the two binaries is the value of rpath in the binary itself.</span>
+<code>ld</code><span> somehow manages to include rpath which contains path to the library.</span>
+<span>This rpath is what allows the loader to locate the library at runtime.</span>
+</li>
+</ul>
+<p><span>Curious observation: dynamic linking on NixOS is not </span><em><span>entirely</span></em><span> dynamic.</span>
+<span>Because executables expect to find shared libraries in specific locations marked with hashes of the libraries themselves, it</span>&rsquo;<span>s not possible to </span><em><span>just</span></em><span> upgrade </span><code>.so</code><span> on disk for all the binaries to pick it up.</span></p>
+</section>
+<section id="Who-sets-rpath">
+
+    <h2>
+    <a href="#Who-sets-rpath"><span>Who sets rpath?</span> </a>
+    </h2>
+<p><span>At this point, we have only one question left:</span></p>
+<p><span>Why?</span></p>
+<p><span>Why do we have that magical rpath thing in one of the binaries.</span>
+<span>The answer is simple </span>&mdash;<span> to set rpath, one passes </span><code>-rpath /nix/store/...</code><span> flag to the linker at compile time.</span>
+<span>The linker then just embeds the specified string as rpath field in the executable, without really inspecting it in any way.</span></p>
+<p><span>And here comes the magical/hacky bit </span>&mdash;<span> the thing that adds that </span><code>-rpath</code><span> argument to the linker</span>&rsquo;<span>s command line is the NixOS wrapper script!</span>
+<span>That is, the </span><code>ld</code><span> on NixOS is not a proper ld, but rather a shell script which does a bit of extra fudging here and there, including the rpath:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cat (which ld)</span>
+<span class="line"><span class="hl-output">&lt;---SNIP---&gt;</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-comment"># Three tasks:</span></span>
+<span class="line"><span class="hl-comment">#</span></span>
+<span class="line"><span class="hl-comment">#   1. Find all -L... switches for rpath</span></span>
+<span class="line"><span class="hl-comment">#</span></span>
+<span class="line"><span class="hl-comment">#   2. Find relocatable flag for build id.</span></span>
+<span class="line"><span class="hl-comment">#</span></span>
+<span class="line"><span class="hl-comment">#   3. Choose 32-bit dynamic linker if needed</span></span>
+<span class="line"><span class="hl-output">declare -a libDirs</span></span>
+<span class="line"><span class="hl-output">&lt;---SNIP---&gt;</span></span>
+<span class="line"><span class="hl-output">        case "$prev" in</span></span>
+<span class="line"><span class="hl-output">            -L)</span></span>
+<span class="line"><span class="hl-output">                libDirs+=("$p")</span></span>
+<span class="line"><span class="hl-output">                ;;</span></span>
+<span class="line"><span class="hl-output">&lt;---SNIP---&gt;</span></span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-output">    for dir in ${libDirs+"${libDirs[@]}"}; do</span></span>
+<span class="line"><span class="hl-output">        &lt;---SNIP---&gt;</span></span>
+<span class="line"><span class="hl-output">                extraAfter+=(-rpath "$dir")</span></span>
+<span class="line"><span class="hl-output">        &lt;---SNIP---&gt;</span></span>
+<span class="line"><span class="hl-output">    done</span></span>
+<span class="line"><span class="hl-output">&lt;---SNIP---&gt;</span></span>
+<span class="line"><span class="hl-output">/nix/store/sga0l55gm9nlwglk79lmihwb2bpv597j-binutils-2.35.2/bin/ld \</span></span>
+<span class="line"><span class="hl-output">    ${extraBefore+"${extraBefore[@]}"} \</span></span>
+<span class="line"><span class="hl-output">    ${params+"${params[@]}"} \</span></span>
+<span class="line"><span class="hl-output">    ${extraAfter+"${extraAfter[@]}"}</span></span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s a lot of going on in that wrapper script, but the relevant thing to us, as far as I understand, is that everything that gets passed as </span><code>-L</code><span> at compile time gets embedded into the binary</span>&rsquo;<span>s rpath, so that it can be used at runtime as well.</span></p>
+<p><span>Now, let</span>&rsquo;<span>s take a look at </span><code>lld</code>&rsquo;<span>s wrapper:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> cat (which lld)</span>
+<span class="line"><span class="hl-output">@@@@@@@TT@@pHpH&lt;&lt;E8o	8o	wN:HgPHwHpp@p@ @@  Stdpp@p@ Ptd@G@@QtdRtd/nix/store/4s21k8k7p1mfik0b33r2spq5hq7774k1-glibc-2.33-108/lib/ld-linux-x86-64.so.2GNUGNU r	\X</span></span>
+<span class="line"><span class="hl-output">0F                                                                                                                                                                        &lt;C5`</span></span>
+<span class="line"><span class="hl-output">Bx	rZ1V3	y</span></span></code></pre>
+
+</figure>
+<p><span>Haha, nope, there</span>&rsquo;<span>s no wrapper!</span>
+<span>Unlike </span><code>ld</code><span>, </span><code>lld</code><span> on NixOS is an honest-to-Bosch binary file, and that</span>&rsquo;<span>s why we can</span>&rsquo;<span>t have great things!</span>
+<span>This is tracked in issue #24744 in the nixpkgs repo :)</span></p>
+<p><span>Update:</span></p>
+<p><span>So</span>&hellip;<span>.. turns out there</span>&rsquo;<span>s more than one </span><code>lld</code><span> on NixOS.</span>
+<span>There</span>&rsquo;<span>s </span><code>pkgs.lld</code><span>, the thing I have been using in the post.</span>
+<span>And then there</span>&rsquo;<span>s </span><code>pkgs.llvmPackages.bintools</code><span> package, which also contains </span><code>lld</code><span>.</span>
+<span>And that version is actually wrapped into an rpath-setting shell script, the same way </span><code>ld</code><span> is.</span></p>
+<p><span>That is, </span><code>pkgs.lld</code><span> is the wrong </span><code>lld</code><span>, the right one is </span><code>pkgs.llvmPackages.bintools</code><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-03-14-rpath-or-why-lld-doesnt-work-on-nixos.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/03/26/self-modifying-code.html b/2022/03/26/self-modifying-code.html
new file mode 100644
index 00000000..35de0a1e
--- /dev/null
+++ b/2022/03/26/self-modifying-code.html
@@ -0,0 +1,646 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Self Modifying Code</title>
+  <meta name="description" content="This post has nothing to do with JIT-like techniques for patching machine code on the fly (though they are cool!).
+Instead, it describes a cute/horrible trick/hack you can use to generate source code if you are not a huge fan of macros.
+The final technique is going to be independent of any particular programming language, but the lead-up is going to be Rust-specific.
+The pattern can be applied to a wide variety of tasks, but we'll use a model problem to study different solutions.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/03/26/self-modifying-code.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Self-Modifying-Code"><span>Self Modifying Code</span> <time datetime="2022-03-26">Mar 26, 2022</time></a>
+    </h1>
+<p><span>This post has nothing to do with JIT-like techniques for patching machine code on the fly (though they are cool!).</span>
+<span>Instead, it describes a cute/horrible trick/hack you can use to generate </span><em><span>source</span></em><span> code if you are not a huge fan of macros.</span>
+<span>The final technique is going to be independent of any particular programming language, but the lead-up is going to be Rust-specific.</span>
+<span>The pattern can be applied to a wide variety of tasks, but we</span>&rsquo;<span>ll use a model problem to study different solutions.</span></p>
+<section id="Problem">
+
+    <h2>
+    <a href="#Problem"><span>Problem</span> </a>
+    </h2>
+<p><span>I have a field-less enum representing various error conditions:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Debug, Clone, Copy)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  InvalidSignature,</span>
+<span class="line">  AccountNotFound,</span>
+<span class="line">  InsufficientBalance,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This is a type I expect to change fairly often.</span>
+<span>I predict that it will grow a lot.</span>
+<span>Even the initial version contains half a dozen variants already!</span>
+<span>For brevity, I am showing only a subset here.</span></p>
+<p><span>For the purposes of serialization, I would like to convert this error to and from an error code.</span>
+<span>One direction is easy, there</span>&rsquo;<span>s built in mechanism for this in Rust:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The other direction is more annoying: it isn</span>&rsquo;<span>t handled by the language automatically yet (although there</span>&rsquo;<span>s an in-progress PR which adds just that!), so we have to write some code ourselves:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(code: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;Error&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> code {</span>
+<span class="line">      <span class="hl-number">0</span> =&gt; Error::InvalidSignature,</span>
+<span class="line">      <span class="hl-number">1</span> =&gt; Error::AccountNotFound,</span>
+<span class="line">      <span class="hl-number">2</span> =&gt; Error::InsufficientBalance,</span>
+<span class="line">      _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, given that I expect this type to change frequently, this is asking for trouble!</span>
+<span>It</span>&rsquo;<span>s very easy for the </span><code>match</code><span> and the enum definition to get out of sync!</span></p>
+<p><span>What should we do? What </span><em><span>can</span></em><span> we do?</span></p>
+</section>
+<section id="Minimalist-Solution">
+
+    <h2>
+    <a href="#Minimalist-Solution"><span>Minimalist Solution</span> </a>
+    </h2>
+<p><span>Now, seasoned Rust developers are probably already thinking about macros (or maybe even about specific macro crates).</span>
+<span>And we</span>&rsquo;<span>ll get there!</span>
+<span>But first, let</span>&rsquo;<span>s see how I usually solve the problem, when (as I am by default) I am not keen on adding macros.</span></p>
+<p><span>The idea is to trick the compiler into telling us the number of elements in the enum, which would allow us to implement some sanity checking.</span>
+<span>We can do this by adding a fake element at the end of the enum:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  InvalidSignature,</span>
+<span class="line">  AccountNotFound,</span>
+<span class="line">  InsufficientBalance,</span>
+<span class="line">  __LAST,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">const</span> ALL: [Error; Error::__LAST <span class="hl-keyword">as</span> <span class="hl-type">usize</span>] = [</span>
+<span class="line">    Error::InvalidSignature,</span>
+<span class="line">    Error::AccountNotFound,</span>
+<span class="line">    Error::InsufficientBalance,</span>
+<span class="line">  ];</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(code: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;Error&gt; {</span>
+<span class="line">    Error::ALL.<span class="hl-title function_ invoke__">get</span>(code <span class="hl-keyword">as</span> <span class="hl-type">usize</span>).<span class="hl-title function_ invoke__">copied</span>()</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    Error::ALL</span>
+<span class="line">      .<span class="hl-title function_ invoke__">into_iter</span>()</span>
+<span class="line">      .<span class="hl-title function_ invoke__">position</span>(|it| it == <span class="hl-keyword">self</span>)</span>
+<span class="line">      .<span class="hl-title function_ invoke__">unwrap_or_default</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, if we add a new error variant, but forget to update the </span><code>ALL</code><span> array, the code will fail to compile </span>&mdash;<span> exactly the reminder we need.</span>
+<span>The major drawback here is that </span><code>__LAST</code><span> variant has to exist.</span>
+<span>This is fine for internal stuff, but something not really great for a public, clean API.</span></p>
+</section>
+<section id="Minimalist-Macro">
+
+    <h2>
+    <a href="#Minimalist-Macro"><span>Minimalist Macro</span> </a>
+    </h2>
+<p><span>Now, let</span>&rsquo;<span>s get to macros, and let</span>&rsquo;<span>s start with the simplest possible one I can think of!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">define_error![</span>
+<span class="line">  InvalidSignature,</span>
+<span class="line">  AccountNotFound,</span>
+<span class="line">  InsufficientBalance,</span>
+<span class="line">];</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Pretty simple, heh? Let</span>&rsquo;<span>s look at the definition of </span><code>define_error!</code><span> though:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-built_in">macro_rules!</span> define_error {</span>
+<span class="line">  ($($err:ident,)*) =&gt; {</span>
+<span class="line">    <span class="hl-meta">#[derive(Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">      $($err,)*</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">      <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(code: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;Error&gt; {</span>
+<span class="line">        <span class="hl-meta">#![allow(non_upper_case_globals)]</span></span>
+<span class="line">        $(<span class="hl-keyword">const</span> $err: <span class="hl-type">u32</span> = Error::$err <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;)*</span>
+<span class="line">        <span class="hl-keyword">match</span> code {</span>
+<span class="line">          $($err =&gt; <span class="hl-title function_ invoke__">Some</span>(Error::$err),)*</span>
+<span class="line">          _ =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  };</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>That</span>&rsquo;<span>s </span>&hellip;<span> quite literally a puzzle!</span>
+<span>Declarative macro machinery is comparatively inexpressive, so you need to get creative to get what you want.</span>
+<span>Here, ideally I</span>&rsquo;<span>d write</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">match</span> code {</span>
+<span class="line">  <span class="hl-number">0</span> =&gt; Error::InvalidSignature,</span>
+<span class="line">  <span class="hl-number">1</span> =&gt; Error::AccountNotFound,</span>
+<span class="line">  <span class="hl-number">2</span> =&gt; Error::InsufficientBalance,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Alas, counting in macro by example is possible, but not trivial.</span>
+<span>It</span>&rsquo;<span>s a subpuzle!</span>
+<span>Rather than solving it, I use the following work-around:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> InvalidSignature: <span class="hl-type">u32</span> = Error::InvalidSignature <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;</span>
+<span class="line"><span class="hl-keyword">match</span> {</span>
+<span class="line">  InvalidSignature =&gt; Error::InvalidSignature,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And then I have to </span><code>#![allow(non_upper_case_globals)]</code><span>, to prevent the compiler from complaining.</span></p>
+</section>
+<section id="Idiomatic-Macro">
+
+    <h2>
+    <a href="#Idiomatic-Macro"><span>Idiomatic Macro</span> </a>
+    </h2>
+<p><span>The big problem with macro is that it</span>&rsquo;<span>s not only the internal implementation which is baroque!</span>
+<span>The call-site is pretty inscrutable as well!</span>
+<span>Let</span>&rsquo;<span>s imagine we are new to a codebase, and come across the following snippet:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">define_error![</span>
+<span class="line">  InvalidSignature,</span>
+<span class="line">  AccountNotFound,</span>
+<span class="line">  InsufficientBalance,</span>
+<span class="line">];</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The question I would ask here would be </span>&ldquo;<span>what</span>&rsquo;<span>s that </span><code>Error</code><span> thing is?</span>&rdquo;<span>.</span>
+<span>Luckily, we live in the age of powerful IDEs, so we can just </span>&ldquo;<span>goto definition</span>&rdquo;<span> to answer that, right?</span></p>
+
+<figure>
+
+<img alt="" src="/assets/gotodef-macro-1.gif">
+</figure>
+<p><span>Well, not really.</span>
+<span>An IDE says that the </span><code>Error</code><span> token is produced by </span><em><span>something</span></em><span> inside that macro invocation.</span>
+<span>That</span>&rsquo;<span>s a correct answer, if not the most useful one!</span>
+<span>So I have to read the definition of the </span><code>define_error</code><span> macro and understand how that works internally to get the idea about public API available externally (e.g., that the </span><code>Error</code><span> refers to a public enum).</span>
+<span>And here the puzzler nature of declarative macros is exacerbated.</span>
+<span>It</span>&rsquo;<span>s hard enough to figure out how to express the idea you want using the restricted language of macros.</span>
+<span>It</span>&rsquo;<span>s doubly hard to understand the idea the macro</span>&rsquo;<span>s </span><em><span>author</span></em><span> had when you can</span>&rsquo;<span>t peek inside their brain and observer only to the implementation of the macro.</span></p>
+<p><span>One remedy here is to make macro input look more like the code we want to produce.</span>
+<span>Something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">define_error![</span>
+<span class="line">  <span class="hl-meta">#[derive(Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">    InvalidSignature,</span>
+<span class="line">    AccountNotFound,</span>
+<span class="line">    InsufficientBalance,</span>
+<span class="line">  }</span>
+<span class="line">];</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This indeed is marginally friendlier for IDEs and people to make sense of:</span></p>
+
+<figure>
+
+<img alt="" src="/assets/gotodef-macro-2.gif">
+</figure>
+<p><span>The cost for this is a more complicated macro implementation.</span>
+<span>Generally, a macro needs to do two things: parse arbitrary token stream input, and emit valid Rust code as output.</span>
+<span>Parsing is usually the more complicated task.</span>
+<span>That</span>&rsquo;<span>s why in our minimal attempt we used maximally simple syntax, just a list of identifiers.</span>
+<span>However, if we want to make the input of the macro look more like Rust, we have to parse a subset of Rust, and that</span>&rsquo;<span>s more involved:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-built_in">macro_rules!</span> define_error {</span>
+<span class="line">  (</span>
+<span class="line">    $(<span class="hl-meta">#[$meta:meta]</span>)*</span>
+<span class="line">    $vis:vis <span class="hl-keyword">enum</span> $Error:ident {</span>
+<span class="line">      $($err:ident,)*</span>
+<span class="line">    }</span>
+<span class="line">  ) =&gt; {</span>
+<span class="line">    $(<span class="hl-meta">#[$meta]</span>)*</span>
+<span class="line">    $vis <span class="hl-keyword">enum</span> $Error {</span>
+<span class="line">      $($err,)*</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">      <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(code: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;Error&gt; {</span>
+<span class="line">        <span class="hl-meta">#![allow(non_upper_case_globals)]</span></span>
+<span class="line">        $(<span class="hl-keyword">const</span> $err: <span class="hl-type">u32</span> = $Error::$err <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;)*</span>
+<span class="line">        <span class="hl-keyword">match</span> code {</span>
+<span class="line">          $($err =&gt; <span class="hl-title function_ invoke__">Some</span>($Error::$err),)*</span>
+<span class="line">          _ =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">define_error![</span>
+<span class="line">  <span class="hl-meta">#[derive(Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">    InvalidSignature,</span>
+<span class="line">    AccountNotFound,</span>
+<span class="line">    InsufficientBalance,</span>
+<span class="line">  }</span>
+<span class="line">];</span></code></pre>
+
+</figure>
+<p><span>We have to carefully deal with all those visibilities and attributes.</span>
+<span>Even after we do that, the connection between the input Rust-like syntax and the output Rust is skin-deep.</span>
+<span>This is mostly smoke and mirrors, and is not much different from, e.g., using Haskell syntax here:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-built_in">macro_rules!</span> define_error {</span>
+<span class="line">  (</span>
+<span class="line">    data $Error:ident = $err0:ident $(| $err:ident)*</span>
+<span class="line">      $(<span class="hl-title function_ invoke__">deriving</span> ($($derive:ident),*))?</span>
+<span class="line">  ) =&gt; {</span>
+<span class="line">    $(<span class="hl-meta">#[derive($($derive),*)]</span>)?</span>
+<span class="line">    <span class="hl-keyword">enum</span> $Error {</span>
+<span class="line">      $err0,</span>
+<span class="line">      $($err,)*</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">      <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(code: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;Error&gt; {</span>
+<span class="line">        <span class="hl-meta">#![allow(non_upper_case_globals)]</span></span>
+<span class="line">        <span class="hl-keyword">const</span> $err0: <span class="hl-type">u32</span> = $Error::$err0 <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;</span>
+<span class="line">        $(<span class="hl-keyword">const</span> $err: <span class="hl-type">u32</span> = $Error::$err <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;)*</span>
+<span class="line">        <span class="hl-keyword">match</span> code {</span>
+<span class="line">          $err0 =&gt; <span class="hl-title function_ invoke__">Some</span>($Error::$err0),</span>
+<span class="line">          $($err =&gt; <span class="hl-title function_ invoke__">Some</span>($Error::$err),)*</span>
+<span class="line">          _ =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">define_error![</span>
+<span class="line">  data Error = InvalidSignature | AccountNotFound | InsufficientBalance</span>
+<span class="line">    <span class="hl-title function_ invoke__">deriving</span> (<span class="hl-built_in">Debug</span>, <span class="hl-built_in">Clone</span>, <span class="hl-built_in">Copy</span>, <span class="hl-built_in">PartialEq</span>, <span class="hl-built_in">Eq</span>)</span>
+<span class="line"></span>
+<span class="line">];</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Attribute-Macro">
+
+    <h2>
+    <a href="#Attribute-Macro"><span>Attribute Macro</span> </a>
+    </h2>
+<p><span>We can meaningfully increase the fidelity between macro input and macro output by switching to a derive macro.</span>
+<span>In contrast to function-like macros, derives require that their input is syntactically and even semantically valid Rust.</span></p>
+<p><span>So the result looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> macros::FromCode;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(FromCode, Debug, Clone, Copy, PartialEq, Eq)]</span></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  InvalidSignature,</span>
+<span class="line">  AccountNotFound,</span>
+<span class="line">  InsufficientBalance,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Again, the </span><code>enum Error</code><span> here is an honest, simple enum!</span>
+<span>It</span>&rsquo;<span>s not an alien beast which just wears enum</span>&rsquo;<span>s skin.</span></p>
+<p><span>And the implementation of the macro doesn</span>&rsquo;<span>t look too bad either, thanks to @dtolnay</span>&rsquo;<span>s tasteful API design:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> proc_macro::TokenStream;</span>
+<span class="line"><span class="hl-keyword">use</span> quote::quote;</span>
+<span class="line"><span class="hl-keyword">use</span> syn::{parse_macro_input, DeriveInput};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[proc_macro_derive(FromCode)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(input: TokenStream) <span class="hl-punctuation">-&gt;</span> TokenStream {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">input</span> = parse_macro_input!(input <span class="hl-keyword">as</span> DeriveInput);</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">error_name</span> = input.ident;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">enum_</span> = <span class="hl-keyword">match</span> input.data {</span>
+<span class="line">    syn::Data::<span class="hl-title function_ invoke__">Enum</span>(it) =&gt; it,</span>
+<span class="line">    _ =&gt; <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;expected an enum&quot;</span>),</span>
+<span class="line">  };</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">arms</span> =</span>
+<span class="line">    enum_.variants.<span class="hl-title function_ invoke__">iter</span>().<span class="hl-title function_ invoke__">enumerate</span>().<span class="hl-title function_ invoke__">map</span>(|(i, var)| {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">i</span> = i <span class="hl-keyword">as</span> <span class="hl-type">u32</span>;</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">var_name</span> = &amp;var.ident;</span>
+<span class="line">      quote! {</span>
+<span class="line">        #i =&gt; <span class="hl-title function_ invoke__">Some</span>(#error_name::#var_name),</span>
+<span class="line">      }</span>
+<span class="line">    });</span>
+<span class="line"></span>
+<span class="line">  quote! {</span>
+<span class="line">    <span class="hl-keyword">impl</span> #error_name {</span>
+<span class="line">      <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(code: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;#error_name&gt; {</span>
+<span class="line">        <span class="hl-keyword">match</span> code {</span>
+<span class="line">          #(#arms)*</span>
+<span class="line">          _ =&gt; <span class="hl-literal">None</span>,</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">  .<span class="hl-title function_ invoke__">into</span>()</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Unlike declarative macros, here we just directly express the syntax that we want to emit </span>&mdash;<span> a match over consecutive natural numbers.</span></p>
+<p><span>The biggest drawback here is that on the call-site now we don</span>&rsquo;<span>t have </span><em><span>any</span></em><span> idea about the extra API generated by the macro.</span>
+<span>If, with declarative macros, you can notice an </span><code>pub fn from_code</code><span> in the same file and guess that that</span>&rsquo;<span>s a part of an API, with a procedural macro that string is in a completely different crate!</span>
+<span>While proc-macro can greatly improve the ergonomics of using and implementing macros (inflated compile times notwithstanding), for the reader, they are arguably even more opaque than declarative macros.</span></p>
+</section>
+<section id="Self-Modifying-Code-1">
+
+    <h2>
+    <a href="#Self-Modifying-Code-1"><span>Self Modifying Code</span> </a>
+    </h2>
+<p><span>Finally, let</span>&rsquo;<span>s see the promised hacky solution :)</span>
+<span>While, as you might have noticed, I am not a huge fan of macros, I like plain old code generation </span>&mdash;<span> text in, text out.</span>
+<span>Text manipulation is much worse-is-betterer than advanced macro systems.</span></p>
+<p><span>So what we are going to do is:</span></p>
+<ul>
+<li>
+<span>Read the file with the enum definition as a string (</span><code>file!()</code><span> macro will be useful here).</span>
+</li>
+<li>
+&ldquo;<span>Parse</span>&rdquo;<span> enum definition using unsophisticated string splitting (</span><code>str::split_once</code><span>, aka </span><code>cut</code><span> would be our parser).</span>
+</li>
+<li>
+<span>Generate the code we want by concatenating strings.</span>
+</li>
+<li>
+<span>Paste the resulting code into a specially marked position.</span>
+</li>
+<li>
+<span>Overwrite the file in place, if there are changes.</span>
+</li>
+<li>
+<span>And we are going to use a </span><code>#[test]</code><span> to drive the process!</span>
+</li>
+</ul>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Debug, Clone, Copy)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  InsufficientBalance,</span>
+<span class="line">  InvalidSignature,</span>
+<span class="line">  AccountNotFound,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Error</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">as_code</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span> <span class="hl-keyword">as</span> <span class="hl-type">u32</span></span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">from_code</span>(code: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;Error&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-keyword">match</span> code {</span>
+<span class="line">      <span class="hl-comment">// region:sourcegen</span></span>
+<span class="line">      <span class="hl-number">0</span> =&gt; Error::InsufficientBalance,</span>
+<span class="line">      <span class="hl-number">1</span> =&gt; Error::InvalidSignature,</span>
+<span class="line">      <span class="hl-number">2</span> =&gt; Error::AccountNotFound,</span>
+<span class="line">      <span class="hl-comment">// endregion:sourcegen</span></span>
+<span class="line">      _ =&gt; <span class="hl-keyword">return</span> <span class="hl-literal">None</span>,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(res)</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sourcegen_from_code</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">original_text</span> = std::fs::<span class="hl-title function_ invoke__">read_to_string</span>(<span class="hl-built_in">file!</span>()).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> (_, variants, _) =</span>
+<span class="line">    <span class="hl-title function_ invoke__">split_twice</span>(&amp;original_text, <span class="hl-string">&quot;pub enum Error {\n&quot;</span>, <span class="hl-string">&quot;}&quot;</span>)</span>
+<span class="line">      .<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">arms</span> = variants</span>
+<span class="line">    .<span class="hl-title function_ invoke__">lines</span>()</span>
+<span class="line">    .<span class="hl-title function_ invoke__">map</span>(|line| line.<span class="hl-title function_ invoke__">trim</span>().<span class="hl-title function_ invoke__">trim_end_matches</span>(<span class="hl-string">&#x27;,&#x27;</span>))</span>
+<span class="line">    .<span class="hl-title function_ invoke__">enumerate</span>()</span>
+<span class="line">    .<span class="hl-title function_ invoke__">map</span>(|(i, var)| <span class="hl-built_in">format!</span>(<span class="hl-string">&quot;      {i} =&gt; Error::{var},\n&quot;</span>))</span>
+<span class="line">    .collect::&lt;<span class="hl-type">String</span>&gt;();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">new_text</span> = {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">start_marker</span> = <span class="hl-string">&quot;      // region:sourcegen\n&quot;</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">end_marker</span> = <span class="hl-string">&quot;      // endregion:sourcegen\n&quot;</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> (prefix, _, suffix) =</span>
+<span class="line">      <span class="hl-title function_ invoke__">split_twice</span>(&amp;original_text, start_marker, end_marker)</span>
+<span class="line">        .<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    <span class="hl-built_in">format!</span>(<span class="hl-string">&quot;{prefix}{start_marker}{arms}{end_marker}{suffix}&quot;</span>)</span>
+<span class="line">  };</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">if</span> new_text != original_text {</span>
+<span class="line">    std::fs::<span class="hl-title function_ invoke__">write</span>(<span class="hl-built_in">file!</span>(), new_text).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;source was not up-to-date&quot;</span>)</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">split_twice</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt;(</span>
+<span class="line">  text: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>,</span>
+<span class="line">  start_marker: &amp;<span class="hl-type">str</span>,</span>
+<span class="line">  end_marker: &amp;<span class="hl-type">str</span>,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(&amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>, &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>, &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>)&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> (prefix, rest) = text.<span class="hl-title function_ invoke__">split_once</span>(start_marker)?;</span>
+<span class="line">  <span class="hl-keyword">let</span> (mid, suffix) = rest.<span class="hl-title function_ invoke__">split_once</span>(end_marker)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Some</span>((prefix, mid, suffix))</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>That</span>&rsquo;<span>s the whole pattern!</span>
+<span>Note how, unlike every other solution, it is crystal clear how the generated code works.</span>
+<span>It</span>&rsquo;<span>s just code which you can goto-definition, or step through in debugging.</span>
+<span>You can be completely oblivious about the shady </span><code>#[test]</code><span> machinery, and that won</span>&rsquo;<span>t harm understanding in any way.</span></p>
+<p><span>The code of the </span>&ldquo;<span>macro</span>&rdquo;<span> is also easy to understand </span>&mdash;<span> that</span>&rsquo;<span>s literally string manipulation.</span>
+<span>What</span>&rsquo;<span>s more, you can easily see how it works by just running the test!</span></p>
+<p><span>The </span>&ldquo;<span>read and update your own source code</span>&rdquo;<span> part is a bit mind-bending!</span>
+<span>But the implementation is tiny and only uses the standard library, so it should be easy to understand.</span></p>
+<p><span>Unlike macros, this doesn</span>&rsquo;<span>t try to enforce at compile time that the generated code is fresh.</span>
+<span>If you update the </span><code>Error</code><span> definition, you need to re-run test for the generated code to be updated as well.</span>
+<span>But this </span><em><span>will</span></em><span> be caught by the tests.</span>
+<span>Note the important detail </span>&mdash;<span> the test only tries to update the source code if there are, in fact, changes.</span>
+<span>That is, writable </span><code>src/</code><span> is required only during development.</span></p>
+<p><span>That</span>&rsquo;<span>s all, hope this survey was useful! Discussion on </span><a href="https://old.reddit.com/r/rust/comments/tp8tmn/blog_post_self_modifying_code/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-03-26-self-modifying-code.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/04/25/why-lsp.html b/2022/04/25/why-lsp.html
new file mode 100644
index 00000000..f75e751f
--- /dev/null
+++ b/2022/04/25/why-lsp.html
@@ -0,0 +1,327 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Why LSP?</title>
+  <meta name="description" content="LSP (language server protocol) is fairly popular today.
+There's a standard explanation of why that is the case.
+You probably have seen this picture before:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/04/25/why-lsp.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Why-LSP"><span>Why LSP?</span> <time datetime="2022-04-25">Apr 25, 2022</time></a>
+    </h1>
+<p><span>LSP (</span><a href="https://microsoft.github.io/language-server-protocol/"><span>language server protocol</span></a><span>) is fairly popular today.</span>
+<span>There</span>&rsquo;<span>s a standard explanation of why that is the case.</span>
+<span>You probably have seen this picture before:</span></p>
+
+<figure>
+
+<img alt="" src="/assets/LSP-MxN.png">
+</figure>
+<p><span>I believe that this standard explanation of LSP popularity is wrong.</span>
+<span>In this post, I suggest an alternative picture.</span></p>
+<section id="Standard-Explanation">
+
+    <h2>
+    <a href="#Standard-Explanation"><span>Standard Explanation</span> </a>
+    </h2>
+<p><span>The explanation goes like this:</span></p>
+<p><span>There are </span><code>M</code><span> editors and </span><code>N</code><span> languages.</span>
+<span>If you want to support a particular language in a particular editor, you need to write a dedicated plugin for that.</span>
+<span>That means </span><code>M * N</code><span> work, as the picture on the left vividly demonstrates.</span>
+<span>What LSP does is cutting that to </span><code>M + N</code><span>, by providing a common thin waist, as show on the right picture.</span></p>
+</section>
+<section id="Why-is-the-explanation-wrong">
+
+    <h2>
+    <a href="#Why-is-the-explanation-wrong"><span>Why is the explanation wrong?</span> </a>
+    </h2>
+<p><span>The problem with the explanation is that also best to illustrate pictorially.</span>
+<span>In short, the picture above is not drawn to scale.</span>
+<span>Here</span>&rsquo;<span>s a better illustration of how, for example, rust-analyzer + VS Code combo works together:</span></p>
+
+<figure>
+
+<img alt="" src="/assets/ra-code.png">
+</figure>
+<p><span>The (big) ball on the left is rust-analyzer </span>&mdash;<span> a language server.</span>
+<span>The similarly sized ball on the right is VS Code </span>&mdash;<span> an editor.</span>
+<span>And the small ball in the center is the code to glue them together, </span><em><span>including</span></em><span> LSP implementations.</span></p>
+<p><span>That code is relatively and absolutely tiny.</span>
+<span>The codebases behind either the language server or the editor are enormous.</span></p>
+<p><span>If the standard theory were correct, then, before LSP, we would have lived in a world where some languages has superb IDE support in some editors.</span>
+<span>For example, IntelliJ would have been great at Java, Emacs at C++, Vim at C#, etc.</span>
+<span>My recollection of that time is quite different.</span>
+<span>To get a decent IDE support, you either used a language supported by JetBrains (IntelliJ or ReSharper) or.</span></p>
+<p><span>There was just a single editor providing meaningful semantic IDE support.</span></p>
+</section>
+<section id="Alternative-Theory">
+
+    <h2>
+    <a href="#Alternative-Theory"><span>Alternative Theory</span> </a>
+    </h2>
+<p><span>I would say that the reason for such poor IDE support in the days of yore is different.</span>
+<span>Rather than </span><code>M * N</code><span> being too big, it was too small, because </span><code>N</code><span> was zero and </span><code>M</code><span> just slightly more than that.</span></p>
+<p><span>I</span>&rsquo;<span>d start with </span><code>N</code><span> </span>&mdash;<span> the number of language servers, this is the side I am relatively familiar with.</span>
+<span>Before LSP, there simply weren</span>&rsquo;<span>t a lot of working language-server shaped things.</span>
+<span>The main reason for that is that building a language server is hard.</span></p>
+<p><span>The essential complexity for a server is pretty high.</span>
+<span>It is known that compilers are complicated, and a language server is a compiler </span><em><em><span>and then some</span></em></em><span>.</span></p>
+<p><em><span>First</span></em><span>, like a compiler, a language server needs to fully understand the language, it needs to be able to distinguish between valid and invalid programs.</span>
+<span>However, while for invalid programs a batch compiler is allowed to emit an error message and exit promptly, a language server must analyze </span><em><span>any</span></em><span> invalid program as best as it can.</span>
+<span>Working with incomplete and invalid programs is the first complication of a language server in comparison to a compiler.</span></p>
+<p><em><span>Second</span></em><span>, while a batch compiler is a pure function which transforms source text into machine code, a language server has to work with a code base which is constantly being modified by the user.</span>
+<span>It is a compiler with a time dimension, and evolution of state over time is one of the hardest problems in programming.</span></p>
+<p><em><span>Third</span></em><span>, a batch compiler is optimized for maximum throughput, while a language server aims to minimize latency (while not completely forgoing throughput).</span>
+<span>Adding a latency requirement doesn</span>&rsquo;<span>t mean that you need to optimize harder.</span>
+<span>Rather, it means that you generally need to turn the architecture on its head to have an acceptable latency at all.</span></p>
+<p><span>And this brings us to a related cluster of accidental complexity surrounding language servers.</span>
+<span>It is well understood how to write a batch compiler.</span>
+<span>It</span>&rsquo;<span>s common knowledge.</span>
+<span>While not everyone have read the dragon book (I didn</span>&rsquo;<span>t meaningfully get past the parsing chapters), everyone knows that that book contains all the answers.</span>
+<span>So most existing compilers end up looking like a typical compiler.</span>
+<span>And, when compiler authors start thinking about IDE support, the first thought is </span>&ldquo;<span>well, IDE is kinda a compiler, and we have a compiler, so problem solved, right?</span>&rdquo;<span>.</span>
+<span>This is quite wrong </span>&mdash;<span> internally an IDE is very different from a compiler but, until very recently, this wasn</span>&rsquo;<span>t common knowledge.</span></p>
+<p><span>Language servers are a counter example to the </span><a href="https://www.joelonsoftware.com/2000/04/06/things-you-should-never-do-part-i/">&ldquo;<span>never rewrite</span>&rdquo;</a><span> rule.</span>
+<span>Majority of well regarded language servers are rewrites or alternative implementations of batch compilers.</span></p>
+<p><span>Both IntelliJ and Eclipse wrote their own compilers rather than re-using javac inside an IDE.</span>
+<span>To provide an adequate IDE support for C#, Microsoft rewrote their C++ batch compiler into an interactive self-hosted one (project Roslyn).</span>
+<span>Dart, despite being a from-scratch, relatively modern language, ended up with </span><em><span>three</span></em><span> implementations (host AOT compiler, host IDE compiler (dart-analyzer), on-device JIT compiler).</span>
+<span>Rust tried both </span>&mdash;<span> incremental evolution of rustc (RLS) and from-scratch implementation (rust-analyzer), and rust-analyzer decisively won.</span></p>
+<p><span>The two exceptions I know are C++ and OCaml.</span>
+<span>Curiously, both require forward declarations and header files, and I don</span>&rsquo;<span>t think this is a coincidence.</span>
+<span>See the </span><a href="https://rust-analyzer.github.io/blog/2020/07/20/three-architectures-for-responsive-ide.html"><span>Three Architectures for a Responsive IDE</span></a><span> post for details.</span></p>
+<p><span>To sum up, on the language server</span>&rsquo;<span>s side things were in a bad equilibrium.</span>
+<span>It was totally possible to implement language servers, but that required a bit of an iconoclastic approach, and it</span>&rsquo;<span>s hard to be a pioneering iconoclast.</span></p>
+<p><span>I am less certain what was happening on the editor</span>&rsquo;<span>s side.</span>
+<span>Still, I do want to claim that we had no editors capable of being an IDE.</span></p>
+<p><span>IDE experience consists of a host of semantic features.</span>
+<span>The most notable example is, of course completion.</span>
+<span>If one wants to implement custom completion for VS Code, one needs to implement</span>
+<a href="https://code.visualstudio.com/api/references/vscode-api#CompletionItemProvider"><span>CompletionItemProvider</span></a><span> interface:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">CompletionItemProvider</span> {</span>
+<span class="line">    <span class="hl-title function_">provideCompletionItems</span>(</span>
+<span class="line">        <span class="hl-attr">document</span>: <span class="hl-title class_">TextDocument</span>,</span>
+<span class="line">        <span class="hl-attr">position</span>: <span class="hl-title class_">Position</span>,</span>
+<span class="line">    ): <span class="hl-title class_">CompletionItem</span>[]</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This means that, in VS Code, code completion (as well as dozens of other IDE related features) is an editor</span>&rsquo;<span>s first-class concept, with uniform user UI and developer API.</span></p>
+<p><span>Contrast this with Emacs and Vim.</span>
+<span>They just don</span>&rsquo;<span>t have proper completion as an editor</span>&rsquo;<span>s extension point.</span>
+<span>Rather, they expose low-level cursor and screen manipulation API, and then people implement competing completion frameworks on top of that!</span></p>
+<p><span>And that</span>&rsquo;<span>s just code completion!</span>
+<span>What about parameter info, inlay hints, breadcrumbs, extend selection, assists, symbol search, find usages (I</span>&rsquo;<span>ll stop here :) )?</span></p>
+<p><span>To sum the above succinctly, the problem with decent IDE support was not of </span><code>N * M</code><span>, but rather of an inadequate equilibrium of a two-sided market.</span></p>
+<p><span>Language vendors were reluctant to create language servers, because it was hard, the demand was low (= no competition from other languages), and, even if one creates a language server, one would find a dozen editors absolutely unprepared to serve as a host for a smart server.</span></p>
+<p><span>On the editor</span>&rsquo;<span>s side, there was little incentive for adding high-level APIs needed for IDEs, because there were no potential providers for those APIs.</span></p>
+</section>
+<section id="Why-LSP-is-great">
+
+    <h2>
+    <a href="#Why-LSP-is-great"><span>Why LSP is great</span> </a>
+    </h2>
+<p><span>And that</span>&rsquo;<span>s why I think LSP is great!</span></p>
+<p><span>I don</span>&rsquo;<span>t think it was a big technical innovation (it</span>&rsquo;<span>s obvious that you want to separate a language-agnostic editor and a language-specific server).</span>
+<span>I think it</span>&rsquo;<span>s a rather bad (aka, </span>&ldquo;<span>good enough</span>&rdquo;<span>) technical implementation (stay tuned for </span>&ldquo;<span>Why LSP sucks?</span>&rdquo;<span> post I guess?).</span>
+<em><span>But</span></em><span> it moved us from a world where not having a language IDE was normal and no one was even thinking about language servers, to a world where a language without working completion and goto definition looks unprofessional.</span></p>
+<p><span>Notably, the two-sided market problem was solved by Microsoft, who were a vendor of both languages (C# and TypeScript) and editors (VS Code and Visual Studio), and who were generally losing in the IDE space to a competitor (JetBrains).</span>
+<span>While I may rant about particular technical details of LSP, I absolutely admire their strategic vision in this particular area.</span>
+<span>They:</span></p>
+<ul>
+<li>
+<span>built an editor on web technologies.</span>
+</li>
+<li>
+<span>identified webdev as a big niche where JetBrains struggles (supporting JS in an IDE is next to impossible).</span>
+</li>
+<li>
+<span>built a language (!!!!) to make it feasible to provide IDE support for webdev.</span>
+</li>
+<li>
+<span>built an IDE platform with a very forward-looking architecture (stay tuned for a post where I explain why </span><code>vscode.d.ts</code><span> is a marvel of technical excellence).</span>
+</li>
+<li>
+<span>launched LSP to increase the value of their platform in other domains for free (moving the whole world to a significantly better IDE equilibrium as a collateral benefit).</span>
+</li>
+<li>
+<span>and now, with code spaces, are posed to become the dominant player in the </span>&ldquo;<span>remote first development</span>&rdquo;<span>, should we indeed stop editing, building, and running code on our local machines.</span>
+</li>
+</ul>
+<p><span>Though, to be fair, I still hope that, in the end, the winner would be JetBrains with their idea of Kotlin as a universal language for any platform :-)</span>
+<span>While Microsoft takes full advantage of worse-is-better technologies which are dominant today (TypeScript and Electron), JetBrains tries to fix things from the bottom up (Kotlin and Compose).</span></p>
+</section>
+<section id="More-on-M-N">
+
+    <h2>
+    <a href="#More-on-M-N"><span>More on M * N</span> </a>
+    </h2>
+<p><span>Now I am just going to hammer it in that it</span>&rsquo;<span>s </span><em><span>really</span></em><span> not </span><code>M * N</code><span> :)</span></p>
+<p><em><span>First</span></em><span>, </span><code>M * N</code><span> argument ignores the fact that this is an embarrassingly parallel problem.</span>
+<span>Neither language designers need to write plugins for all editors, nor editors need to add special support for all languages.</span>
+<span>Rather, a language should implement a server which speaks some protocol, an editor needs to implement language agnostic APIs for providing completions and such, and, if both the language and the editor are not esoteric, someone who is interested in both would just write a bit of glue code to bind the two together!</span>
+<span>rust-analyzer</span>&rsquo;<span>s VS Code plugin is 3.2k lines of code, neovim plugin is 2.3k and Emacs plugin is 1.2k.</span>
+<span>All three are developed independently by different people.</span>
+<span>That</span>&rsquo;<span>s the magic of decentralized open source development at its finest!</span>
+<span>If the plugins were to support custom protocol instead of LSP (provided that the editor supports high-level IDE API inside), I</span>&rsquo;<span>d expect to add maybe 2k lines for that, which is still well within hobbyist working part-time budget.</span></p>
+<p><em><span>Second</span></em><span>, for </span><code>M * N</code><span> optimization you</span>&rsquo;<span>d expect the protocol implementation to be generated from some machine readable implementation.</span>
+<span>But until the latest release, the source of truth for LSP spec was an informal markdown document.</span>
+<span>Every language and client was coming up with their own way to extract protocol out of it, many (including rust-analyzer) were just syncing the changes manually, with quite a bit of dupliction.</span></p>
+<p><em><span>Third</span></em><span>, if </span><code>M * N</code><span> is a problem, you</span>&rsquo;<span>d expect to see only one LSP implementation for each editor.</span>
+<span>In reality, there are two competing Emacs implementations (lsp-mode and eglot) and, I kid you not, at the time of writing rust-analyzer</span>&rsquo;<span>s manual contains instruction for integration with 6 (six) different LSP clients for vim.</span>
+<span>To echo the first point, this is open source!</span>
+<span>The </span><em><span>total</span></em><span> amount of work is almost irrelevant, the thing that matters is the amount of coordination to get things done.</span></p>
+<p><em><span>Fourth</span></em><span>, Microsoft itself doesn</span>&rsquo;<span>t try to take advantage of </span><code>M + N</code><span>.</span>
+<span>There</span>&rsquo;<span>s </span><em><span>no</span></em><span> universal LSP implementation in VS Code.</span>
+<span>Instead, each language is required to have a dedicated plugin with physically independent implementations of LSP.</span></p>
+</section>
+<section id="Action-Items">
+
+    <h2>
+    <a href="#Action-Items"><span>Action Items</span> </a>
+    </h2>
+<dl>
+<dt><span>Everyone</span></dt>
+<dd>
+<p><span>Please demand better IDE support!</span>
+<span>I think today we crossed the threshold of general availability of baseline IDE support, but there</span>&rsquo;<span>s so much we can do beyond the basics.</span>
+<span>In the ideal world, it should be possible to inspect every little semantic details about expression at the cursor, using the same simple API one can use today to inspect contents of editor</span>&rsquo;<span>s buffer.</span></p>
+</dd>
+<dt><span>Text Editor Authors</span></dt>
+<dd>
+<p><span>Pay attention to the architecture of VS Code.</span>
+<span>While electron delivers questionable user experience, the internal architecture has a lot of wisdom in it.</span>
+<span>Do orient editor</span>&rsquo;<span>s API around presentation-agnostic high-level features.</span>
+<span>Basic IDE functionality should be a first-class extension point, it shouldn</span>&rsquo;<span>t be re-invented by every plugin</span>&rsquo;<span>s author.</span>
+<span>In particular, add </span><a href="https://rust-analyzer.github.io/blog/2020/09/28/how-to-make-a-light-bulb.html"><span>assist/code action/💡</span></a><span> as a first-class UX concept already.</span>
+<span>It</span>&rsquo;<span>s the single most important UX innovation of IDEs, which is very old at this point.</span>
+<span>Its outright ridiculous that this isn</span>&rsquo;<span>t a standard interface across all editors.</span></p>
+<p><span>But don</span>&rsquo;<span>t make LSP </span><em><span>itself</span></em><span> a first class concept.</span>
+<span>Surprising as it might seem, VS Code knows </span><em><span>nothing</span></em><span> about LSP.</span>
+<span>It just provides a bunch of extension points without caring the least how they are implemented.</span>
+<span>LSP implementation then is just a library, which is used by language-specific plugins.</span>
+<span>E.g., Rust and C++ extensions for VS Code do not share the same LSP implementation at runtime, there are two copies of LSP library in memory!</span></p>
+<p><span>Also, try to harness the power of open-source.</span>
+<span>Don</span>&rsquo;<span>t enforce centralization of all LSP implementations!</span>
+<span>Make it possible for separate groups of people to independenty work on perfect Go support and perfect Rust support for your editor.</span>
+<span>VS Code is one possible model, with a marketplace and distributed, independent plugins.</span>
+<span>But it probably should be possible to organize the work as a single shared repo/source tree, as long as languages can have independent maintainers sets</span></p>
+</dd>
+<dt><span>Language Server Authors</span></dt>
+<dd>
+<p><span>You are doing a great job!</span>
+<span>The quality of IDE support is improving rapidly for all the languages, though I feel this is only a beginning of a long road.</span>
+<span>One thing to keep in mind is that LSP is </span><em><span>an</span></em><span> interface to a semantic info about the language, but it isn</span>&rsquo;<span>t </span><em><span>the</span></em><span> interface.</span>
+<span>A better thing might come along.</span>
+<span>Even today, limitations of LSP prevent from shipping useful features.</span>
+<span>So, try to treat LSP as a serialization format, not as an internal data model.</span>
+<span>And try to write more about how to implement language servers </span>&mdash;<span> I feel like there</span>&rsquo;<span>s still not enough knowledge about this out there.</span></p>
+</dd>
+</dl>
+<p><span>That</span>&rsquo;<span>s it!</span></p>
+<hr>
+<p><span>P.S. If by any chance you are benefiting from using rust-analyzer, consider sponsoring </span><a href="https://opencollective.com/rust-analyzer"><span>Ferrous Systems Open Source Collective for rust-analyzer</span></a><span> to support its development!</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-04-25-why-lsp.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/05/29/binary-privacy.html b/2022/05/29/binary-privacy.html
new file mode 100644
index 00000000..c8507096
--- /dev/null
+++ b/2022/05/29/binary-privacy.html
@@ -0,0 +1,200 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Binary Privacy</title>
+  <meta name="description" content="This post documents one rule of thumb I find useful when coding:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/05/29/binary-privacy.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Binary-Privacy"><span>Binary Privacy</span> <time datetime="2022-05-29">May 29, 2022</time></a>
+    </h1>
+<p><span>This post documents one rule of thumb I find useful when coding:</span></p>
+
+<aside class="block">
+
+<p><span>Either make all fields of a type public, or make none of them public</span></p>
+
+</aside>
+  <p><span>Being a rule-of-thumb, it naturally has exceptions, but those are relatively few.</span>
+<span>The primary context here is application development.</span>
+<span>Libraries with semver-constrained API have other guidelines </span>&mdash;<span> </span><a href="https://www.tedinski.com/2018/02/06/system-boundaries.html"><span>the rules are different at the boundaries</span></a><span>.</span></p>
+<p><span>This privacy rule is a manifestation of the fact that the two most popular </span><em><span>kinds</span></em><span> of entities in programs are:</span></p>
+<ul>
+<li>
+<span>Abstract data types </span>&mdash;<span> complex objects with opaque implementation which guard interior invariants and expose intentionally limited API to the outside world</span>
+</li>
+<li>
+<span>Data </span>&mdash;<span> relatively simple objects which group a bunch of related attributes together</span>
+</li>
+</ul>
+<p><span>If some fields of a type are private, it can</span>&rsquo;<span>t be data.</span>
+<span>If some fields of a type are public, it can </span><em><span>still</span></em><span> be an ADT, but the abstraction boundary will be a bit awkward.</span>
+<span>Better to just add getters for (usually few) fields which can be public, to make it immediately obvious what role is played by the type.</span></p>
+<p><span>An example of ADT would be </span><a href="https://github.com/rust-lang/rust-analyzer/blob/f94fa62d69faf5bd63b3772d3ec4f0c76cf2db57/crates/vfs/src/file_set.rs#L14"><code>FileSet</code></a><span> from rust-analyzer</span>&rsquo;<span>s virtual file system implementation.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Default)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">FileSet</span> {</span>
+<span class="line">  files: HashMap&lt;VfsPath, FileId&gt;,</span>
+<span class="line">  paths: HashMap&lt;FileId, VfsPath&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">FileSet</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">insert</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, file_id: FileId, path: VfsPath) {</span>
+<span class="line">    <span class="hl-keyword">self</span>.files.<span class="hl-title function_ invoke__">insert</span>(path.<span class="hl-title function_ invoke__">clone</span>(), file_id);</span>
+<span class="line">    <span class="hl-keyword">self</span>.paths.<span class="hl-title function_ invoke__">insert</span>(file_id, path);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">len</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span>.files.<span class="hl-title function_ invoke__">len</span>()</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">file_for_path</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">self</span>,</span>
+<span class="line">    path: &amp;VfsPath,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;FileId&gt; {</span>
+<span class="line">    <span class="hl-keyword">self</span>.files.<span class="hl-title function_ invoke__">get</span>(path).<span class="hl-title function_ invoke__">copied</span>()</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">path_for_file</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">self</span>,</span>
+<span class="line">    file: &amp;FileId,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;&amp;VfsPath&gt; {</span>
+<span class="line">    <span class="hl-keyword">self</span>.paths.<span class="hl-title function_ invoke__">get</span>(file)</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">iter</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">impl</span> <span class="hl-title class_">Iterator</span>&lt;Item = FileId&gt; + <span class="hl-symbol">&#x27;_</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span>.paths.<span class="hl-title function_ invoke__">keys</span>().<span class="hl-title function_ invoke__">copied</span>()</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This type maintains a bidirectional mapping between string paths and integral file ids.</span>
+<span>How exactly the mapping is maintained (hash map, search tree, trie?) is irrelevant, this implementation detail is abstracted away.</span>
+<span>Additionally, there</span>&rsquo;<span>s an invariant: </span><code>files</code><span> and </span><code>paths</code><span> fields are consistent, complimentary mappings.</span>
+<span>So this is the case where all fields are private and there</span>&rsquo;<span>s a bunch of accessor functions.</span></p>
+<p><span>An example of data would be </span><a href="https://github.com/rust-lang/rust-analyzer/blob/f94fa62d69faf5bd63b3772d3ec4f0c76cf2db57/crates/vfs/src/loader.rs#L26"><code>Directories</code></a><span> struct:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Debug, Clone, Default)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Directories</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> extensions: <span class="hl-type">Vec</span>&lt;<span class="hl-type">String</span>&gt;,</span>
+<span class="line">    <span class="hl-keyword">pub</span> include: <span class="hl-type">Vec</span>&lt;AbsPathBuf&gt;,</span>
+<span class="line">    <span class="hl-keyword">pub</span> exclude: <span class="hl-type">Vec</span>&lt;AbsPathBuf&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This type specifies a set of paths to include in VFS, a sort-of simplified gitignore.</span>
+<span>This is an inert piece of data </span>&mdash;<span> a bunch of extensions, include paths and exclude paths.</span>
+<span>Any combination of the three is valid, so there</span>&rsquo;<span>s no need for privacy here.</span></p>
+<section id="Connections">
+
+    <h2>
+    <a href="#Connections"><span>Connections</span> </a>
+    </h2>
+<p><span>This rule is very mechanical, but it reflects a deeper distinction between flavors of types.</span>
+<span>For a more thorough treatment of the underlying phenomenon, see </span>&ldquo;<span>Be clear what kind of class you</span>&rsquo;<span>re writing</span>&rdquo;<span> chapter from Alexandrescu</span>&rsquo;<span>s </span>&ldquo;<span>C++ Coding Standards</span>&rdquo;<span> and</span>
+<a href="https://www.tedinski.com/2018/02/27/the-expression-problem.html">&ldquo;<span>The Expression Problem</span>&rdquo;</a><span> from ever thought-provoking Kaminski.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-05-29-binary-privacy.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/05/29/builder-lite.html b/2022/05/29/builder-lite.html
new file mode 100644
index 00000000..05b7babb
--- /dev/null
+++ b/2022/05/29/builder-lite.html
@@ -0,0 +1,243 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Builder Lite</title>
+  <meta name="description" content="In this short post, I describe and name a cousin of the builder pattern --- builder lite.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/05/29/builder-lite.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Builder-Lite"><span>Builder Lite</span> <time datetime="2022-05-29">May 29, 2022</time></a>
+    </h1>
+<p><span>In this short post, I describe and name a cousin of the builder pattern </span>&mdash;<span> builder lite.</span></p>
+<p><span>Unlike a traditional builder, which uses a separate builder object, builder lite re-uses the object itself to provide builder functionality.</span></p>
+<p><span>Here</span>&rsquo;<span>s an illustrative example</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Builder Lite</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Shape</span> {</span>
+<span class="line">  position: Vec3,</span>
+<span class="line">  geometry: Geometry,</span>
+<span class="line">  material: <span class="hl-type">Option</span>&lt;Material&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Shape</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(geometry: Geometry) <span class="hl-punctuation">-&gt;</span> Shape {</span>
+<span class="line">    Shape {</span>
+<span class="line">      position: Vec3::<span class="hl-title function_ invoke__">default</span>(),</span>
+<span class="line">      geometry,</span>
+<span class="line">      material: <span class="hl-literal">None</span>,</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">with_position</span>(<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, position: Vec3) <span class="hl-punctuation">-&gt;</span> Shape {</span>
+<span class="line">    <span class="hl-keyword">self</span>.position = position;</span>
+<span class="line">    <span class="hl-keyword">self</span></span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">with_material</span>(<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, material: Material) <span class="hl-punctuation">-&gt;</span> Shape {</span>
+<span class="line">    <span class="hl-keyword">self</span>.material = <span class="hl-title function_ invoke__">Some</span>(material);</span>
+<span class="line">    <span class="hl-keyword">self</span></span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Call site</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">shape</span> = Shape::<span class="hl-title function_ invoke__">new</span>(Geometry::Sphere::<span class="hl-title function_ invoke__">with_radius</span>(<span class="hl-number">1</span>))</span>
+<span class="line">  .<span class="hl-title function_ invoke__">with_position</span>(<span class="hl-title function_ invoke__">Vec3</span>(<span class="hl-number">0</span>, <span class="hl-number">9</span>, <span class="hl-number">2</span>))</span>
+<span class="line">  .<span class="hl-title function_ invoke__">with_material</span>(Material::<span class="hl-title function_ invoke__">SolidColor</span>(Color::Red));</span></code></pre>
+
+</figure>
+<p><span>In contrast, the full builder is significantly wordier at the definition site, and requires a couple of extra invocations at the call site:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Builder</figcaption>
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Shape</span> {</span>
+<span class="line">  position: Vec3,</span>
+<span class="line">  geometry: Geometry,</span>
+<span class="line">  material: <span class="hl-type">Option</span>&lt;Material&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">ShapeBuilder</span> {</span>
+<span class="line">  position: <span class="hl-type">Option</span>&lt;Vec3&gt;,</span>
+<span class="line">  geometry: <span class="hl-type">Option</span>&lt;Geometry&gt;,</span>
+<span class="line">  texture: <span class="hl-type">Option</span>&lt;Texture&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Shape</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">builder</span>() <span class="hl-punctuation">-&gt;</span> ShapeBuilder { ... }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">ShapeBuilder</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">position</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, position: Vec3) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">Self</span> { ... }</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">geometry</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, geometry: Geometry) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">Self</span> { ... }</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">material</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, material: Material) <span class="hl-punctuation">-&gt;</span> &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">Self</span> { ... }</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">build</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Shape { ... }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Call site</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">shape</span> = Shape::<span class="hl-title function_ invoke__">builder</span>()</span>
+<span class="line">  .<span class="hl-title function_ invoke__">position</span>(<span class="hl-title function_ invoke__">Vec3</span>(<span class="hl-number">9</span>, <span class="hl-number">2</span>))</span>
+<span class="line">  .<span class="hl-title function_ invoke__">geometry</span>(Geometry::Sphere::<span class="hl-title function_ invoke__">with_radius</span>(<span class="hl-number">1</span>))</span>
+<span class="line">  .<span class="hl-title function_ invoke__">material</span>(Material::<span class="hl-title function_ invoke__">SolidColor</span>(Color::Red))</span>
+<span class="line">  .<span class="hl-title function_ invoke__">build</span>();</span></code></pre>
+
+</figure>
+<p><span>The primary benefit of builder-lite is that it is an incremental, zero-cost evolution from the </span><code>new</code><span> method.</span>
+<span>As such, it is especially useful in the context where the code evolves rapidly, in an uncertain direction.</span>
+<span>That is, when building applications rather than library.</span></p>
+<p><span>To pull a motivational example from work, we had the following typical code:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">PeerManagerActor</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(</span>
+<span class="line">    store: Store,</span>
+<span class="line">    config: NetworkConfig,</span>
+<span class="line">    client_addr: Recipient&lt;NetworkClientMessages&gt;,</span>
+<span class="line">    view_client_addr: Recipient&lt;NetworkViewClientMessages&gt;,</span>
+<span class="line">    routing_table_addr: Addr&lt;RoutingTableActor&gt;,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> anyhow::<span class="hl-type">Result</span>&lt;<span class="hl-keyword">Self</span>&gt; {</span></code></pre>
+
+</figure>
+<p><span>Here</span>&rsquo;<span>s a </span><code>new</code><span> method with a whole bunch of arguments for various dependencies.</span>
+<span>What we needed to do is to add yet another dependency, so that it could be overwritten in tests.</span>
+<span>The first attempt just added one more parameter to the </span><code>new</code><span> method:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(</span>
+<span class="line">    store: Store,</span>
+<span class="line">    config: NetworkConfig,</span>
+<span class="line">    client_addr: Recipient&lt;NetworkClientMessages&gt;,</span>
+<span class="line">    view_client_addr: Recipient&lt;NetworkViewClientMessages&gt;,</span>
+<span class="line">    routing_table_addr: Addr&lt;RoutingTableActor&gt;,</span>
+<span class="line">+   ping_counter: <span class="hl-type">Box</span>&lt;<span class="hl-keyword">dyn</span> PingCounter&gt;,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> anyhow::<span class="hl-type">Result</span>&lt;<span class="hl-keyword">Self</span>&gt; {</span></code></pre>
+
+</figure>
+<p><span>However, this change required update of the seven call-sites where the </span><code>new</code><span> was called to supply the default counter.</span>
+<span>Switching that to builder lite allowed us to only modify a single call-site where we cared to override the counter.</span></p>
+<p><span>A note on naming:</span><br>
+<span>If builder methods are to be used only occasionally, </span><code>with_foo</code><span> is the best naming.</span>
+<span>If most call-sites make use of builder methods, just </span><code>.foo</code><span> might work better.</span>
+<span>For boolean properties, sometimes it makes sense to have both:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">fancy</span>(<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">  <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">with_fancy</span>(<span class="hl-literal">true</span>)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">with_fancy</span>(<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, yes: <span class="hl-type">bool</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">  <span class="hl-keyword">self</span>.fancy = yes;</span>
+<span class="line">  <span class="hl-keyword">self</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/v07kac/blog_post_builder_lite/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-05-29-builder-lite.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/06/11/caches-in-rust.html b/2022/06/11/caches-in-rust.html
new file mode 100644
index 00000000..5aebbe75
--- /dev/null
+++ b/2022/06/11/caches-in-rust.html
@@ -0,0 +1,432 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Caches In Rust</title>
+  <meta name="description" content="In this post I'll describe how to implement caches in Rust.
+It is inspired by two recent refactors I landed at nearcore (nearcore#6549, nearcore#6811).
+Based on that experience, it seems that implementing caches wrong is rather easy, and making a mistake there risks spilling over, and spoiling the overall architecture of the application a bit.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/06/11/caches-in-rust.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Caches-In-Rust"><span>Caches In Rust</span> <time datetime="2022-06-11">Jun 11, 2022</time></a>
+    </h1>
+<p><span>In this post I</span>&rsquo;<span>ll describe how to implement caches in Rust.</span>
+<span>It is inspired by two recent refactors I landed at nearcore (</span><a href="https://github.com/near/nearcore/pull/6549"><span>nearcore#6549</span></a><span>, </span><a href="https://github.com/near/nearcore/pull/6811"><span>nearcore#6811</span></a><span>).</span>
+<span>Based on that experience, it seems that implementing caches wrong is rather easy, and making a mistake there risks </span>&ldquo;<span>spilling over</span>&rdquo;<span>, and spoiling the overall architecture of the application a bit.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with an imaginary setup with an application with some configuration and a database:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  config: Config,</span>
+<span class="line">  db: Db,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The database is an untyped key-value store:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Db</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">load</span>(&amp;<span class="hl-keyword">self</span>, key: &amp;[<span class="hl-type">u8</span>]) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Option</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt;&gt; {</span>
+<span class="line">    ...</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And the </span><code>App</code><span> encapsulates database and provides typed access to domain-specific </span><code>Widget</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(serde::Serialize, serde::Deserialize)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Widget</span> {</span>
+<span class="line">  title: <span class="hl-type">String</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">get_widget</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">self</span>,</span>
+<span class="line">    id: <span class="hl-type">u32</span>,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Option</span>&lt;Widget&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">key</span> = id.<span class="hl-title function_ invoke__">to_be_bytes</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">value</span> = <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.db.<span class="hl-title function_ invoke__">load</span>(&amp;key)? {</span>
+<span class="line">      <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-literal">None</span>),</span>
+<span class="line">      <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">widget</span>: Widget =</span>
+<span class="line">      bincode::<span class="hl-title function_ invoke__">deserialize</span>(&amp;value).<span class="hl-title function_ invoke__">map_err</span>(|it| {</span>
+<span class="line">        io::Error::<span class="hl-title function_ invoke__">new</span>(io::ErrorKind::InvalidData, it)</span>
+<span class="line">      })?;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-title function_ invoke__">Some</span>(widget))</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Now, for the sake of argument let</span>&rsquo;<span>s assume that database access and subsequent deserialization are costly, and that we want to add a cache of Widgets in front of the database.</span>
+<span>Data-oriented thinking would compel us to get rid of deserialization step instead, but we will not pursue that idea this time.</span></p>
+<p><span>We</span>&rsquo;<span>ll use a simple </span><code>HashMap</code><span> for the cache:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  config: Config,</span>
+<span class="line">  db: Db,</span>
+<span class="line">  cache: HashMap&lt;<span class="hl-type">u32</span>, Widget&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And we need to modify </span><code>get_widget</code><span> method to return the value from the cache, if there is one:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">get_widget</span>(</span>
+<span class="line hl-line">    &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">    id: <span class="hl-type">u32</span>,</span>
+<span class="line hl-line">  ) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Option</span>&lt;&amp;Widget&gt;&gt; {</span>
+<span class="line"></span>
+<span class="line hl-line">    <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">contains_key</span>(&amp;id) {</span>
+<span class="line hl-line">      <span class="hl-keyword">let</span> <span class="hl-variable">widget</span> = <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">get</span>(&amp;id).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line hl-line">      <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-title function_ invoke__">Some</span>(widget));</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">key</span> = id.<span class="hl-title function_ invoke__">to_be_bytes</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">value</span> = <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.db.<span class="hl-title function_ invoke__">load</span>(&amp;key)? {</span>
+<span class="line">      <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-literal">None</span>),</span>
+<span class="line">      <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">widget</span>: Widget =</span>
+<span class="line">      bincode::<span class="hl-title function_ invoke__">deserialize</span>(&amp;value).<span class="hl-title function_ invoke__">map_err</span>(|it| {</span>
+<span class="line">        io::Error::<span class="hl-title function_ invoke__">new</span>(io::ErrorKind::InvalidData, it)</span>
+<span class="line">      })?;</span>
+<span class="line"></span>
+<span class="line hl-line">    <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">insert</span>(id, widget);</span>
+<span class="line hl-line">    <span class="hl-keyword">let</span> <span class="hl-variable">widget</span> = <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">get</span>(&amp;id).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-title function_ invoke__">Some</span>(widget))</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The biggest change is the </span><code>&amp;mut self</code><span>.</span>
+<span>Even when reading the widget, we need to modify the </span><code>cache</code><span>, and the easiest way to get that ability is to require an exclusive reference.</span></p>
+<p><span>I want to argue that this path of least resistance doesn</span>&rsquo;<span>t lead to a good place.</span>
+<span>There are </span><em><span>many</span></em><span> problems with methods of the following-shape:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">get</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;Widget</span></code></pre>
+
+</figure>
+<p><em><span>First</span></em><span>, such methods conflict with each other.</span>
+<span>For example, the following code won</span>&rsquo;<span>t work, because we</span>&rsquo;<span>ll try to borrow the app exclusively twice.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">app</span>: &amp;<span class="hl-keyword">mut</span> App = ...;</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">w1</span> = app.<span class="hl-title function_ invoke__">get_widget</span>(<span class="hl-number">1</span>)?;</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">w2</span> = app.<span class="hl-title function_ invoke__">get_widget</span>(<span class="hl-number">2</span>)?;</span></code></pre>
+
+</figure>
+<p><em><span>Second</span></em><span>, the </span><code>&amp;mut</code><span> methods conflict even with </span><code>&amp;</code><span> methods.</span>
+<span>Naively, it would seem that, as </span><code>get_widget</code><span> </span><em><span>returns</span></em><span> a shared reference, we should be able to call </span><code>&amp;</code><span> methods.</span>
+<span>So, one can expect something like this to work:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">w</span>: &amp;Widget = app.<span class="hl-title function_ invoke__">get_widget</span>(<span class="hl-number">1</span>)?.<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">c</span>: &amp;Color = &amp;app.config.main_color;</span></code></pre>
+
+</figure>
+<p><span>Alas, it doesn</span>&rsquo;<span>t.</span>
+<span>Rust borrow checker doesn</span>&rsquo;<span>t distinguish between </span><code>mut</code><span> and non-</span><code>mut</code><span> lifetimes (for a good reason: doing that would be unsound).</span>
+<span>So, although </span><code>w</code><span> is just </span><code>&amp;Widget</code><span>, the lifetime there is the same as on the </span><code>&amp;mut self</code><span>, so the app remains mutably borrowed while the widget exists.</span></p>
+<p><em><span>Third</span></em><span>, perhaps the most important point, the </span><code>&amp;mut self</code><span> becomes viral </span>&mdash;<span> most of functions in the program begin requiring </span><code>&amp;mut</code><span>, and you lose type-system distinction between read-only and read-write operations.</span>
+<span>There</span>&rsquo;<span>s no distinction between </span>&ldquo;<span>this function can only modify the cache</span>&rdquo;<span> and </span>&ldquo;<span>this function can modify literally everything</span>&rdquo;<span>.</span></p>
+<p><em><span>Finally</span></em><span>, even implementing </span><code>get_widget</code><span> is not pleasant.</span>
+<span>Seasoned rustaceans among you might twitch at the needlessly-repeated hashmap lookups.</span>
+<span>But trying to get rid of those with the help of the entry-API runs into current borrow checker limitations.</span></p>
+<p><span>Let</span>&rsquo;<span>s look at how we can better tackle this!</span></p>
+<p><span>The general idea for this class of problems is to think what the ownership and borrowing situation </span><em><span>should</span></em><span> be and try to achieve that, as opposed to merely following suggestions by the compiler.</span>
+<span>That is, </span><em><span>most</span></em><span> of the time just using </span><code>&amp;mut</code><span> and </span><code>&amp;</code><span> as compiler guides you is a path to success, as, it turns out, majority of the code naturally follows simple aliasing rules.</span>
+<span>But there are exceptions, it</span>&rsquo;<span>s important to recognize them as such and make use of interior mutability to implement the aliasing structure which makes sense.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with a simplified case.</span>
+<span>Suppose that there</span>&rsquo;<span>s only one </span><code>Widget</code><span> to deal with.</span>
+<span>In this case, we</span>&rsquo;<span>d want something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  ...</span>
+<span class="line">  cache: <span class="hl-type">Option</span>&lt;Widget&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">get_widget</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;Widget {</span>
+<span class="line">    <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(widget) = &amp;<span class="hl-keyword">self</span>.cache {</span>
+<span class="line">      <span class="hl-keyword">return</span> widget;</span>
+<span class="line">    }</span>
+<span class="line hl-line">    <span class="hl-keyword">self</span>.cache = <span class="hl-title function_ invoke__">Some</span>(<span class="hl-title function_ invoke__">create_widget</span>());</span>
+<span class="line">    <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">as_ref</span>().<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This doesn</span>&rsquo;<span>t work as is </span>&mdash;<span> modifying the </span><code>cache</code><span> needs </span><code>&amp;mut</code><span> which we</span>&rsquo;<span>d very much prefer to avoid.</span>
+<span>However, thinking about this pattern, it feels like it </span><em><span>should</span></em><span> be valid </span>&mdash;<span> we enforce at runtime that the contents of the </span><code>cache</code><span> is never overwritten.</span>
+<span>That is, we actually </span><em><span>do</span></em><span> have exclusive access to cache on the highlighted line at runtime, we just can</span>&rsquo;<span>t explain that to the type system.</span>
+<span>But we can reach out for </span><code>unsafe</code><span> for that.</span>
+<span>What</span>&rsquo;<span>s more, Rust</span>&rsquo;<span>s type system is powerful enough to encapsulate that usage of unsafe into a safe and generally re-usable API.</span>
+<span>So let</span>&rsquo;<span>s pull </span><a href="https://docs.rs/once_cell"><code>once_cell</code></a><span> crate for this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  ...</span>
+<span class="line">  cache: once_cell::sync::OnceCell&lt;Widget&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">get_widget</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;Widget {</span>
+<span class="line">    <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">get_or_init</span>(create_widget)</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Coming back to the original hash-map example, we can apply the same logic here:</span>
+<span>as long as we never overwrite, delete or move values, we can safely return references to them.</span>
+<span>This is handled by the </span><a href="https://docs.rs/elsa"><code>elsa</code></a><span> crate:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  config: Config,</span>
+<span class="line">  db: Db,</span>
+<span class="line">  cache: elsa::map::FrozenMap&lt;<span class="hl-type">u32</span>, <span class="hl-type">Box</span>&lt;Widget&gt;&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">get_widget</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">self</span>,</span>
+<span class="line">    id: <span class="hl-type">u32</span>,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Option</span>&lt;&amp;Widget&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(widget) = <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">get</span>(&amp;id) {</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-title function_ invoke__">Some</span>(widget));</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">key</span> = id.<span class="hl-title function_ invoke__">to_be_bytes</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">value</span> = <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.db.<span class="hl-title function_ invoke__">load</span>(&amp;key)? {</span>
+<span class="line">      <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-literal">None</span>),</span>
+<span class="line">      <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">widget</span>: Widget =</span>
+<span class="line">      bincode::<span class="hl-title function_ invoke__">deserialize</span>(&amp;value).<span class="hl-title function_ invoke__">map_err</span>(|it| {</span>
+<span class="line">        io::Error::<span class="hl-title function_ invoke__">new</span>(io::ErrorKind::InvalidData, it)</span>
+<span class="line">      })?;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">widget</span> = <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">insert</span>(id, <span class="hl-type">Box</span>::<span class="hl-title function_ invoke__">new</span>(widget));</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-title function_ invoke__">Some</span>(widget))</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The third case is that of a bounded cache.</span>
+<span>If you need to evict values, than the above reasoning does not apply.</span>
+<span>If the user of a cache gets a </span><code>&amp;T</code><span>, and than the corresponding entry is evicted, the reference would dangle.</span>
+<span>In this situations, we want the clients of the cache to co-own the value.</span>
+<span>This is easily handled by an </span><code>Rc</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  config: Config,</span>
+<span class="line">  db: Db,</span>
+<span class="line">  cache: RefCell&lt;lru::LruCache&lt;<span class="hl-type">u32</span>, Rc&lt;Widget&gt;&gt;&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">App</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">get_widget</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">self</span>,</span>
+<span class="line">    id: <span class="hl-type">u32</span>,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Option</span>&lt;Rc&lt;Widget&gt;&gt;&gt; {</span>
+<span class="line">    {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">cache</span> = <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">borrow_mut</span>();</span>
+<span class="line">      <span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(widget) = cache.<span class="hl-title function_ invoke__">get</span>(&amp;id) {</span>
+<span class="line">        <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-title function_ invoke__">Some</span>(Rc::<span class="hl-title function_ invoke__">clone</span>(widget)));</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">key</span> = id.<span class="hl-title function_ invoke__">to_be_bytes</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">value</span> = <span class="hl-keyword">match</span> <span class="hl-keyword">self</span>.db.<span class="hl-title function_ invoke__">load</span>(&amp;key)? {</span>
+<span class="line">      <span class="hl-literal">None</span> =&gt; <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-literal">None</span>),</span>
+<span class="line">      <span class="hl-title function_ invoke__">Some</span>(it) =&gt; it,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">widget</span>: Widget =</span>
+<span class="line">      bincode::<span class="hl-title function_ invoke__">deserialize</span>(&amp;value).<span class="hl-title function_ invoke__">map_err</span>(|it| {</span>
+<span class="line">        io::Error::<span class="hl-title function_ invoke__">new</span>(io::ErrorKind::InvalidData, it)</span>
+<span class="line">      })?;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">widget</span> = Rc::<span class="hl-title function_ invoke__">new</span>(widget);</span>
+<span class="line">    {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">cache</span> = <span class="hl-keyword">self</span>.cache.<span class="hl-title function_ invoke__">borrow_mut</span>();</span>
+<span class="line">      cache.<span class="hl-title function_ invoke__">put</span>(id, Rc::<span class="hl-title function_ invoke__">clone</span>(&amp;widget));</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(<span class="hl-title function_ invoke__">Some</span>(widget))</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To sum up: when implementing a cache, the path of the least resistance is to come up with a signature like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">get</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;T</span></code></pre>
+
+</figure>
+<p><span>This often leads to problems down the line.</span>
+<span>It</span>&rsquo;<span>s usually better to employ some interior mutability and get either of these instead:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">get</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;T</span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">get</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> T</span></code></pre>
+
+</figure>
+<p><span>This is an instance of the more general effect: despite the </span>&ldquo;<span>mutability</span>&rdquo;<span> terminology, Rust references track not mutability, but aliasing.</span>
+<span>Mutability and exclusive access are correlated, but not perfectly.</span>
+<span>It</span>&rsquo;<span>s important to identify instances where you need to employ interior mutability, often they are architecturally interesting.</span></p>
+<p><span>To learn more about relationships between aliasing and mutability, I recommend the following two posts:</span></p>
+<dl>
+<dt><span>Rust: A unique perspective</span></dt>
+<dd>
+<p><a href="https://limpet.net/mbrubeck/2019/02/07/rust-a-unique-perspective.html" class="url">https://limpet.net/mbrubeck/2019/02/07/rust-a-unique-perspective.html</a></p>
+</dd>
+<dt><span>Accurate mental model for Rust’s reference types</span></dt>
+<dd>
+<p><a href="https://docs.rs/dtolnay/latest/dtolnay/macro._02__reference_types.html" class="url">https://docs.rs/dtolnay/latest/dtolnay/macro._02__reference_types.html</a></p>
+</dd>
+</dl>
+<p><span>Finally, the </span>&ldquo;<span>borrow checker</span>&rdquo;<span> limitation is explained (with much skill and humor, I should add), in this document:</span></p>
+<dl>
+<dt><span>Polonius the Crab</span></dt>
+<dd>
+<p><a href="https://docs.rs/polonius-the-crab/0.2.1/polonius_the_crab/" class="url">https://docs.rs/polonius-the-crab/0.2.1/polonius_the_crab/</a></p>
+</dd>
+</dl>
+<p><span>That</span>&rsquo;<span>s all! Discussion on </span><a href="https://old.reddit.com/r/rust/comments/v9xsnb/blog_post_caches_in_rust/"><span>/r/rust</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-06-11-caches-in-rust.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/06/29/notes-on-gats.html b/2022/06/29/notes-on-gats.html
new file mode 100644
index 00000000..155cf7dd
--- /dev/null
+++ b/2022/06/29/notes-on-gats.html
@@ -0,0 +1,137 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Notes on GATs</title>
+  <meta name="description" content="There's a bit of discussion happening in Rust community on the generic associated types topic.
+I can not help but add my own thoughts to the pile :-)">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/06/29/notes-on-gats.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Notes-on-GATs"><span>Notes on GATs</span> <time datetime="2022-06-29">Jun 29, 2022</time></a>
+    </h1>
+<p><span>There</span>&rsquo;<span>s a bit of discussion happening in Rust community on the generic associated types topic.</span>
+<span>I can not help but add my own thoughts to the pile :-)</span></p>
+<p><span>I don</span>&rsquo;<span>t intend to write a well-edited post considering all pros and cones (intentional typo to demonstrate how unedited this is).</span>
+<span>Rather, I just want to dump my experience as is.</span>
+<span>Ultimately I trust the lang team to make the right call here </span><strong><strong><span>way</span></strong></strong><span> more than I trust myself.</span>
+<span>The post could be read as a bit inflammatory, but my stated goal here is not to sway someone</span>&rsquo;<span>s mind by the arguments, but rather expose my own thinking process.</span></p>
+<p><span>This post is partially prompted by the following comment from the RFC:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>I probably have GATs in every project I do write.</span></p>
+</blockquote>
+
+</figure>
+<p><span>It stuck with me, because this is very much the opposite of the experience I have.</span>
+<span>I</span>&rsquo;<span>ve been using Rust extensively for a while, mostly as an application (as opposed to library) developer, and I can</span>&rsquo;<span>t remember a single instance where I really wanted to have GATs.</span>
+<span>This is a consequences of my overall code style </span>&mdash;<span> I try to use abstraction sparingly and rarely reach out for traits.</span>
+<span>I don</span>&rsquo;<span>t think I</span>&rsquo;<span>ve ever build a meaningful abstraction which was expressed via traits?</span>
+<span>On the contrary, I try hard to make everything concrete and non-generic on the language level.</span></p>
+<p><span>What</span>&rsquo;<span>s more, when I do reach out for traits, most of the time this is to use trait objects, which give me a new runtime capability to use different, substitutable concrete type.</span>
+<span>For the static,monomorphization based subset of traits I find that most of the time non-trait solution seem to work.</span></p>
+<p><span>And I think GATs (and associated types in general) don</span>&rsquo;<span>t work with trait objects, which probably explains why, even when I use traits, I don</span>&rsquo;<span>t generally need GATs.</span>
+<span>Though, it seems to me that lifetime-only subset of GATs actually works with trait objects?</span>
+<span>That is, lending iterator seems to be object safe?</span></p>
+<p><span>I guess, the only place where I do, indirectly, want GATs is to make </span><code>async trait</code><span> work, but even then, I usually am interested in object-safe async traits, which I think don</span>&rsquo;<span>t need and can</span>&rsquo;<span>t use GATs?</span></p>
+<hr>
+<p><span>Another disconnection between my usage of Rust and discussion surrounding the GATs is in one of the prominent examples </span>&mdash;<span> parser combinator library.</span>
+<span>In practice, for me parser combinator</span>&rsquo;<span>s primary use-case was always a vehicle for teaching advanced types (eg, the monads paper uses parsers as one of the examples).</span>
+<span>For production use-cases I</span>&rsquo;<span>ve encountered, it was always either a hand-written parser, or a full-blown parser generator.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-06-29-notes-on-gats.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/07/04/unit-and-integration-tests.html b/2022/07/04/unit-and-integration-tests.html
new file mode 100644
index 00000000..4b3f9d70
--- /dev/null
+++ b/2022/07/04/unit-and-integration-tests.html
@@ -0,0 +1,257 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Unit and Integration Tests</title>
+  <meta name="description" content="In this post I argue that integration-vs-unit is a confused, and harmful, distinction.
+I provide a more useful two-dimensional mental model instead.
+The model is descriptive (it allows to think more clearly about any test), but I also include my personal prescriptions (the model shows metrics which are and aren't worth optimizing).">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/07/04/unit-and-integration-tests.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Unit-and-Integration-Tests"><span>Unit and Integration Tests</span> <time datetime="2022-07-04">Jul 4, 2022</time></a>
+    </h1>
+<p><span>In this post I argue that integration-vs-unit is a confused, and harmful, distinction.</span>
+<span>I provide a more useful two-dimensional mental model instead.</span>
+<span>The model is descriptive (it allows to think more clearly about any test), but I also include my personal prescriptions (the model shows metrics which are and aren</span>&rsquo;<span>t worth optimizing).</span></p>
+<p><span>Credit for the idea goes to the </span><a href="https://abseil.io/resources/swe-book"><span>SWE book</span></a><span>.</span>
+<span>I always felt that integration versus unit debate is confused, the book helped me to formulate in which way exactly.</span></p>
+<p><span>I won</span>&rsquo;<span>t actually rigorously demonstrate the existing confusion </span>&mdash;<span> I find it self-evident.</span>
+<span>As just two examples:</span></p>
+<ul>
+<li>
+<span>Unit-testing is used as a synonym with automated testing (x-unit frameworks).</span>
+</li>
+<li>
+<span>Cargo uses </span>&ldquo;<span>unit</span>&rdquo;<span> and </span>&ldquo;<span>integration</span>&rdquo;<span> terminology to describe Rust-specific properties of the compilation model, which is orthogonal to the traditional, however fuzzy, meaning of this terms.</span>
+</li>
+</ul>
+<p><span>Most of the time, it</span>&rsquo;<span>s more productive to speak about just </span>&ldquo;<span>tests</span>&rdquo;<span>, or maybe </span>&ldquo;<span>automated tests</span>&rdquo;<span>, rather than argue where something should be considered a unit or an integration tests.</span></p>
+<p><span>But I argue that a useful, more precise classification exists.</span></p>
+<section id="Purity">
+
+    <h2>
+    <a href="#Purity"><span>Purity</span> </a>
+    </h2>
+<p><em><span>The first</span></em><span> axis of classification is, broadly speaking, performance.</span>
+&ldquo;<span>How much time would a thousand of similar tests take?</span>&rdquo;<span> is a very useful metric.</span>
+<span>The dependency between the time from making an edit to getting the test results and most other interesting metrics in software (performance, time to fix defects, security) is super-linear.</span>
+<span>Tests longer than attention span obliterate productivity.</span></p>
+<p><span>It</span>&rsquo;<span>s useful to take a closer look at what constitutes a performant test.</span>
+<span>One non-trivial observation here is that test speed is categorical, rather than numerical.</span>
+<span>Certain tests are order-of-magnitude slower than others.</span>
+<span>Consider the following list:</span></p>
+<ol>
+<li>
+<span>Single-threaded pure computation</span>
+</li>
+<li>
+<span>Multi-threaded parallel computation</span>
+</li>
+<li>
+<span>Multi-threaded concurrent computation with time-based synchronization and access to disk</span>
+</li>
+<li>
+<span>Multi-process computation</span>
+</li>
+<li>
+<span>Distributed computation</span>
+</li>
+</ol>
+<p><span>Each step of this ladder adds half-an-order of magnitude to test</span>&rsquo;<span>s runtime.</span></p>
+<p><span>Time is not the only thing affected </span>&mdash;<span> the higher you go, the bigger is the fraction of flaky tests.</span>
+<span>It</span>&rsquo;<span>s nay impossible to make a test for a pure function flaky.</span>
+<span>If you add threads into the mix, keeping flakiness out requires some careful thinking about synchronization.</span>
+<span>And if the tests spans several processes, it is almost bound to fail under some more unusual circumstances.</span></p>
+<p><span>Yet another effect we observe along this axis is resilience to unrelated changes.</span>
+<span>The more of operating system and other processes is involved in the test, the higher is the probability that some upgrade somewhere breaks something.</span></p>
+<p><span>I think the </span>&ldquo;<span>purity</span>&rdquo;<span> concept from functional programming is a good way to generalize this axis of the differences between the tests.</span>
+<span>Pure test do little-to-no IO, they are independent of timings and environment.</span>
+<span>Less pure tests do more of the impure things.</span>
+<span>Purity is correlated with performance, repeatability and stability.</span>
+<span>Test purity is non-binary, but it is mostly discrete.</span>
+<span>Threads, time, file-system, network, processes are the notches to think about.</span></p>
+</section>
+<section id="Extent">
+
+    <h2>
+    <a href="#Extent"><span>Extent</span> </a>
+    </h2>
+<p><em><span>The second</span></em><span> axis is the fraction of the code which gets exercised, potentially indirectly, by the test.</span>
+<span>Does the test exercise only the business logic module, or is the database API and the HTTP handling also required?</span>
+<span>This is </span><em><span>distinct</span></em><span> from performance: running more code doesn</span>&rsquo;<span>t mean that the code will run slower.</span>
+<span>An infinite loop takes very little code.</span>
+<span>What affects performance is not whether tests for business logic touch persistence, but whether, in tests, persistence is backed by an in-memory hash-map or by an out-of-process database server.</span></p>
+<p><span>The </span>&ldquo;<span>extent</span>&rdquo;<span> of the tests is a good indicator of the overall architecture of the application, but usually it isn</span>&rsquo;<span>t a worthy metric to optimize by itself.</span>
+<span>On the contrary, artificially limiting the extent of tests by mocking your own code (as opposed to mocking impure IO) reduces fidelity of the tests, and makes the code more brittle in the face of refactors.</span></p>
+<p><span>One potential exception here is the impact on compilation time.</span>
+<span>In a layered application </span><code>A &lt; B &lt; C</code><span>, it</span>&rsquo;<span>s possible to test </span><code>A</code><span> either through its interface to </span><code>B</code><span> (small-extent test) or by driving </span><code>A</code><span> indirectly through </span><code>C</code><span>.</span>
+<span>The latter has a problem that, after changing </span><code>A</code><span>, running tests might require, depending on the language, rebuilding </span><code>B</code><span> and </span><code>C</code><span> as well.</span></p>
+<hr>
+<p><span>Summing up:</span></p>
+<ul>
+<li>
+<span>Don</span>&rsquo;<span>t think about tests in terms of opposition between unit and integration, whatever that means. Instead,</span>
+</li>
+<li>
+<span>Think in terms of test</span>&rsquo;<span>s </span><strong><strong><span>purity</span></strong></strong><span> and </span><strong><strong><span>extent</span></strong></strong><span>.</span>
+</li>
+<li>
+<strong><strong><span>Purity</span></strong></strong><span> corresponds to the amount of generalized IO the test is doing and is correlated with desirable metrics, namely performance and resilience.</span>
+</li>
+<li>
+<strong><strong><span>Extent</span></strong></strong><span> corresponds to the amount of code the test exercises. Extent somewhat correlates with impurity, but generally does not directly affect performance.</span>
+</li>
+</ul>
+<p><span>And, the prescriptive part:</span></p>
+<ul>
+<li>
+<span>Ruthlessly optimize purity, moving one step down on the ladder of impurity gives huge impact.</span>
+</li>
+<li>
+<span>Generally, just let the tests have their natural extent. Extent isn</span>&rsquo;<span>t worth optimizing by itself, but it can tell you something about your application</span>&rsquo;<span>s architecture.</span>
+</li>
+</ul>
+<p><span>If you enjoyed this post, you might like </span><a href="https://matklad.github.io/2021/05/31/how-to-test.html"><em><span>How to Test</span></em></a><span> as well.</span>
+<span>It goes further in the prescriptive direction, but, when writing it, I didn</span>&rsquo;<span>t have the two dimensional purity-extent vocabulary yet.</span></p>
+<hr>
+<p><span>As I</span>&rsquo;<span>ve said, this framing is lifted from the SWE book.</span>
+<span>There are two differences, one small and one big.</span>
+<span>The small difference is that the book uses </span>&ldquo;<span>size</span>&rdquo;<span> terminology in place of </span>&ldquo;<span>purity</span>&rdquo;<span>.</span>
+<span>The big difference is that the second axis is different: rather than looking at which fraction code gets exercised by the test, the book talks about test </span>&ldquo;<span>scope</span>&rdquo;<span>: how large is the bit we are actually testing?</span></p>
+<p><span>I do find scope concept useful to think about!</span>
+<span>And, unlike extent, keeping most tests focused is a good active prescriptive advice.</span></p>
+<p><span>I however find the scope concept a bit too fuzzy for actual classification.</span></p>
+<p><span>Consider this test from rust-analyzer, which checks that we can complete a method from a trait if the trait is implemented:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">completes_trait_method</span>() {</span>
+<span class="line">    <span class="hl-title function_ invoke__">check</span>(</span>
+<span class="line">        <span class="hl-string">r&quot;</span></span>
+<span class="line"><span class="hl-string">struct S {}</span></span>
+<span class="line"><span class="hl-string">pub trait T {</span></span>
+<span class="line"><span class="hl-string">    fn f(&amp;self)</span></span>
+<span class="line"><span class="hl-string">}</span></span>
+<span class="line"><span class="hl-string">impl T for S {}</span></span>
+<span class="line"><span class="hl-string"></span></span>
+<span class="line"><span class="hl-string">fn main(s: S) {</span></span>
+<span class="line"><span class="hl-string">    s.$0</span></span>
+<span class="line"><span class="hl-string">}</span></span>
+<span class="line"><span class="hl-string">&quot;</span>,</span>
+<span class="line">        expect![[<span class="hl-string">r#&quot;</span></span>
+<span class="line"><span class="hl-string">            me f() (as T) fn(&amp;self)</span></span>
+<span class="line"><span class="hl-string">        &quot;#</span>]],</span>
+<span class="line">    );</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I struggle with determining the scope of this test.</span>
+<span>On the one hand, this clearly tests very narrow, very specific scenario.</span>
+<span>On the other hand, to make this work, all the layers of the system have to work just right.</span>
+<span>The lexer, the parser, name resolution and type checking all have to be prepared for incomplete code.</span>
+<span>This test tests not so much the completion logic itself, as all the underlying infrastructure for semantic analysis.</span></p>
+<p><span>The test is very easy to classify in the purity/extent framework.</span>
+<span>It</span>&rsquo;<span>s 100% pure </span>&mdash;<span> no IO, just a single thread.</span>
+<span>It has maximal extent </span>&mdash;<span> the tests exercises the bulk of the rust-analyzer codebase, the only thing that isn</span>&rsquo;<span>t touched here is the LSP itself.</span></p>
+<p><span>Also, as a pitch for the  </span><a href="https://matklad.github.io/2021/05/31/how-to-test.html"><em><span>How to Test</span></em></a><span> post, take a second to appreciate how simple the test is, considering that it tests an error-resilient, highly incremental compiler :)</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-07-04-unit-and-integration-tests.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/07/10/almost-rules.html b/2022/07/10/almost-rules.html
new file mode 100644
index 00000000..ab71973f
--- /dev/null
+++ b/2022/07/10/almost-rules.html
@@ -0,0 +1,287 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Almost Rules</title>
+  <meta name="description" content="This is going to be a philosophical post, vaguely about language design, and vaguely about Rust.
+If you've been following this blog for a while, you know that one theme I consistently hammer at is that of boundaries.
+This article is no exception!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/07/10/almost-rules.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Almost-Rules"><span>Almost Rules</span> <time datetime="2022-07-10">Jul 10, 2022</time></a>
+    </h1>
+<p><span>This is going to be a philosophical post, vaguely about language design, and vaguely about Rust.</span>
+<span>If you</span>&rsquo;<span>ve been following this blog for a while, you know that one theme I consistently hammer at is that of boundaries.</span>
+<span>This article is no exception!</span></p>
+<p><span>Obligatory link to Ted Kaminski:</span></p>
+<p><a href="https://www.tedinski.com/2018/02/06/system-boundaries.html" class="url">https://www.tedinski.com/2018/02/06/system-boundaries.html</a></p>
+<p><span>The most important boundary for a software project is its external interface, that which the users directly interact with and which you give backwards compatibility guarantees for.</span>
+<span>For a web-service, this would be the URL scheme and the shape of JSON request and responses.</span>
+<span>For a command line application </span>&mdash;<span> the set and the meaning of command-line flags.</span>
+<span>For an OS kernel </span>&mdash;<span> the set of syscalls (Linux) or the blessed user-space libraries (Mac).</span>
+<span>And, for a programming language, this would be the definition of the language itself, its syntax and semantics.</span></p>
+<p><span>Sometimes, however, it is beneficial to install somewhat artificial, internal boundaries, a sort-of macro level layers pattern.</span>
+<span>Boundaries have a high cost.</span>
+<span>They prevent changes.</span>
+<span>But a skillfully placed internal (or even an artificial external) boundary can also help.</span></p>
+<p><span>It cuts the system in two, and, if the cut is relatively narrow in comparison to the overall size of the system (hourglass shape), this boundary becomes a great way to understand the system.</span>
+<span>Understanding </span><em><span>just</span></em><span> the boundary allows you to imagine how the subsystem beneath it </span><em><span>could</span></em><span> be implemented.</span>
+<span>Most of the time, your imaginary version would be pretty close to what actually happens, and this mental map would help you a great deal to peel off the layers of glue code and get a gut feeling for where the core logic is.</span></p>
+<p><span>Even if an internal boundary starts out in the right place, it, unlike an external one, is ever in danger of being violated.</span>
+&ldquo;<span>Internal boundary</span>&rdquo;<span> is a very non-physical thing, most of the time it</span>&rsquo;<span>s just informal rules like </span>&ldquo;<span>module A shall not import module B</span>&rdquo;<span>.</span>
+<span>It</span>&rsquo;<span>s very hard to notice that something is </span><em><span>not</span></em><span> being done!</span>
+<span>That</span>&rsquo;<span>s why, I think, larger companies can benefit from microservices architecture: in theory, if we </span><em><span>just</span></em><span> solve human coordination problem, a monolith can be architectured just as cleanly, while offering much better performance.</span>
+<span>In practice, at sufficient scale, maintaining good architecture across teams is hard, and becomes much easier if the intended internal boundaries are reified as processes.</span></p>
+<p><span>It</span>&rsquo;<span>s hard enough to protect from accidental breaching of internal boundaries.</span>
+<span>But there</span>&rsquo;<span>s a bigger problem: often, internal boundaries stand in the way of user-visible system features, and it takes a lot of authority to protect internal system</span>&rsquo;<span>s boundary at the cost of not shipping something.</span></p>
+<p><span>In this post, I</span>&rsquo;<span>d want to catalog some of the cases I</span>&rsquo;<span>ve seen in the Rust programming language where I think an internal boundaries were eroded with time.</span></p>
+<section id="Namespaces">
+
+    <h2>
+    <a href="#Namespaces"><span>Namespaces</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s a somewhat obscure feature of Rust</span>&rsquo;<span>s name resolution, but various things that inhabit Rust</span>&rsquo;<span>s scopes (structs, modules, traits, variables) are split into three namespaces: types, values and macros.</span>
+<span>This allows to have two things with the same name in the same scope without causing conflicts:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">x</span> { }</span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">x</span>() {}</span></code></pre>
+
+</figure>
+<p><span>The above is legal Rust, because the </span><code>x</code><span> struct lives in the types namespace, while the </span><code>x</code><span> </span><em><span>function</span></em><span> lives in the values namespace.</span>
+<span>The namespaces are reflected syntactically: </span><code>.</code><span> is used to traverse value namespace, while </span><code>::</code><span> traverses types.</span></p>
+<p><span>Except that this is </span><em><span>almost</span></em><span> a rule.</span>
+<span>There are some cases where compiler gives up on clear syntax-driven namespacing rules and just does ad-hoc disambiguation.</span>
+<span>For example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::<span class="hl-type">str</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">s</span>: &amp;<span class="hl-type">str</span> = <span class="hl-type">str</span>::<span class="hl-title function_ invoke__">from_utf8</span>(<span class="hl-string">b&quot;hello&quot;</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">  <span class="hl-type">str</span>::<span class="hl-title function_ invoke__">len</span>(s);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, the </span><code>str</code><span> in </span><code>&amp;str</code><span> and </span><code>str::len</code><span> is the </span><code>str</code><span> </span><em><span>type</span></em><span>, from the type namespace.</span>
+<span>The two other </span><code>str</code><span>s are the </span><code>str</code><span> </span><em><span>module</span></em><span>.</span>
+<span>In other words, the </span><code>str::len</code><span> is a method of a </span><code>str</code><span> type, while </span><code>str::from_utf8</code><span> is a free-standing function in the </span><code>str</code><span> module.</span>
+<span>Like types, modules inhabit the types namespace, so normally the code here would cause a compilation error.</span>
+<span>Compiler (and rust-analyzer) just hacks the primitive types case.</span></p>
+<p><span>Another recently added case is that of const generics.</span>
+<span>Previously, the </span><code>T</code><span> in </span><code>foo::&lt;T&gt;()</code><span> was a syntactically-unambiguous reference to something from the types namespace.</span>
+<span>Today, it can refer either to a type or to a value.</span>
+<span>This begs the question: is splitting type and value namespaces a good idea?</span>
+<span>If we have to disambiguate anyway, perhaps we could have just a single namespace and avoid introducing second lookup syntax?</span>
+<span>That is, just </span><code>use std.collections.HashMap;</code><span>.</span></p>
+<p><span>I </span><em><span>think</span></em><span> these namespace aspirations re-enact similar developments from C.</span>
+<span>I haven</span>&rsquo;<span>t double checked my history here, so take the following with the grain of salt and do your own research before quoting, but I </span><em><span>think</span></em><span> that C, in the initial versions, used to have very strict syntactic separation between types and values.</span>
+<span>That</span>&rsquo;<span>s why you are required to write </span><code>struct</code><span> when declaring a local variable of struct type:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-class"><span class="hl-keyword">struct</span> <span class="hl-title">foo</span> {</span> <span class="hl-type">int</span> a; };</span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">int</span> <span class="hl-title function_">main</span><span class="hl-params">(<span class="hl-type">void</span>)</span> {</span>
+<span class="line">  <span class="hl-class"><span class="hl-keyword">struct</span> <span class="hl-title">foo</span> <span class="hl-title">x</span>;</span></span>
+<span class="line">  <span class="hl-keyword">return</span> <span class="hl-number">0</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The </span><code>struct</code><span> keyword tells the parser that it is parsing a type, and, therefore a declaration.</span>
+<span>But then at a latter point typedefs were added, and so the parser was taught to disambiguate types and values via the the lexer hack:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-class"><span class="hl-keyword">struct</span> <span class="hl-title">foo</span> {</span></span>
+<span class="line">  <span class="hl-type">int</span> a;</span>
+<span class="line">};</span>
+<span class="line"><span class="hl-keyword">typedef</span> <span class="hl-class"><span class="hl-keyword">struct</span> <span class="hl-title">foo</span> <span class="hl-title">bar</span>;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">int</span> <span class="hl-title function_">main</span><span class="hl-params">(<span class="hl-type">void</span>)</span> {</span>
+<span class="line">  bar x;</span>
+<span class="line">  <span class="hl-keyword">return</span> <span class="hl-number">0</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Patterns-and-Expressions">
+
+    <h2>
+    <a href="#Patterns-and-Expressions"><span>Patterns and Expressions</span> </a>
+    </h2>
+<p><span>Rust has separate grammatical categories for patterns and expressions.</span>
+<span>It used to be the case that any utterance can be unambiguously classified, depending solely on the syntactic context, as either an expression or a pattern.</span>
+<span>But then a minor exception happened:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f</span>(value: <span class="hl-type">Option</span>&lt;<span class="hl-type">i32</span>&gt;) {</span>
+<span class="line">  <span class="hl-keyword">match</span> value {</span>
+<span class="line">    <span class="hl-literal">None</span> =&gt; (),</span>
+<span class="line">    none =&gt; (),</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Syntactically, </span><code>None</code><span> and </span><code>none</code><span> are indistinguishable.</span>
+<span>But they play quite different roles: </span><code>None</code><span> refers to the </span><code>Option::None</code><span> constant, while </span><code>none</code><span> introduces a fresh binding into the scope.</span>
+<span>Swift elegantly disambiguates the two at the syntax level, by requiring a leading </span><code>.</code><span> for enum variants.</span>
+<span>Rust just hacks this at the name-resolution layer, by defaulting to a new binding unless there</span>&rsquo;<span>s a matching constant in the scope.</span></p>
+<p><span>Recently, the scope of the hack was increased greatly: with destructing assignment implemented, an expression can be re-classified as a pattern now:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> (<span class="hl-keyword">mut</span> a, <span class="hl-keyword">mut</span> b) = (<span class="hl-number">0</span>, <span class="hl-number">1</span>);</span>
+<span class="line">(a, b) = (b, a)</span></code></pre>
+
+</figure>
+<p><span>Syntactically, </span><code>=</code><span> is a binary expression, so both the left hand side and the right hand side are expressions.</span>
+<span>But now the </span><code>lhs</code><span> is re-interpreted as a pattern.</span></p>
+<p><span>So perhaps the syntactic boundary between expressions and patterns is a fake one, and we should have used unified expression syntax throughout?</span></p>
+</section>
+<section id="s-1">
+
+    <h2>
+    <a href="#s-1"><code>::&lt;&gt;</code> </a>
+    </h2>
+<p><span>A boundary which stands intact is the class of the grammar.</span>
+<span>Rust is still an </span><code>LL(k)</code><span> language: it can be parsed using a straightforward single-pass algorithm which doesn</span>&rsquo;<span>t require backtracking.</span>
+<span>The cost of this boundary is that we have to type </span><code>.collect::&lt;Vec&lt;_&gt;&gt;()</code><span> rather than </span><code>.collect&lt;Vec&lt;_&gt;&gt;()</code><span> (nowadays, I type just </span><code>.collect()</code><span> and use the light-bulb to fill-in the turbofish).</span></p>
+</section>
+<section id="0-0">
+
+    <h2>
+    <a href="#0-0"><code>().0.0</code> </a>
+    </h2>
+<p><span>Another recent development is the erosion of the boundary between the lexer and the parser.</span>
+<span>Rust has tuple structs, and uses </span><code>.0</code><span> cutesy syntax to access numbered field.</span>
+<span>This is problematic for nested tuple struct.</span>
+<span>They need syntax like </span><code>foo.1.2</code><span>, but to the lexer this string looks like three tokens: </span><code>foo</code><span>, </span><code>.</code><span>, </span><code>1.2</code><span>.</span>
+<span>That is, </span><code>1.2</code><span> is a floating point number, </span><code>6/5</code><span>.</span>
+<span>So, historically one had to write this expression as </span><code>foo.1 .2</code><span>, with a meaningful whitespace.</span></p>
+<p><span>Today, this is hacked in the parser, which takes the </span><code>1.2</code><span> token from the lexer, inspects its text and further breaks it up into </span><code>1</code><span>, </span><code>.</code><span> and </span><code>2</code><span> tokens.</span></p>
+<p><span>The last example is quite interesting: in Rust, unlike many programming languages, the separation between the lexer and the parser is not an arbitrary internal boundary, but is actually a part of an external, semver protected API.</span>
+<span>Tokens are the input to macros, so macro behavior depends on how exactly the input text is split into tokens.</span></p>
+<p><span>And there</span>&rsquo;<span>s a second boundary violation here: in theory, </span>&ldquo;<span>token</span>&rdquo;<span> as seen by a macro is just its text plus hygiene info.</span>
+<span>In practice though, to implement captures in macro by example (</span><code>$x:expr</code><span> things), a token could also be a fully-formed fragment of internal compiler</span>&rsquo;<span>s AST data structure.</span>
+<span>The API is carefully future proofed such that, as soon as the macro looks at such a magic token, it gets decomposed into underlying true tokens, but there are some examples where the internal details leak via changes in observable behavior.</span></p>
+</section>
+<section id="Lifetime-Parametricity">
+
+    <h2>
+    <a href="#Lifetime-Parametricity"><span>Lifetime Parametricity</span> </a>
+    </h2>
+<p><span>To end this on a more positive note, here</span>&rsquo;<span>s one pretty important internal boundary which is holding up pretty well.</span>
+<span>In Rust, lifetimes don</span>&rsquo;<span>t affect code generation.</span>
+<span>In fact, lifetimes are fully stripped from the data which is passed to codegen.</span>
+<span>This is pretty important: although the inferred lifetimes are opaque and hard to reason about, you can be sure that, for example, the exact location where a value is dropped is independent from the whims of the borrow checker.</span></p>
+<hr>
+<p><span>Conclusion: not really? It seems that we are generally overly-optimistic about internal boundaries, and they seem to crumble under the pressure of feature requests, unless the boundary in question is physically reified (please don</span>&rsquo;<span>t take this as an endorsement of microservice architecture for compilers).</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-07-10-almost-rules.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/10/03/from-paxos-to-bft.html b/2022/10/03/from-paxos-to-bft.html
new file mode 100644
index 00000000..305c8335
--- /dev/null
+++ b/2022/10/03/from-paxos-to-bft.html
@@ -0,0 +1,667 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>From Paxos to BFT</title>
+  <meta name="description" content="This is a sequel to Notes on Paxos post.
+Similarly, the primarily goal here is for me to understand why the BFT consensus algorithm works in detail.
+This might, or might not be useful for other people!
+The Paxos article is a prerequisite, best to read that now, and return to this article tomorrow :)">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/10/03/from-paxos-to-bft.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#From-Paxos-to-BFT"><span>From Paxos to BFT</span> <time datetime="2022-10-03">Oct 3, 2022</time></a>
+    </h1>
+<p><span>This is a sequel to </span><a href="https://matklad.github.io/2020/11/01/notes-on-paxos.html"><span>Notes on Paxos</span></a><span> post.</span>
+<span>Similarly, the primarily goal here is for me to understand why the BFT consensus algorithm works in detail.</span>
+<span>This might, or might not be useful for other people!</span>
+<span>The Paxos article is a prerequisite, best to read that now, and return to this article tomorrow :)</span></p>
+<p><span>Note also that while Paxos was more or less a direct translation of Lamport</span>&rsquo;<span>s lecture, this post is a mish-mash oft the original BFT paper by Liskov and Castro, my own thinking, and a cursory glance as </span><a href="https://lamport.azurewebsites.net/tla/byzpaxos.html"><span>this formalization</span></a><span>.</span>
+<span>As such, the probability that there are no mistakes here is quite low.</span></p>
+<section id="What-is-BFT">
+
+    <h2>
+    <a href="#What-is-BFT"><span>What is BFT?</span> </a>
+    </h2>
+<p><span>BFT stands for Byzantine Fault Tolerant consensus.</span>
+<span>Similarly to Paxos, we imagine a distributed system of computers communicating over a faulty network which can arbitrary reorder, delay, and drop messages.</span>
+<span>And we want computers to agree on some specific choice of value among the set of possibilities, such that any two computers pick the same value.</span>
+<span>Unlike Paxos though,  we also assume that computers themselves might be faulty or malicious.</span>
+<span>So, we add a new condition to our list of bad things.</span>
+<span>Besides reordering, duplication, delaying and dropping, a fake message can be manufactured out of thin air.</span></p>
+<p><span>Of course, if absolutely arbitrary messages can be forged, then no consensus is possible </span>&mdash;<span> each machine lives in its own solipsistic world which might be completely unlike the world of every other machine.</span>
+<span>So there</span>&rsquo;<span>s one restriction </span>&mdash;<span> messages are cryptographically signed by the senders, and it is assumed that it is impossible for a faulty node to impersonate non-faulty one.</span></p>
+<p><span>Can we still achieve consensus?</span>
+<span>As long as for each </span><code>f</code><span> faulty, malicious nodes, we have at least </span><code>2f + 1</code><span> honest ones.</span></p>
+<p><span>Similarly to the Paxos post, we will capture this intuition into a precise mathematical statement about trajectories of state machines.</span></p>
+</section>
+<section id="Paxos-Revisited">
+
+    <h2>
+    <a href="#Paxos-Revisited"><span>Paxos Revisited</span> </a>
+    </h2>
+<p><span>Our plan is to start with vanilla Paxos, and then patch it to allow byzantine behavior.</span>
+<span>Here</span>&rsquo;<span>s what we</span>&rsquo;<span>ve arrived at last time:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Paxos</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝔹       -- Numbered set of ballots (for example, ℕ)</span>
+<span class="line">  𝕍       -- Arbitrary set of values</span>
+<span class="line">  𝔸       -- Finite set of acceptors</span>
+<span class="line">  ℚ ∈ 2^𝔸 -- Set of quorums</span>
+<span class="line"></span>
+<span class="line">  -- Sets of messages for each of the four subphases</span>
+<span class="line">  Msgs1a ≡ {type: {"1a"}, bal: 𝔹}</span>
+<span class="line"></span>
+<span class="line">  Msgs1b ≡ {type: {"1b"}, bal: 𝔹, acc: 𝔸,</span>
+<span class="line">            vote: {bal: 𝔹, val: 𝕍} ∪ {null}}</span>
+<span class="line"></span>
+<span class="line">  Msgs2a ≡ {type: {"2a"}, bal: 𝔹, val: 𝕍}</span>
+<span class="line"></span>
+<span class="line">  Msgs2b ≡ {type: {"2b"}, bal: 𝔹, val: 𝕍, acc: 𝔸}</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ≠ {}</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  -- Set of all messages sent so far</span>
+<span class="line">  msgs ∈ 2^(Msgs1a ∪ Msgs1b ∪ Msgs2a ∪ Msgs2b)</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to ballot numbers or -1</span>
+<span class="line">  -- maxBal :: 𝔸 -&gt; 𝔹 ∪ {-1}</span>
+<span class="line">  maxBal ∈ (𝔹 ∪ {-1})^𝔸</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to their last vote</span>
+<span class="line">  -- lastVote :: 𝔸 -&gt; {bal: 𝔹, val: 𝕍} ∪ {null}</span>
+<span class="line">  lastVote ∈ ({bal: 𝔹, val: 𝕍} ∪ {null})^𝔸</span>
+<span class="line"></span>
+<span class="line">Send(m) ≡ msgs' = msgs ∪ {m}</span>
+<span class="line"></span>
+<span class="line">Safe(b, v) ≡</span>
+<span class="line">  ∃ q ∈ ℚ:</span>
+<span class="line">  let</span>
+<span class="line">    qmsgs  ≡ {m ∈ msgs: m.type = "1b" ∧ m.bal = b ∧ m.acc ∈ q}</span>
+<span class="line">    qvotes ≡ {m ∈ qmsgs: m.vote ≠ null}</span>
+<span class="line">  in</span>
+<span class="line">      ∀ a ∈ q: ∃ m ∈ qmsgs: m.acc = a</span>
+<span class="line">    ∧ (  qvotes = {}</span>
+<span class="line">       ∨ ∃ m ∈ qvotes:</span>
+<span class="line">             m.vote.val = v</span>
+<span class="line">           ∧ ∀ m1 ∈ qvotes: m1.vote.bal &lt;= m.vote.bal)</span>
+<span class="line"></span>
+<span class="line">Phase1a(b) ≡</span>
+<span class="line">    maxBal' = maxBal</span>
+<span class="line">  ∧ lastVote' = lastVote</span>
+<span class="line">  ∧ Send({type: "1a", bal: b})</span>
+<span class="line"></span>
+<span class="line">Phase1b(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "1a" ∧ maxBal(a) &lt; m.bal</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                            then m.bal - 1</span>
+<span class="line">                            else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = lastVote</span>
+<span class="line">    ∧ Send({type: "1b", bal: m.bal, acc: a, vote: lastVote(a)})</span>
+<span class="line"></span>
+<span class="line">Phase2a(b, v) ≡</span>
+<span class="line">   ¬∃ m ∈ msgs: m.type = "2a" ∧ m.bal = b</span>
+<span class="line">  ∧ Safe(b, v)</span>
+<span class="line">  ∧ maxBal' = maxBal</span>
+<span class="line">  ∧ lastVote' = lastVote</span>
+<span class="line">  ∧ Send({type: "2a", bal: b, val: v})</span>
+<span class="line"></span>
+<span class="line">Phase2b(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "2a" ∧ maxBal(a) &lt; m.bal</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1 then m.bal else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                              then {bal: m.bal, val: m.val}</span>
+<span class="line">                              else lastVote(a1)</span>
+<span class="line">    ∧ Send({type: "2b", bal: m.bal, val: m.val, acc: a})</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">    msgs = {}</span>
+<span class="line">  ∧ maxBal   = λ a ∈ 𝔸: -1</span>
+<span class="line">  ∧ lastVote = λ a ∈ 𝔸: null</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">    ∃ b ∈ 𝔹:</span>
+<span class="line">        Phase1a(b) ∨ ∃ v ∈ 𝕍: Phase2a(b, v)</span>
+<span class="line">  ∨ ∃ a ∈ 𝔸:</span>
+<span class="line">        Phase1b(a) ∨ Phase2b(a)</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: ∃ q ∈ ℚ, b ∈ 𝔹: AllVotedFor(q, b, v)}</span>
+<span class="line"></span>
+<span class="line">AllVotedFor(q, b, v) ≡</span>
+<span class="line">  ∀ a ∈ q: (a, b, v) ∈ votes</span>
+<span class="line"></span>
+<span class="line">votes ≡</span>
+<span class="line">  let</span>
+<span class="line">    msgs2b ≡ {m ∈ msgs: m.type = "2b"}</span>
+<span class="line">  in</span>
+<span class="line">    {(m.acc, m.bal, m.val): m ∈ msgs2b}</span></code></pre>
+
+</figure>
+<p><span>Our general idea is to add some </span>&ldquo;<span>evil</span>&rdquo;<span> acceptors 𝔼 to the mix and allow them sending arbitrary messages, while at the same time making sure that the subset of </span>&ldquo;<span>good</span>&rdquo;<span> acceptors continues to run Paxos.</span>
+<span>What makes this complex is that we don</span>&rsquo;<span>t know which acceptor are good and which are bad.</span>
+<span>So this is our setup</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝔹       -- Numbered set of ballots (for example, ℕ)</span>
+<span class="line">  𝕍       -- Arbitrary set of values</span>
+<span class="line">  𝔸       -- Finite set of good acceptors</span>
+<span class="line">  𝔼       -- Finite set of evil acceptors</span>
+<span class="line">  𝔸𝔼 ≡ 𝔸 ∪ 𝔼 -- All acceptors</span>
+<span class="line">  ℚ ∈ 2^𝔸𝔼 -- Set of quorums</span>
+<span class="line"></span>
+<span class="line">  Msgs1a ≡ {type: {"1a"}, bal: 𝔹}</span>
+<span class="line"></span>
+<span class="line">  Msgs1b ≡ {type: {"1b"}, bal: 𝔹, acc: 𝔸𝔼,</span>
+<span class="line">            vote: {bal: 𝔹, val: 𝕍} ∪ {null}}</span>
+<span class="line"></span>
+<span class="line">  Msgs2a ≡ {type: {"2a"}, bal: 𝔹, val: 𝕍}</span>
+<span class="line"></span>
+<span class="line">  Msgs2b ≡ {type: {"2b"}, bal: 𝔹, val: 𝕍, acc: 𝔸𝔼}</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line">  𝔼 ∩ 𝔸 = {}</span>
+<span class="line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ∩ 𝔸 ≠ {}</span></code></pre>
+
+</figure>
+<p><span>If previously the quorum condition was </span>&ldquo;<span>any two quorums have an acceptor in common</span>&rdquo;<span>, it is now </span>&ldquo;<span>any two quorums have a good acceptor in common</span>&rdquo;<span>.</span>
+<span>An alternative way to say that is </span>&ldquo;<span>a byzantine quorum is a super-set of normal quorum</span>&rdquo;<span>, which corresponds to the intuition where we are running normal Paxos, and there are just some extra evil guys whom we try to ignore.</span>
+<span>For Paxos, we allowed </span><code>f</code><span> faulty out of </span><code>2f + 1</code><span> total nodes  with </span><code>f+1</code><span> quorums.</span>
+<span>For Byzantine Paxos, we</span>&rsquo;<span>ll have </span><code>f</code><span> byzantine out </span><code>3f + 1</code><span> nodes with </span><code>2f+1</code><span> quorums.</span>
+<span>As I</span>&rsquo;<span>ve said, if we forget about byzantine folks, we get exactly </span><code>f + 1</code><span> out of </span><code>2f + 1</code><span> picture of normal Paxos.</span></p>
+<p><span>The next step is to determine behavior for byzantine nodes.</span>
+<span>They can send any message, as long as they are the author:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Byzantine(a) ≡</span>
+<span class="line">      ∃ b ∈ 𝔹:             Send({type: "1a", bal: b})</span>
+<span class="line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2a", bal: b, val: v})</span>
+<span class="line">    ∨ ∃ b1, b2 ∈ 𝔹, v ∈ 𝕍: Send({type: "1b", bal: b1, acc: a,</span>
+<span class="line">                                  vote: {bal: b2, val: v}})</span>
+<span class="line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2b", bal: b, val: v, acc: a})</span>
+<span class="line">  ∧ maxBal' = maxBal</span>
+<span class="line">  ∧ lastVote' = lastVote</span></code></pre>
+
+</figure>
+<p><span>That is, a byzantine acceptor can send any </span><code>1a</code><span> or </span><code>2a</code><span> message at any time, while for </span><code>1b</code><span> and </span><code>2b</code><span> the author should match.</span></p>
+<p><span>What breaks?</span>
+<span>The most obvious thing is </span><code>Phase2b</code><span>, that is, voting.</span>
+<span>In Paxos, as soon as an acceptor receives a </span><code>2a</code><span> message, it votes for it.</span>
+<span>The correctness of Paxos hinges on the </span><code>Safe</code><span> check before we send </span><code>2a</code><span> message, but a Byzantine node can send an arbitrary </span><code>2a</code><span>.</span></p>
+<p><span>The solution here is natural: rather than blindly trust </span><code>2a</code><span> messages, acceptors would themselves double-check the safety condition, and reject the message if it doesn</span>&rsquo;<span>t hold:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Phase2b(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "2a" ∧ maxBal(a) &lt; m.bal</span>
+<span class="line hl-line">    ∧ Safe(m.bal, m.val)</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1 then m.bal else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                              then {bal: m.bal, val: m.val}</span>
+<span class="line">                              else lastVote(a1)</span>
+<span class="line">    ∧ Send({type: "2b", bal: m.bal, val: m.val, acc: a})</span></code></pre>
+
+</figure>
+<p><span>Implementation wise, this means that, when a coordinator sends a </span><code>2a</code><span>, it also wants to include </span><code>1b</code><span> messages proving the safety of </span><code>2a</code><span>.</span>
+<span>But in the spec we can just assume that all messages are broadcasted, for simplicity.</span>
+<span>Ideally, for correct modeling you also want to model how each acceptor learns new messages, to make sure that negative reasoning about a certain message </span><em><span>not</span></em><span> being sent doesn</span>&rsquo;<span>t creep in, but we</span>&rsquo;<span>ll avoid that here.</span></p>
+<p><span>However, just re-checking safety doesn</span>&rsquo;<span>t fully solve the problem.</span>
+<span>It might be the case that several values are safe at a particular ballot (indeed, in the first ballot any value is safe), and it is exactly the job of a coordinator / </span><code>2a</code><span> message to pick one value to break the tie.</span>
+<span>And in our case a byzantine coordinator can send two </span><code>2a</code><span> for different valid values.</span></p>
+<p><span>And here we</span>&rsquo;<span>ll make the single non-trivial modification to the algorithm.</span>
+<span>Like the </span><code>Safe</code><span> condition is at the heart of Paxos, the </span><code>Confirmed</code><span> condition is the heart here.</span></p>
+<p><span>So basically we expect a good coordinator to send just one </span><code>2a</code><span> message, but a bad one can send many.</span>
+<span>And we want to somehow distinguish the two cases.</span>
+<span>One way to do that is to broadcast ACKs for </span><code>2a</code><span> among acceptors.</span>
+<span>If I received a </span><code>2a</code><span> message, checked that the value therein is safe, and also know that everyone else received this same </span><code>2a</code><span> message, I can safely vote for the value.</span></p>
+<p><span>So we introduce a new message type, </span><code>2ac</code><span>, which confirms a valid </span><code>2a</code><span> message:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Msgs2ac ≡ {type: {"2ac"}, bal: 𝔹, val: 𝕍, acc: 𝔸}</span></code></pre>
+
+</figure>
+<p><span>Naturally, evil acceptors can confirm whatever:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Byzantine(a) ≡</span>
+<span class="line">      ∃ b ∈ 𝔹:             Send({type: "1a", bal: b})</span>
+<span class="line">    ∨ ∃ b1, b2 ∈ 𝔹, v ∈ 𝕍: Send({type: "1b", bal: b1, acc: a,</span>
+<span class="line">                                 vote: {bal: b2, val: v}})</span>
+<span class="line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2a", bal: b, val: v})</span>
+<span class="line hl-line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2ac", bal: b, val: v, acc: a})</span>
+<span class="line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2b", bal: b, val: v, acc: a})</span>
+<span class="line">  ∧ maxBal' = maxBal</span>
+<span class="line">  ∧ lastVote' = lastVote</span></code></pre>
+
+</figure>
+<p><span>But, if we get a quorum of confirmations, we can be sure that no other value will be confirmed in a given ballot (each good acceptors confirms at most a single message in a ballot (and we need a bit of state for that as well))</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Confirmed(b, v) ≡</span>
+<span class="line">  ∃ q ∈ ℚ: ∀ a ∈ q: {type: "2ac", bal: b, val: v, acc: a} ∈ msgs</span></code></pre>
+
+</figure>
+<p><span>Putting everything so far together, we get</span></p>
+
+<figure class="code-block">
+<figcaption class="title">Not Yet BFT Paxos</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝔹          -- Numbered set of ballots (for example, ℕ)</span>
+<span class="line">  𝕍          -- Arbitrary set of values</span>
+<span class="line">  𝔸          -- Finite set of acceptors</span>
+<span class="line hl-line">  𝔼          -- Finite set of evil acceptors</span>
+<span class="line hl-line">  𝔸𝔼 ≡ 𝔸 ∪ 𝔼 -- Set of all acceptors</span>
+<span class="line hl-line">  ℚ ∈ 2^𝔸𝔼   -- Set of quorums</span>
+<span class="line"></span>
+<span class="line">  Msgs1a ≡ {type: {"1a"}, bal: 𝔹}</span>
+<span class="line"></span>
+<span class="line">  Msgs1b  ≡ {type: {"1b"}, bal: 𝔹, acc: 𝔸,</span>
+<span class="line">             vote: {bal: 𝔹, val: 𝕍} ∪ {null}}</span>
+<span class="line"></span>
+<span class="line">  Msgs2a  ≡ {type: {"2a"}, bal: 𝔹, val: 𝕍}</span>
+<span class="line hl-line">  Msgs2ac ≡ {type: {"2ac"}, bal: 𝔹, val: 𝕍, acc: 𝔸}</span>
+<span class="line"></span>
+<span class="line">  Msgs2b  ≡ {type: {"2b"}, bal: 𝔹, val: 𝕍, acc: 𝔸}</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line hl-line">  𝔼 ∩ 𝔸 = {}</span>
+<span class="line hl-line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ∩ 𝔸 ≠ {}</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  -- Set of all messages sent so far</span>
+<span class="line">  msgs ∈ 2^(Msgs1a ∪ Msgs1b ∪ Msgs2a ∪ Msgs2ac ∪ Msgs2b)</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to ballot numbers or -1</span>
+<span class="line">  -- maxBal :: 𝔸 -&gt; 𝔹 ∪ {-1}</span>
+<span class="line">  maxBal ∈ (𝔹 ∪ {-1})^𝔸</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to their last vote</span>
+<span class="line">  -- lastVote :: 𝔸 -&gt; {bal: 𝔹, val: 𝕍} ∪ {null}</span>
+<span class="line">  lastVote ∈ ({bal: 𝔹, val: 𝕍} ∪ {null})^𝔸</span>
+<span class="line"></span>
+<span class="line hl-line">  -- Function which maps acceptors to values they confirmed as safe</span>
+<span class="line hl-line">  -- confirm :: (𝔸, 𝔹) -&gt; 𝕍 ∪ {null}</span>
+<span class="line hl-line">  confirm ∈ (𝕍 ∪ {null})^(𝔸 × 𝔹)</span>
+<span class="line hl-line"></span>
+<span class="line">Send(m) ≡ msgs' = msgs ∪ {m}</span>
+<span class="line"></span>
+<span class="line">Confirmed(b, v) ≡</span>
+<span class="line hl-line">  ∃ q ∈ ℚ: ∀ a ∈ q: {type: "2ac", bal: b, val: v, acc: a} ∈ msgs</span>
+<span class="line hl-line"></span>
+<span class="line">Safe(b, v) ≡</span>
+<span class="line">  ∃ q ∈ ℚ:</span>
+<span class="line">  let</span>
+<span class="line">    qmsgs  ≡ {m ∈ msgs: m.type = "1b" ∧ m.bal = b ∧ m.acc ∈ q}</span>
+<span class="line">    qvotes ≡ {m ∈ qmsgs: m.vote ≠ null}</span>
+<span class="line">  in</span>
+<span class="line">      ∀ a ∈ q: ∃ m ∈ qmsgs: m.acc = a</span>
+<span class="line">    ∧ (  qvotes = {}</span>
+<span class="line">       ∨ ∃ m ∈ qvotes:</span>
+<span class="line">             m.vote.val = v</span>
+<span class="line">           ∧ ∀ m1 ∈ qvotes: m1.vote.bal &lt;= m.vote.bal)</span>
+<span class="line"></span>
+<span class="line">Byzantine(a) ≡</span>
+<span class="line hl-line">      ∃ b ∈ 𝔹:             Send({type: "1a", bal: b})</span>
+<span class="line hl-line">    ∨ ∃ b1, b2 ∈ 𝔹, v ∈ 𝕍: Send({type: "1b", bal: b1, acc: a,</span>
+<span class="line hl-line">                                 vote: {bal: b2, val: v}})</span>
+<span class="line hl-line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2a", bal: b, val: v})</span>
+<span class="line hl-line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2ac", bal: b, val: v, acc: a})</span>
+<span class="line hl-line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2b", bal: b, val: v, acc: a})</span>
+<span class="line hl-line">  ∧ maxBal' = maxBal</span>
+<span class="line hl-line">  ∧ lastVote' = lastVote</span>
+<span class="line hl-line">  ∧ confirm' = confirm</span>
+<span class="line"></span>
+<span class="line">Phase1b(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "1a" ∧ maxBal(a) &lt; m.bal</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                            then m.bal - 1</span>
+<span class="line">                            else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = lastVote</span>
+<span class="line">    ∧ confirm' = confirm</span>
+<span class="line">    ∧ Send({type: "1b", bal: m.bal, acc: a, vote: lastVote(a)})</span>
+<span class="line hl-line"></span>
+<span class="line hl-line">Phase2ac(a) ≡</span>
+<span class="line hl-line">  ∃ m ∈ msgs:</span>
+<span class="line hl-line">      m.type = "2a"</span>
+<span class="line hl-line">    ∧ confirm(a, m.bal) = null</span>
+<span class="line hl-line">    ∧ Safe(m.bal, m.val)</span>
+<span class="line hl-line">    ∧ maxBal' = maxBal</span>
+<span class="line hl-line">    ∧ lastVote' = lastVote</span>
+<span class="line hl-line">    ∧ confirm' = λ a1 ∈ 𝔸, b1 \in 𝔹:</span>
+<span class="line hl-line">                 if a = a1 ∧ b1 = m.bal then m.val else confirm(a1, b1)</span>
+<span class="line hl-line">    ∧ Send({type: "2ac", bal: m.bal, val: m.val, acc: a})</span>
+<span class="line"></span>
+<span class="line">Phase2b(a) ≡</span>
+<span class="line">  ∃ b ∈ 𝔹, v ∈ 𝕍:</span>
+<span class="line hl-line">      Confirmed(b, v)</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1 then m.bal else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                              then {bal: m.bal, val: m.val}</span>
+<span class="line">                              else lastVote(a1)</span>
+<span class="line">    ∧ confirm' = confirm</span>
+<span class="line">    ∧ Send({type: "2b", bal: m.bal, val: m.val, acc: a})</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">    msgs = {}</span>
+<span class="line">  ∧ maxBal   = λ a ∈ 𝔸: -1</span>
+<span class="line">  ∧ lastVote = λ a ∈ 𝔸: null</span>
+<span class="line">  ∧ confirm = λ a ∈ 𝔸, b ∈ 𝔹: null</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">    ∃ a ∈ 𝔸:</span>
+<span class="line">        Phase1b(a) ∨ Phase2ac(a) ∨ Phase2b(a)</span>
+<span class="line">  ∨ ∃ a ∈ 𝔼:</span>
+<span class="line">        Byzantine(a)</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: ∃ q ∈ ℚ, b ∈ 𝔹: AllVotedFor(q, b, v)}</span>
+<span class="line"></span>
+<span class="line">AllVotedFor(q, b, v) ≡</span>
+<span class="line">  ∀ a ∈ q: (a, b, v) ∈ votes</span>
+<span class="line"></span>
+<span class="line">votes ≡</span>
+<span class="line">  let</span>
+<span class="line">    msgs2b ≡ {m ∈ msgs: m.type = "2b"}</span>
+<span class="line">  in</span>
+<span class="line">    {(m.acc, m.bal, m.val): m ∈ msgs2b}</span></code></pre>
+
+</figure>
+<p><span>In the above, I</span>&rsquo;<span>ve also removed phases </span><code>1a</code><span> and </span><code>2a</code><span>, as byzantine acceptors are allowed to send arbitrary messages as well (we</span>&rsquo;<span>ll need explicit </span><code>1a</code><span>/</span><code>2a</code><span> for liveness, but we won</span>&rsquo;<span>t discuss that here).</span></p>
+<p><span>The most important conceptual addition is </span><code>Phase2ac</code><span> </span>&mdash;<span> if an acceptor receives a new </span><code>2a</code><span> message for some ballot with a safe value, it sends out the confirmation provided that it hadn</span>&rsquo;<span>t done that already.</span>
+<span>In </span><code>Phase2b</code><span> then we can vote for confirmed values: confirmation by a quorum guarantees both that the value is safe at this ballot, and that this is a single value that can be voted for in this ballot (two different values can</span>&rsquo;<span>d be confirmed in the same ballot, because quorums have an honest acceptor in common).</span>
+<span>This </span><em><span>almost</span></em><span> works, but there</span>&rsquo;<span>s still a problem.</span>
+<span>Can you spot it?</span></p>
+<p><span>The problem is in the </span><code>Safe</code><span> condition.</span>
+<span>Recall that the goal of the </span><code>Safe</code><span> condition is to pick a value </span><code>v</code><span> for ballot </span><code>b</code><span>, such that, if any earlier ballot </span><code>b1</code><span> concludes, the value chosen in </span><code>b1</code><span> would necessary be </span><code>v</code><span>.</span>
+<span>The way </span><code>Safe</code><span> works for ballot </span><code>b</code><span> in normal Paxos is that the coordinator asks a certain quorum to abstain from further voting in ballots earlier than </span><code>b</code><span>, collects existing votes, and uses those votes to pick a safe value.</span>
+<span>Specifically, it looks at the vote for the highest-numbered ballot in the set, and declares a value from it as safe (it </span><em><span>is</span></em><span> safe: it was safe at </span><em><span>that</span></em><span> ballot, and for all future ballots there</span>&rsquo;<span>s a quorum which abstained from voting).</span></p>
+<p><span>This procedure puts a lot of trust in that highest vote, which makes it vulnerable.</span>
+<span>An evil acceptor can just say that it voted in some high ballot, and force a choice of arbitrary value.</span>
+<span>So, we need some independent confirmation that the vote was cast for a safe value.</span>
+<span>And we can re-use </span><code>2ac</code><span> messages for this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Safe(b, v) ≡</span>
+<span class="line">  ∃ q ∈ Q:</span>
+<span class="line">  let</span>
+<span class="line">    qmsgs  ≡ {m ∈ msgs: m.type = "1b" ∧ m.bal = b ∧ m.acc ∈ q}</span>
+<span class="line">    qvotes ≡ {m ∈ qmsgs: m.vote ≠ null}</span>
+<span class="line">  in</span>
+<span class="line">      ∀ a ∈ q: ∃ m ∈ qmsgs: m.acc = a</span>
+<span class="line">   ∧ (  qvotes = {}</span>
+<span class="line">       ∨ ∃ m ∈ qvotes:</span>
+<span class="line">             m.vote.val = v</span>
+<span class="line">           ∧ ∀ m1 ∈ qvotes: m1.vote.bal &lt;= m.vote.bal</span>
+<span class="line hl-line">           ∧ Confirmed(m.vote.bal, v))</span></code></pre>
+
+</figure>
+<p><span>And </span>&hellip;<span> that</span>&rsquo;<span>s it, really.</span>
+<span>Now we can sketch a proof that this thing indeed achieves BFT consensus, because it actually models normal Paxos among non-byzantine acceptors.</span></p>
+<p><span>Phase1a messages of Paxos are modeled by Phase1a messages of BFT Paxos, as they don</span>&rsquo;<span>t have any preconditions, the same goes for Phase1b.</span>
+<span>Phase2a message of Paxos is emitted when a value becomes confirmed in BFT Paxos.</span>
+<span>This is correct modeling, because BFT</span>&rsquo;<span>s Safe condition models normal Paxos Safe condition (this </span>&hellip;<span> is a bit inexact I think, to make this exact, we want to separate </span>&ldquo;<span>this value is safe</span>&rdquo;<span> from </span>&ldquo;<span>we are voting for this value</span>&rdquo;<span> in original Paxos as well).</span>
+<span>Finally, Phase2b also displays direct correspondence.</span></p>
+<p><span>As a final pop-quiz, I claim that the </span><code>Confirmed(m.vote.bal, v)</code><span> condition in </span><code>Safe</code><span> above can be relaxed.</span>
+<span>As stated, </span><code>Confirmed</code><span> needs a byzantine quorum of confirmations, which guarantees both that the value is safe and that it is the single confirmed value, which is a bit more than we need here.</span>
+<span>Do you see what would be enough?</span></p>
+<p><span>The final specification contains this relaxation:</span></p>
+
+<figure class="code-block">
+<figcaption class="title">BFT Paxos</figcaption>
+
+
+<pre><code><span class="line">Sets:</span>
+<span class="line">  𝔹          -- Numbered set of ballots (for example, ℕ)</span>
+<span class="line">  𝕍          -- Arbitrary set of values</span>
+<span class="line">  𝔸          -- Finite set of acceptors</span>
+<span class="line">  𝔼          -- Finite set of evil acceptors</span>
+<span class="line">  𝔸𝔼 ≡ 𝔸 ∪ 𝔼 -- Set of all acceptors</span>
+<span class="line">  ℚ ∈ 2^𝔸𝔼   -- Set of quorums</span>
+<span class="line">  𝕎ℚ ∈ 2^𝔸𝔼  -- Set of weak quorums</span>
+<span class="line"></span>
+<span class="line">  Msgs1a ≡ {type: {"1a"}, bal: 𝔹}</span>
+<span class="line"></span>
+<span class="line">  Msgs1b  ≡ {type: {"1b"}, bal: 𝔹, acc: 𝔸𝔼,</span>
+<span class="line">             vote: {bal: 𝔹, val: 𝕍} ∪ {null}}</span>
+<span class="line"></span>
+<span class="line">  Msgs2a  ≡ {type: {"2a"}, bal: 𝔹, val: 𝕍}</span>
+<span class="line">  Msgs2ac ≡ {type: {"2ac"}, bal: 𝔹, val: 𝕍, acc: 𝔸𝔸𝔼}</span>
+<span class="line"></span>
+<span class="line">  Msgs2b  ≡ {type: {"2b"}, bal: 𝔹, val: 𝕍, acc: 𝔸𝔸𝔼}</span>
+<span class="line"></span>
+<span class="line">Assume:</span>
+<span class="line">  𝔼 ∩ 𝔸 = {}</span>
+<span class="line">  ∀ q1, q2 ∈ ℚ: q1 ∩ q2 ∩ 𝔸 ≠ {}</span>
+<span class="line">  ∀ q ∈ 𝕎ℚ: q ∩ 𝔸 ≠ {}</span>
+<span class="line"></span>
+<span class="line">Vars:</span>
+<span class="line">  -- Set of all messages sent so far</span>
+<span class="line">  msgs ∈ 2^(Msgs1a ∪ Msgs1b ∪ Msgs2a ∪ Msgs2ac ∪ Msgs2b)</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to ballot numbers or -1</span>
+<span class="line">  -- maxBal :: 𝔸 -&gt; 𝔹 ∪ {-1}</span>
+<span class="line">  maxBal ∈ (𝔹 ∪ {-1})^𝔸</span>
+<span class="line"></span>
+<span class="line">  -- Function that maps acceptors to their last vote</span>
+<span class="line">  -- lastVote :: 𝔸 -&gt; {bal: 𝔹, val: 𝕍} ∪ {null}</span>
+<span class="line">  lastVote ∈ ({bal: 𝔹, val: 𝕍} ∪ {null})^𝔸</span>
+<span class="line"></span>
+<span class="line">  -- Function which maps acceptors to values they confirmed as safe</span>
+<span class="line">  -- confirm :: (𝔸, 𝔹) -&gt; 𝕍 ∪ {null}</span>
+<span class="line">  confirm ∈ (𝕍 ∪ {null})^(𝔸 × 𝔹)</span>
+<span class="line"></span>
+<span class="line">Send(m) ≡ msgs' = msgs ∪ {m}</span>
+<span class="line"></span>
+<span class="line">Safe(b, v) ≡</span>
+<span class="line">  ∃ q ∈ ℚ:</span>
+<span class="line">  let</span>
+<span class="line">    qmsgs  ≡ {m ∈ msgs: m.type = "1b" ∧ m.bal = b ∧ m.acc ∈ q}</span>
+<span class="line">    qvotes ≡ {m ∈ qmsgs: m.vote ≠ null}</span>
+<span class="line">  in</span>
+<span class="line">      ∀ a ∈ q: ∃ m ∈ qmsgs: m.acc = a</span>
+<span class="line">    ∧ (  qvotes = {}</span>
+<span class="line">       ∨ ∃ m ∈ qvotes:</span>
+<span class="line">             m.vote.val = v</span>
+<span class="line">           ∧ ∀ m1 ∈ qvotes: m1.vote.bal &lt;= m.vote.bal</span>
+<span class="line">           ∧ confirmedWeak(m.vote.val, v))</span>
+<span class="line"></span>
+<span class="line">Confirmed(b, v) ≡</span>
+<span class="line">  ∃ q ∈ ℚ: ∀ a ∈ q: {type: "2ac", bal: b, val: v, acc: a} ∈ msgs</span>
+<span class="line"></span>
+<span class="line">ConfirmedWeak(b, v) ≡</span>
+<span class="line">  ∃ q ∈ 𝕎ℚ: ∀ a ∈ q: {type: "2ac", bal: b, val: v, acc: a} ∈ msgs</span>
+<span class="line"></span>
+<span class="line">Byzantine(a) ≡</span>
+<span class="line">      ∃ b ∈ 𝔹:             Send({type: "1a", bal: b})</span>
+<span class="line">    ∨ ∃ b1, b2 ∈ 𝔹, v ∈ 𝕍: Send({type: "1b", bal: b1, acc: a,</span>
+<span class="line">                                 vote: {bal: b2, val: v}})</span>
+<span class="line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2a", bal: b, val: v})</span>
+<span class="line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2ac", bal: b, val: v, acc: a})</span>
+<span class="line">    ∨ ∃ b ∈ 𝔹, v ∈ 𝕍:      Send({type: "2b", bal: b, val: v, acc: a})</span>
+<span class="line">  ∧ maxBal' = maxBal</span>
+<span class="line">  ∧ lastVote' = lastVote</span>
+<span class="line">  ∧ confirm' = confirm</span>
+<span class="line"></span>
+<span class="line">Phase1b(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "1a" ∧ maxBal(a) &lt; m.bal</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                            then m.bal - 1</span>
+<span class="line">                            else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = lastVote</span>
+<span class="line">    ∧ confirm' = confirm</span>
+<span class="line">    ∧ Send({type: "1b", bal: m.bal, acc: a, vote: lastVote(a)})</span>
+<span class="line"></span>
+<span class="line">Phase2ac(a) ≡</span>
+<span class="line">  ∃ m ∈ msgs:</span>
+<span class="line">      m.type = "2a"</span>
+<span class="line">    ∧ confirm(a, m.bal) = null</span>
+<span class="line">    ∧ Safe(m.bal, m.val)</span>
+<span class="line">    ∧ maxBal' = maxBal</span>
+<span class="line">    ∧ lastVote' = lastVote</span>
+<span class="line">    ∧ confirm' = λ a1 ∈ 𝔸, b1 \in 𝔹:</span>
+<span class="line">                 if a = a1 ∧ b1 = m.bal then m.val else confirm(a1, b1)</span>
+<span class="line">    ∧ Send({type: "2ac", bal: m.bal, val: m.val, acc: a})</span>
+<span class="line"></span>
+<span class="line">Phase2b(a) ≡</span>
+<span class="line">  ∃ b ∈ 𝔹, v ∈ 𝕍:</span>
+<span class="line">      confirmed(b, v)</span>
+<span class="line">    ∧ maxBal' = λ a1 ∈ 𝔸: if a = a1 then m.bal else maxBal(a1)</span>
+<span class="line">    ∧ lastVote' = λ a1 ∈ 𝔸: if a = a1</span>
+<span class="line">                              then {bal: m.bal, val: m.val}</span>
+<span class="line">                              else lastVote(a1)</span>
+<span class="line">    ∧ confirm' = confirm</span>
+<span class="line">    ∧ Send({type: "2b", bal: m.bal, val: m.val, acc: a})</span>
+<span class="line"></span>
+<span class="line">Init ≡</span>
+<span class="line">    msgs = {}</span>
+<span class="line">  ∧ maxBal   = λ a ∈ 𝔸: -1</span>
+<span class="line">  ∧ lastVote = λ a ∈ 𝔸: null</span>
+<span class="line">  ∧ confirm = λ a ∈ 𝔸, b ∈ 𝔹: null</span>
+<span class="line"></span>
+<span class="line">Next ≡</span>
+<span class="line">    ∃ b ∈ 𝔹:</span>
+<span class="line">        Phase1a(b) ∨ ∃ v ∈ 𝕍: Phase2a(b, v)</span>
+<span class="line">  ∨ ∃ a ∈ 𝔸:</span>
+<span class="line">        Phase1b(a) ∨ Phase2ac(a) ∨ Phase2b(a)</span>
+<span class="line">  ∨ ∃ a ∈ 𝔼:</span>
+<span class="line">        Byzantine(a)</span>
+<span class="line"></span>
+<span class="line">chosen ≡</span>
+<span class="line">  {v ∈ V: ∃ q ∈ ℚ, b ∈ 𝔹: AllVotedFor(q, b, v)}</span>
+<span class="line"></span>
+<span class="line">AllVotedFor(q, b, v) ≡</span>
+<span class="line">  ∀ a ∈ q: (a, b, v) ∈ votes</span>
+<span class="line"></span>
+<span class="line">votes ≡</span>
+<span class="line">  let</span>
+<span class="line">    msgs2b ≡ {m ∈ msgs: m.type = "2b"}</span>
+<span class="line">  in</span>
+<span class="line">    {(m.acc, m.bal, m.val): m ∈ msgs2b}</span></code></pre>
+
+</figure>
+<p><span>TLA+ specs for this post are available here: </span><a href="https://github.com/matklad/paxosnotes" class="url">https://github.com/matklad/paxosnotes</a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-10-03-from-paxos-to-bft.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/10/06/hard-mode-rust.html b/2022/10/06/hard-mode-rust.html
new file mode 100644
index 00000000..384322ea
--- /dev/null
+++ b/2022/10/06/hard-mode-rust.html
@@ -0,0 +1,1050 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Hard Mode Rust</title>
+  <meta name="description" content="This post is a case study of writing a Rust application using only minimal, artificially constrained API (eg, no dynamic memory allocation).
+It assumes a fair bit of familiarity with the language.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/10/06/hard-mode-rust.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Hard-Mode-Rust"><span>Hard Mode Rust</span> <time datetime="2022-10-06">Oct 6, 2022</time></a>
+    </h1>
+<p><span>This post is a case study of writing a Rust application using only minimal, artificially constrained API (eg, no dynamic memory allocation).</span>
+<span>It assumes a fair bit of familiarity with the language.</span></p>
+<section id="Hard-Mode-Rust-1">
+
+    <h2>
+    <a href="#Hard-Mode-Rust-1"><span>Hard Mode Rust</span> </a>
+    </h2>
+<p><span>The back story here is a particular criticism of Rust and C++ from hard-core C programmers.</span>
+<span>This criticism is aimed at </span><a href="https://en.cppreference.com/w/cpp/language/raii"><span>RAII</span></a><span> </span>&mdash;<span> the language-defining feature of C++, which was wholesale imported to Rust as well.</span>
+<span>RAII makes using various resources requiring cleanups (file descriptors, memory, locks) easy </span>&mdash;<span> any place in the program can create a resource, and the cleanup code will be invoked automatically when needed.</span>
+<span>And herein lies the problem </span>&mdash;<span> because allocating resources becomes easy, RAII encourages a sloppy attitude to resources, where they are allocated and destroyed all over the place.</span>
+<span>In particular, this leads to:</span></p>
+<ul>
+<li>
+<span>Decrease in reliability. Resources are usually limited in principle, but actual resource exhaustion happens rarely.</span>
+<span>If resources are allocated throughout the program, there are many virtually untested codepaths.</span>
+</li>
+<li>
+<span>Lack of predictability. It usually is impossible to predict up-front how much resources will the program consume.</span>
+<span>Instead, resource-consumption is observed empirically.</span>
+</li>
+<li>
+<span>Poor performance. Usually, it is significantly more efficient to allocate and free resources in batches.</span>
+<span>Cleanup code for individual resources is scattered throughout codebase, increasing code bloat</span>
+</li>
+<li>
+<span>Spaghetti architecture. Resource allocation is an architecturally salient thing.</span>
+<span>If all resource management is centralized to a single place, it becomes significantly easier to understand lifecycle of resources.</span>
+</li>
+</ul>
+<p><span>I think this is a fair criticism.</span>
+<span>In fact, I think this is the same criticism that C++ and Rust programmers aim at garbage collected languages.</span>
+<span>This is a spectrum:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">           GC object graph</span>
+<span class="line">                 v v</span>
+<span class="line">                  v</span>
+<span class="line">        Tree of values with RAII</span>
+<span class="line">                 v v</span>
+<span class="line">                  v</span>
+<span class="line">Static allocation of resources at startup</span></code></pre>
+
+</figure>
+<p><span>Rust programmers typically are not exposed to the lowest level of this pyramid.</span>
+<span>But there</span>&rsquo;<span>s a relatively compact exercise to gain the relevant experience: try re-implementing your favorite Rust programs on hard mode.</span></p>
+<p><strong><strong><span>Hard Mode</span></strong></strong><span> means that you split your program into </span><code>std</code><span> binary and </span><code>#![no_std]</code><span> no-alloc library.</span>
+<span>Only the small binary is allowed to directly ask OS for resources.</span>
+<span>For the library, all resources must be injected.</span>
+<span>In particular, to do memory allocation, the library receives a slice of bytes of a fixed size, and should use that for all storage.</span>
+<span>Something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// app/src/main.rs</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">mem_limit</span> = <span class="hl-number">64</span> * <span class="hl-number">1024</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">memory</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">0u8</span>; mem_limit];</span>
+<span class="line">  app::<span class="hl-title function_ invoke__">run</span>(&amp;<span class="hl-keyword">mut</span> memory)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// app/src/lib.rs</span></span>
+<span class="line"><span class="hl-meta">#![no_std]</span> <span class="hl-comment">// &lt;- the point of the exercise</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">run</span>(memory: &amp;<span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>]) {</span>
+<span class="line">  ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Ray-Tracing">
+
+    <h2>
+    <a href="#Ray-Tracing"><span>Ray Tracing</span> </a>
+    </h2>
+<p><span>So, this is what the post is about: my experience implementing a toy hard mode ray tracer.</span>
+<span>You can find the code on GitHub: </span><a href="http://github.com/matklad/crt" class="url">http://github.com/matklad/crt</a><span>.</span></p>
+<p><span>The task of a ray tracer is to convert a description of a 3D scene like the following one:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">background #000000</span>
+<span class="line"></span>
+<span class="line">camera {</span>
+<span class="line">    pos 0,10,-50</span>
+<span class="line">    look_at 0,0,0</span>
+<span class="line">    up 0,-1,0</span>
+<span class="line">    focus 50</span>
+<span class="line">    dim 80x60</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">light {</span>
+<span class="line">    pos -20,10,0</span>
+<span class="line">    color #aa1111</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">plane {</span>
+<span class="line">    pos 0,-10,0</span>
+<span class="line">    normal 0,1,0</span>
+<span class="line">    material {</span>
+<span class="line">        color #5566FF</span>
+<span class="line">        diffuse 3</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">mesh {</span>
+<span class="line">    material {</span>
+<span class="line">        color #BB5566</span>
+<span class="line">        diffuse 3</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    data {</span>
+<span class="line">        v 5.92,4.12,0.00</span>
+<span class="line">        v 5.83,4.49,0.00</span>
+<span class="line">        v 5.94,4.61,0.00</span>
+<span class="line">        v 6.17,4.49,0.00</span>
+<span class="line">        v 6.42,4.12,0.00</span>
+<span class="line">        v 5.38,4.12,2.74</span>
+<span class="line">        ...</span>
+<span class="line"></span>
+<span class="line">        vn -0.96,-0.25,0.00</span>
+<span class="line">        vn -0.96,0.25,0.00</span>
+<span class="line">        vn -0.09,0.99,0.00</span>
+<span class="line">        vn 0.68,0.73,0.00</span>
+<span class="line">        vn 0.87,0.49,0.00</span>
+<span class="line">        vn -0.89,-0.25,-0.36</span>
+<span class="line">        ...</span>
+<span class="line"></span>
+<span class="line">        f 1/1 2/2 3/3</span>
+<span class="line">        f 4/4 5/5 6/6</span>
+<span class="line">        ...</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Into a rendered image like this:</span></p>
+
+<figure>
+
+<img alt="" src="https://user-images.githubusercontent.com/1711539/194287665-05583649-dcb0-4014-82b9-424f945e19a4.png">
+</figure>
+<p><span>This works rather intuitive conceptually.</span>
+<span>First, imagine the above scene, with an infinite fuchsia colored plane and a red Utah teapot hovering above that.</span>
+<span>Then, imagine a camera standing at </span><code>0,10,-50</code><span> (in cartesian coordinates) and aiming at the origin.</span>
+<span>Now, draw an imaginary rectangular 80x60 screen at a focus distance of 50 from the camera along its line of side.</span>
+<span>To get a 2D picture, we shoot a ray from the camera through each </span>&ldquo;<span>pixel</span>&rdquo;<span> on the screen, note which object on the scene is hit (plan, teapot, background), and color the pixel accordingly.</span>
+<span>See </span><a href="https://pbrt.org"><span>PBRT Book</span></a><span> if you feel like falling further into this particular rabbit hole (warning: it is very deep) (I apologize for </span>&ldquo;<span>little square pixels</span>&rdquo;<span> simplification I use throughout the post :-) ).</span></p>
+<p><span>I won</span>&rsquo;<span>t focus on specific algorithms to implement that (indeed, crt is a very naive tracer), but rather highlight Hard Mode Rust specific concerns.</span></p>
+</section>
+<section id="Pixel-Buffer">
+
+    <h2>
+    <a href="#Pixel-Buffer"><span>Pixel Buffer</span> </a>
+    </h2>
+<p><span>Ultimately, the out of a ray tracer is a 2D buffer with 8bit RGB pixels.</span>
+<span>One would typically represent it as follows:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Color</span> { r: <span class="hl-type">u8</span>, g: <span class="hl-type">u8</span>, b: <span class="hl-type">u8</span> }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Buf</span> {</span>
+<span class="line">  dim: [<span class="hl-type">u32</span>; <span class="hl-number">2</span>]</span>
+<span class="line">  <span class="hl-comment">// invariant: data.len() == dim.0 * dim.1</span></span>
+<span class="line">  data: <span class="hl-type">Box</span>&lt;[Color]&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>For us, we want someone else (main) to allocate that box of colors for us, so instead we do the following:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Buf</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  dim: [<span class="hl-type">u32</span>; <span class="hl-number">2</span>],</span>
+<span class="line">  buf: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [Color],</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; Buf&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(dim: Idx, buf: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [Color]) <span class="hl-punctuation">-&gt;</span> Buf&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">    <span class="hl-built_in">assert!</span>(dim.<span class="hl-number">0</span> * dim.<span class="hl-number">1</span> == buf.<span class="hl-title function_ invoke__">len</span>() <span class="hl-keyword">as</span> <span class="hl-type">u32</span>);</span>
+<span class="line">    Buf { dim, buf }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The </span><code>'m</code><span> lifetime we use for abstract memory managed elsewhere.</span>
+<span>Note how the struct grew an extra lifetime!</span>
+<span>This is extra price we have to pay for not relying on RAII to cleanup resources for us:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Easy Mode</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">paint</span>(buf: &amp;<span class="hl-keyword">mut</span> Buf) { ... }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">PaintCtx</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">  buf: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> Buf</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Hard Mode</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">paint</span>(buf: &amp;<span class="hl-keyword">mut</span> Buf&lt;<span class="hl-symbol">&#x27;_</span>&gt;) { ... }</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">PaintCtx</span>&lt;<span class="hl-symbol">&#x27;a</span>, <span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  buf: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> Buf&lt;<span class="hl-symbol">&#x27;m</span>&gt;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note in particular how the </span><code>Ctx</code><span> struct now has to include two lifetimes.</span>
+<span>This feels unnecessary: </span><code>'a</code><span> is shorter than </span><code>'m</code><span>.</span>
+<span>I wish it was possible to somehow abstract that away:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">PaintCtx</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">  buf: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> Buf&lt;<span class="hl-symbol">&#x27;_</span>&gt; <span class="hl-comment">// &amp;&#x27;a mut exists&lt;&#x27;m&gt;: Buf&lt;&#x27;m&gt;</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I don</span>&rsquo;<span>t think that</span>&rsquo;<span>s really possible (</span><a href="https://matklad.github.io/2018/05/04/encapsulating-lifetime-of-the-field.html"><span>earlier post about this</span></a><span>).</span>
+<span>In particular, the following would run into variance issues:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">PaintCtx</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt; {</span>
+<span class="line">  buf: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> Buf&lt;<span class="hl-symbol">&#x27;a</span>&gt;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Ultimately, this is annoying, but not a deal breaker.</span></p>
+<p><span>With this </span><code>rgb::Buf&lt;'_&gt;</code><span>, we can sketch the program:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// hard mode library</span></span>
+<span class="line"><span class="hl-meta">#![no_std]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">render</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt;(</span>
+<span class="line">  crt: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>,   <span class="hl-comment">// textual description of the scene</span></span>
+<span class="line">  mem: &amp;<span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>], <span class="hl-comment">// all the memory we can use</span></span>
+<span class="line">  buf: &amp;<span class="hl-keyword">mut</span> rgb::Buf, <span class="hl-comment">// write image here</span></span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;(), Error&lt;<span class="hl-symbol">&#x27;a</span>&gt;&gt; {</span>
+<span class="line">  ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// main</span></span>
+<span class="line"><span class="hl-meta">#[derive(argh::FromArgs)]</span></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Args</span> {</span>
+<span class="line">  <span class="hl-meta">#[argh(option, default = <span class="hl-string">&quot;64&quot;</span>)]</span>  mem: <span class="hl-type">usize</span>,</span>
+<span class="line">  <span class="hl-meta">#[argh(option, default = <span class="hl-string">&quot;800&quot;</span>)]</span> width: <span class="hl-type">u32</span>,</span>
+<span class="line">  <span class="hl-meta">#[argh(option, default = <span class="hl-string">&quot;600&quot;</span>)]</span> height: <span class="hl-type">u32</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() <span class="hl-punctuation">-&gt;</span> anyhow::<span class="hl-type">Result</span>&lt;()&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">args</span>: Args = argh::<span class="hl-title function_ invoke__">from_env</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">crt</span> = <span class="hl-type">String</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  io::<span class="hl-title function_ invoke__">stdin</span>()</span>
+<span class="line">    .<span class="hl-title function_ invoke__">read_to_string</span>(&amp;<span class="hl-keyword">mut</span> crt)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">context</span>(<span class="hl-string">&quot;reading input&quot;</span>)?;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Allocate all the memory.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">mem</span> = <span class="hl-built_in">vec!</span>[<span class="hl-number">0</span>; args.mem * <span class="hl-number">1024</span>];</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Allocate the image</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">buf</span> = <span class="hl-built_in">vec!</span>[</span>
+<span class="line">    rgb::Color::<span class="hl-title function_ invoke__">default</span>();</span>
+<span class="line">    (args.width * args.height) <span class="hl-keyword">as</span> <span class="hl-type">usize</span></span>
+<span class="line">  ];</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">buf</span> =</span>
+<span class="line">    rgb::Buf::<span class="hl-title function_ invoke__">new</span>([args.width, args.height], &amp;<span class="hl-keyword">mut</span> buf);</span>
+<span class="line"></span>
+<span class="line">  render::<span class="hl-title function_ invoke__">render</span>(</span>
+<span class="line">    &amp;crt,</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> mem,</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> buf,</span>
+<span class="line">  )</span>
+<span class="line">  .<span class="hl-title function_ invoke__">map_err</span>(|err| anyhow::format_err!(<span class="hl-string">&quot;{err}&quot;</span>))?;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Write result as a PPM image format.</span></span>
+<span class="line">  <span class="hl-title function_ invoke__">write_ppm</span>(&amp;buf, &amp;<span class="hl-keyword">mut</span> io::<span class="hl-title function_ invoke__">stdout</span>().<span class="hl-title function_ invoke__">lock</span>())</span>
+<span class="line">    .<span class="hl-title function_ invoke__">context</span>(<span class="hl-string">&quot;writing output&quot;</span>)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(())</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">write_ppm</span>(</span>
+<span class="line">  buf: &amp;rgb::Buf,</span>
+<span class="line">  w: &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">dyn</span> io::Write,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;()&gt; {</span>
+<span class="line">  ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Hard-Mode-Rayon">
+
+    <h2>
+    <a href="#Hard-Mode-Rayon"><span>Hard Mode Rayon</span> </a>
+    </h2>
+<p><span>Ray tracing is an embarrassingly parallel task </span>&mdash;<span> the color of each output pixel can be computed independently.</span>
+<span>Usually, the excellent </span><a href="https://lib.rs/crates/rayon"><span>rayon</span></a><span> library is used to take advantage of parallelism, but for our raytracer I want to show a significantly simpler API design for taking advantage of many cores.</span>
+<span>I</span>&rsquo;<span>ve seen this design in </span><a href="https://github.com/sorbet/sorbet/blob/master/common/concurrency/WorkerPool.h"><span>Sorbet</span></a><span>, a type checker for Ruby.</span></p>
+<p><span>Here</span>&rsquo;<span>s how a </span><code>render</code><span> function with support for parallelism looks:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line hl-line"><span class="hl-keyword">type</span> <span class="hl-title class_">ThreadPool</span>&lt;<span class="hl-symbol">&#x27;t</span>&gt; = <span class="hl-keyword">dyn</span> <span class="hl-title function_ invoke__">Fn</span>(&amp;(<span class="hl-keyword">dyn</span> <span class="hl-title function_ invoke__">Fn</span>() + <span class="hl-built_in">Sync</span>)) + <span class="hl-symbol">&#x27;t</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">render</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt;(</span>
+<span class="line">  crt: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>,</span>
+<span class="line">  mem: &amp;<span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>],</span>
+<span class="line hl-line">  in_parallel: &amp;ThreadPool&lt;<span class="hl-symbol">&#x27;_</span>&gt;,</span>
+<span class="line">  buf: &amp;<span class="hl-keyword">mut</span> rgb::Buf&lt;<span class="hl-symbol">&#x27;_</span>&gt;,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;(), Error&lt;<span class="hl-symbol">&#x27;a</span>&gt;&gt; {</span></code></pre>
+
+</figure>
+<p><span>The interface here is the </span><code>in_parallel</code><span> function, which takes another function as an argument and runs it, in parallel, on all available threads.</span>
+<span>You typically use it like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">work</span>: ConcurrentQueue&lt;Work&gt; = ConcurrentQueue::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">work.<span class="hl-title function_ invoke__">extend</span>(available_work);</span>
+<span class="line"><span class="hl-title function_ invoke__">in_parallel</span>(&amp;|| {</span>
+<span class="line">  <span class="hl-keyword">while</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(item) = work.<span class="hl-title function_ invoke__">pop</span>() {</span>
+<span class="line">    <span class="hl-title function_ invoke__">process</span>(item);</span>
+<span class="line">  }</span>
+<span class="line">})</span></code></pre>
+
+</figure>
+<p><span>This is </span><em><span>similar</span></em><span> to a typical threadpool, but different.</span>
+<span>Similar to a threadpool, there</span>&rsquo;<span>s a number of threads (typically one per core) which execute arbitrary jobs.</span>
+<span>The first difference is that a typical threadpool sends a job to to a single thread, while in this design the same job is broadcasted to all threads.</span>
+<span>The job is </span><code>Fn + Sync</code><span> rather than </span><code>FnOnce + Send</code><span>.</span>
+<span>The second difference is that we </span><em><span>block</span></em><span> until the job is done on all threads, so we can borrow data from the stack.</span></p>
+<p><span>It</span>&rsquo;<span>s on the caller to explicitly implement a concurrent queue to distributed specific work items.</span>
+<span>In my implementation, I slice the image in rows</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ThreadPool</span>&lt;<span class="hl-symbol">&#x27;t</span>&gt; = <span class="hl-keyword">dyn</span> <span class="hl-title function_ invoke__">Fn</span>(&amp;(<span class="hl-keyword">dyn</span> <span class="hl-title function_ invoke__">Fn</span>() + <span class="hl-built_in">Sync</span>)) + <span class="hl-symbol">&#x27;t</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">render</span>&lt;<span class="hl-symbol">&#x27;a</span>&gt;(</span>
+<span class="line">  crt: &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>,</span>
+<span class="line">  mem: &amp;<span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>],</span>
+<span class="line">  in_parallel: &amp;ThreadPool&lt;<span class="hl-symbol">&#x27;_</span>&gt;,</span>
+<span class="line">  buf: &amp;<span class="hl-keyword">mut</span> rgb::Buf&lt;<span class="hl-symbol">&#x27;_</span>&gt;,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;(), Error&lt;<span class="hl-symbol">&#x27;a</span>&gt;&gt; {</span>
+<span class="line">  ...</span>
+<span class="line">  <span class="hl-comment">// Note: this is not mut, because this is</span></span>
+<span class="line">  <span class="hl-comment">// a concurrent iterator.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">rows</span> = buf.<span class="hl-title function_ invoke__">partition</span>();</span>
+<span class="line">  <span class="hl-title function_ invoke__">in_parallel</span>(&amp;|| {</span>
+<span class="line">    <span class="hl-comment">// next_row increments an atomic and</span></span>
+<span class="line">    <span class="hl-comment">// uses the row index to give an `&amp;mut`</span></span>
+<span class="line">    <span class="hl-comment">// into the row&#x27;s pixels.</span></span>
+<span class="line">    <span class="hl-keyword">while</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(row) = rows.<span class="hl-title function_ invoke__">next_row</span>() {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">y</span>: <span class="hl-type">u32</span> = row.y;</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">buf</span>: &amp;<span class="hl-keyword">mut</span> [rgb::Color] = row.buf;</span>
+<span class="line">      <span class="hl-keyword">for</span> <span class="hl-variable">x</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..dim[<span class="hl-number">0</span>] {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">color</span> = render::<span class="hl-title function_ invoke__">render_pixel</span>(&amp;scene, [x, y]);</span>
+<span class="line">        buf[x <span class="hl-keyword">as</span> <span class="hl-type">usize</span>] = <span class="hl-title function_ invoke__">to_rgb</span>(&amp;color);</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  });</span>
+<span class="line">  ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In </span><code>main</code><span>, we implement a concrete </span><code>ThreadPool</code><span> by spawning a thread per core:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() <span class="hl-punctuation">-&gt;</span> anyhow::<span class="hl-type">Result</span>&lt;()&gt; {</span>
+<span class="line">  ...</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">threads</span> = <span class="hl-keyword">match</span> args.jobs {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>(it) =&gt; Threads::<span class="hl-title function_ invoke__">new</span>(it),</span>
+<span class="line">    <span class="hl-literal">None</span> =&gt; Threads::<span class="hl-title function_ invoke__">with_max_threads</span>()?,</span>
+<span class="line">  };</span>
+<span class="line">  render::<span class="hl-title function_ invoke__">render</span>(</span>
+<span class="line">    &amp;crt,</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> mem,</span>
+<span class="line">    &amp;|f| threads.<span class="hl-title function_ invoke__">in_parallel</span>(f),</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> buf,</span>
+<span class="line">  )</span>
+<span class="line">  .<span class="hl-title function_ invoke__">map_err</span>(|err| anyhow::format_err!(<span class="hl-string">&quot;{err}&quot;</span>))?;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Allocator">
+
+    <h2>
+    <a href="#Allocator"><span>Allocator</span> </a>
+    </h2>
+<p><span>The scenes we are going to render are fundamentally dynamically sized.</span>
+<span>They can contain arbitrary number of objects.</span>
+<span>So we can</span>&rsquo;<span>t just statically allocate all the memory up-front.</span>
+<span>Instead, there</span>&rsquo;<span>s a CLI argument which sets the amount of memory a ray tracer can use, and we should either manage with that, or return an error.</span>
+<span>So we do need to write our own allocator.</span>
+<span>But we</span>&rsquo;<span>ll try very hard to only allocate the memory we actually need, so we won</span>&rsquo;<span>t have to implement memory deallocation at all.</span>
+<span>So a simple bump allocator would do:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Mem</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  raw: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>],</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-meta">#[derive(Debug)]</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Oom</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; Mem&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(raw: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>]) <span class="hl-punctuation">-&gt;</span> Mem&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">    Mem { raw }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">alloc</span>&lt;T&gt;(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, t: T) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;&amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> T, Oom&gt; { ... }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">alloc_array</span>&lt;T&gt;(</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">    n: <span class="hl-type">usize</span>,</span>
+<span class="line">    <span class="hl-keyword">mut</span> element: <span class="hl-keyword">impl</span> <span class="hl-title class_">FnMut</span>(<span class="hl-type">usize</span>) <span class="hl-punctuation">-&gt;</span> T,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;&amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [T], Oom&gt; { ... }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">alloc_array_default</span>&lt;T: <span class="hl-built_in">Default</span>&gt;(</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">    n: <span class="hl-type">usize</span>,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;&amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [T], Oom&gt; {</span>
+<span class="line">    <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">alloc_array</span>(n, |_| T::<span class="hl-title function_ invoke__">default</span>())</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>We can create an allocator from a slice of bytes, and then ask it to allocate values and arrays.</span>
+<span>Schematically, </span><code>alloc</code><span> looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// PSEUDOCODE, doesn&#x27;t handle alignment and is broken.</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">alloc</span>&lt;<span class="hl-symbol">&#x27;a</span>, T&gt;(</span>
+<span class="line">  &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">  val: T,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;&amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> T, Oom&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">size</span> = mem::size_of::&lt;T&gt;();</span>
+<span class="line">  <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.raw.<span class="hl-title function_ invoke__">len</span>() &lt; size {</span>
+<span class="line">    <span class="hl-comment">// Return error if there isn&#x27;t enough of memory.</span></span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">Err</span>(Oom);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Split off size_of::&lt;T&gt; bytes from the start,</span></span>
+<span class="line">  <span class="hl-comment">// doing a little `mem::take` dance to placate</span></span>
+<span class="line">  <span class="hl-comment">// the borrowchecker.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">res</span>: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>] = {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">raw</span> = mem::<span class="hl-title function_ invoke__">take</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>.raw);</span>
+<span class="line">    <span class="hl-keyword">let</span> (res, raw) = raw.<span class="hl-title function_ invoke__">split_at_mut</span>(size);</span>
+<span class="line">    <span class="hl-keyword">self</span>.raw = raw;</span>
+<span class="line">    res</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Initialize the value</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = res <span class="hl-keyword">as</span> *<span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>] <span class="hl-keyword">as</span> *<span class="hl-keyword">mut</span> <span class="hl-type">u8</span> <span class="hl-keyword">as</span> *<span class="hl-keyword">mut</span> T;</span>
+<span class="line">  <span class="hl-keyword">unsafe</span> {</span>
+<span class="line">    ptr::<span class="hl-title function_ invoke__">write</span>(res, val);</span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(&amp;<span class="hl-keyword">mut</span> *res)</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To make this fully kosher we need to handle alignment as well, but I cut that bit out for brevity.</span></p>
+<p><span>For allocating arrays, it</span>&rsquo;<span>s useful if all-zeros bitpattern is a valid default instance of type, as that allows to skip element-wise initialization.</span>
+<span>This condition isn</span>&rsquo;<span>t easily expressible in today</span>&rsquo;<span>s Rust though, so we require initializing every array member.</span></p>
+<p><span>The result of an allocation is </span><code>&amp;'m T</code><span> </span>&mdash;<span> this is how we spell </span><code>Box&lt;T&gt;</code><span> on hard mode.</span></p>
+</section>
+<section id="Parsing">
+
+    <h2>
+    <a href="#Parsing"><span>Parsing</span> </a>
+    </h2>
+<p><span>The scene contains various objects, like spheres and planes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Sphere</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> center: v64, <span class="hl-comment">// v64 is [f64; 3]</span></span>
+<span class="line">  <span class="hl-keyword">pub</span> radius: <span class="hl-type">f64</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Plane</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> origin: v64,</span>
+<span class="line">  <span class="hl-keyword">pub</span> normal: v64,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Usually, we</span>&rsquo;<span>d represent a scene as</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Scene</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> camera: Camera,</span>
+<span class="line">  <span class="hl-keyword">pub</span> spheres: <span class="hl-type">Vec</span>&lt;Sphere&gt;,</span>
+<span class="line">  <span class="hl-keyword">pub</span> planes: <span class="hl-type">Vec</span>&lt;Plane&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>We </span><em><span>could</span></em><span> implement a resizable array (</span><code>Vec</code><span>), but doing that would require us to either leak memory, or to implement proper deallocation logic in our allocator, and add destructors to reliably trigger that.</span>
+<span>But destructors is exactly something we are trying to avoid in this exercise.</span>
+<span>So our scene will have to look like this instead:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Scene</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">pub</span> camera: Camera,</span>
+<span class="line">  <span class="hl-keyword">pub</span> spheres: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [Sphere],</span>
+<span class="line">  <span class="hl-keyword">pub</span> planes: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [Plane],</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And that means we want to know the number of objects we</span>&rsquo;<span>ll need upfront.</span>
+<span>The way we solve this problem is by doing two-pass parsing.</span>
+<span>In the first pass, we just count things, then we allocate them, then we actually parse them into allocated space.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_ invoke__">pub</span>(<span class="hl-keyword">crate</span>) <span class="hl-keyword">fn</span> <span class="hl-title function_">parse</span>&lt;<span class="hl-symbol">&#x27;m</span>, <span class="hl-symbol">&#x27;i</span>&gt;(</span>
+<span class="line">  mem: &amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;m</span>&gt;,</span>
+<span class="line">  input: &amp;<span class="hl-symbol">&#x27;i</span> <span class="hl-type">str</span>,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;Scene&lt;<span class="hl-symbol">&#x27;m</span>&gt;, Error&lt;<span class="hl-symbol">&#x27;i</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-comment">// Size the allocations.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">n_spheres</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">n_planes</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">word</span> <span class="hl-keyword">in</span> input.<span class="hl-title function_ invoke__">split_ascii_whitespace</span>() {</span>
+<span class="line">    <span class="hl-keyword">match</span> word {</span>
+<span class="line">      <span class="hl-string">&quot;sphere&quot;</span> =&gt; n_spheres += <span class="hl-number">1</span>,</span>
+<span class="line">      <span class="hl-string">&quot;plane&quot;</span> =&gt; n_planes += <span class="hl-number">1</span>,</span>
+<span class="line">      _ =&gt; (),</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Allocate.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">res</span> = Scene {</span>
+<span class="line">    camera: <span class="hl-built_in">Default</span>::<span class="hl-title function_ invoke__">default</span>(),</span>
+<span class="line">    spheres: mem.<span class="hl-title function_ invoke__">alloc_array_default</span>(n_spheres)?</span>
+<span class="line">    planes: mem.<span class="hl-title function_ invoke__">alloc_array_default</span>(n_planes)?,</span>
+<span class="line">  };</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Parse _into_ the allocated scene.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">p</span> = Parser::<span class="hl-title function_ invoke__">new</span>(mem, input);</span>
+<span class="line">  <span class="hl-title function_ invoke__">scene</span>(&amp;<span class="hl-keyword">mut</span> p, &amp;<span class="hl-keyword">mut</span> res)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(res)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If an error is encountered during parsing, we want to create a helpful error message.</span>
+<span>If the message is fully dynamic, we</span>&rsquo;<span>d have to allocate it </span><em><span>into</span></em><span> </span><code>'m</code><span>, but it seems simpler to just re-use bits of input for error message.</span>
+<span>Hence, </span><code>Error&lt;'i&gt;</code><span> is tied to the input lifetime </span><code>'i</code><span>, rather memory lifetime </span><code>'m</code><span>.</span></p>
+</section>
+<section id="Nested-Objects">
+
+    <h2>
+    <a href="#Nested-Objects"><span>Nested Objects</span> </a>
+    </h2>
+<p><span>One interesting type of object on the scene is a mesh of triangles (for example, the teapot is just a bunch of triangles).</span>
+<span>A naive way to represent a bunch of triangles is to use a vector:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Triangle</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> a: v64,</span>
+<span class="line">  <span class="hl-keyword">pub</span> b: v64,</span>
+<span class="line">  <span class="hl-keyword">pub</span> c: v64,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">Mesh</span> = <span class="hl-type">Vec</span>&lt;Triangle&gt;;</span></code></pre>
+
+</figure>
+<p><span>This is wasteful: in a mesh, each edge is shared by two triangles.</span>
+<span>So a single vertex belongs to a bunch of triangles.</span>
+<span>If we store a vector of triangles, we are needlessly duplicating vertex data.</span>
+<span>A more compact representation is to store unique vertexes once, and to use indexes for sharing:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Mesh</span> {</span>
+<span class="line">  <span class="hl-keyword">pub</span> vertexes: <span class="hl-type">Vec</span>&lt;v64&gt;,</span>
+<span class="line">  <span class="hl-keyword">pub</span> faces: <span class="hl-type">Vec</span>&lt;MeshFace&gt;,</span>
+<span class="line">}</span>
+<span class="line"><span class="hl-comment">// Indexes point into vertexes vector.</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">MeshFace</span> { a: <span class="hl-type">u32</span>, b: <span class="hl-type">u32</span>, c: <span class="hl-type">u32</span> }</span></code></pre>
+
+</figure>
+<p><span>Again, on hard mode that would be</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Mesh</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">pub</span> vertexes: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [v64],</span>
+<span class="line">  <span class="hl-keyword">pub</span> faces: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [MeshFace],</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And a scene contains a bunch of meshes :</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Scene</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">pub</span> camera: Camera,</span>
+<span class="line">  <span class="hl-keyword">pub</span> spheres: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [Sphere],</span>
+<span class="line">  <span class="hl-keyword">pub</span> planes: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [Plane],</span>
+<span class="line">  <span class="hl-keyword">pub</span> meshes: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [Mesh&lt;<span class="hl-symbol">&#x27;m</span>&gt;],</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note how, if the structure is recursive, we have </span>&ldquo;<span>owned pointers</span>&rdquo;<span> of </span><code>&amp;'m mut T&lt;'m&gt;</code><span> shape.</span>
+<span>Originally I worried that that would cause problem with variance, but it seems to work fine for ownership specifically.</span>
+<span>During processing, you still need </span><code>&amp;'a mut T&lt;'m&gt;</code><span> though.</span></p>
+<p><span>And that</span>&rsquo;<span>s why parsing functions hold an uncomfortable bunch of lifetimes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">mesh</span>&lt;<span class="hl-symbol">&#x27;m</span>, <span class="hl-symbol">&#x27;i</span>&gt;(</span>
+<span class="line">  p: &amp;<span class="hl-keyword">mut</span> Parser&lt;<span class="hl-symbol">&#x27;m</span>, <span class="hl-symbol">&#x27;i</span>, <span class="hl-symbol">&#x27;_</span>&gt;,</span>
+<span class="line">  res: &amp;<span class="hl-keyword">mut</span> Mesh&lt;<span class="hl-symbol">&#x27;m</span>&gt;,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;(), Error&lt;<span class="hl-symbol">&#x27;i</span>&gt;&gt; { ... }</span></code></pre>
+
+</figure>
+<p><span>The parser </span><code>p</code><span> holds </span><code>&amp;'i str</code><span> input and a </span><code>&amp;'a mut Mem&lt;'m&gt;</code><span> memory.</span>
+<span>It parses input </span><em><span>into</span></em><span> a </span><code>&amp;'b mut Mesh&lt;'m&gt;</code><span>.</span></p>
+</section>
+<section id="Bounding-Volume-Hierarchy">
+
+    <h2>
+    <a href="#Bounding-Volume-Hierarchy"><span>Bounding Volume Hierarchy</span> </a>
+    </h2>
+<p><span>With </span><code>Scene&lt;'m&gt;</code><span> fully parsed, we can finally get to rendering the picture.</span>
+<span>A naive way to do this would be to iterate through each pixel, shooting a ray through it, and then do a nested iterations over every shape, looking for the closest intersection.</span>
+<span>That</span>&rsquo;<span>s going to be slow!</span>
+<span>The teapot model contains about 1k triangles, and we have 640*480 pixels, which gives us 307</span><span>_200</span><span>_000 ray-triangle intersection tests, which is quite slow even with multithreading.</span></p>
+<p><span>So we are going to speed this up.</span>
+<span>The idea is simple </span>&mdash;<span> just don</span>&rsquo;<span>t intersect a ray with each triangle.</span>
+<span>It is possible to quickly discard batches of triangles.</span>
+<span>If we have a  batch of triangles, we can draw a 3D box around them as a pre-processing step.</span>
+<span>Now if the ray doesn</span>&rsquo;<span>t intersect the bounding box, we know that it can</span>&rsquo;<span>t intersect any of the triangles.</span>
+<span>So we can use one test with a bounding box instead of many tests for each triangle.</span></p>
+<p><span>This is of course one-sided </span>&mdash;<span> if the ray intersects the box, it might still miss all of the triangles.</span>
+<span>But, if we place bounding boxes smartly (small boxes which cover many adjacent triangles), we can hope to skip a lot of work.</span></p>
+<p><span>We won</span>&rsquo;<span>t go for really smart ways of doing that, and instead will use a simple divide-and-conquer scheme.</span>
+<span>Specifically, we</span>&rsquo;<span>ll draw a large box around all triangles we have.</span>
+<span>Then, we</span>&rsquo;<span>ll note which dimension of the resulting box is the longest.</span>
+<span>If, for example, the box is very tall, we</span>&rsquo;<span>ll cut it in half horizontally, such that each half contains half of the triangles.</span>
+<span>Then, we</span>&rsquo;<span>ll recursively subdivide the two halves.</span></p>
+<p><span>In the end, we get a binary tree, where each node contains a bounding box and two children, whose bounding boxes are contained in the parent</span>&rsquo;<span>s bounding box.</span>
+<span>Leaves contains triangles.</span>
+<span>This construction is called a bounding volume hierarchy, bvh.</span></p>
+<p><span>To intersect the ray with bvh, we use a recursive procedure.</span>
+<span>Starting at the root node, we descend into children whose bounding boxes are intersected by the ray.</span>
+<span>Sometimes we</span>&rsquo;<span>ll have to descend into both children, but often enough at least one child</span>&rsquo;<span>s bounding box won</span>&rsquo;<span>t touch the ray, allowing us to completely skip the subtree.</span></p>
+<p><span>On easy mode Rust, we can code it like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">BoundingBox</span> {</span>
+<span class="line">  <span class="hl-comment">// Opposite corners of the box.</span></span>
+<span class="line">  lo: v64, hi: v64,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Bvh</span> {</span>
+<span class="line">  root: BvhNode</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">BvhNode</span> {</span>
+<span class="line">  Split {</span>
+<span class="line">    bb: BoundingBox,</span>
+<span class="line">    children: [<span class="hl-type">Box</span>&lt;BvhNode&gt;; <span class="hl-number">2</span>],</span>
+<span class="line">    <span class="hl-comment">/// Which of X,Y,Z dimensions was used</span></span>
+<span class="line">    <span class="hl-comment">// to cut the bb in two.</span></span>
+<span class="line">    axis: <span class="hl-type">u8</span>,</span>
+<span class="line">  }</span>
+<span class="line">  Leaf {</span>
+<span class="line">    bb: BoundingBox,</span>
+<span class="line">    <span class="hl-comment">/// Index of the triangle in a mesh.</span></span>
+<span class="line">    triangle: <span class="hl-type">u32</span>,</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>On hard mode, we don</span>&rsquo;<span>t really love all those separate boxes, we love arrays!</span>
+<span>So what we</span>&rsquo;<span>d rather have is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Bvh</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  splits: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [BvhSplit],</span>
+<span class="line">  leaves: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [BvhLeaf],</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">BvhSplit</span> {</span>
+<span class="line">  <span class="hl-comment">/// Index into either splits or leaves.</span></span>
+<span class="line">  <span class="hl-comment">/// The `tag` is in the highest bit.</span></span>
+<span class="line">  children: [<span class="hl-type">u32</span>; <span class="hl-number">2</span>],</span>
+<span class="line">  bb: BoundingBox,</span>
+<span class="line">  axis: <span class="hl-type">u8</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">BvhLeaf</span> {</span>
+<span class="line">  face: <span class="hl-type">u32</span>,</span>
+<span class="line">  bb: BoundingBox,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>So we want to write the following function which recursively constructs a bvh for a mesh:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">build</span>(</span>
+<span class="line">  mem: &amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;m</span>&gt;,</span>
+<span class="line">  mesh: &amp;Mesh&lt;<span class="hl-symbol">&#x27;m</span>&gt;,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;Bvh&lt;<span class="hl-symbol">&#x27;m</span>&gt;, Oom&gt; { ... }</span></code></pre>
+
+</figure>
+<p><span>The problem is, unlike the parser, we can</span>&rsquo;<span>t cheaply determine the number of leaves and splits without actually building the whole tree.</span></p>
+</section>
+<section id="Scratch-Space">
+
+    <h2>
+    <a href="#Scratch-Space"><span>Scratch Space</span> </a>
+    </h2>
+<p><span>So what we are going to do here is to allocate a pointer-tree structure into some scratch space, and then copy that into an </span><code>&amp;'m mut</code><span> array.</span>
+<span>How do we find the scratch space?</span>
+<span>Our memory is </span><code>&amp;'m [u8]</code><span>.</span>
+<span>We allocate stuff from the start of the region.</span>
+<span>So we can split of some amount of scratch space from the end:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">&amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>] <span class="hl-punctuation">-&gt;</span> (&amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>], &amp;<span class="hl-symbol">&#x27;s</span> <span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>])</span></code></pre>
+
+</figure>
+<p><span>Stuff we allocate into the first half is allocated </span>&ldquo;<span>permanently</span>&rdquo;<span>.</span>
+<span>Stuff we allocate into the second half is allocated temporarily.</span>
+<span>When we drop temp buffer, we can reclaim all that space.</span></p>
+<p><span>This</span>&hellip;<span> probably is the most sketchy part of the whole endeavor.</span>
+<span>It is </span><code>unsafe</code><span>, requires lifetimes casing, and I actually can</span>&rsquo;<span>t get it past miri.</span>
+<span>But it should be fine, right?</span></p>
+<p><span>So, I have the following thing API:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Mem</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">with_scratch</span>&lt;T&gt;(</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">    size: <span class="hl-type">usize</span>,</span>
+<span class="line">    f: <span class="hl-keyword">impl</span> <span class="hl-title class_">FnOnce</span>(&amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;m</span>&gt;, &amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> T,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> T { ... }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It can be used like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">test_scratch</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">buf</span> = [<span class="hl-number">0u8</span>; <span class="hl-number">4</span>];</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">mem</span> = Mem::<span class="hl-title function_ invoke__">new</span>(&amp;<span class="hl-keyword">mut</span> buf);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">x</span> = mem.<span class="hl-title function_ invoke__">alloc</span>(<span class="hl-number">0u8</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">y</span> = mem.<span class="hl-title function_ invoke__">with_scratch</span>(<span class="hl-number">2</span>, |mem, scratch| {</span>
+<span class="line">    <span class="hl-comment">// Here, we can allocate _permanent_ stuff from `mem`,</span></span>
+<span class="line">    <span class="hl-comment">// and temporary stuff from `scratch`.</span></span>
+<span class="line">    <span class="hl-comment">// Only permanent stuff can escape.</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">y</span> = mem.<span class="hl-title function_ invoke__">alloc</span>(<span class="hl-number">1u8</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">z</span> = scratch.<span class="hl-title function_ invoke__">alloc</span>(<span class="hl-number">2u8</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">    <span class="hl-built_in">assert_eq!</span>((*x, *y, *z), (<span class="hl-number">0</span>, <span class="hl-number">1</span>, <span class="hl-number">2</span>));</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// The rest of memory is occupied by scratch.</span></span>
+<span class="line">    <span class="hl-built_in">assert!</span>(mem.<span class="hl-title function_ invoke__">alloc</span>(<span class="hl-number">0u8</span>).<span class="hl-title function_ invoke__">is_err</span>());</span>
+<span class="line"></span>
+<span class="line">    y <span class="hl-comment">// Returning z here fails.</span></span>
+<span class="line">  });</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// The scratch memory is now reclaimed.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">z</span> = mem.<span class="hl-title function_ invoke__">alloc</span>(<span class="hl-number">3u8</span>).<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>((*x, *y, *z), (<span class="hl-number">0</span>, <span class="hl-number">1</span>, <span class="hl-number">3</span>));</span>
+<span class="line">  <span class="hl-built_in">assert_eq!</span>(buf, [<span class="hl-number">0</span>, <span class="hl-number">1</span>, <span class="hl-number">3</span>, <span class="hl-number">0</span>]);</span>
+<span class="line">  <span class="hl-comment">// Will fail to compile.</span></span>
+<span class="line">  <span class="hl-comment">// assert_eq!(*x, 0);</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And here</span>&rsquo;<span>s how </span><code>with_scratch</code><span> implemented:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">with_scratch</span>&lt;T&gt;(</span>
+<span class="line">  &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">  size: <span class="hl-type">usize</span>,</span>
+<span class="line">  f: <span class="hl-keyword">impl</span> <span class="hl-title class_">FnOnce</span>(&amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;m</span>&gt;, &amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;_</span>&gt;) <span class="hl-punctuation">-&gt;</span> T,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> T {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">raw</span> = mem::<span class="hl-title function_ invoke__">take</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>.raw);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// Split off scratch space.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">mid</span> = raw.<span class="hl-title function_ invoke__">len</span>() - size;</span>
+<span class="line">  <span class="hl-keyword">let</span> (mem, scratch) = raw.<span class="hl-title function_ invoke__">split_at_mut</span>(mid);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">self</span>.raw = mem;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">res</span> = <span class="hl-title function_ invoke__">f</span>(<span class="hl-keyword">self</span>, &amp;<span class="hl-keyword">mut</span> Mem::<span class="hl-title function_ invoke__">new</span>(scratch));</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">data</span> = <span class="hl-keyword">self</span>.raw.<span class="hl-title function_ invoke__">as_mut_ptr</span>();</span>
+<span class="line">  <span class="hl-comment">// Glue the scratch space back in.</span></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">len</span> = <span class="hl-keyword">self</span>.raw.<span class="hl-title function_ invoke__">len</span>() + size;</span>
+<span class="line">  <span class="hl-comment">// This makes miri unhappy, any suggestions? :(</span></span>
+<span class="line">  <span class="hl-keyword">self</span>.raw = <span class="hl-keyword">unsafe</span> { slice::<span class="hl-title function_ invoke__">from_raw_parts_mut</span>(data, len) };</span>
+<span class="line">  res</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>With this infrastructure in place, we can finally implement bvh construction!</span>
+<span>We</span>&rsquo;<span>ll do it in three steps:</span></p>
+<ol>
+<li>
+<span>Split of half the memory into a scratch space.</span>
+</li>
+<li>
+<span>Build a dynamically-sized tree in that space, counting leaves and interior nodes.</span>
+</li>
+<li>
+<span>Allocate arrays of the right size in the permanent space, and copy data over once.</span>
+</li>
+</ol>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Bvh</span>&lt;<span class="hl-symbol">&#x27;m</span>&gt; {</span>
+<span class="line">  splits: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [BvhSplit],</span>
+<span class="line">  leaves: &amp;<span class="hl-symbol">&#x27;m</span> <span class="hl-keyword">mut</span> [BvhLeaf],</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">BvhSplit</span> {</span>
+<span class="line">  children: [<span class="hl-type">u32</span>; <span class="hl-number">2</span>],</span>
+<span class="line">  bb: BoundingBox,</span>
+<span class="line">  axis: <span class="hl-type">u8</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">BvhLeaf</span> {</span>
+<span class="line">  face: <span class="hl-type">u32</span>,</span>
+<span class="line">  bb: BoundingBox,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Temporary tree we store in the scratch space.</span></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Node</span>&lt;<span class="hl-symbol">&#x27;s</span>&gt; {</span>
+<span class="line">  Split {</span>
+<span class="line">    children: [&amp;<span class="hl-symbol">&#x27;s</span> <span class="hl-keyword">mut</span> Node&lt;<span class="hl-symbol">&#x27;s</span>&gt;; <span class="hl-number">2</span>],</span>
+<span class="line">    bb: BoundingBox,</span>
+<span class="line">    axis: <span class="hl-type">u8</span></span>
+<span class="line">  },</span>
+<span class="line">  Leaf { face: <span class="hl-type">u32</span>, bb: BoundingBox },</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">build</span>(</span>
+<span class="line">  mem: &amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;m</span>&gt;,</span>
+<span class="line">  mesh: &amp;Mesh&lt;<span class="hl-symbol">&#x27;m</span>&gt;,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;Bvh&lt;<span class="hl-symbol">&#x27;m</span>&gt;, Oom&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">free_mem</span> = mem.<span class="hl-title function_ invoke__">free</span>();</span>
+<span class="line">  mem.<span class="hl-title function_ invoke__">with_scratch</span>(free_mem / <span class="hl-number">2</span>, |mem, scratch| {</span>
+<span class="line">    <span class="hl-keyword">let</span> (node, n_splits, n_leaves) =</span>
+<span class="line">      <span class="hl-title function_ invoke__">build_scratch</span>(scratch, mesh);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">res</span> = Bvh {</span>
+<span class="line">      splits: mem.<span class="hl-title function_ invoke__">alloc_array_default</span>(n_splits <span class="hl-keyword">as</span> <span class="hl-type">usize</span>)?,</span>
+<span class="line">      leaves: mem.<span class="hl-title function_ invoke__">alloc_array_default</span>(n_leaves <span class="hl-keyword">as</span> <span class="hl-type">usize</span>)?,</span>
+<span class="line">    };</span>
+<span class="line">    <span class="hl-title function_ invoke__">copy</span>(&amp;<span class="hl-keyword">mut</span> res, &amp;node);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(res)</span>
+<span class="line">  })</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">build_scratch</span>&lt;<span class="hl-symbol">&#x27;s</span>&gt;(</span>
+<span class="line">  mem: &amp;<span class="hl-keyword">mut</span> Mem&lt;<span class="hl-symbol">&#x27;s</span>&gt;,</span>
+<span class="line">  mesh: &amp;Mesh&lt;<span class="hl-symbol">&#x27;_</span>&gt;,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Result</span>&lt;(&amp;<span class="hl-symbol">&#x27;s</span> <span class="hl-keyword">mut</span> Node&lt;<span class="hl-symbol">&#x27;s</span>&gt;, <span class="hl-type">usize</span>, <span class="hl-type">usize</span>), Oom&gt; {</span>
+<span class="line">  ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">copy</span>&lt;<span class="hl-symbol">&#x27;m</span>, <span class="hl-symbol">&#x27;s</span>&gt;(res: &amp;<span class="hl-keyword">mut</span> Bvh&lt;<span class="hl-symbol">&#x27;m</span>&gt;, node: &amp;Node&lt;<span class="hl-symbol">&#x27;s</span>&gt;) {</span>
+<span class="line">  ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And that</span>&rsquo;<span>s it!</span>
+<span>The thing actually works, miri complaints notwithstanding!</span></p>
+</section>
+<section id="Conclusions">
+
+    <h2>
+    <a href="#Conclusions"><span>Conclusions</span> </a>
+    </h2>
+<p><span>Actually, I am impressed.</span>
+<span>I was certain that this won</span>&rsquo;<span>t actually work out, and that I</span>&rsquo;<span>d have to write copious amount of unsafe to get the runtime behavior I want.</span>
+<span>Specifically, I believed that </span><code>&amp;'m mut T&lt;'m&gt;</code><span> variance issue would force my hand to add </span><code>'m</code><span>, </span><code>'mm</code><span>, </span><code>'mmm</code><span> and further lifetimes, but that didn</span>&rsquo;<span>t happen.</span>
+<span>For </span>&ldquo;<span>owning</span>&rdquo;<span> pointers, </span><code>&amp;'m mut T&lt;'m'&gt;</code><span> turned out to work fine!</span>
+<span>It</span>&rsquo;<span>s only when processing you might need extra lifetimes.</span>
+<code>Parser&lt;'m, 'i, 'a&gt;</code><span> is at least two lifetimes more than I am completely comfortable with, but I guess I can live with that.</span></p>
+<p><span>I wonder how far this style of programming can be pushed.</span>
+<span>Aesthetically, I quite like that I can tell precisely how much memory the program would use!</span></p>
+<p><span>Code for the post: </span><a href="http://github.com/matklad/crt" class="url">http://github.com/matklad/crt</a><span>.</span></p>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/xx7xci/blog_post_hard_mode_rust/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-10-06-hard-mode-rust.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/10/19/why-linux-troubleshooting-advice-sucks.html b/2022/10/19/why-linux-troubleshooting-advice-sucks.html
new file mode 100644
index 00000000..c09aed7f
--- /dev/null
+++ b/2022/10/19/why-linux-troubleshooting-advice-sucks.html
@@ -0,0 +1,172 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Why Linux Troubleshooting Advice Sucks</title>
+  <meta name="description" content="A short post on how to create better troubleshooting documentation, prompted by me spending last evening trying to get builtin display of my laptop working with Linux.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/10/19/why-linux-troubleshooting-advice-sucks.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Why-Linux-Troubleshooting-Advice-Sucks"><span>Why Linux Troubleshooting Advice Sucks</span> <time datetime="2022-10-19">Oct 19, 2022</time></a>
+    </h1>
+<p><span>A short post on how to create better troubleshooting documentation, prompted by me spending last evening trying to get builtin display of my laptop working with Linux.</span></p>
+<p><span>What finally fixed the blank screen for me was this advice from NixOS wiki:</span></p>
+
+<aside class="block">
+<div class="title">12th Gen (Alder Lake)</div>
+<p><span>X Server may fail to start with the newer 12th generation, Alder Lake, iRISxe integrated graphics chips.</span>
+<span>If this is the case, you can give the kernel a hint as to what driver to use.</span>
+<span>First confirm the graphic chip</span>&rsquo;<span>s device ID by running in a terminal:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> nix-shell -p pciutils --run "lspci | grep VGA"</span>
+<span class="line"><span class="hl-output">00:02.0 VGA compatible controller: Intel Corporation Device 46a6 (rev 0c)</span></span></code></pre>
+
+</figure>
+<p><span>In this example, </span>&ldquo;<span>46a6</span>&rdquo;<span> is the device ID. You can then add this to your configuration and reboot:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">boot.kernelParams = [ "i915.force_probe=46a6" ];</span></code></pre>
+
+</figure>
+
+</aside>
+  <p><span>While this particular approach worked, in contrast to a dozen different ones I tried before, I think it shares a very common flaw, which is endemic to troubleshooting documentation.</span>
+<span>Can you spot it?</span></p>
+<p><span>The advice tells you the remedy (</span>&ldquo;<span>add this kernel parameter</span>&rdquo;<span>), but it doesn</span>&rsquo;<span>t explain how to verify that this indeed is the problem.</span>
+<span>That is, if the potential problem is a not loaded kernel driver, it would really help me to know how to check which kernel driver is in use, so that I can do both:</span></p>
+<ul>
+<li>
+<em><span>Before</span></em><span> adding the parameter, check that </span><code>46a6</code><span> doesn</span>&rsquo;<span>t have a driver</span>
+</li>
+<li>
+<em><span>After</span></em><span> the fix, verify that </span><code>i915</code><span> is indeed used.</span>
+</li>
+</ul>
+<p><span>If a </span>&ldquo;<span>fix</span>&rdquo;<span> doesn</span>&rsquo;<span>t come with a linked </span>&ldquo;<span>diagnostic</span>&rdquo;<span>, a very common outcome is:</span></p>
+<ol>
+<li>
+<span>Apply some random fix from the Internet</span>
+</li>
+<li>
+<span>Observe that the final problem (blank screen) isn</span>&rsquo;<span>t fixed</span>
+</li>
+<li>
+<span>Wonder which of the two is the case:</span>
+<ul>
+<li>
+<span>the fix is not relevant for the problem,</span>
+</li>
+<li>
+<span>the fix is relevant, but is applied wrong.</span>
+</li>
+</ul>
+</li>
+</ol>
+<p><span>So, call to action: if you are writing any kind of documentation, before explaining how to </span><em><span>fix</span></em><span> the problem, teach the user how to </span><em><span>diagnose</span></em><span> it.</span></p>
+<p><span>When helping with </span><code>git</code><span>, start with explaining </span><code>git log</code><span> and </span><code>git status</code><span>, not with </span><code>git reset</code><span> or </span><code>git reflog</code><span>.</span></p>
+<hr>
+<p><span>While the post might come as just a tiny bit angry, I want to explicitly mention that I am eternally grateful to all the people who write </span><em><span>any</span></em><span> kind of docs for using Linux on desktop.</span>
+<span>I</span>&rsquo;<span>ve been running it for more than 10 years at this point, and I am still completely clueless as to how debug issues from the first principles.</span>
+<span>If not for all of the wikis, stackoverflows and random forum posts out there, I wouldn</span>&rsquo;<span>t be able to use the OS, so thank you all!</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-10-19-why-linux-troubleshooting-advice-sucks.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/10/24/actions-permissions.html b/2022/10/24/actions-permissions.html
new file mode 100644
index 00000000..44467d44
--- /dev/null
+++ b/2022/10/24/actions-permissions.html
@@ -0,0 +1,122 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>GitHub Actions Permissions</title>
+  <meta name="description" content="This short note documents important wrong default in GitHub Actions, which should be corrected for much better contribution experience.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/10/24/actions-permissions.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#GitHub-Actions-Permissions"><span>GitHub Actions Permissions</span> <time datetime="2022-10-24">Oct 24, 2022</time></a>
+    </h1>
+<p><span>This short note documents important wrong default in GitHub Actions, which should be corrected for much better contribution experience.</span></p>
+<p><span>Under </span><span class="menu"><span>Settings › Actions › General</span></span><span> there</span>&rsquo;<span>s this setting (default pictured):</span></p>
+
+<figure>
+
+<img alt="" src="https://user-images.githubusercontent.com/1711539/197515707-28b440ae-2053-425c-bf86-8dc3734cf9b4.png">
+</figure>
+<p><span>To save your contributors quite a bit of frustration, you want to flip it to this instead:</span></p>
+
+<figure>
+
+<img alt="" src="https://user-images.githubusercontent.com/1711539/197516133-6f31195a-8487-45e3-b6c6-973f9ef66868.png">
+</figure>
+<p><span>Obviously, the first best solution here is for GitHub itself to change the default.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-10-24-actions-permissions.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/10/28/elements-of-a-great-markup-language.html b/2022/10/28/elements-of-a-great-markup-language.html
new file mode 100644
index 00000000..00023acf
--- /dev/null
+++ b/2022/10/28/elements-of-a-great-markup-language.html
@@ -0,0 +1,483 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Elements Of a Great Markup Language</title>
+  <meta name="description" content="This post contains some inconclusive musing on lightweight markup languages (Markdown, AsciiDoc, LaTeX, reStructuredText, etc).
+The overall mood is that I don't think a genuinely great markup languages exists.
+I wish it did though.
+As an appropriate disclosure, this text is written in AsciiDoctor.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/10/28/elements-of-a-great-markup-language.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Elements-Of-a-Great-Markup-Language"><span>Elements Of a Great Markup Language</span> <time datetime="2022-10-28">Oct 28, 2022</time></a>
+    </h1>
+<p><span>This post contains some inconclusive musing on lightweight markup languages (Markdown, AsciiDoc, LaTeX, reStructuredText, etc).</span>
+<span>The overall mood is that I don</span>&rsquo;<span>t think a genuinely great markup languages exists.</span>
+<span>I wish it did though.</span>
+<span>As an appropriate disclosure, this text is written in AsciiDoctor.</span></p>
+<p><span>EDIT: if you like this post, you should definitely check out </span><a href="https://djot.net" class="url">https://djot.net</a><span>.</span></p>
+<p><span>EDIT: welp, that escalated quickly, this post is now written in Djot.</span></p>
+<section id="Document-Model">
+
+    <h2>
+    <a href="#Document-Model"><span>Document Model</span> </a>
+    </h2>
+<p><span>This I think is the big one.</span>
+<span>Very often, a particular markup language is married to a particular output format, either syntactically (markdown supports HTML syntax), or by the processor just not making a crisp enough distinction between the input document and the output (AsciiDoctor).</span></p>
+<p><span>Roughly, if the markup language is for emitting HTML, or PDF, or DocBook XML, that</span>&rsquo;<span>s bad.</span>
+<span>A good markup language describes an abstract hierarchical structure of the document, and lets a separate program to adapt that structure to the desired output.</span></p>
+<p><span>More or less, what I want from markup is to convert a text string into a document tree:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Element</span> {</span>
+<span class="line">  <span class="hl-title function_ invoke__">Text</span>(<span class="hl-type">String</span>),</span>
+<span class="line">  Node {</span>
+<span class="line">    tag: <span class="hl-type">String</span>,</span>
+<span class="line">    attributes: Map&lt;<span class="hl-type">String</span>, <span class="hl-type">String</span>&gt;</span>
+<span class="line">    children: <span class="hl-type">Vec</span>&lt;Element&gt;,</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">parse_markup</span>(input: &amp;<span class="hl-type">str</span>) <span class="hl-punctuation">-&gt;</span> Element { ... }</span></code></pre>
+
+</figure>
+<p><span>Markup language which nails this perfectly is HTML.</span>
+<span>It directly expresses this tree structure.</span>
+<span>Various viewers for HTML can then render the document in a particular fashion.</span>
+<span>HTML</span>&rsquo;<span>s syntax itself doesn</span>&rsquo;<span>t really care about tag names and semantics: you can imagine authoring HTML documents using an alternative set of tag names.</span></p>
+<p><span>Markup language which completely falls over this is Markdown.</span>
+<span>There</span>&rsquo;<span>s no way to express generic tree structure, conversion to HTML with specific browser tags is hard-coded.</span></p>
+<p><span>Language which does this half-good is AsciiDoctor.</span></p>
+<p><span>In AsciiDoctor, it is possible to express genuine nesting.</span>
+<span>Here</span>&rsquo;<span>s a bunch of nested blocks with some inline content and attributes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">====</span>
+<span class="line">Here are your options:</span>
+<span class="line"></span>
+<span class="line">.Red Pill</span>
+<span class="line">[%collapsible]</span>
+<span class="line">======</span>
+<span class="line">Escape into the real world.</span>
+<span class="line">======</span>
+<span class="line"></span>
+<span class="line">.Blue Pill</span>
+<span class="line">[%collapsible]</span>
+<span class="line">======</span>
+<span class="line">Live within the simulated reality without want or fear.</span>
+<span class="line">======</span>
+<span class="line"></span>
+<span class="line">====</span></code></pre>
+
+</figure>
+<p><span>The problem with AsciiDoctor is that generic blocks come of as a bit of implementation detail, not as a foundation.</span>
+<span>It is difficult to untangle presentation-specific semantics of particular blocks (examples, admonitions, etc) from the generic document structure.</span>
+<span>As a fun consequence, a semantic-neutral block (equivalent of a </span><code>&lt;/div&gt;</code><span>) is the only kind of block which can</span>&rsquo;<span>t actually nest in AsciiDoctor, due to syntactic ambiguity.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Great markup format unambiguously interprets an input string as an abstract tree model of a document.</span>
+<span>It doesn</span>&rsquo;<span>t ascribe semantics to particular tag names or attributes.</span></p>
+</div>
+</aside></section>
+<section id="Concrete-Syntax">
+
+    <h2>
+    <a href="#Concrete-Syntax"><span>Concrete Syntax</span> </a>
+    </h2>
+<p><span>Syntax matters.</span>
+<span>For lightweight text markup languages, syntax is of utmost importance.</span></p>
+<p><span>The only right way to spell a list is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">- Foo</span>
+<span class="line">- Bar</span>
+<span class="line">- Baz</span></code></pre>
+
+</figure>
+<p><span>Not</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-tag">&lt;<span class="hl-name">ul</span>&gt;</span></span>
+<span class="line">    <span class="hl-tag">&lt;<span class="hl-name">li</span>&gt;</span>Foo<span class="hl-tag">&lt;/<span class="hl-name">li</span>&gt;</span></span>
+<span class="line">    <span class="hl-tag">&lt;<span class="hl-name">li</span>&gt;</span>Bar<span class="hl-tag">&lt;/<span class="hl-name">li</span>&gt;</span></span>
+<span class="line">    <span class="hl-tag">&lt;<span class="hl-name">li</span>&gt;</span>Baz<span class="hl-tag">&lt;/<span class="hl-name">li</span>&gt;</span></span>
+<span class="line"><span class="hl-tag">&lt;/<span class="hl-name">ul</span>&gt;</span></span></code></pre>
+
+</figure>
+<p><span>And most definitely not</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">\begin</span>{itemize}</span>
+<span class="line">    <span class="hl-keyword">\item</span> foo</span>
+<span class="line">    <span class="hl-keyword">\item</span> Bar</span>
+<span class="line">    <span class="hl-keyword">\item</span> Baz</span>
+<span class="line"><span class="hl-keyword">\end</span>{itemize}</span></code></pre>
+
+</figure>
+<p><span>Similarly, you lose if you spell links like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">`My Blog &lt;https://matklad.github.io&gt;`_</span></code></pre>
+
+</figure>
+<p><span>Markdown is the trailblazer here, it picked a lot of great concrete syntaxes.</span>
+<span>Though, some choices are questionable, like trailing double space rule, or the syntax for including images.</span></p>
+<p><span>AsciiDoctor is the treasure trove of tasteful syntactic decisions.</span></p>
+<section id="Inline-Formatting">
+
+    <h3>
+    <a href="#Inline-Formatting"><span>Inline Formatting</span> </a>
+    </h3>
+<p><span>For example </span><code>*bold*</code><span> is </span><strong><span>bold</span></strong><span>, </span><code>_italics_</code><span> is </span><em><span>italics</span></em><span>, and repeating the emphasis symbol twice (</span><code>__like *this*__</code><span>) allows for </span><em><span>unambiguous </span><strong><span>nesting</span></strong></em><span>.</span></p>
+</section>
+<section id="Links">
+
+    <h3>
+    <a href="#Links"><span>Links</span> </a>
+    </h3>
+<p><span>URls are spelled like this</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">https://matklad.github.io[My Blog]</span></code></pre>
+
+</figure>
+<p><span>And images like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">image:/media/logo.png[width=640,height=480]</span></code></pre>
+
+</figure>
+<p><span>This is a generic syntax:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">tag : argument [attributes]</span></code></pre>
+
+</figure>
+<p><span>For example </span><code>http://example.com[]</code><span> gets parsed as </span><code>&lt;http&gt;//example.com&lt;/http&gt;</code><span>, and the converter knows basic url schemes.</span>
+<span>And of course there</span>&rsquo;<span>s a generic link syntax for corner cases where a URL syntax isn</span>&rsquo;<span>t a valid AsciiDoctor syntax:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">link:downloads/report.pdf[Get Report]</span></code></pre>
+
+</figure>
+<p><span>(</span><code>image:</code><span> produces an inline element, while </span><code>image::</code><span> emits a block. Again, this </span><em><span>isn</span>&rsquo;<span>t</span></em><span> hard-coded to images, it is a generic syntax for </span><code>whatever::</code><span>).</span></p>
+</section>
+<section id="Lists">
+
+    <h3>
+    <a href="#Lists"><span>Lists</span> </a>
+    </h3>
+<p><span>Another tasteful decision are numbered lists, which use </span><code>.</code><span> to avoid tedious renumbering:</span></p>
+<div class="two-col">
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[lowerroman]</span>
+<span class="line">1. One</span>
+<span class="line">2. Two</span>
+<span class="line">3. Three</span></code></pre>
+
+</figure>
+<ol type="i">
+<li>
+<span>One</span>
+</li>
+<li>
+<span>Two</span>
+</li>
+<li>
+<span>Three</span>
+</li>
+</ol>
+</div>
+</section>
+<section id="Tables">
+
+    <h3>
+    <a href="#Tables"><span>Tables</span> </a>
+    </h3>
+<p><span>And AsciiDoctor also has a reasonable-ish syntax for tables, with one-line per cell and a blank like to delimit rows.</span></p>
+<div class="two-col">
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[cols="1,1"]</span>
+<span class="line">|===</span>
+<span class="line">|First</span>
+<span class="line">|Row</span>
+<span class="line"></span>
+<span class="line">|X</span>
+<span class="line">|Y</span>
+<span class="line"></span>
+<span class="line">|Last</span>
+<span class="line">|Row</span>
+<span class="line">|===</span></code></pre>
+
+</figure>
+<table>
+<tr>
+<td><span>First</span></td>
+<td><span>Row</span></td>
+</tr>
+<tr>
+<td><span>X</span></td>
+<td><span>Y</span></td>
+</tr>
+<tr>
+<td><span>Last</span></td>
+<td><span>Row</span></td>
+</tr>
+</table>
+</div>
+<hr>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Great markup format contains a tasteful selection of syntactic forms to express common patterns:</span>
+<span>lists, admonitions, links, footnotes, cross-references, quotes, tables, images.</span></p>
+<p><span>The syntax is fundamentally sugary, and expands to the standard tree-of-nodes-with-attributes.</span></p>
+</div>
+</aside></section>
+</section>
+<section id="Composable-Processing">
+
+    <h2>
+    <a href="#Composable-Processing"><span>Composable Processing</span> </a>
+    </h2>
+<p><span>To convert our nice, sweet syntax to general tree and than into the final output, we need some kind of a tool.</span>
+<span>One way to do that is by direct translation from our source document to, eg, html.</span></p>
+<p><span>Such one-step translation is convenient for all-inclusive tools, but is a barrier for extensibility.</span>
+<span>Amusingly, AsciiDoctor is both a positive and a negative example here.</span></p>
+<p><span>On the negative side of things, classical AsciiDoctor is an extensible Ruby processor.</span>
+<span>To extend it, you essentially write a </span>&ldquo;<span>compiler plugin</span>&rdquo;<span> </span>&mdash;<span> a bit of Ruby code which gets hook into the main processor and gets invoked as a callback when certain </span>&ldquo;<span>tags</span>&rdquo;<span> are parsed.</span>
+<span>This plugin interacts with the Ruby API of the processor itself, and is tied to a particular toolchain.</span></p>
+<p><span>In contrast, asciidoctor-web-pdf, a newer thing (which non-the-less uses the same Ruby core), approaches the task a bit differently.</span>
+<span>There</span>&rsquo;<span>s no API to extend the processor itself.</span>
+<span>Rather, the processor produces an abstract document tree, and then a user-supplied JavaScript function can convert that </span><em><em><span>piece of data</span></em></em><span> into whatever html it needs, by following a lightweight visitor pattern.</span>
+<span>I think this is the key to a rich ecosystem:  strictly separate converting input text to an abstract document model from rendering the model through some template.</span>
+<span>The two parts could be done by two separate processes which exchange serialized data.</span>
+<span>It</span>&rsquo;<span>s even possible to imagine some canonical JSON encoding of the parsed document.</span></p>
+<p><span>There</span>&rsquo;<span>s one more behavior where all-inclusive approach of AsciiDoctor gets in a way of doing the right thing.</span>
+<span>AsciiDoctor supports includes, and they are textual, preprocessor includes, meaning that syntax of the included file affects what follows afterwards.</span>
+<span>A much cleaner solution would have been to keep includes in the document tree as distinct nodes (with the path to the included file as an attribute), and let it to the output layer to interpret those as either verbatim text, or subdocuments.</span></p>
+<p><span>Another aspect of composability is that the parsing part of the processing should have, at minimum, a lightweight, embeddable implementation.</span>
+<span>Ideally, of course, there</span>&rsquo;<span>s a spec and an array of implementations to choose from.</span></p>
+<p><span>Markdown fairs fairly well here: there never was a shortage of implementations, and today we even have a bunch of different specs!</span></p>
+<p><span>AsciiDoctor</span>&hellip;
+<span>Well, I am amazed.</span>
+<span>The original implementation of AsciiDoc was in Python.</span>
+<span>AsciiDoctor, the current tool, is in Ruby.</span>
+<span>Neither is too embeddable.</span>
+<em><span>But!</span></em><span> AsciiDoctor folks are crazy, they compiled Ruby to JavaScript (and Java), and so the toolchain is available on JVM and Node.</span>
+<span>At least for Node, I can confidently say that that</span>&rsquo;<span>s a real production-ready thing which is quite convenient to use!</span>
+<span>Still, I</span>&rsquo;<span>d prefer a Rust library or a small WebAssembly blob instead.</span></p>
+<p><span>A different aspect of composability is extensibility.</span>
+<span>In Markdown land, the usual answer for when Markdown doesn</span>&rsquo;<span>t quite do everything needed (i.e., in 90% of cases), the answer is to extend </span><em><span>concrete syntax</span></em><span>.</span>
+<span>This is quite unfortunate, changing syntax is </span><em><span>hard</span></em><span>.</span>
+<span>A much better avenue I think is to take advantage of the generic tree structure, and extend the </span><em><span>output</span></em><span> layer instead.</span>
+<span>Tree-with-attributes should be enough to express whatever structure is needed, and than its up to the converter to pattern-match this structure and emit its special thing.</span></p>
+<p><span>Do you remember the fancy two-column rendering above with source-code on the left, and rendered document on the right?</span>
+<span>This is how I</span>&rsquo;<span>ve done it:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[.two-col]</span>
+<span class="line">--</span>
+<span class="line">```</span>
+<span class="line">[lowerroman]</span>
+<span class="line">1. One</span>
+<span class="line">2. Two</span>
+<span class="line">{cap=" Three"}</span>
+<span class="line">```</span>
+<span class="line"></span>
+<span class="line">[lowerroman]</span>
+<span class="line">1. One</span>
+<span class="line">2. Two</span>
+<span class="line">3. Three</span>
+<span class="line">--</span></code></pre>
+
+</figure>
+<p><span>That is, a generic block, with </span><code>.two-col</code><span> attribute and two children </span>&mdash;<span> a listing block and a list.</span>
+<span>Then there</span>&rsquo;<span>s a separate css which assigns an appropriate </span><code>flexbox</code><span> layout for </span><code>.two-col</code><span> elements.</span>
+<span>There</span>&rsquo;<span>s no need for special </span>&ldquo;<span>two column layout</span>&rdquo;<span> extension.</span>
+<span>It would be perhaps </span><em><span>nice</span></em><span> to have a dedicated syntax here, but just re-using generic </span><code>--</code><span> block is quite ok!</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Great markup language defines the semantics of converting text to a document tree, and provides a lightweight library to do the parsing.</span></p>
+<p><span>Converting an abstract document tree to a specific output type is left to a thriving ecosystem of converters.</span>
+<span>A particularly powerful form of converter allows calling user-supplied functions on document elements.</span>
+<span>Combined with a generic syntax for nodes and attributes, this provides extensibility which is:</span></p>
+<ul>
+<li>
+<span>Easy to use (there</span>&rsquo;<span>s no new syntax to learn, only new attributes)</span>
+</li>
+<li>
+<span>Easy to implement (no need to depend on internal API of particular converter, extension is a pure function from data to data)</span>
+</li>
+<li>
+<span>Powerful (everything can be expressed as a tree of nodes with attributes)</span>
+</li>
+</ul>
+</div>
+</aside></section>
+<section id="Where-Do-We-Stand-Now">
+
+    <h2>
+    <a href="#Where-Do-We-Stand-Now"><span>Where Do We Stand Now?</span> </a>
+    </h2>
+<p><span>Not quite there, I would think!</span>
+<span>AsciiDoctor at least half-ticks quite a few of the checkboxes, but it is still not perfect.</span></p>
+<p><span>There is a specification in progress, I have high hopes that it</span>&rsquo;<span>ll spur alternative implementations (and most of AsciiDoctor problems are implementation issues).</span>
+<span>At the same time, I am not overly-optimistic.</span>
+<span>The overriding goal for AsciiDoctor is compatibility, and rightfully so.</span>
+<span>There</span>&rsquo;<span>s a lot of content already written, and I would hate to migrate this blog, for example :)</span></p>
+<p><span>At the same time, there are quite a few rough edges in AsciiDoctor:</span></p>
+<ul>
+<li>
+<span>includes</span>
+</li>
+<li>
+<span>non-nestable generic blocks</span>
+</li>
+<li>
+<span>many ways to do certain things (AsciiDoctor essentially supports the union of Markdown and AsciiDoc concrete syntaxes)</span>
+</li>
+<li>
+<span>lack of some concrete sugar (reference-style links are notably better in Markdown)</span>
+</li>
+</ul>
+<p><span>It feels like there</span>&rsquo;<span>s a smaller, simpler language somewhere (no, I will not link that xkcd for once (though </span><code>xkcd:927[]</code><span> would be a nice use of AsciiDoctor extensibility))</span></p>
+<p><span>On the positive side of things, it seems that in the recent years we built a lot of infrastructure to make these kinds of projects more feasible.</span></p>
+<p><em><span>Rust</span></em><span> is just about the perfect language to take a </span><code>String</code><span> from a user and parse it into some sort of a tree, while packaging the whole thing into a self-contained zero-dependency, highly</span>
+<span>embeddable, reliable, and reusable library.</span></p>
+<p><em><span>WebAssembly</span></em><span> greatly extends reusability of low-level libraries: between a static library with a </span><code>C</code><span> ABI, and a </span><code>.wasm</code><span> module, you got all important platforms covered.</span></p>
+<p><span>True extensibility </span><em><span>fundamentally</span></em><span> requires taking code as input data.</span>
+<span>A converter from a great markup language to HTML should accept some user-written script file as an argument, to do fine tweaking of the conversion process.</span>
+<span>WebAssembly can be a part of the solution, it is a toolchain-neutral way of expressing computation.</span>
+<span>But we have something even more appropriate.</span>
+<em><span>Deno</span></em><span> with its friendly scripting language with nice template literals and a capabilities based security model, is just about the perfect runtime to implement a static site generator which takes a bunch of input documents, a custom conversion script, and outputs a bunch of HTML files.</span></p>
+<p><span>If I didn</span>&rsquo;<span>t have anything else to do, I</span>&rsquo;<span>d certainly be writing my own lightweight markup language today!</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-10-28-elements-of-a-great-markup-language.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/11/05/accessibility-px-or-rem.html b/2022/11/05/accessibility-px-or-rem.html
new file mode 100644
index 00000000..75e622ed
--- /dev/null
+++ b/2022/11/05/accessibility-px-or-rem.html
@@ -0,0 +1,252 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Accessibility: px or rem?</title>
+  <meta name="description" content="The genre of this post is: I am having opinions on something I am not an expert at, so hopefully the Internet would correct me.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/11/05/accessibility-px-or-rem.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Accessibility-px-or-rem"><span>Accessibility: px or rem?</span> <time datetime="2022-11-05">Nov 5, 2022</time></a>
+    </h1>
+<p><span>The genre of this post is: </span>&ldquo;<span>I am having opinions on something I am not an expert at, so hopefully the Internet would correct me</span>&rdquo;<span>.</span></p>
+<p><span>The specific question in question is:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>Should you use </span><code>px</code><span> or </span><code>rem</code><span> units in your CSS?</span></p>
+</blockquote>
+
+</figure>
+<p><span>I am not a web developer, but I do have a blog where I write CSS myself, and I very much want to do the right thing.</span>
+<span>I was researching and agonizing over this question for years, as I wasn</span>&rsquo;<span>t able to find a conclusive argument one way or another.</span>
+<span>So I am writing one.</span></p>
+<p><span>This isn</span>&rsquo;<span>t ideal, but I am lazy, so this post assumes that you already did the research and understand the mechanics of and the difference between </span><code>px</code><span>, </span><code>em</code><span>, and </span><code>rem</code><span>.</span>
+<span>And so, you position is probably:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>Of course </span><code>rem</code><span>, because that honors user</span>&rsquo;<span>s setting for the font-size, and so is more accessible, although </span>&hellip;</p>
+</blockquote>
+
+</figure>
+<p><span>Although there are buts:</span></p>
+<p><em><span>But</span></em><span> the default font-size is </span><code>16px</code><span>, and that</span>&rsquo;<span>s just too small.</span>
+<span>If you just roll with intended defaults, than the text will be painful to read even for folks with great vision!</span></p>
+<p><em><span>But</span></em><span> default font-size of </span><code>x</code><span> pixels just doesn</span>&rsquo;<span>t make sense: the actual perceived font size very much depends on the font itself.</span>
+<span>At </span><code>16px</code><span>, some fonts will be small, some tiny, and some maybe even just about right.</span></p>
+<p><em><span>But</span></em><span> the recommended way to </span><em><span>actually</span></em><span> use rem boils down to setting a percentage font-size for the root element, such that </span><code>1rem</code><span> is not the intended </span>&ldquo;<span>font size of the root element</span>&rdquo;<span>, but is equal to 1px (under default settings).</span>
+<span>Which, at this point, sounds like using pixels, just with more steps?</span>
+<span>After all, the modern browsers can zoom the pixels just fine?</span></p>
+<p><span>So, yeah, lingering doubts</span>&hellip;
+<span>If you are like me, you painstakingly used </span><code>rem</code>&rsquo;<span>s everywhere, and then </span><code>html { font-size: 22px }</code><span> because default is unusable, and percentage of default is stupidly ugly :-)</span></p>
+<hr>
+<p><span>So lets settle the question then.</span></p>
+<p><span>The practical data we want is what do the users actually do in practice?</span>
+<span>Do they zoom or do they change default font size?</span>
+<span>I have spent 10 minutes googling that, didn</span>&rsquo;<span>t find the answer.</span></p>
+<p><span>After that, I decided to just check how it actually works.</span>
+<span>So, I opened browser</span>&rsquo;<span>s settings, cranked the font size to the max, and opened Google.</span></p>
+<p><span>To be honest, that was the moment where the question was mentally settled for me.</span>
+<span>If Google</span>&rsquo;<span>s search page doesn</span>&rsquo;<span>t respect user-agent</span>&rsquo;<span>s default font-size, it</span>&rsquo;<span>s an indirect, but also very strong, evidence that that</span>&rsquo;<span>s not a meaningful thing to do.</span></p>
+<p><span>The result of my ad-hoc survey:</span></p>
+<div class="two-col">
+<dl>
+<dt><span>Don</span>&rsquo;<span>t care:</span></dt>
+<dd>
+<ul>
+<li>
+<span>Google</span>
+</li>
+<li>
+<span>Lobsters</span>
+</li>
+<li>
+<span>Hackernews</span>
+</li>
+<li>
+<span>Substack</span>
+</li>
+<li>
+<span>antirez.com</span>
+</li>
+<li>
+<span>tonsky.me</span>
+</li>
+<li>
+<span>New Reddit</span>
+</li>
+</ul>
+</dd>
+</dl>
+<p><br>
+</p>
+<dl>
+<dt><span>Embiggen:</span></dt>
+<dd>
+<ul>
+<li>
+<span>Wikipedia</span>
+</li>
+<li>
+<span>Discourse</span>
+</li>
+<li>
+<span>Old Reddit</span>
+</li>
+</ul>
+</dd>
+</dl>
+</div>
+<p><span>Google versus Wikipedia it is, eh?</span>
+<span>But this is actually quite informative: if you adjust your browser</span>&rsquo;<span>s default font-size, you are in an </span>&ldquo;<span>Alice in the Wonderland</span>&rdquo;<span> version of the web which alternates between too large and too small.</span></p>
+<p><span>The next useful question is: what about mobile?</span>
+<span>After some testing and googling, it seems that changing browser</span>&rsquo;<span>s default font-size is just not possible on the iPhone?</span>
+<span>That the only option is page zoom?</span></p>
+<p><span>Again, I don</span>&rsquo;<span>t actually have the data on whether users rely on zoom or on font size.</span>
+<span>But so far it looks like the user doesn</span>&rsquo;<span>t really have a choice?</span>
+<span>Only zoom seems to actually work in practice?</span></p>
+<p><span>The final bit of evidence which completely settled the question in my mind comes from this post:</span></p>
+<p><a href="https://www.craigabbott.co.uk/blog/accessibility-and-font-sizes" class="url">https://www.craigabbott.co.uk/blog/accessibility-and-font-sizes</a></p>
+<p><span>It tells us that</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>Using the wrong units of measurement in your Cascading Style Sheets (CSS) is a</span>
+<span>big barrier for many visually impaired users, and it can cause your website fail</span>
+<span>the Web Content Accessibility Guidelines (WCAG) 2.1 on</span>
+<a href="https://www.w3.org/WAI/WCAG21/Understanding/resize-text.html"><span>1.4.4 Resize text</span></a><span>.</span></p>
+</blockquote>
+
+</figure>
+<p><span>That WCAG document is really worth the read:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>The scaling of content is primarily a user agent responsibility. User agents</span>
+<span>that satisfy UAAG 1.0 Checkpoint 4.1 allow users to configure text scale. The</span>
+<span>author</span>&rsquo;<span>s responsibility is to create Web content that does not prevent the</span>
+<span>user agent from scaling the content effectively. Authors may satisfy this</span>
+<span>Success Criterion by verifying that content does not interfere with user agent</span>
+<span>support for resizing text, including text-based controls, or by providing direct</span>
+<span>support for resizing text or changing the layout. An example of direct support</span>
+<span>might be via server-side script that can be used to assign different style</span>
+<span>sheets.</span></p>
+<p><strong><strong><span>The author cannot rely on the user agent to satisfy this Success Criterion</span>
+<span>for HTML content if users do not have access to a user agent with zoom support.</span>
+<span>For example, if they work in an environment that requires them to use IE 6.</span></strong></strong></p>
+<p><span>If the author is using a technology whose user agents do not provide zoom</span>
+<span>support, the author is responsible to provide this type of functionality</span>
+<span>directly or to provide content that works with the type of functionality</span>
+<span>provided by the user agent. If the user agent doesn</span>&rsquo;<span>t provide zoom functionality</span>
+<span>but does let the user change the text size, the author is responsible for</span>
+<span>ensuring that the content remains usable when the text is resized.</span></p>
+</blockquote>
+
+</figure>
+<p><span>My reading of the above text: it</span>&rsquo;<span>s on me, as an author, to ensure that my readers can scale the content using whatever method their user agent employs.</span>
+<span>If the UA can zoom, that</span>&rsquo;<span>s perfect, we are done.</span></p>
+<p><span>If the reader</span>&rsquo;<span>s actual UA can</span>&rsquo;<span>t zoom, but it can change default font size (eg, IE 6), then I need to support that.</span></p>
+<p><span>That</span>&rsquo;<span>s </span>&hellip;<span> most reasonable I guess?</span>
+<span>Just make sure that your actual users, in their actual use, can read stuff.</span>
+<span>And I am pretty sure my target audience doesn</span>&rsquo;<span>t use IE 6, which I don</span>&rsquo;<span>t support anyway.</span></p>
+<p><strong><strong><span>TL;DR</span></strong></strong><span> for the whole post:</span></p>
+<p><span>Use pixels.</span>
+<span>The goal is not to check the </span>&ldquo;<span>I suffered pain to make my website accessible</span>&rdquo;<span> checkbox, the goal is to make the site accessible to real users.</span>
+<span>There</span>&rsquo;<span>s an explicit guideline about that.</span>
+<span>There</span>&rsquo;<span>s a strong evidence that, barring highly unusual circumstances, real users zoom, and pixels zoom just fine.</span></p>
+<hr>
+<p><span>As a nice bonus, if you </span><em><em><span>don</span>&rsquo;<span>t</span></em></em><span> use rem, you make browser</span>&rsquo;<span>s font size setting more useful, because it can control the scale of the browser</span>&rsquo;<span>s own chrome (which is fixed) independently from the scale of websites (which vary).</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-11-05-accessibility-px-or-rem.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/11/18/if-a-tree-falls-in-a-forest-does-it-overflow-the-stack.html b/2022/11/18/if-a-tree-falls-in-a-forest-does-it-overflow-the-stack.html
new file mode 100644
index 00000000..957e9204
--- /dev/null
+++ b/2022/11/18/if-a-tree-falls-in-a-forest-does-it-overflow-the-stack.html
@@ -0,0 +1,327 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>If a Tree Falls in a Forest, Does It Overflow the Stack?</title>
+  <meta name="description" content="A well-known pitfall when implementing a linked list in Rust is that the the default recursive drop implementation causes stack overflow for long lists.
+A similar problem exists for tree data structures as well.
+This post describes a couple of possible solutions for trees.
+This is a rather esoteric problem, so the article is denser than is appropriate for a tutorial.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/11/18/if-a-tree-falls-in-a-forest-does-it-overflow-the-stack.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#If-a-Tree-Falls-in-a-Forest-Does-It-Overflow-the-Stack"><span>If a Tree Falls in a Forest, Does It Overflow the Stack?</span> <time datetime="2022-11-18">Nov 18, 2022</time></a>
+    </h1>
+<p><span>A well-known pitfall when implementing a linked list in Rust is that the the default recursive </span><code>drop</code><span> implementation causes stack overflow for long lists.</span>
+<span>A similar problem exists for tree data structures as well.</span>
+<span>This post describes a couple of possible solutions for trees.</span>
+<span>This is a rather esoteric problem, so the article is denser than is appropriate for a tutorial.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with our beloved linked list:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  value: T,</span>
+<span class="line">  next: <span class="hl-type">Option</span>&lt;<span class="hl-type">Box</span>&lt;Node&lt;T&gt;&gt;&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; Node&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(value: T) <span class="hl-punctuation">-&gt;</span> Node&lt;T&gt; {</span>
+<span class="line">    Node { value, next: <span class="hl-literal">None</span> }</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">with_next</span>(<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, next: Node&lt;T&gt;) <span class="hl-punctuation">-&gt;</span> Node&lt;T&gt; {</span>
+<span class="line">    <span class="hl-keyword">self</span>.next = <span class="hl-title function_ invoke__">Some</span>(<span class="hl-type">Box</span>::<span class="hl-title function_ invoke__">new</span>(next));</span>
+<span class="line">    <span class="hl-keyword">self</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It</span>&rsquo;<span>s easy to cause this code to crash:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[test]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">stack_overflow</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">node</span> = Node::<span class="hl-title function_ invoke__">new</span>(<span class="hl-number">0</span>);</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">_</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..<span class="hl-number">100_000</span> {</span>
+<span class="line">    node = Node::<span class="hl-title function_ invoke__">new</span>(<span class="hl-number">0</span>).<span class="hl-title function_ invoke__">with_next</span>(node);</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-title function_ invoke__">drop</span>(node) <span class="hl-comment">// boom</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The crash happens in the automatically generated recursive </span><code>drop</code><span> function.</span>
+<span>The fix is to write </span><code>drop</code><span> manually, in a non-recursive way:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; <span class="hl-built_in">Drop</span> <span class="hl-keyword">for</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">drop</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) {</span>
+<span class="line">    <span class="hl-keyword">while</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(next) = <span class="hl-keyword">self</span>.next.<span class="hl-title function_ invoke__">take</span>() {</span>
+<span class="line">      *<span class="hl-keyword">self</span> = *next;</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>What about trees?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  value: T,</span>
+<span class="line">  left: <span class="hl-type">Option</span>&lt;<span class="hl-type">Box</span>&lt;Node&lt;T&gt;&gt;&gt;,</span>
+<span class="line">  right: <span class="hl-type">Option</span>&lt;<span class="hl-type">Box</span>&lt;Node&lt;T&gt;&gt;&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If the tree is guaranteed to be balanced, the automatically generated drop is actually fine, because the height of the tree will be logarithmic.</span>
+<span>If the tree is unbalanced though, the same stack overflow might happen.</span></p>
+<p><span>Let</span>&rsquo;<span>s write an iterative </span><code>Drop</code><span> to fix this.</span>
+<span>The problem though is that the </span>&ldquo;<span>swap with </span><code>self</code>&rdquo;<span> trick we used for list doesn</span>&rsquo;<span>t work, as we have two children to recur into.</span>
+<span>The standard solution would be to replace a stack with an explicit vector of work times:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; <span class="hl-built_in">Drop</span> <span class="hl-keyword">for</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">drop</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">work</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    work.<span class="hl-title function_ invoke__">extend</span>(<span class="hl-keyword">self</span>.left.<span class="hl-title function_ invoke__">take</span>());</span>
+<span class="line">    work.<span class="hl-title function_ invoke__">extend</span>(<span class="hl-keyword">self</span>.right.<span class="hl-title function_ invoke__">take</span>());</span>
+<span class="line">    <span class="hl-keyword">while</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(node) = work.<span class="hl-title function_ invoke__">pop</span>() {</span>
+<span class="line">      work.<span class="hl-title function_ invoke__">extend</span>(node.left.<span class="hl-title function_ invoke__">take</span>());</span>
+<span class="line">      work.<span class="hl-title function_ invoke__">extend</span>(node.right.<span class="hl-title function_ invoke__">take</span>());</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This works, but also makes my internal C programmer scream: we allocate a vector to free memory!</span>
+<span>Can we do better?</span></p>
+<p><span>One approach would be to build on balanced trees observation.</span>
+<span>If we recur into the shorter branch, and iteratively drop the longer one, we should be fine:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; <span class="hl-built_in">Drop</span> <span class="hl-keyword">for</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">drop</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) {</span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">      <span class="hl-title function_ invoke__">match</span> (<span class="hl-keyword">self</span>.left.<span class="hl-title function_ invoke__">take</span>(), <span class="hl-keyword">self</span>.right.<span class="hl-title function_ invoke__">take</span>()) {</span>
+<span class="line">        (<span class="hl-literal">None</span>, <span class="hl-literal">None</span>) =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">        (<span class="hl-literal">None</span>, <span class="hl-title function_ invoke__">Some</span>(it)) | (<span class="hl-title function_ invoke__">Some</span>(it), <span class="hl-literal">None</span>) =&gt; *<span class="hl-keyword">self</span> = *it,</span>
+<span class="line">        (<span class="hl-title function_ invoke__">Some</span>(left), <span class="hl-title function_ invoke__">Some</span>(right)) =&gt; {</span>
+<span class="line">          *<span class="hl-keyword">self</span> =</span>
+<span class="line">            *<span class="hl-keyword">if</span> left.depth &gt; right.depth { left } <span class="hl-keyword">else</span> { right }</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This requires maintaining the depths though.</span>
+<span>Can we make do without?</span>
+<span>My C instinct (not that I wrote any substantial amount of C though) would be to go down the tree, and stash the parent links into the nodes themselves.</span>
+<span>And we actually can do something like that:</span></p>
+<ul>
+<li>
+<span>If the current node has only a single child, we can descend into the node</span>
+</li>
+<li>
+<span>If there are two children, we can rotate the tree. If we always rotate into a</span>
+<span>single direction, eventually we</span>&rsquo;<span>ll get into the single-child situation.</span>
+</li>
+</ul>
+<p><span>Here</span>&rsquo;<span>s how a single rotation could look:</span></p>
+
+<figure>
+
+<img alt="" src="https://user-images.githubusercontent.com/1711539/202797128-87e40cf0-be55-44b3-9bdf-5dc15b33812b.png">
+</figure>
+<p><span>Or, in code,</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; <span class="hl-built_in">Drop</span> <span class="hl-keyword">for</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">drop</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) {</span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">      <span class="hl-title function_ invoke__">match</span> (<span class="hl-keyword">self</span>.left.<span class="hl-title function_ invoke__">take</span>(), <span class="hl-keyword">self</span>.right.<span class="hl-title function_ invoke__">take</span>()) {</span>
+<span class="line">        (<span class="hl-literal">None</span>, <span class="hl-literal">None</span>) =&gt; <span class="hl-keyword">break</span>,</span>
+<span class="line">        (<span class="hl-literal">None</span>, <span class="hl-title function_ invoke__">Some</span>(it)) | (<span class="hl-title function_ invoke__">Some</span>(it), <span class="hl-literal">None</span>) =&gt; *<span class="hl-keyword">self</span> = *it,</span>
+<span class="line">        (<span class="hl-title function_ invoke__">Some</span>(<span class="hl-keyword">mut</span> left), <span class="hl-title function_ invoke__">Some</span>(right)) =&gt; {</span>
+<span class="line">          mem::<span class="hl-title function_ invoke__">swap</span>(<span class="hl-keyword">self</span>, &amp;<span class="hl-keyword">mut</span> *left);</span>
+<span class="line">          left.left = <span class="hl-keyword">self</span>.right.<span class="hl-title function_ invoke__">take</span>();</span>
+<span class="line">          left.right = <span class="hl-title function_ invoke__">Some</span>(right);</span>
+<span class="line">          <span class="hl-keyword">self</span>.right = <span class="hl-title function_ invoke__">Some</span>(left);</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Ok, what if we have an n-ary tree?</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  value: T,</span>
+<span class="line">  children: <span class="hl-type">Vec</span>&lt;Node&lt;T&gt;&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I </span><em><span>think</span></em><span> the same approach works: we can treat the first child as </span><code>left</code><span>, and the last child as </span><code>right</code><span>, and do essentially the same rotations.</span>
+<span>Though, we will rotate in other direction (as removing the right child is cheaper), and we</span>&rsquo;<span>ll also check that we have at least two grandchildren (to avoid allocation when pushing to an empty vector).</span></p>
+<p><span>Which gives something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; <span class="hl-built_in">Drop</span> <span class="hl-keyword">for</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">drop</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) {</span>
+<span class="line">    <span class="hl-keyword">loop</span> {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(<span class="hl-keyword">mut</span> right) = <span class="hl-keyword">self</span>.children.<span class="hl-title function_ invoke__">pop</span>() <span class="hl-keyword">else</span> {</span>
+<span class="line">        <span class="hl-keyword">break</span>;</span>
+<span class="line">      };</span>
+<span class="line">      <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.children.<span class="hl-title function_ invoke__">is_empty</span>() {</span>
+<span class="line">        *<span class="hl-keyword">self</span> = right;</span>
+<span class="line">        <span class="hl-keyword">continue</span>;</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-keyword">if</span> right.children.<span class="hl-title function_ invoke__">len</span>() &lt; <span class="hl-number">2</span> {</span>
+<span class="line">        <span class="hl-keyword">self</span>.children.<span class="hl-title function_ invoke__">extend</span>(right.children.<span class="hl-title function_ invoke__">drain</span>(..));</span>
+<span class="line">        <span class="hl-keyword">continue</span>;</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-comment">// Non trivial case:</span></span>
+<span class="line">      <span class="hl-comment">//   &gt;= 2 children,</span></span>
+<span class="line">      <span class="hl-comment">//   &gt;= 2 grandchildren.</span></span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">me</span> = mem::<span class="hl-title function_ invoke__">replace</span>(<span class="hl-keyword">self</span>, right);</span>
+<span class="line">      mem::<span class="hl-title function_ invoke__">swap</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>.children[<span class="hl-number">0</span>], &amp;<span class="hl-keyword">mut</span> me);</span>
+<span class="line">      <span class="hl-comment">// Doesn&#x27;t allocate, this is the same slot</span></span>
+<span class="line">      <span class="hl-comment">// we popped from at the start of the loop.</span></span>
+<span class="line">      <span class="hl-keyword">self</span>.children[<span class="hl-number">0</span>].children.<span class="hl-title function_ invoke__">push</span>(me);</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I am not sure this works, and I am not sure this works in linear time, but I am fairly certain that something like this could be made to work if need be.</span></p>
+<p><span>Though, practically, if something like this is a concern, you probably want to re-design the tree structure to be something like this instead:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span>&lt;T&gt; {</span>
+<span class="line">  value: T,</span>
+<span class="line">  children: Range&lt;<span class="hl-type">usize</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Tree</span>&lt;T&gt; {</span>
+<span class="line">   nodes: <span class="hl-type">Vec</span>&lt;Node&lt;T&gt;&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-11-18-if-a-tree-falls-in-a-forest-does-it-overflow-the-stack.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2022/12/31/raytracer-construction-kit.html b/2022/12/31/raytracer-construction-kit.html
new file mode 100644
index 00000000..de4e050f
--- /dev/null
+++ b/2022/12/31/raytracer-construction-kit.html
@@ -0,0 +1,558 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Ray Tracer Construction Kit</title>
+  <meta name="description" content="Ray or path tracing is an algorithm for getting a 2D picture out of a 3D virtual scene, by simulating a trajectory of a particle of light which hits the camera.
+It's one of the fundamental techniques of computer graphics, but that's not why it is the topic for today's blog post.
+Implementing a toy ray tracer is one of the best exercises for learning a particular programming language (and a great deal about software architecture in general as well), and that's the why? for this text.
+My goal here is to teach you to learn new programming languages better, by giving a particularly good exercise for that.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2022/12/31/raytracer-construction-kit.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Ray-Tracer-Construction-Kit"><span>Ray Tracer Construction Kit</span> <time datetime="2022-12-31">Dec 31, 2022</time></a>
+    </h1>
+<p><span>Ray or path tracing is an algorithm for getting a 2D picture out of a 3D virtual scene, by simulating a trajectory of a particle of light which hits the camera.</span>
+<span>It</span>&rsquo;<span>s one of the fundamental techniques of computer graphics, but that</span>&rsquo;<span>s not why it is the topic for today</span>&rsquo;<span>s blog post.</span>
+<span>Implementing a toy ray tracer is one of the best exercises for learning a particular programming language (and a great deal about software architecture in general as well), and that</span>&rsquo;<span>s the </span>&ldquo;<span>why?</span>&rdquo;<span> for this text.</span>
+<span>My goal here is to teach you to learn new programming languages better, by giving a particularly good exercise for that.</span></p>
+<p><span>But first, some background</span></p>
+<section id="Background">
+
+    <h2>
+    <a href="#Background"><span>Background</span> </a>
+    </h2>
+<p><span>Learning a programming language consists of learning the theory (knowledge) and the set of tricks to actually make computer do things (skills).</span>
+<span>For me, the best way to learn skills is to practice them.</span>
+<span>Ray tracer is an exceptionally good practice dummy, because:</span></p>
+<ul>
+<li>
+<span>It is a project of an appropriate scale: a couple of weekends.</span>
+</li>
+<li>
+<span>It is a project with a flexible scale </span>&mdash;<span> if you get carried away, you can sink </span><em><span>a lot</span></em><span> of weekends before you hit diminishing returns on effort.</span>
+</li>
+<li>
+<span>Ray tracer can make use of a lot of aspects of the language </span>&mdash;<span> modules, static and runtime polymorphism, parallelism, operator overloading, IO, string parsing, performance optimization, custom data structures.</span>
+<span>Really, I think the project doesn</span>&rsquo;<span>t touch only a couple of big things, namely networking and evented programming.</span>
+</li>
+<li>
+<span>It is a very visual and feedback-friendly project </span>&mdash;<span> a bug is not some constraint violation deep in the guts of the database, it</span>&rsquo;<span>s a picture upside-down!</span>
+</li>
+</ul>
+<p><span>I want to stress once again that here I view ray tracer as a learning exercise.</span>
+<span>We aren</span>&rsquo;<span>t going to draw any beautiful photorealistic pictures here, we</span>&rsquo;<span>ll settle for ugly things with artifacts.</span></p>
+<p><span>Eg, this </span>&ldquo;<span>beauty</span>&rdquo;<span> is the </span><em><span>final</span></em><span> result of my last exercise:</span></p>
+
+<figure>
+
+<img alt="" src="https://user-images.githubusercontent.com/1711539/194287665-05583649-dcb0-4014-82b9-424f945e19a4.png">
+</figure>
+<p><span>And, to maximize learning, I think its better to do everything yourself from scratch.</span>
+<span>A crappy teapot which you did from the first principles is full to the brim with knowledge, while a beautiful landscape which you got by following step-by-step instructions is hollow.</span></p>
+<p><span>And that</span>&rsquo;<span>s the gist of the post: I</span>&rsquo;<span>ll try to teach you as little about ray tracing as possible, to give you just enough clues to get some pixels to the screen.</span>
+<span>To be more poetic, you</span>&rsquo;<span>ll draw the rest of the proverbial owl.</span></p>
+<p><span>This is in contrast to </span><a href="https://raytracing.github.io"><span>Ray Tracing in One Weekend</span></a><span> which does a splendid job teaching ray tracing, but contains way to many spoilers if you want to learn software architecture (rather than graphics programming).</span>
+<span>In particular, it contains snippets of code.</span>
+<span>We won</span>&rsquo;<span>t see that here </span>&mdash;<span> as a corollary, all the code you</span>&rsquo;<span>ll write is fully your invention!</span></p>
+<p><span>Sadly, there</span>&rsquo;<span>s one caveat to the plan: as the fundamental task is tracing a ray as it gets reflected through the 3D scene, we</span>&rsquo;<span>ll need a hefty amount of math.</span>
+<span>Not an insurmountable amount </span>&mdash;<span> everything is going to be pretty visual and logical.</span>
+<span>But still, we</span>&rsquo;<span>ll need some of the more advanced stuff, such as vectors and cross product.</span></p>
+<p><span>If you are very comfortable with that, you can approach the math parts the same way as the programming parts </span>&mdash;<span> grab a pencil and a stack of paper and try to work out formulas yourself.</span>
+<span>If solving math puzzlers is not your cup of tea, feel </span><em><span>absolutely</span></em><span> free to just look up formulas online.</span>
+<a href="https://avikdas.com/build-your-own-raytracer" class="url">https://avikdas.com/build-your-own-raytracer</a><span> is a great resource for that.</span>
+<span>If, however, linear algebra is your worst nightmare, you might want to look for a more step-by-step tutorial (or maybe pick a different problem altogether! Another good exercise is a small chat server, for example).</span></p>
+</section>
+<section id="Algorithm-Overview">
+
+    <h2>
+    <a href="#Algorithm-Overview"><span>Algorithm Overview</span> </a>
+    </h2>
+<p><span>So, what exactly is ray tracing?</span>
+<span>Imagine a 3D scene with different kinds of objects: an infinite plane, a sphere, a bunch of small triangles which resemble a teapot from afar.</span>
+<span>The scene is illuminated by some distant light source, and so objects cast shadows and reflect each other.</span>
+<span>We observe the scene from a particular view point.</span>
+<span>Roughly, a ray of light is emitted by a light source, bounces off scene objects and eventually, if it gets into our eye, we perceive a sensation of color, which is mixed from light</span>&rsquo;<span>s original color, as well the colors of all the objects the ray reflected from.</span></p>
+<p><span>Now, we are going to crudely simplify the picture.</span>
+<span>Rather than casting rays from the light source, we</span>&rsquo;<span>ll cast rays from the point of view.</span>
+<span>Whatever is intersected by the ray will be painted as a pixels in the resulting image.</span></p>
+<p><span>Let</span>&rsquo;<span>s do this step-by-step</span></p>
+</section>
+<section id="Images">
+
+    <h2>
+    <a href="#Images"><span>Images</span> </a>
+    </h2>
+<p><span>The ultimate result of our ray tracer is an image.</span>
+<span>A straightforward way to represent an image is to use a 2D grid of pixels, where each pixel is an </span>&ldquo;<span>red, green, blue</span>&rdquo;<span> triple where color values vary from 0 to 255.</span>
+<span>How do we display the image?</span>
+<span>One can reach out for graphics libraries like OpenGL, or image formats like BMP or PNG.</span></p>
+<p><span>But, in the spirit of simplifying the problem so that we can do everything ourselves, we will simplify the problem!</span>
+<span>As a first step, we</span>&rsquo;<span>ll display image as text in the terminal.</span>
+<span>That is, we</span>&rsquo;<span>ll print </span><code>.</code><span> for </span>&ldquo;<span>white</span>&rdquo;<span> pixels and </span><code>x</code><span> for </span>&ldquo;<span>black</span>&rdquo;<span> pixels.</span></p>
+<p><span>So, as the very first step, let</span>&rsquo;<span>s write some code to display such image by just printing it.</span>
+<span>A good example image would be 64 by 48 pixels wide, with 5 pixel large circle in the center.</span>
+<span>And here</span>&rsquo;<span>s the first encounter of math: to do this, we want to iterate all </span><code>(x, y)</code><span> pixels and fill them if they are inside the circle.</span>
+<span>It</span>&rsquo;<span>s useful to recall equation for circle at the origin: </span><code>x^2 + y^2 = r^2</code><span> where </span><code>r</code><span> is the radius.</span></p>
+<p><span>🎉 we got hello-world working!</span>
+<span>Now, let</span>&rsquo;<span>s go for more image-y images.</span>
+<span>We can roll our own </span>&ldquo;<span>real</span>&rdquo;<span> format like BMP (I think that one is comparatively simple), but there</span>&rsquo;<span>s a cheat code here.</span>
+<span>There are text-based image formats!</span>
+<span>In particular, PPM is the one especially convenient.</span>
+<a href="https://en.wikipedia.org/wiki/Netpbm"><span>Wikipedia Article</span></a><span> should be enough to write our own impl.</span>
+<span>I suggest using </span><code>P3</code><span> variation, but </span><code>P6</code><span> is also nice if you want something less offensively inefficient.</span></p>
+<p><span>So, rewrite your image outputting code to produce a </span><code>.ppm</code><span> file, and also make sure that you have an image viewer that can actually display it.</span>
+<span>Spend some time viewing your circle in its colorful glory (can you color it with a gradient?).</span></p>
+<p><span>If you made it this far, I think you understand the spirit of the exercise </span>&mdash;<span> you</span>&rsquo;<span>ve just implemented an encoder for a real image format, using nothing but a Wikipedia article.</span>
+<span>It might not be the fastest encoder out there, but it</span>&rsquo;<span>s the thing you did yourself.</span>
+<span>You probably want to encapsulate it in a module or something, and do a nice API over it.</span>
+<span>Go for it! Experiment with various abstractions in the language.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>There are two ways to </span>&ldquo;<span>write a .ppm file</span>&rdquo;<span>: your ray tracer can write to a specific named file on disk.</span>
+<span>Alternatively, it can print directly to stdout, to facilitate redirection:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> my-ray-tracer &gt; image.ppm</span></code></pre>
+
+</figure>
+<p><span>These days, I find printing to stdout more convenient, but I used to prefer writing directly to a file!</span></p>
+</div>
+</aside></section>
+<section id="One-Giant-Leap-Into-3D">
+
+    <h2>
+    <a href="#One-Giant-Leap-Into-3D"><span>One Giant Leap Into 3D</span> </a>
+    </h2>
+<p><span>Now that we can display stuff, let</span>&rsquo;<span>s do an absolutely basic ray tracer.</span>
+<span>We</span>&rsquo;<span>ll use a very simple scene: just a single sphere with the camera looking directly at it.</span>
+<span>And we</span>&rsquo;<span>ll use a trivial ray tracing algorithm: shoot the ray from the camera, if it hit the sphere, paint black, else, paint white.</span>
+<span>If you do this as a mental experiment, you</span>&rsquo;<span>ll realize that the end result is going to be </span><em><span>exactly</span></em><span> what we</span>&rsquo;<span>ve got so far: a picture with a circle in it.</span>
+<span>Except now, it</span>&rsquo;<span>s going to be in 3D!</span></p>
+<p><span>This is going to be the most annoying part, as there are a lot of fiddly details to get this right, while the result is, ahem, underwhelming.</span>
+<span>Let</span>&rsquo;<span>s do this though.</span></p>
+<p><span>First, the sphere.</span>
+<span>For simplicity, let</span>&rsquo;<span>s assume that its center is at the origin, and it has radius 5, and so it</span>&rsquo;<span>s equation is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">x^2 + y^2 + z^2 = 25</span></code></pre>
+
+</figure>
+<p><span>Or, in vector form:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">v̅ ⋅ v̅ = 25</span></code></pre>
+
+</figure>
+<p><span>Here, </span><code>v̅</code><span> is a point on a sphere (an </span><code>(x, y, z)</code><span> vector) and </span><code>⋅</code><span> is the dot product.</span>
+<span>As a bit of foreshadowing, if you are brave enough to take a stab at deriving various formulas, keeping to vector notation might be simpler.</span></p>
+<p><span>Now, let</span>&rsquo;<span>s place the camera.</span>
+<span>It is convenient to orient axes such that </span><code>Y</code><span> points up, </span><code>X</code><span> points to the right, and </span><code>Z</code><span> points at the viewer (ie, </span><code>Z</code><span> is depth).</span>
+<span>So let</span>&rsquo;<span>s say that camera is at </span><code>(0, 0, -20)</code><span> and it looks at </span><code>(0, 0, 0)</code><span> (so, directly at the sphere</span>&rsquo;<span>s center).</span></p>
+<p><span>Now, the fiddly bit.</span>
+<span>It</span>&rsquo;<span>s somewhat obvious how to cast a ray from the camera. If camera</span>&rsquo;<span>s position is </span><code>C̅</code><span>, and we cast the ray in the direction </span><code>d̅</code><span>, then the equation of points on the ray is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">C̅ + t d̅</span></code></pre>
+
+</figure>
+<p><span>where </span><code>t</code><span> is a scalar parameter.</span>
+<span>Or, in the cartesian form,</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">(0 + t dx, 0 + t dy, -20 + t dz)</span></code></pre>
+
+</figure>
+<p><span>where </span><code>(dx, dy, dz)</code><span> is the direction vector for a particular ray.</span>
+<span>For example, for a ray which goes straight to the center of the sphere, that would be </span><code>(0, 0, 1)</code><span>.</span></p>
+<p><span>What is not obvious is how do we pick direction </span><code>d</code><span>?</span>
+<span>We</span>&rsquo;<span>ll figure that out later.</span>
+<span>For now, assume that we have some magical box, which, given </span><code>(x, y)</code><span> position of the pixel in the image, gives us the </span><code>(dx, dy, dz)</code><span> of the corresponding ray.</span>
+<span>With that, we can use the following algorithm:</span></p>
+<p><span>Iterate through all </span><code>(x, y)</code><span> pixels of our 64x48 the image.</span>
+<span>From the </span><code>(x, y)</code><span> of each pixel, compute the corresponding ray</span>&rsquo;<span>s </span><code>(dx, dy, dz)</code><span>.</span>
+<span>Check if the ray intersects the sphere.</span>
+<span>If it does, plaint the </span><code>(x, y)</code><span> pixel black.</span></p>
+<p><span>To check for intersection, we can plug the ray equation, </span><code>C̅ + t d̅</code><span>, into the sphere equation, </span><code>v̅ ⋅ v̅ = r^2</code><span>.</span>
+<span>That is, we can substitute </span><code>C̅ + t d̅</code><span> for </span><code>v̅</code><span>.</span>
+<span>As </span><code>C̅</code><span>, </span><code>d̅</code><span> and </span><code>r</code><span> are specific numbers, the resulting equation would have only a single variable, </span><code>t</code><span>, and we could solve for that.</span>
+<span>For details, either apply pencil and paper, or look up </span>&ldquo;<span>ray sphere intersection</span>&rdquo;<span>.</span></p>
+<p><span>But how do we find d̅ for each pixel?</span>
+<span>To do that, we actually need to add the screen to the scene.</span>
+<span>Our image is 64x48 rectangle.</span>
+<span>So let</span>&rsquo;<span>s place that between the camera and the sphere.</span></p>
+<p><span>We have camera at </span><code>(0, 0, -20)</code><span> our rectangular screen at, say, </span><code>(0, 0, -10)</code><span> and a sphere at </span><code>(0, 0, 0)</code><span>.</span>
+<span>Now, each pixel in our 2D image has a corresponding point in our 3D scene, and we</span>&rsquo;<span>ll cast the ray from camera</span>&rsquo;<span>s position through this point.</span></p>
+<p><span>The full list of parameters to define the scene is:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">sphere center:   0 0 0</span>
+<span class="line">sphere radius:   5</span>
+<span class="line">camera position: 0 0 -20</span>
+<span class="line">camera up:       0 1 0</span>
+<span class="line">camera right:    1 0 0</span>
+<span class="line">focal distance:  10</span>
+<span class="line">screen width:    64</span>
+<span class="line">screen height:   48</span></code></pre>
+
+</figure>
+<p><span>Focal distance is the distance from the camera to the screen.</span>
+<span>If we know the direction camera is looking along and the focal distance, we can calculate the position of the center of the screen, but that</span>&rsquo;<span>s not enough.</span>
+<span>The screen can rotate, as we didn</span>&rsquo;<span>t fixed which side is up, so we need an extra parameter for that.</span>
+<span>We also add a parameter for direction to the right for convenience, though it</span>&rsquo;<span>s possible to derive </span>&ldquo;<span>right</span>&rdquo;<span> from </span>&ldquo;<span>up</span>&rdquo;<span> and </span>&ldquo;<span>forward</span>&rdquo;<span> directions.</span></p>
+<p><span>Given this set of parameters, how do we calculate the ray corresponding to, say, </span><code>(10, 20)</code><span> pixel?</span>
+<span>Well, I</span>&rsquo;<span>ll leave that up to you, but one hint I</span>&rsquo;<span>ll give is that you can calculate the middle of the screen (camera position + view direction × focal distance).</span>
+<span>If you have the middle of the screen, you can get to </span><code>(x, y)</code><span> pixel by stepping </span><code>x</code><span> steps up (and we know up!) and </span><code>y</code><span> steps right (and we know right!).</span>
+<span>Once we know the coordinates of the point of the screen through which the ray shoots, we can compute ray</span>&rsquo;<span>s direction as the difference between that point and camera</span>&rsquo;<span>s origin.</span></p>
+<p><span>Again, this is super fiddly and frustrating!</span>
+<span>My suggestion would be:</span></p>
+<ul>
+<li>
+<span>Draw some illustrations to understand relation between camera, screen, sphere, and rays.</span>
+</li>
+<li>
+<span>Try to write the code which, given </span><code>(x, y)</code><span> position of the pixel in the image, gives </span><code>(dx, dy, dz)</code><span> coordinates of the direction of the ray from the camera through the pixel.</span>
+</li>
+<li>
+<span>If that doesn</span>&rsquo;<span>t work, lookup the solution, </span><a href="https://avikdas.com/build-your-own-raytracer/01-casting-rays/project.html" class="url">https://avikdas.com/build-your-own-raytracer/01-casting-rays/project.html</a><span> describes one way to do it!</span>
+</li>
+</ul>
+<p><span>Coding wise, we obviously want to introduce some machinery here.</span>
+<span>The basic unit we need is a 3D vector </span>&mdash;<span> a triple of three real numbers </span><code>(x, y, z)</code><span>.</span>
+<span>It should support all the expected operations </span>&mdash;<span> addition, subtraction, multiplication by scalar, dot product, etc.</span>
+<span>If your language supports operator overloading, you might look that up know.</span>
+<span>Is it a good idea to overload operator for dot product?</span>
+<span>You won</span>&rsquo;<span>t know unless you try!</span></p>
+<p><span>We also need something to hold the info about sphere, camera and the screen and to do the ray casting.</span></p>
+<p><span>If everything works, you should get a familiar image of the circle.</span>
+<span>But it</span>&rsquo;<span>s now powered by a real ray tracer and its real honest to god 3D, even if it doesn</span>&rsquo;<span>t look like it!</span>
+<span>Indeed, with ray casting and ray-sphere intersection code, all the essential aspects are in place, from now on everything else are just bells and whistles.</span></p>
+</section>
+<section id="Second-Sphere">
+
+    <h2>
+    <a href="#Second-Sphere"><span>Second Sphere</span> </a>
+    </h2>
+<p><span>Ok, now that we can see one sphere, let</span>&rsquo;<span>s add the second one.</span>
+<span>We need to solve two subproblems for this to make sense.</span>
+<em><span>First</span></em><span>, we need to parameterize our single sphere with the color (so that the second one looks differently, once we add it).</span>
+<em><span>Second</span></em><span>, we should no longer hard-code </span><code>(0, 0, 0)</code><span> as a center of the sphere, and make that a parameter, adjusting the formulas accordingly.</span>
+<span>This is a good place to debug the code.</span>
+<span>If you think you move the sphere up, does it actually moves up in the image?</span></p>
+<p><span>Now, the second sphere can be added with different radius, position and color.</span>
+<span>The ray casting code now needs to be adjusted to say </span><em><span>which</span></em><span> sphere intersected the ray.</span>
+<span>Additionally, it needs to handle the case where the ray intersects </span><em><span>both</span></em><span> spheres and figure out which one is closer.</span></p>
+<p><span>With this machinery in hand, we can now create some true 3D scenes.</span>
+<span>If one sphere is fully in front of the other, that</span>&rsquo;<span>s just concentric circles.</span>
+<span>But if the spheres intersect, the picture is somewhat more interesting.</span></p>
+</section>
+<section id="Let-There-Be-Phong">
+
+    <h2>
+    <a href="#Let-There-Be-Phong"><span>Let There Be Phong</span> </a>
+    </h2>
+<p><span>The next step is going to be comparatively easy implementation wise, but it will fill our spheres with vibrant colors and make them spring out in their full 3D glory.</span>
+<span>We will add light to the scene.</span></p>
+<p><span>Light source will be parameterized by two values:</span></p>
+<ul>
+<li>
+<span>Position of the light source.</span>
+</li>
+<li>
+<span>Color and intensity of light.</span>
+</li>
+</ul>
+<p><span>For the latter, we can use a vector with three components </span><code>(red, green, blue)</code><span>, where each components varies from 0.0 (no light) to 1.0 (maximally bright light).</span>
+<span>We can use a similar vector to describe a color of the object.</span>
+<span>Now, when the light hits the object, the resulting color would be a componentwise product of the light</span>&rsquo;<span>s color and the object</span>&rsquo;<span>s color.</span></p>
+<p><span>Another contributor is the direction of light.</span>
+<span>If the light falls straight at the object, it seems bright.</span>
+<span>If the light falls obliquely, it is more dull.</span></p>
+<p><span>Let</span>&rsquo;<span>s get more specific:</span></p>
+<ul>
+<li>
+<code>P̅</code><span> is a point on our sphere where the light falls.</span>
+</li>
+<li>
+<code>N̅</code><span> is the normal vector at </span><code>P̅</code><span>.</span>
+<span>That is, it</span>&rsquo;<span>s a vector with length 1, which is locally perpendicular to the surface at </span><code>P̅</code>
+</li>
+<li>
+<code>L̅</code><span> is the position of the light source</span>
+</li>
+<li>
+<code>R̅</code><span> is a vector of length one from </span><code>P̅</code><span> to </span><code>L̅</code><span>: </span><code>R̅ = (L̅ - P̅) / |L̅ - P̅|</code>
+</li>
+</ul>
+<p><span>Then, </span><code>R̅ ⋅ N̅</code><span> gives us this </span>&ldquo;<span>is the light falling straight at the surface?</span>&rdquo;<span> coefficient between 0 and 1.</span>
+<span>Dot product between two unit vectors measures how similar their direction is (it is 0 for perpendicular vectors, and 1 for collinear ones).</span>
+<span>So, </span>&ldquo;<span>is light perpendicular</span>&rdquo;<span> is the same as </span>&ldquo;<span>is light collinear with normal</span>&rdquo;<span> is dot product.</span></p>
+<p><span>The final color will be the memberwise product of light</span>&rsquo;<span>s color and sphere</span>&rsquo;<span>s color multiplied by this attenuating coefficient.</span>
+<span>Putting it all together:</span></p>
+<p><span>For each pixel </span><code>(x, y)</code><span> we cast a </span><code>C̅ + t d̅</code><span> ray through it.</span>
+<span>If the ray hits the sphere, we calculate point </span><code>P</code><span> where it happens, as well as sphere</span>&rsquo;<span>s normal at point </span><code>P</code><span>.</span>
+<span>For sphere, normal is a vector which connects sphere</span>&rsquo;<span>s center with </span><code>P</code><span>.</span>
+<span>Then we cast a ray from </span><code>P</code><span> to the light source </span><code>L̅</code><span>.</span>
+<span>If this ray hits the other sphere, the point is occluded and the pixel remains dark.</span>
+<span>Otherwise, we compute the color using using the angle between normal and direction to the light.</span></p>
+<p><span>With this logic in place, the picture now should display two 3D-looking spheres, rather than a pair of circles.</span>
+<span>In particular, our spheres now cast shadows!</span></p>
+<p><span>What we implemented here is a part of </span><a href="https://en.wikipedia.org/wiki/Phong_reflection_model"><span>Phong reflection model</span></a><span>, specifically, the diffuse part.</span>
+<span>Extending the code to include ambient and specular parts is a good way to get some nicer looking pictures!</span></p>
+</section>
+<section id="Scene-Description-Language">
+
+    <h2>
+    <a href="#Scene-Description-Language"><span>Scene Description Language</span> </a>
+    </h2>
+<p><span>At this point, we accumulated quite a few parameters: camera config, positions of spheres, there colors, light sources (you totally can have many of them!).</span>
+<span>Specifying all those things as constants in the code makes experimentation hard, so a next logical step is to devise some kind of textual format which describes the scene.</span>
+<span>That way, our ray tracer reads a textual screen description as an input, and renders a </span><code>.ppm</code><span> as an output.</span></p>
+<p><span>One obvious choice is to use JSON, though it</span>&rsquo;<span>s not too convenient to edit by hand, and bringing in a JSON parser is contrary to our </span>&ldquo;<span>do it yourself</span>&rdquo;<span> approach.</span>
+<span>So I would suggest to design your own small language to specify the scene.</span>
+<span>You might want to take a look at </span><a href="https://kdl.dev" class="url">https://kdl.dev</a><span> for the inspiration.</span></p>
+<p><span>Note how the program grows bigger </span>&mdash;<span> there are now distinctive parts for input parsing, output formatting, rendering per-se, as well as the underlying nascent 3D geometry library.</span>
+<span>As usual, if you feel like organizing all that somewhat better, go for it!</span></p>
+</section>
+<section id="Plane-And-Other-Shapes">
+
+    <h2>
+    <a href="#Plane-And-Other-Shapes"><span>Plane And Other Shapes</span> </a>
+    </h2>
+<p><span>So far, we</span>&rsquo;<span>ve only rendered spheres.</span>
+<span>There</span>&rsquo;<span>s a huge variety of other shapes we can add, and it makes sense to tackle at least a couple.</span>
+<span>A good candidate is a plane.</span>
+<span>To specify a plane, we need a normal, and a point on a plane.</span>
+<span>For example, </span><code>N̅ ⋅ v̅ = 0</code><span> is the equation of the plain which goes through the origin and is orthogonal to </span><code>N̅</code><span>.</span>
+<span>We can plug our ray equation instead of </span><code>v̅</code><span> and solve for </span><code>t</code><span> as usual.</span></p>
+<p><span>The second shape to add is a triangle.</span>
+<span>A triangle can be naturally specified using its three vertexes.</span>
+<span>One of the more advanced math exercises would be to derive a formula for ray-triangle intersection.</span>
+<span>As usual, math isn</span>&rsquo;<span>t the point of the exercise, so feel free to just look that up!</span></p>
+<p><span>With spheres, planes and triangles which are all shapes, there clearly is some amount of polymorphism going on!</span>
+<span>You might want to play with various ways to best express that in your language of choice!</span></p>
+</section>
+<section id="Meshes">
+
+    <h2>
+    <a href="#Meshes"><span>Meshes</span> </a>
+    </h2>
+<p><span>Triangles are interesting, because there are a lot of existing 3D models specified as a bunch of triangles.</span>
+<span>If you download such a model and put it into the scene, you can render somewhat impressive images.</span></p>
+<p><span>There are many formats for storing 3D meshes, but for out purposes </span><a href="https://en.wikipedia.org/wiki/Wavefront_.obj_file"><span>.obj</span></a><span> files are the best.</span>
+<span>Again, this is a plain text format which you can parse by hand.</span></p>
+<p><span>There are plenty of </span><code>.obj</code><span> models to download, with the </span><a href="https://graphics.cs.utah.edu/courses/cs6620/fall2013/prj05/teapot.obj"><span>Utah teapot</span></a><span> being the most famous one.</span></p>
+<p><span>Note that the model specifies three parameters for each triangle</span>&rsquo;<span>s vertex:</span></p>
+<ul>
+<li>
+<span>coordinate (</span><code>v</code><span>)</span>
+</li>
+<li>
+<span>normal (</span><code>vn</code><span>)</span>
+</li>
+<li>
+<span>texture (</span><code>vt</code><span>)</span>
+</li>
+</ul>
+<p><span>For the first implementation, you</span>&rsquo;<span>d want to ignore </span><code>vn</code><span> and </span><code>vt</code><span>, and aim at getting a highly polygonal teapot on the screen.</span>
+<span>Note that the model contains thousands of triangles, and would take significantly more time to render.</span>
+<span>You might want to downscale the resolution a bit until we start optimizing performance.</span></p>
+<p><span>To make the picture less polygony, you</span>&rsquo;<span>d want to look at those </span><code>vn</code><span> normals.</span>
+<span>The idea here is that, instead of using a true triangle</span>&rsquo;<span>s normal when calculating light, to use a fake normal as if the the triangle wasn</span>&rsquo;<span>t actually flat.</span>
+<span>To do that, the </span><code>.obj</code><span> files specifies </span>&ldquo;<span>fake</span>&rdquo;<span> normals for each vertex of a triangle.</span>
+<span>If a ray intersects a triangle somewhere in the middle, you can compute a fake normal at that point by taking a weighted average of the three normals at the vertexes.</span></p>
+<p><span>At this point, you should get a picture roughly comparable to the one at the start of the article!</span></p>
+</section>
+<section id="Performance-Optimizations">
+
+    <h2>
+    <a href="#Performance-Optimizations"><span>Performance Optimizations</span> </a>
+    </h2>
+<p><span>With all bells and whistles, our ray tracer should be rather slow, especially for larger images.</span>
+<span>There are three tricks I suggest to make it faster (and also to learn a bunch of stuff).</span></p>
+<p><em><span>First</span></em><span>, ray tracing is an embarrassingly parallel task: each pixel is independent from the others.</span>
+<span>So, as a quick win, make sure that you program uses all the cores for rendering.</span>
+<span>Did you manage to get a linear speedup?</span></p>
+<p><em><span>Second</span></em><span>, its a good opportunity to look into profiling tools.</span>
+<span>Can you figure out what specifically is the slowest part?</span>
+<span>Can you make it faster?</span></p>
+<p><em><span>Third</span></em><span>, our implementation which loops over each shape to find the closest intersection is a bit naive.</span>
+<span>It would be cool if we had something like a binary search tree, which would show us the closest shape automatically.</span>
+<span>As far as I know, there isn</span>&rsquo;<span>t a general algorithmically optimal index data structure for doing spatial lookups.</span>
+<span>However, there</span>&rsquo;<span>s a bunch of somewhat heuristic data structures which tend to work well in practice.</span></p>
+<p><span>One that I suggest implementing is the bounding volume hierarchy.</span>
+<span>The crux of the idea is that we can take a bunch of triangles and place them inside a bigger object (eg, a gigantic sphere).</span>
+<span>Then, if a ray doesn</span>&rsquo;<span>t intersect this bigger object, we don</span>&rsquo;<span>t need to check any triangles contained within.</span>
+<span>There</span>&rsquo;<span>s a certain freedom in how one picks such bounding objects.</span></p>
+<p><span>For BVH, we will use axis-aligned bounding box as our bounding volumes.</span>
+<span>It is a cuboid whose edges are parallel to the coordinate axis.</span>
+<span>You can parametrize an AABB with two points </span>&mdash;<span> the one with the lowest coordinates, and the one with the highest.</span>
+<span>It</span>&rsquo;<span>s also easy to construct an AABB which bounds a set of shapes </span>&mdash;<span> take the minimum and maximum coordinates of all vertexes.</span>
+<span>Similarly, intersecting an AABB with a ray is fast.</span></p>
+<p><span>The next idea is to define a hierarchy of AABBs.</span>
+<span>First, we define a root AABB for the whole scene.</span>
+<span>If the ray doesn</span>&rsquo;<span>t hit it, we are done.</span>
+<span>The root box is then subdivided into two smaller boxes.</span>
+<span>The ray can hit one or two of them, and we recur into each box that got hit.</span>
+<span>Worst case, we are recurring into both subdivisions, which isn</span>&rsquo;<span>t any faster, but in the common case we can skip at least a half.</span>
+<span>For simplicity, we also start with computing an AABB for each triangle we have in a scene, so we can think uniformly about a bunch of AABBs.</span></p>
+<p><span>Putting everything together, we start with a bunch of small AABBs for our primitives.</span>
+<span>As a first step, we compute their common AABB.</span>
+<span>This will be the basis of our recursion step: a bunch of small AABBs, and a huge AABB encompassing all of them.</span>
+<span>We want to subdivide the big box.</span>
+<span>To do that, we select its longest axis (eg, if the big box is very tall, we aim to cut it in two horizontally), and find a midpoint.</span>
+<span>Then, we sort small AABBs into those whoche center is before or after midpoint along this axis.</span>
+<span>Finally, for each of the two subsets we compute a pair of new AABBs, and then recur.</span></p>
+<p><span>Crucially, the two new bounding boxes might intersect.</span>
+<span>We can</span>&rsquo;<span>t just cut the root box in two and unambiguously assign small AABBs to the two half, as they might not be entirely within one.</span>
+<span>But, we can expect the intersection to be pretty small in practice.</span></p>
+</section>
+<section id="Next-Steps">
+
+    <h2>
+    <a href="#Next-Steps"><span>Next Steps</span> </a>
+    </h2>
+<p><span>If you</span>&rsquo;<span>ve made it this far, you have a pretty amazing pice of software!</span>
+<span>While it probably clocks at only a couple of thousands lines of code, it covers a pretty broad range of topics, from text file parsing to advanced data structures for spatial data.</span>
+<span>I deliberately spend no time explaining how to best fit all these pieces into a single box, that</span>&rsquo;<span>s the main thing for you to experiment with and to learn.</span></p>
+<p><span>There are two paths one can take from here:</span></p>
+<ul>
+<li>
+<span>If you liked the graphics programming aspect of the exercise, there</span>&rsquo;<span>s a </span><em><span>lot</span></em><span> you can do to improve the quality of the output.</span>
+<a href="https://pbrt.org" class="url">https://pbrt.org</a><span> is the canonical book on the topic.</span>
+</li>
+<li>
+<span>If you liked the software engineering side of the project, you can try to re-implement it in different programming languages, to get a specific benchmark to compare different programming paradigms.</span>
+<span>Alternatively, you might want to look for other similar self-contained hand-made projects.</span>
+<span>Some options include:</span>
+<ul>
+<li>
+<span>Software rasterizer: rather than simulating a path of a ray, we can project triangles onto the screen.</span>
+<span>This is potentially much faster, and should allow for real-time rendering.</span>
+</li>
+<li>
+<span>A highly concurrent chat server: a program which listens on a TCP port, allows clients to connect to it and exchange messages.</span>
+</li>
+<li>
+<span>A toy programming language: going full road from a text file to executable </span><code>.wasm</code><span>. Bonus points if you also do an LSP server for your language.</span>
+</li>
+<li>
+<span>A distributed key-value store based on Paxos or Raft.</span>
+</li>
+<li>
+<span>A toy relational database</span>
+</li>
+</ul>
+</li>
+</ul>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2022-12-31-raytracer-construction-kit.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/01/04/on-random-numbers.html b/2023/01/04/on-random-numbers.html
new file mode 100644
index 00000000..79f5a93f
--- /dev/null
+++ b/2023/01/04/on-random-numbers.html
@@ -0,0 +1,253 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>On Random Numbers</title>
+  <meta name="description" content="This is a short post which decomposes random numbers topic into principal components and maps them to Rust ecosystem.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/01/04/on-random-numbers.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#On-Random-Numbers"><span>On Random Numbers</span> <time datetime="2023-01-04">Jan 4, 2023</time></a>
+    </h1>
+<p><span>This is a short post which decomposes </span>&ldquo;<span>random numbers</span>&rdquo;<span> topic into principal components and maps them to Rust ecosystem.</span></p>
+<section id="True-Randomness">
+
+    <h2>
+    <a href="#True-Randomness"><span>True Randomness</span> </a>
+    </h2>
+<p><span>For cryptographic purposes (eg, generating a key pair for public key cryptography), you want to use real random numbers, derived from genuinely stochastic physical signals</span>
+<span>(hardware random number generator, keyboard input, etc).</span>
+<span>The shape of the API here is:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">fill_buffer_with_random_data</span>(buf: &amp;<span class="hl-keyword">mut</span> [<span class="hl-type">u8</span>])</span></code></pre>
+
+</figure>
+<p><span>As this fundamentally requires talking to some physical devices, this task is handled by the operating system.</span>
+<span>Different operating systems provide different APIs, covering which is beyond the scope of this article (and my own knowledge).</span></p>
+<p><span>In Rust, </span><a href="https://lib.rs/getrandom"><code>getrandom</code></a><span> crate provides a cross-platform wrapper for this functionality.</span></p>
+<p><span>It is a major deficiency of Rust standard library that this functionality is not exposed there.</span>
+<span>Getting cryptographically secure random data is in the same class of OS services as getting the current time or reading standard input.</span>
+<span>Arguably, it</span>&rsquo;<span>s even more important, as most applications for this functionality are security-critical.</span></p>
+</section>
+<section id="Pseudorandom-Number-Generator">
+
+    <h2>
+    <a href="#Pseudorandom-Number-Generator"><span>Pseudorandom Number Generator</span> </a>
+    </h2>
+<p><span>For various non-cryptographic randomized algorithms, you want to start with a fixed, deterministic </span><code>seed</code><span>, and generate a stream of numbers, statistically indistinguishable from random.</span>
+<span>The shape of the API here is:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">random_u32</span>(state: &amp;<span class="hl-keyword">mut</span> <span class="hl-type">f64</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span></span></code></pre>
+
+</figure>
+<p><span>There are many different algorithms to do that.</span>
+<a href="https://lib.rs/fastrand"><code>fastrand</code></a><span> crate implements something sufficiently close to the state of the art.</span></p>
+<p><span>Alternatively, a good-enough PRNG can be implemented in 9 lines of code:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">random_numbers</span>(seed: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">impl</span> <span class="hl-title class_">Iterator</span>&lt;Item = <span class="hl-type">u32</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">random</span> = seed;</span>
+<span class="line">  std::iter::<span class="hl-title function_ invoke__">repeat_with</span>(<span class="hl-keyword">move</span> || {</span>
+<span class="line">    random ^= random &lt;&lt; <span class="hl-number">13</span>;</span>
+<span class="line">    random ^= random &gt;&gt; <span class="hl-number">17</span>;</span>
+<span class="line">    random ^= random &lt;&lt; <span class="hl-number">5</span>;</span>
+<span class="line">    random</span>
+<span class="line">  })</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This code was lifted from Rust</span>&rsquo;<span>s standard library (</span><a href="https://github.com/rust-lang/rust/blob/1.55.0/library/core/src/slice/sort.rs#L559-L573"><span>source</span></a><span>).</span></p>
+<p><span>The best way to seed a PRNG is usually by using a fixed constant.</span>
+<span>If you absolutely need </span><em><span>some</span></em><span> amount of randomness in the seed, you can use the following hack:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">random_seed</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u64</span> {</span>
+<span class="line">  std::hash::Hasher::<span class="hl-title function_ invoke__">finish</span>(&amp;std::hash::BuildHasher::<span class="hl-title function_ invoke__">build_hasher</span>(</span>
+<span class="line">    &amp;std::collections::hash_map::RandomState::<span class="hl-title function_ invoke__">new</span>(),</span>
+<span class="line">  ))</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In Rust, hash maps include some amount of randomization to avoid exploitable pathological behavior due to collisions.</span>
+<span>The above snippet extracts that randomness.</span></p>
+</section>
+<section id="Non-Uniformly-Distributed-Random-Numbers-Uniformly-Distributed-Random-Non-Numbers">
+
+    <h2>
+    <a href="#Non-Uniformly-Distributed-Random-Numbers-Uniformly-Distributed-Random-Non-Numbers"><span>Non-Uniformly Distributed Random Numbers, Uniformly Distributed Random Non-Numbers.</span> </a>
+    </h2>
+<p><span>Good PRNG gives you a sequence of </span><code>u32</code><span> numbers where each number is as likely as every other one.</span>
+<span>You can convert that to a number from 0 to 10 with </span><code>random_u32() % 10</code><span>.</span>
+<span>This will be good enough for most purposes, but will fail rigorous statistical tests.</span>
+<span>Because 2</span><sup><span>32</span></sup><span> isn</span>&rsquo;<span>t evenly divisible by 10, 0 would be ever so slightly more frequent than </span><code>9</code><span>.</span>
+<span>There is an algorithm to do this correctly (if </span><code>random_u32()</code><span> is very large, and falls into the literal remainder after dividing 2</span><sup><span>32</span></sup><span> by 10, throw it away and try again).</span></p>
+<p><span>Sometimes you you want to use </span><code>random_u32()</code><span> to generate other kinds of random things, like a random point on a 3D sphere, or a random permutation.</span>
+<span>There are also algorithms for that.</span></p>
+<p><span>Sphere: generate random point in the unit cube; if it is also in the unit ball, project it onto the surface, otherwise throw it away and try again.</span></p>
+<p><span>Permutation: naive algorithm of selecting a random element to be the first, then selecting a random element among the rest to be the second, etc, works.</span></p>
+<p><span>There are libraries which provide collections of such algorithms.</span>
+<span>For example, </span><code>fastrand</code><span> includes most common ones, like generating numbers in range, generating floating point numbers or shuffling slices.</span></p>
+<p><code>rand</code><span> includes more esoteric cases line the aforementioned point on a sphere or a normal distribution.</span></p>
+</section>
+<section id="Ambient-Global-Source-Of-Random-Numbers">
+
+    <h2>
+    <a href="#Ambient-Global-Source-Of-Random-Numbers"><span>Ambient Global Source Of Random Numbers</span> </a>
+    </h2>
+<p><span>It is customary to expect existence of a global random number generator seeded for you.</span>
+<span>This is an anti-pattern </span>&mdash;<span> in the overwhelming majority of cases, passing a random number generator explicitly leads to better software.</span>
+<span>In particular, this is a requirement for deterministic tests.</span></p>
+<p><span>In any case, this functionality can be achieved by storing a state of PRNG in a thread local:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::cell::Cell;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">thread_local_random_u32</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">  thread_local! {</span>
+<span class="line">      <span class="hl-keyword">static</span> STATE: Cell&lt;<span class="hl-type">u64</span>&gt; = Cell::<span class="hl-title function_ invoke__">new</span>(<span class="hl-title function_ invoke__">random_seed</span>())</span>
+<span class="line">  }</span>
+<span class="line">  STATE.<span class="hl-title function_ invoke__">with</span>(|cell| {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">state</span> = cell.<span class="hl-title function_ invoke__">get</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">result</span> = <span class="hl-title function_ invoke__">random_u32</span>(&amp;<span class="hl-keyword">mut</span> state);</span>
+<span class="line">    cell.<span class="hl-title function_ invoke__">set</span>(state);</span>
+<span class="line">    result</span>
+<span class="line">  })</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="rand">
+
+    <h2>
+    <a href="#rand"><span>rand</span> </a>
+    </h2>
+<p><a href="https://lib.rs/rand"><code>rand</code></a><span> is an umbrella crate which includes all of the above.</span>
+<code>rand</code><span> also provides flexible trait-based </span>&ldquo;<span>plugin</span>&rdquo;<span> interface, allowing you to mix and match different combinations of PRNGs and algorithms.</span>
+<span>User interface of </span><code>rand</code><span> is formed primarily by extension traits.</span></p>
+</section>
+<section id="Kinds-Of-Randomness">
+
+    <h2>
+    <a href="#Kinds-Of-Randomness"><span>Kinds Of Randomness</span> </a>
+    </h2>
+<p><span>Circling back to the beginning of the post, it is very important to distinguish between the two use-cases:</span></p>
+<ul>
+<li>
+<span>using unpredictable data for cryptography</span>
+</li>
+<li>
+<span>using statistically uniform random data for stochastic algorithms</span>
+</li>
+</ul>
+<p><span>Although the two use-cases both have </span>&ldquo;<span>randomness</span>&rdquo;<span> in their name, they are disjoint, and underlying algorithms and APIs don</span>&rsquo;<span>t have anything in common.</span>
+<span>They are physically different: one is a syscall, another is a pure function mapping integers to integers.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-01-04-on-random-numbers.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/01/25/next-rust-compiler.html b/2023/01/25/next-rust-compiler.html
new file mode 100644
index 00000000..6f3206db
--- /dev/null
+++ b/2023/01/25/next-rust-compiler.html
@@ -0,0 +1,194 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Next Rust Compiler</title>
+  <meta name="description" content="In Rust in 2023, @nrc floated an idea of a Rust compiler rewrite.
+As my hobby is writing Rust compiler frontends (1, 2), I have some (but not very many) thoughts here!
+The post consists of two parts, covering organizational and technical aspects.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/01/25/next-rust-compiler.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Next-Rust-Compiler"><span>Next Rust Compiler</span> <time datetime="2023-01-25">Jan 25, 2023</time></a>
+    </h1>
+<p><span>In </span><a href="https://www.ncameron.org/blog/rust-in-2023/"><span>Rust in 2023</span></a><span>, </span><a href="http://github.com/nrc"><span>@nrc</span></a><span> floated an idea of a Rust compiler rewrite.</span>
+<span>As my hobby is writing Rust compiler frontends (</span><a href="https://github.com/intellij-rust/intellij-rust"><span>1</span></a><span>, </span><a href="https://github.com/rust-lang/rust-analyzer"><span>2</span></a><span>), I have some (but not very many) thoughts here!</span>
+<span>The post consists of two parts, covering organizational and technical aspects.</span></p>
+<section id="Organization">
+
+    <h2>
+    <a href="#Organization"><span>Organization</span> </a>
+    </h2>
+<p><span>Writing a production-grade compiler is not a small endeavor.</span>
+<span>The questions of who writes the code, who pays the people writing the code, and what</span>&rsquo;<span>s the economic incentive to fund the work in the first place are quite important.</span></p>
+<p><span>My naive guesstimate is that Rust is currently at that stage of its life where it</span>&rsquo;<span>s clear that the language won</span>&rsquo;<span>t die, and would be deployed quite widely, but where, at the same time, the said deployment didn</span>&rsquo;<span>t quite happen to the full extent yet.</span>
+<span>From within the Rust community, it seems like Rust is everywhere.</span>
+<span>My guess is that from the outside it looks like there</span>&rsquo;<span>s Rust in at least some places.</span></p>
+<p><span>In other words, it</span>&rsquo;<span>s high time to invest substantially into Rust ecosystem, as the risk that the investment sinks completely is relatively low, but the expected growth is still quite high.</span>
+<span>This makes me think that a next-gen rust compiler isn</span>&rsquo;<span>t too unlikely: I feel that rustc is stuck in a local optimum, and that, with some boldness, it is possible to deliver something more awesome.</span></p>
+</section>
+<section id="Technicalities">
+
+    <h2>
+    <a href="#Technicalities"><span>Technicalities</span> </a>
+    </h2>
+<p><span>Here</span>&rsquo;<span>s what I think an awesome rust compiler would do:</span></p>
+<dl>
+<dt><span>rust-native compilation model</span></dt>
+<dd>
+<p><span>Like C++, Rust (ab)uses the C compilation model </span>&mdash;<span> compilation units are separately compiled into object files, which are then linked into a single executable by the linker.</span>
+<span>This model is at odds with how the language work.</span>
+<span>In particular, compiling a generic function isn</span>&rsquo;<span>t actually possible until you know specific type parameters at the call-site.</span>
+<span>Rust and C++ hack around that by compiling a separate copy for every call-site (C++ even re-type-checks every call-site), and deduplicating instantiations during the link step.</span>
+<span>This creates a lot of wasted work, which is only there because we try to follow </span>&ldquo;<span>compile to object files then link</span>&rdquo;<span> model of operation.</span>
+<span>It would be significantly more efficient to merge compiler and linker, such that only the minimal amount of code is compiled, compiled code is fully aware about surrounding context and can be inlined across crates, and where the compilation makes the optimal use of all available CPU and RAM.</span></p>
+</dd>
+<dt><span>intra-crate parallelism</span></dt>
+<dd>
+<p><span>C compilation model is not stupid </span>&mdash;<span> it is the way it is to enable separate compilation.</span>
+<span>Back in the day, compiling whole programs was simply not possible due to the limitations of the hardware.</span>
+<span>Rather, a program had to be compiled in separate parts, and then the parts linked together into the final artifact.</span>
+<span>With bigger computers today, we don</span>&rsquo;<span>t think about separate compilation as much.</span>
+<span>It is still important though </span>&mdash;<span> not only our computers are more powerful, our programs are much bigger.</span>
+<span>Moreover, computing power comes not from increasing clock speeds, but from a larger number of cores.</span></p>
+<p><span>Rust</span>&rsquo;<span>s DAG of anonymous crates with well-defined declaration-site checked interfaces is actually quite great for compiling Rust in parallel (especially if we get rid of completely accidental interactions between monomorphization and existing linkers).</span>
+<span>However, even a single crate can be quite large, and is compiled sequentially.</span>
+<span>For example, in the </span><a href="https://quick-lint-js.com/blog/cpp-vs-rust-build-times"><span>recent compile time benchmark</span></a><span>, a significant chunk of time was spent compiling just this </span><a href="https://github.com/quick-lint/cpp-vs-rust/blob/master/rust/libs/fe/tests/test_lex.rs"><span>file</span></a><span> with a bunch of functions.</span>
+<span>Intuitively, as all these functions are completely independent, compiler should be able to process them in parallel.</span>
+<span>In reality, Rust doesn</span>&rsquo;<span>t actually make that as easy as it seems, but it definitely is possible to do better than the current compiler.</span></p>
+</dd>
+<dt><span>open-world compiling; stable MIR</span></dt>
+<dd>
+<p><span>Today, Rust tooling is a black-box </span>&mdash;<span> you feed it with source text and an executable binary for the output.</span>
+<span>This solves the problem of producing executable binaries quite well!</span></p>
+<p><span>However, for more complex projects you want to have more direct relationship with the code.</span>
+<span>You want tools other than compiler to understand the meaning of the code, and to act on it.</span>
+<span>For example automated large scale refactors and code analysis, project-specific linting rules or formal proofs of correctness all could benefit from having an access to semantically rich model of the language.</span></p>
+<p><span>Providing such semantic  model, where AST is annotated with resolved names, inferred types, and bodies are converted to a simple and precise IR, is a huge ask.</span>
+<span>Not because it is technically hard to implement, but because this adds an entirely new stable API to the language.</span>
+<span>Nonetheless, such an API would unlock quite a few use cases, so the tradeoff is worth it.</span></p>
+</dd>
+<dt><span>hermetic deterministic compilation</span></dt>
+<dd>
+<p><span>It is increasingly common to want reproducible builds.</span>
+<span>With NixOS and Guix, whole Linux distros are built in a deterministic fashion.</span>
+<span>It is possible to achieve reproducibility by carefully freezing whatever mess you are currently in, the docker way.</span>
+<span>But a better approach is to start with inherently pure and hermetic components, and assemble them into a larger system.</span></p>
+<p><span>Today, Rust has some amount of determinism in its compilation, but it is achieved by plugging loopholes, rather than by not admitting impurities into the system in the first place.</span>
+<span>For example, the </span><a href="https://github.com/rust-lang/rust/blob/027c8507b4265dcf285b0b503e2a49214b929f7b/compiler/rustc_builtin_macros/src/env.rs#L81"><code>env!</code></a><span> macro literally looks up a value in compiler</span>&rsquo;<span>s environment, without any attempt at restricting or at least enumerating available inputs.</span>
+<span>Procedural macros are an unrestricted RCE.</span></p>
+<p><span>It feels like we can do better, and that we should do better, if the goal is still </span><a href="http://venge.net/graydon/talks/intro-talk-2.pdf"><span>less mess</span></a><span>.</span></p>
+</dd>
+<dt><span>lazy and error-resilient compilation</span></dt>
+<dd>
+<p><span>For the task of providing immediate feedback right in the editor when the user types the code, compilation </span>&ldquo;<span>pipeline</span>&rdquo;<span> needs to be changed significantly.</span>
+<span>It should be lazy (so that only the minimal amount of code is inspected and re-analyzed on typing) and resilient and robust to errors (IDE job mostly ends when the code is error free).</span>
+<a href="https://rust-analyzer.github.io"><span>rust-analyzer</span></a><span> shows one possible way to do that, with the only drawback of being a completely separate tool for IDE, and only IDE.</span>
+<span>There</span>&rsquo;<span>s no technical limitation why the full compiler can</span>&rsquo;<span>t be like that, just the organizational limitation of it being very hard to re-architecture existing entrenched code, perfected for its local optimum.</span></p>
+</dd>
+<dt><code>cargo install rust-compiler</code></dt>
+<dd>
+<p><span>Finally, for the benefit of compiler writers themselves, a compiler should be a simple rust crate, which builds with stable Rust and is otherwise a very boring text processing utility.</span>
+<span>Again, rust-analyzer shows that it is possible, and that the benefits for development velocity are enormous.</span>
+<span>I am glad to see </span><a href="https://jyn.dev/2023/01/12/Bootstrapping-Rust-in-2023.html"><span>a recent movement</span></a><span> to making the build process for the compiler simpler!</span></p>
+</dd>
+</dl>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/rust/comments/10ld2vn/blog_post_next_rust_compiler/"><span>/r/rust</span></a></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-01-25-next-rust-compiler.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/01/26/rusts-ugly-syntax.html b/2023/01/26/rusts-ugly-syntax.html
new file mode 100644
index 00000000..9a918d75
--- /dev/null
+++ b/2023/01/26/rusts-ugly-syntax.html
@@ -0,0 +1,328 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Rust's Ugly Syntax</title>
+  <meta name="description" content="People complain about Rust syntax.
+I think that most of the time when people think they have an issue with Rust's syntax, they actually object to Rust's semantics.
+In this slightly whimsical post, I'll try to disentangle the two.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/01/26/rusts-ugly-syntax.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Rust-s-Ugly-Syntax"><span>Rust</span>&rsquo;<span>s Ugly Syntax</span> <time datetime="2023-01-26">Jan 26, 2023</time></a>
+    </h1>
+<p><span>People complain about Rust syntax.</span>
+<span>I think that most of the time when people think they have an issue with Rust</span>&rsquo;<span>s syntax, they actually object to Rust</span>&rsquo;<span>s semantics.</span>
+<span>In this slightly whimsical post, I</span>&rsquo;<span>ll try to disentangle the two.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with an example of an ugly Rust syntax:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>&lt;P: <span class="hl-built_in">AsRef</span>&lt;Path&gt;&gt;(path: P) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">inner</span>(path: &amp;Path) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path)?;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">bytes</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    file.<span class="hl-title function_ invoke__">read_to_end</span>(&amp;<span class="hl-keyword">mut</span> bytes)?;</span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(bytes)</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-title function_ invoke__">inner</span>(path.<span class="hl-title function_ invoke__">as_ref</span>())</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This function reads contents of a given binary file.</span>
+<span>This is lifted straight from the standard library, so it is very much not a strawman example.</span>
+<span>And, at least to me, it</span>&rsquo;<span>s definitely not a pretty one!</span></p>
+<p><span>Let</span>&rsquo;<span>s try to imagine what this same function would look like if Rust had a better syntax.</span>
+<span>Any resemblance to real programming languages, living or dead, is purely coincidental!</span></p>
+<p><span>Let</span>&rsquo;<span>s start with Rs++:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">template</span>&lt;std::HasConstReference&lt;std::Path&gt; P&gt;</span>
+<span class="line">std::io::outcome&lt;std::vector&lt;<span class="hl-type">uint8_t</span>&gt;&gt;</span>
+<span class="line">std::<span class="hl-built_in">read</span>(P path) {</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-built_in">read_</span>(path.<span class="hl-built_in">as_reference</span>());</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-type">static</span></span>
+<span class="line">std::io::outcome&lt;std::vector&lt;<span class="hl-type">uint8_t</span>&gt;&gt;</span>
+<span class="line"><span class="hl-built_in">read_</span>(&amp;<span class="hl-keyword">auto</span> <span class="hl-type">const</span> std::Path path) {</span>
+<span class="line">    <span class="hl-keyword">auto</span> file = <span class="hl-keyword">try</span> std::File::<span class="hl-built_in">open</span>(path);</span>
+<span class="line">    std::vector bytes;</span>
+<span class="line">    <span class="hl-keyword">try</span> file.<span class="hl-built_in">read_to_end</span>(&amp;bytes);</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-built_in">okey</span>(bytes);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>A Rhodes variant:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">public</span> io.Result&lt;ArrayList&lt;Byte&gt;&gt; read&lt;P <span class="hl-keyword">extends</span> <span class="hl-title class_">ReferencingFinal</span>&lt;Path&gt;&gt;(</span>
+<span class="line">        P path) {</span>
+<span class="line">    <span class="hl-keyword">return</span> myRead(path.get_final_reference());</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">private</span> io.Result&lt;ArrayList&lt;Byte&gt;&gt; <span class="hl-title function_">myRead</span><span class="hl-params">(</span></span>
+<span class="line"><span class="hl-params">        <span class="hl-keyword">final</span> reference lifetime <span class="hl-keyword">var</span> Path path)</span> {</span>
+<span class="line">    <span class="hl-type">var</span> <span class="hl-variable">file</span> <span class="hl-operator">=</span> <span class="hl-keyword">try</span> File.open(path);</span>
+<span class="line">    ArrayList&lt;Byte&gt; bytes = ArrayList.new();</span>
+<span class="line">    <span class="hl-keyword">try</span> file.readToEnd(borrow bytes);</span>
+<span class="line">    <span class="hl-keyword">return</span> Success(bytes);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Typical RhodesScript:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">public <span class="hl-keyword">function</span> read&lt;P <span class="hl-keyword">extends</span> <span class="hl-title class_">IncludingRef</span>&lt;<span class="hl-title class_">Path</span>&gt;&gt;(</span>
+<span class="line">    <span class="hl-attr">path</span>: P,</span>
+<span class="line">): io.<span class="hl-property">Result</span>&lt;<span class="hl-title class_">Array</span>&lt;byte&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-title function_">myRead</span>(path.<span class="hl-title function_">included_ref</span>());</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">private <span class="hl-keyword">function</span> <span class="hl-title function_">myRead</span>(<span class="hl-params"></span></span>
+<span class="line"><span class="hl-params">    path: &amp;<span class="hl-keyword">const</span> Path,</span></span>
+<span class="line"><span class="hl-params"></span>): io.<span class="hl-property">Result</span>&lt;<span class="hl-title class_">Array</span>&lt;byte&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> file = <span class="hl-keyword">try</span> <span class="hl-title class_">File</span>.<span class="hl-title function_">open</span>(path);</span>
+<span class="line">    <span class="hl-title class_">Array</span>&lt;byte&gt; bytes = <span class="hl-title class_">Array</span>.<span class="hl-title function_">new</span>()</span>
+<span class="line">    <span class="hl-keyword">try</span> file.<span class="hl-title function_">readToEnd</span>(&amp;bytes)</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-title class_">Ok</span>(bytes);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Rattlesnake:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">def</span> <span class="hl-title function_">read</span>[P: Refing[Path]](path: P): io.Result[<span class="hl-type">List</span>[byte]]:</span>
+<span class="line">    <span class="hl-keyword">def</span> <span class="hl-title function_">inner</span>(<span class="hl-params">path: @Path</span>): io.Result[<span class="hl-type">List</span>[byte]]:</span>
+<span class="line">        file := <span class="hl-keyword">try</span> File.<span class="hl-built_in">open</span>(path)</span>
+<span class="line">        <span class="hl-built_in">bytes</span> := <span class="hl-type">List</span>.new()</span>
+<span class="line">        <span class="hl-keyword">try</span> file.read_to_end(@: <span class="hl-built_in">bytes</span>)</span>
+<span class="line">        <span class="hl-keyword">return</span> Ok(<span class="hl-built_in">bytes</span>)</span>
+<span class="line">    <span class="hl-keyword">return</span> inner(path.ref)</span></code></pre>
+
+</figure>
+<p><span>And, to conclude, CrabML:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">read :: 'p  ref_of =&gt; 'p -&gt; u8 vec io.either.t</span>
+<span class="line">let read p =</span>
+<span class="line">  let</span>
+<span class="line">    inner :: &amp;path -&gt; u8 vec.t io.either.t</span>
+<span class="line">    inner p =</span>
+<span class="line">      let mut file = try (File.open p) in</span>
+<span class="line">      let mut bytes = vec.new () in</span>
+<span class="line">      try (file.read_to_end (&amp;mut bytes)); Right bytes</span>
+<span class="line">  in</span>
+<span class="line">    ref_op p |&gt; inner</span>
+<span class="line">;;</span></code></pre>
+
+</figure>
+<p><span>As a slightly more serious and useful exercise, let</span>&rsquo;<span>s do the opposite </span>&mdash;<span> keep the Rust syntax, but try to simplify semantics until the end result looks presentable.</span></p>
+<p><span>Here</span>&rsquo;<span>s our starting point:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>&lt;P: <span class="hl-built_in">AsRef</span>&lt;Path&gt;&gt;(path: P) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">inner</span>(path: &amp;Path) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path)?;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">bytes</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">    file.<span class="hl-title function_ invoke__">read_to_end</span>(&amp;<span class="hl-keyword">mut</span> bytes)?;</span>
+<span class="line">    <span class="hl-title function_ invoke__">Ok</span>(bytes)</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-title function_ invoke__">inner</span>(path.<span class="hl-title function_ invoke__">as_ref</span>())</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The biggest source of noise here is the nested function.</span>
+<span>The motivation for it is somewhat esoteric.</span>
+<span>The outer function is generic, while the inner function isn</span>&rsquo;<span>t.</span>
+<span>With the current compilation model, that means that the outer function is compiled together with the user</span>&rsquo;<span>s code, gets inlined and is optimized down to nothing.</span>
+<span>In contrast, the inner function is compiled when the std itself is being compiled, saving time when compiling user</span>&rsquo;<span>s code.</span>
+<span>One way to simplify this (losing a bit of performance) is to say that generic functions are always separately compiled, but accept an extra runtime argument under the hood which describes the physical dimension of input parameters.</span></p>
+<p><span>With that, we get</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>&lt;P: <span class="hl-built_in">AsRef</span>&lt;Path&gt;&gt;(path: P) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path.<span class="hl-title function_ invoke__">as_ref</span>())?;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">bytes</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  file.<span class="hl-title function_ invoke__">read_to_end</span>(&amp;<span class="hl-keyword">mut</span> bytes)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(bytes)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The next noisy element is the </span><code>&lt;P: AsRef&lt;Path&gt;&gt;</code><span> constraint.</span>
+<span>It is needed because Rust loves exposing physical layout of bytes in memory as an interface, specifically for cases where that brings performance.</span>
+<span>In particular, the meaning of </span><code>Path</code><span> is not that it is some abstract representation of a file path, but that it is just literally a bunch of contiguous bytes in memory.</span>
+<span>So we need </span><code>AsRef</code><span> to make this work with </span><em><span>any</span></em><span> abstraction which is capable of representing such a slice of bytes.</span>
+<span>But if we don</span>&rsquo;<span>t care about performance, we can require that all interfaces are fairly abstract and mediated via virtual function calls, rather than direct memory access.</span>
+<span>Then we won</span>&rsquo;<span>t need </span><code>AsRef</code><span>at all:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>(path: &amp;Path) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;<span class="hl-type">Vec</span>&lt;<span class="hl-type">u8</span>&gt;&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path)?;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">bytes</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  file.<span class="hl-title function_ invoke__">read_to_end</span>(&amp;<span class="hl-keyword">mut</span> bytes)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(bytes)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Having done this, we can actually get rid of </span><code>Vec&lt;u8&gt;</code><span> as well </span>&mdash;<span> we can no longer use generics to express efficient growable array of bytes in the language itself.</span>
+<span>We</span>&rsquo;<span>d have to use some opaque </span><code>Bytes</code><span> type provided by the runtime:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>(path: &amp;Path) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;Bytes&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path)?;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">bytes</span> = Bytes::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  file.<span class="hl-title function_ invoke__">read_to_end</span>(&amp;<span class="hl-keyword">mut</span> bytes)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(bytes)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Technically, we are still carrying ownership and borrowing system with us, but, without direct control over memory layout of types, it no longer brings massive performance benefits.</span>
+<span>It still helps to avoid GC, prevent iterator invalidation, and statically check that non-thread-safe code isn</span>&rsquo;<span>t actually used across threads.</span>
+<span>Still, we can easily get rid of those &amp;-pretzels if we just switch to GC.</span>
+<span>We don</span>&rsquo;<span>t even need to worry about concurrency much </span>&mdash;<span> as our objects are separately allocated and always behind a pointer, we can hand-wave data races away by noticing that operations with pointer-sized things are atomic on x86 anyway.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>(path: Path) <span class="hl-punctuation">-&gt;</span> io::<span class="hl-type">Result</span>&lt;Bytes&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path)?;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">bytes</span> = Bytes::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  file.<span class="hl-title function_ invoke__">read_to_end</span>(bytes)?;</span>
+<span class="line">  <span class="hl-title function_ invoke__">Ok</span>(bytes)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Finally, we are being overly pedantic with error handling here </span>&mdash;<span> not only we mention a possibility of failure in the return type, we even use </span><code>?</code><span> to highlight any specific expression that might fail.</span>
+<span>It would be much simpler to not think about error handling at all, and let some top-level</span><br>
+<code>try { } catch (...) { /* intentionally empty */ }</code><br>
+<span>handler deal with it:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">read</span>(path: Path) <span class="hl-punctuation">-&gt;</span> Bytes {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">file</span> = File::<span class="hl-title function_ invoke__">open</span>(path);</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">bytes</span> = Bytes::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  file.<span class="hl-title function_ invoke__">read_to_end</span>(bytes);</span>
+<span class="line">  bytes</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><strong><strong><span>Much</span></strong></strong><span> better now!</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-01-26-rusts-ugly-syntax.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/02/10/how-a-zig-ide-could-work.html b/2023/02/10/how-a-zig-ide-could-work.html
new file mode 100644
index 00000000..5fcf651d
--- /dev/null
+++ b/2023/02/10/how-a-zig-ide-could-work.html
@@ -0,0 +1,340 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>How a Zig IDE Could Work</title>
+  <meta name="description" content="Zig is a very interesting language from an IDE point of view.
+Some aspects of it are friendly to IDEs, like a very minimal and simple-to-parse syntax
+(Zig can even be correctly lexed line-by-line, very cool!),
+the absence of syntactic macros, and ability to do a great deal of semantic analysis on a file-by-file basis, in parallel.
+On the other hand, comptime.
+I accidentally spent some time yesterday thinking about how to build an IDE for that, this post is a result.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/02/10/how-a-zig-ide-could-work.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#How-a-Zig-IDE-Could-Work"><span>How a Zig IDE Could Work</span> <time datetime="2023-02-10">Feb 10, 2023</time></a>
+    </h1>
+<p><span>Zig is a very interesting language from an IDE point of view.</span>
+<span>Some aspects of it are friendly to IDEs, like a very minimal and simple-to-parse syntax</span>
+<span>(Zig can even be </span><em><span>correctly</span></em><span> lexed line-by-line, very cool!),</span>
+<span>the absence of syntactic macros, and ability to do a great deal of semantic analysis on a file-by-file basis, in parallel.</span>
+<span>On the other hand, </span><code>comptime</code><span>.</span>
+<span>I accidentally spent some time yesterday thinking about how to build an IDE for that, this post is a result.</span></p>
+<section id="How-Does-the-Zig-Compiler-Work">
+
+    <h2>
+    <a href="#How-Does-the-Zig-Compiler-Work"><span>How Does the Zig Compiler Work?</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s useful to discuss a bit how the compiler works today.</span>
+<span>For something more thorough, refer to this excellent series of posts: </span><a href="https://mitchellh.com/zig" class="url">https://mitchellh.com/zig</a><span>.</span></p>
+<p><span>First, each Zig file is parsed into an AST.</span>
+<span>Delightfully, parsing doesn</span>&rsquo;<span>t require any context whatsoever, it</span>&rsquo;<span>s a pure </span><code>[]const u8 -&gt; Ast</code><span> function, and the resulting Ast is just a piece of data.</span></p>
+<p><span>After parsing, the Ast is converted to an intermediate representation, Zir.</span>
+<span>This is where Zig diverges a bit from more typical statically compiled languages.</span>
+<span>Zir actually resembles something like Python</span>&rsquo;<span>s bytecode </span>&mdash;<span> an intermediate representation that an interpreter for a dynamically-typed language would use.</span>
+<span>That</span>&rsquo;<span>s because it </span><em><span>is</span></em><span> an interpreter</span>&rsquo;<span>s IR </span>&mdash;<span> the next stage would use Zir to evaluate comptime.</span></p>
+<p><span>Let</span>&rsquo;<span>s look at an example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> generic_add</span>(<span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>, lhs: T, rhs: T) T {</span>
+<span class="line">  <span class="hl-keyword">return</span> lhs <span class="hl-operator">+</span> rhs;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, the Zir for </span><code>generic_add</code><span> would encode addition as a typeless operation, because we don</span>&rsquo;<span>t know types at this point.</span>
+<span>In particular, </span><code>T</code><span> can be whatever.</span>
+<span>When the compiler would </span><em><span>instantiate</span></em><span> </span><code>generic_add</code><span> with different </span><code>T</code><span>s, like </span><code>generic_add(u32, ...)</code><span>, </span><code>generic_add(f64, ...)</code><span>, it will re-use the same Zir for different instantiations.</span>
+<span>That</span>&rsquo;<span>s the two purposes of Zir: to directly evaluate code at compile time, and to serve as a template for monomorphisation.</span></p>
+<p><span>The next stage is where the magic happens </span>&mdash;<span> the compiler partially evaluates dynamically typed Zir to convert it into a fairly standard statically typed IR.</span>
+<span>The process starts at the </span><code>main</code><span> function.</span>
+<span>The compiler more or less tries to evaluate the Zir.</span>
+<span>If it sees something like </span><code>90 + 2</code><span>, it directly evaluates that to </span><code>92</code><span>.</span>
+<span>For something which can</span>&rsquo;<span>t be evaluated at compile time, like </span><code>a + 2</code><span> where </span><code>a</code><span> is a runtime variable, the compiler generates typed IR for addition (as, at this point, we already know the type of </span><code>a</code><span>).</span></p>
+<p><span>When the compiler sees something like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> T = <span class="hl-type">u8</span>;</span>
+<span class="line"><span class="hl-keyword">const</span> x = generic_add(T, a, b);</span></code></pre>
+
+</figure>
+<p><span>the compiler monomorphises the generic call.</span>
+<span>It checks that all comptime arguments (</span><code>T</code><span>) are fully evaluated, and starts partial evaluation of the called function, with comptime parameters fixed to particular values (this of course is memoized).</span></p>
+<p><span>The whole process is lazy </span>&mdash;<span> only things transitively used from main are analyzed.</span>
+<span>Compiler won</span>&rsquo;<span>t complain about something like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> unused</span>() <span class="hl-type">void</span> {</span>
+<span class="line">    <span class="hl-numbers">1</span> <span class="hl-operator">+</span> <span class="hl-string">&quot;&quot;</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This looks perfectly fine at the Zir level, and the compiler will not move beyond Zir unless the function is actually called somewhere.</span></p>
+</section>
+<section id="And-an-IDE">
+
+    <h2>
+    <a href="#And-an-IDE"><span>And an IDE?</span> </a>
+    </h2>
+<p><span>IDE adds several dimensions to the compiler:</span></p>
+<ul>
+<li>
+<span>works with incomplete and incorrect code</span>
+</li>
+<li>
+<span>works with code which rapidly changes over time</span>
+</li>
+<li>
+<span>gives results immediately, there is no edit/compile cycle</span>
+</li>
+<li>
+<span>provides source to source transformations</span>
+</li>
+</ul>
+<p><span>The hard bit is the combination of rapid changes and immediate results.</span>
+<span>This is usually achieved using some smart, language-specific combination of</span></p>
+<ul>
+<li>
+<p><span>Incrementality: although changes are frequent and plentiful, they are local, and it is often possible to re-use large chunks of previous analysis.</span></p>
+</li>
+<li>
+<p><span>Laziness: unlike a compiler, an IDE does not need full analysis results for the entirety of the codebase.</span>
+<span>Usually, analysis of the function which is currently being edited is the only time-critical part, everything else can be done asynchronously, later.</span></p>
+</li>
+</ul>
+<p><span>This post gives an overview of some specific fruitful combinations of the two ideas:</span></p>
+<p><a href="https://rust-analyzer.github.io/blog/2020/07/20/three-architectures-for-responsive-ide.html" class="url">https://rust-analyzer.github.io/blog/2020/07/20/three-architectures-for-responsive-ide.html</a></p>
+<p><span>How can we apply the ideas to Zig?</span>
+<span>Let</span>&rsquo;<span>s use this as our running example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> guinea_pig</span>(<span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>, foo: Foo) <span class="hl-type">void</span> {</span>
+<span class="line">    foo.&lt;complete here&gt;;</span>
+<span class="line"></span>
+<span class="line">    helper(T).&lt;here&gt;;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">var</span> t: T = <span class="hl-literal">undefined</span>;</span>
+<span class="line">    t.&lt;<span class="hl-keyword">and</span> here&gt;;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There are two, separate interesting questions to ask here:</span></p>
+<ul>
+<li>
+<span>what result do we even want here?</span>
+</li>
+<li>
+<span>how to achieve that given strict performance requirements?</span>
+</li>
+</ul>
+</section>
+<section id="Just-Compile-Everything">
+
+    <h2>
+    <a href="#Just-Compile-Everything"><span>Just Compile Everything</span> </a>
+    </h2>
+<p><span>It</span>&rsquo;<span>s useful to start with a pedantically correct approach.</span>
+<span>Let</span>&rsquo;<span>s run our usual compilation (recursively monomorphising called functions starting from the </span><code>main</code><span>).</span>
+<span>The result would contain a bunch of different monomorphisations of </span><code>guinea_pig</code><span>, for different values of </span><code>T</code><span>.</span>
+<span>For each </span><em><span>specific</span></em><span> monomorphisation it</span>&rsquo;<span>s now clear what is the correct answer.</span>
+<span>For the unspecialized case as written in the source code, the IDE can now show something reasonable by combining partial results from each monomorphisation.</span></p>
+<p><span>There are several issues with this approach.</span></p>
+<p><em><span>First</span></em><span>, collecting the </span><em><span>full</span></em><span> set of monomorphisations is not well-defined in the presence of conditional compilation.</span>
+<span>Even if you run the </span>&ldquo;<span>full</span>&rdquo;<span> compilation starting from main, today compiler assumes some particular environment (eg, Windows or Linux), which doesn</span>&rsquo;<span>t give you a full picture.</span>
+<span>There</span>&rsquo;<span>s a fascinating issue about multibuilds </span>&mdash;<span> making the compiler process all combinations of conditional compilation flags at the same time: </span><a href="https://github.com/ziglang/zig/issues/3028"><span>zig#3028</span></a><span>.</span>
+<span>With my IDE writer hat on, I really hope it gets in, as it will move IDE support from inherently heuristic territory, to something where, in principle, there</span>&rsquo;<span>s a correct result (even if might not be particularly easy to compute).</span></p>
+<p><span>The </span><em><span>second</span></em><span> problem is that this probably is going to be much too slow.</span>
+<span>If you think about IDE support for the first time, a very tantalizing idea is to try to lean just into incremental compilation.</span>
+<span>Specifically, you can imagine a compiler that maintains fully type-checked and resolved view of the code at all times.</span>
+<span>If a user edits something, the compiler just incrementally changes what needs to be changed.</span>
+<span>So the trick for IDE-grade interactive performance is just to implement sufficiently advanced incremental compilation.</span></p>
+<p><span>The problem with sufficiently incremental compiler is that even the perfect incrementality, which does the minimal required amount of work, will be slow in a non-insignificant amount of cases.</span>
+<span>The nature of code is that a small change to the source in a single place might lead to a large change to resolved types all over the project.</span>
+<span>For examples, changing the name of some popular type invalidates all the code that uses this type.</span>
+<span>That</span>&rsquo;<span>s the fundamental reason why IDE try hard to maintain an ability to </span><em><span>not</span></em><span> analyze everything.</span></p>
+<p><span>On the other hand, at the end of the day you</span>&rsquo;<span>ll have to do this work at least by the time you run the tests.</span>
+<span>And Zig</span>&rsquo;<span>s compiler is written from the ground up to be very incremental and very fast, so perhaps this will be good enough?</span>
+<span>My current gut feeling is that the answer is no </span>&mdash;<span> even if you </span><em><span>can</span></em><span> re-analyze everything in, say, 100ms, that</span>&rsquo;<span>ll still require burning the battery for essentially useless work.</span>
+<span>Usually, there</span>&rsquo;<span>s a lot more atomic small edits for a single test run.</span></p>
+<p><span>The </span><em><span>third</span></em><span> problem with the approach of collection all monomorphisations is that it simply does not work if the function isn</span>&rsquo;<span>t actually called, yet.</span>
+<span>Which is common in incomplete code that is being written, exactly the use-case where the IDE is most useful!</span></p>
+</section>
+<section id="Compile-Only-What-We-Need">
+
+    <h2>
+    <a href="#Compile-Only-What-We-Need"><span>Compile Only What We Need</span> </a>
+    </h2>
+<p><span>Thinking about the </span>&ldquo;<span>full</span>&rdquo;<span> approach more, it feels like it could be, at least in theory, optimized somewhat.</span>
+<span>Recall that in this approach we have a graph of function instantiations, which starts at the root (</span><code>main</code><span>), and contains various monomorphisations of </span><code>guinea_pig</code><span> on paths reachable from the root.</span></p>
+<p><span>It is clear we actually don</span>&rsquo;<span>t need the full graph to answer queries about instantiations of </span><code>guinea_pig</code><span>.</span>
+<span>For example, if we have something like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> helper</span>() <span class="hl-type">i32</span> {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>and the </span><code>helper</code><span> does not (transitively) call </span><code>guinea_pig</code><span>, we can avoid looking into its body, as the signature is enough to analyze everything else.</span></p>
+<p><span>More precisely, given the graph of monomorphisations, we can select minimal subgraph which includes all paths from </span><code>main</code><span> to </span><code>guinea_pig</code><span> instantiations, as well as all the functions whose bodies we need to process to understand their signatures.</span>
+<span>My intuition is that the size of that subgraph is going to be much smaller than the whole thing, and, in principle, an algorithm which would analyze only that subgraph should be speedy enough in practice.</span></p>
+<p><span>The problem though is that, as far as I know, it</span>&rsquo;<span>s not possible to understand what belongs to the subgraph without analysing the whole thing!</span>
+<span>In particular, using compile-time reflection our </span><code>guinea_pig</code><span> can be called through something like </span><code>comptime "guinea" ++ "_pig"</code><span>.</span>
+<span>It</span>&rsquo;<span>s impossible to infer the call graph just from Zir.</span></p>
+<p><span>And of course this does not help the case where the function isn</span>&rsquo;<span>t called at all.</span></p>
+</section>
+<section id="Abstract-Comptime-Interpretation">
+
+    <h2>
+    <a href="#Abstract-Comptime-Interpretation"><span>Abstract Comptime Interpretation</span> </a>
+    </h2>
+<p><span>It is possible to approach</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> guinea_pig</span>(<span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>, foo: Foo) <span class="hl-type">void</span> {</span>
+<span class="line">    foo.&lt;complete here&gt;;</span>
+<span class="line"></span>
+<span class="line">    helper(T).&lt;here&gt;;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">var</span> t: T = <span class="hl-literal">undefined</span>;</span>
+<span class="line">    t.&lt;<span class="hl-keyword">and</span> here&gt;;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>from a different direction.</span>
+<span>What if we just treat this function as the root of our graph?</span>
+<span>We can</span>&rsquo;<span>d do that exactly, because it has some comptime parameters.</span>
+<span>But we </span><em><span>can</span></em><span> say that we have some opaque values for the parameters: </span><code>T = opaquevalue</code><span>.</span>
+<span>Of course, we won</span>&rsquo;<span>t be able to fully evaluate everything and things like </span><code>if (T == int)</code><span> would probably need to propagate opaqueness.</span>
+<span>At the same time, something like the result of </span><code>BoundedArray(opaque)</code><span> would still be pretty useful for an IDE.</span></p>
+<p><span>I am wondering if there</span>&rsquo;<span>s even perhaps some compilation-time savings in this approach?</span>
+<span>My understanding (which might be very wrong!) is that if a generic function contains something like </span><code>90 + 2</code><span>, this expression would be comptime-evaluated anew for every instantiation.</span>
+<span>In theory, what we could do is to partially evaluate this function substituting opaque values for comptime parameters, and then, for any specific instantiation, we can use the result of this partial evaluation as a template.</span>
+<span>Not sure what that would mean precisely though: it definitely would be more complicated than just substituting </span><code>T</code><span>s in the result.</span></p>
+</section>
+<section id="What-is-to-Be-Done">
+
+    <h2>
+    <a href="#What-is-to-Be-Done"><span>What is to Be Done?</span> </a>
+    </h2>
+<p><span>Ast and Zir infra is good.</span>
+<span>It is per-file, so it naturally just works in an IDE.</span></p>
+<p><a href="https://github.com/ziglang/zig/issues/3028"><span>Multibuilds</span></a><span> are important.</span>
+<span>I am somewhat skeptical that they</span>&rsquo;<span>ll actually fly, and it</span>&rsquo;<span>s not a complete game over if they don</span>&rsquo;<span>t</span>
+<span>(Rust has the same problem with conditional compilation, and it does create fundamental problems for both the users and authors of IDEs, but the end result is still pretty useful).</span>
+<span>Still, if Zig does ship multibuilds, that</span>&rsquo;<span>d be awesome.</span></p>
+<p><span>Given the unused function problem, I think it</span>&rsquo;<span>s impossible to avoid at least some amount of abstract interpretation, so </span><code>Sema</code><span> has to learn to deal with opaque values.</span></p>
+<p><span>With abstract interpretation machinery in place, it can be used as a first, responsive layer of IDE support.</span></p>
+<p><span>Computing the full set of monomoprisations in background can be used to augment these limited synchronous features with precise results asynchronously.</span>
+<span>Though, this might be tough to express in existing editor UIs.</span>
+<span>Eg, the goto definition result is now an asynchronous stream of values.</span></p>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/Zig/comments/10ysssh/blog_post_how_a_zig_ide_could_work/"><span>/r/zig</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-02-10-how-a-zig-ide-could-work.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/02/12/a-love-letter-to-deno.html b/2023/02/12/a-love-letter-to-deno.html
new file mode 100644
index 00000000..aee38969
--- /dev/null
+++ b/2023/02/12/a-love-letter-to-deno.html
@@ -0,0 +1,206 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>&lt;3 Deno</title>
+  <meta name="description" content="Deno is a relatively new JavaScript runtime.
+I find quite interesting and aesthetically appealing, in-line with the recent trend to rein in the worse-is-better law of software evolution.
+This post explains why.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/02/12/a-love-letter-to-deno.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#3-Deno"><span>&lt;3 Deno</span> <time datetime="2023-02-12">Feb 12, 2023</time></a>
+    </h1>
+<p><a href="https://deno.land/manual@v1.30.3/introduction"><span>Deno</span></a><span> is a relatively new JavaScript runtime.</span>
+<span>I find quite interesting and aesthetically appealing, in-line with the recent trend to rein in the worse-is-better law of software evolution.</span>
+<span>This post explains why.</span></p>
+<p><span>The way I see it, the primary goal of Deno is to simplify development of software, relative to the status quo.</span>
+<span>Simplifying means removing the accidental complexity.</span>
+<span>To me, a big source of accidental complexity in today</span>&rsquo;<span>s software are implicit dependencies.</span>
+<span>Software is built of many components, and while some components are relatively well-defined (Linux syscall interface, amd64 ISA), others are much less so.</span>
+<span>Example: upgrading OpenSSL for your Rust project from 1.1.1 to 3.0.0 works on your machine, but breaks on CI, because 3.0.0 now needs some new perl module, which is </span><em><span>expected</span></em><span> to usually be there together with the perl installation, but that is not universally so.</span>
+<span>One way to solve these kinds of problems is by putting </span><del><span>an abstraction boundary</span></del><span> a docker container around them.</span>
+<span>But a different approach is to very carefully avoid creating the issues.</span>
+<span>Deno, in the general sense, picks this second noble hard path.</span></p>
+<p><span>One of the first problems in this area is bootstrapping.</span>
+<span>In general, you can paper over quite a bit of complexity by writing some custom script to do all the grunt work.</span>
+<span>But how do you run it?</span></p>
+<p><span>One answer is to use a shell script, as the shell is already installed.</span>
+<span>Which shell? Bash, sh, powershell?</span>
+<span>Probably POSIX sh is a sane choice, Windows users can just run </span><del><span>a docker container</span></del><span> a Linux in their subsystem.</span>
+<span>You</span>&rsquo;<span>ll also want to install shellcheck to make sure you don</span>&rsquo;<span>t accidentally use bashisms.</span>
+<span>At some point your script grows too large, and you rewrite it in Python.</span>
+<span>You now have to install Python, I</span>&rsquo;<span>ve heard it</span>&rsquo;<span>s much easier these days on Windows.</span>
+<span>Of course, you</span>&rsquo;<span>ll run that inside </span><del><span>a docker container</span></del><span> a virtual environment.</span>
+<span>And you would be careful to use </span><code>python3 -m pip</code><span> rather than </span><code>pip3</code><span> to make sure you use the right thing.</span></p>
+<p><span>Although scripting and plumbing should be a way to combat complexity, just getting to the point where every contributor to your software can run scripts requires </span><del><span>a docker container</span></del><span> a great deal of futzing with the environment!</span></p>
+<p><span>Deno doesn</span>&rsquo;<span>t solve the problem of just being already there on every imaginable machine.</span>
+<span>However, it strives very hard to not create additional problems once you get the </span><code>deno</code><span> binary onto the machine.</span>
+<span>Some manifestations of that:</span></p>
+<p><span>Deno comes with a code formatter (</span><code>deno fmt</code><span>) and an LSP server (</span><code>deno lsp</code><span>) out of the box.</span>
+<span>The high order bit here is not that these are high-value features which drive productivity (though that is so), but that you don</span>&rsquo;<span>t need to pull extra deps to get these features.</span>
+<span>Similarly, Deno is a TypeScript runtime </span>&mdash;<span> there</span>&rsquo;<span>s no transpilation step involved, you just </span><code>deno main.ts</code><span>.</span></p>
+<p><span>Deno does not rely on system</span>&rsquo;<span>s shell.</span>
+<span>Most scripting environments, including node, python, and ruby, make a grave mistake of adding an API to spawn a process intermediated by the shell.</span>
+<span>This is slow, insecure, and brittle (</span><em><span>which</span></em><span> shell was that, again?).</span>
+<span>I have a  </span><a href="https://matklad.github.io/2021/07/30/shell-injection.html"><span>longer post</span></a><span> about the issue.</span>
+<span>Deno doesn</span>&rsquo;<span>t have this vulnerable API.</span>
+<span>Not that </span>&ldquo;<span>not having an API</span>&rdquo;<span> is a particularly challenging technical achievement, but it </span><em><span>is</span></em><span> better than the current default.</span></p>
+<p><span>Deno has a correctly designed tasks system.</span>
+<span>Whenever you do a non-trivial software project, there inevitably comes a point where you need to write some software to orchestrate your software.</span>
+<span>Accidental complexity creeps in the form of a </span><code>Makefile</code><span> (</span><em><span>which</span></em><span> </span><code>make</code><span> is that?) or a </span><code>./scripts/*.sh</code><span> directory.</span>
+<span>Node (as far as I know) pioneered a great idea to treat these as a first-class concern of the project, by including a </span><code>scripts</code><span> field in the </span><code>package.json</code><span>.</span>
+<span>It then botched the execution by running the scripts through system</span>&rsquo;<span>s shell, which downgrades it to </span><code>./scripts</code><span> directory with more indirection.</span>
+<span>In contrast, Deno runs the scripts in </span><a href="https://github.com/denoland/deno_task_shell"><code>deno_task_shell</code></a><span> </span>&mdash;<span> a purpose-built small cross-platform shell.</span>
+<span>You no longer need to worry that </span><code>rm</code><span> might behave differently depending on </span><code>which rm</code><span> it is, because it</span>&rsquo;<span>s a shell</span>&rsquo;<span>s built-in now.</span></p>
+<p><span>These are all engineering nice-to-haves.</span>
+<span>They don</span>&rsquo;<span>t necessary matter as much in isolation, but together they point at project values which align very well with my own ones.</span>
+<span>But there are a couple of innovative, bigger features as well.</span></p>
+<p><span>The first big feature is the permissions system.</span>
+<span>When you run a Deno program, you need to specify explicitly which OS resources it can access.</span>
+<span>Pinging </span><code>google.com</code><span> would require an explicit opt-in.</span>
+<span>You can safely run</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> deno run https://shady.website.eu/caesar-cipher.ts &lt; in.txt &gt; out.txt</span></code></pre>
+
+</figure>
+<p><span>and be sure that this won</span>&rsquo;<span>t steal your secrets.</span>
+<span>Of course, it can still burn the CPU indefinitely or fill </span><code>out.txt</code><span> with garbage, but it won</span>&rsquo;<span>t be able to read anything beyond explicitly passed input.</span>
+<span>For many, if not most, scripting tasks this is a nice extra protection from supply chain attacks.</span></p>
+<p><span>The second big feature is Deno</span>&rsquo;<span>s interesting, minimal, while still practical, take on dependency management.</span>
+<span>First, it goes without saying that there are no global dependencies.</span>
+<span>Everything is scoped to the current project.</span>
+<span>Naturally, there are also lockfiles with checksums.</span></p>
+<p><span>However, there</span>&rsquo;<span>s no package registry or even a separate package manager.</span>
+<span>In Deno, a dependency is always a URL.</span>
+<span>The runtime itself understands URLs, downloads their contents and loads the resulting TypeScript or JavaScript.</span>
+<span>Surprisingly, it feels like this is enough to express various dependency patterns.</span>
+<span>For example, if you need a centralized registry, like </span><a href="https://deno.land/x" class="url">https://deno.land/x</a><span>, you can use URLs pointing to that!</span>
+<span>URLs can also express semver, with </span><code>foo@1</code><span> redirecting to </span><code>foo@1.2.3</code><span>.</span>
+<a href="https://deno.land/manual@v1.30.3/basics/import_maps"><span>Import maps</span></a><span> are a standard, flexible way to remap dependencies, for when you need to tweak something deep in the tree.</span>
+<span>Crucially, in addition to lockfiles Deno comes with a built in </span><code>deno vendor</code><span> command, which fetches all of the dependencies of the current project and puts them into a subfolder, making production deployments immune to dependencies</span>&rsquo;<span> hosting failures.</span></p>
+<p><span>Deno</span>&rsquo;<span>s approach to built-in APIs beautifully bootstraps from its url-based dependency management.</span>
+<span>First, Deno provides a set of runtime APIs.</span>
+<span>These APIs are absolutely stable, follow existing standards (eg, </span><code>fetch</code><span> for doing networking), and play the role of providing cross-platform interface for the underlying OS.</span>
+<span>Then there</span>&rsquo;<span>s the standard library.</span>
+<span>There</span>&rsquo;<span>s an ambition to provide a comprehensive batteries included standard library, which is vetted by core developers, a-la Go.</span>
+<span>At the same time, </span><em><span>huge</span></em><span> stdlib requires a lot of work over many years.</span>
+<span>So, as a companion to a stable 1.30.3 runtime APIs, which is a part of </span><code>deno</code><span> binary, there</span>&rsquo;<span>s 0.177.0 version of stdlib, which is downloaded just like any other dependency.</span>
+<span>I am fairly certain that in time this will culminate in actually stable, comprehensive, and high quality stdlib.</span></p>
+<p><span>All these together mean that you can be sure that, if you got </span><code>deno --version</code><span> working, then </span><code>deno run your-script.ts</code><span> will always work, as the surface area for things to go wrong due to differences in the environment is drastically cut.</span></p>
+<p><span>The only big drawback of Deno is the language </span>&mdash;<span> all this runtime awesomeness is tied to TypeScript.</span>
+<span>JavaScript is a curious beast </span>&mdash;<span> post ES6, it is actually quite pleasant to use, and has some really good parts, like injection-proof template literal semantics.</span>
+<span>But all the old </span><a href="https://www.destroyallsoftware.com/talks/wat"><span>WATs</span></a><span> like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[<span class="hl-string">&quot;10&quot;</span>, <span class="hl-string">&quot;10&quot;</span>, <span class="hl-string">&quot;10&quot;</span>].<span class="hl-title function_">map</span>(<span class="hl-built_in">parseInt</span>)</span></code></pre>
+
+</figure>
+<p><span>are still there.</span>
+<span>TypeScript does an admirable job with typing JavaScript, as it exists in the wild, but the resulting type system is not simple.</span>
+<span>It seems that, linguistically, something substantially better than TypeScript is possible in theory.</span>
+<span>But among the actually existing languages, TypeScript seems like a solid choice.</span></p>
+<p><span>To sum up, historically the domain of </span>&ldquo;<span>scripting</span>&rdquo;<span> and </span>&ldquo;<span>glue code</span>&rdquo;<span> was plagued by the problem of accidentally supergluing oneself to a particular UNIX flavor at hand.</span>
+<span>Deno finally seems like a technology that tries to solve this issue of implicit dependencies by not having the said dependencies </span><del><span>instead of putting everything in a docker container</span></del><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-02-12-a-love-letter-to-deno.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/02/16/three-state-stability.html b/2023/02/16/three-state-stability.html
new file mode 100644
index 00000000..382a95e9
--- /dev/null
+++ b/2023/02/16/three-state-stability.html
@@ -0,0 +1,177 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Three-State Stability</title>
+  <meta name="description" content="Usually, when discussing stability of the APIs (in a broad sense; databases and programming languages are also APIs), only two states are mentioned:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/02/16/three-state-stability.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Three-State-Stability"><span>Three-State Stability</span> <time datetime="2023-02-16">Feb 16, 2023</time></a>
+    </h1>
+<p><span>Usually, when discussing stability of the APIs (in a broad sense; databases and programming languages are also APIs), only two states are mentioned:</span></p>
+<ul>
+<li>
+<span>an API is stable if there</span>&rsquo;<span>s a promise that all future changes would be backwards compatible</span>
+</li>
+<li>
+<span>otherwise, it is unstable</span>
+</li>
+</ul>
+<p><span>This is reflected in, e.g, SemVer: before 1.0, anything goes, after 1.0 you only allow to break API if you bump major version.</span></p>
+<p><span>I think the </span><em><span>actual</span></em><span> situation in the real world is a bit more nuanced than that.</span>
+<span>In addition to clearly stable or clearly unstable, there</span>&rsquo;<span>s often a poorly defined third category.</span>
+<span>It often manifests as either:</span></p>
+<ul>
+<li>
+<span>some technically non-stable version of the project (e.g., </span><code>0.2</code><span>) becoming widely used and de facto stable</span>
+</li>
+<li>
+<span>some minor but technically breaking quietly slipping in shortly after 1.0</span>
+</li>
+</ul>
+<p><span>Here</span>&rsquo;<span>s what I think happens over a lifetime of a typical API:</span></p>
+<p><span>In the first phase, the API is actively evolving.</span>
+<span>There is a promise of anti-stability </span>&mdash;<span> there</span>&rsquo;<span>s constant change and a lot of experimentation.</span>
+<span>Almost no one is using the project seriously:</span></p>
+<ul>
+<li>
+<span>the API is simply incomplete, there are large gaps in functionality</span>
+</li>
+<li>
+<span>chasing upstream requires continuous, large effort</span>
+</li>
+<li>
+<span>there</span>&rsquo;<span>s no certainty that the project will, in fact, ship a stable version, rather than die</span>
+</li>
+</ul>
+<p><span>In the second phase, the API is </span><em><span>mostly</span></em><span> settled.</span>
+<span>It does everything it needs to do, and the shape feels mostly right.</span>
+<span>Transition to this state happens when the API maintainers feel like they nailed down everything.</span>
+<span>However, no wide deployment had happened, so there might still be minor, but backwards incompatible adjustments wanting to be made.</span>
+<span>It makes sense to use the API for all </span><em><span>active</span></em><span> projects (though it costs you an innovation token).</span>
+<span>The thing basically works, you </span><em><span>might</span></em><span> need to adjust your code from time to time, occasionally an adjustment is not trivial, but the overall expected effort is low.</span>
+<span>The API is fully production ready, and has everything except stability.</span>
+<span>If you write a program on top of the API today, and try to run it ten years later, it will fail.</span>
+<span>But if you are making your own releases a couple of times a year, you should be fine.</span></p>
+<p><span>In the third phase, the API is fully stable, and no backwards-incompatible changes are expected.</span>
+<span>Otherwise, it is identical to the second phase.</span>
+<span>Transition to this phase happens after:</span></p>
+<ul>
+<li>
+<span>early adopters empirically stop uncovering deficiencies in the API</span>
+</li>
+<li>
+<span>API maintainers make a commitment to maintain stability.</span>
+</li>
+</ul>
+<p><span>In other words, it is not unstable -&gt; stable, it is rather:</span></p>
+<ul>
+<li>
+<span>experimental (unstable, not fit for production)</span>
+</li>
+<li>
+<span>production ready (still unstable, but you can budget-in a bounded amount of upgrade work)</span>
+</li>
+<li>
+<span>stable (no maintenance work is required)</span>
+</li>
+</ul>
+<p><span>We don</span>&rsquo;<span>t have great, catchy terms to describe the second bullet, so it gets lumped together with the first or the last one.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-02-16-three-state-stability.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/02/21/why-SAT-is-hard.html b/2023/02/21/why-SAT-is-hard.html
new file mode 100644
index 00000000..7f903913
--- /dev/null
+++ b/2023/02/21/why-SAT-is-hard.html
@@ -0,0 +1,258 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Why SAT Is Hard</title>
+  <meta name="description" content="An introductory post about complexity theory today!
+It is relatively well-known that there exist so-called NP-complete problems --- particularly hard problems, such that, if you solve one of them efficiently, you can solve all of them efficiently.
+I think I've learned relatively early that, e.g., SAT is such a hard problem.
+I've similarly learned a bunch of specific examples of equally hard problems, where solving one solves the other.
+However, why SAT is harder than any NP problem remained a mystery for a rather long time to me.
+It is a shame --- this fact is rather intuitive and easy to understand.
+This post is my attempt at an explanation.
+It assumes some familiarity with the space, but it's not going to be too technical or thorough.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/02/21/why-SAT-is-hard.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Why-SAT-Is-Hard"><span>Why SAT Is Hard</span> <time datetime="2023-02-21">Feb 21, 2023</time></a>
+    </h1>
+<p><span>An introductory post about complexity theory today!</span>
+<span>It is relatively well-known that there exist so-called NP-complete problems </span>&mdash;<span> particularly hard problems, such that, if you solve one of them efficiently, you can solve </span><em><span>all</span></em><span> of them efficiently.</span>
+<span>I think I</span>&rsquo;<span>ve learned relatively early that, e.g., SAT is such a hard problem.</span>
+<span>I</span>&rsquo;<span>ve similarly learned a bunch of specific examples of equally hard problems, where solving one solves the other.</span>
+<span>However, why SAT is harder than </span><em><span>any</span></em><span> NP problem remained a mystery for a rather long time to me.</span>
+<span>It is a shame </span>&mdash;<span> this fact is rather intuitive and easy to understand.</span>
+<span>This post is my attempt at an explanation.</span>
+<span>It assumes </span><em><span>some</span></em><span> familiarity with the space, but it</span>&rsquo;<span>s not going to be too technical or thorough.</span></p>
+<section id="Summary">
+
+    <h2>
+    <a href="#Summary"><span>Summary</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s say you are solving some search problem, like </span>&ldquo;<span>find a path that visits every vertex in a graph once</span>&rdquo;<span>.</span>
+<span>It is often possible to write a naive algorithm for it, where we exhaustively check every possible prospective solution:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">for every possible path:</span>
+<span class="line">    if path visits every vertex once:</span>
+<span class="line">        return path</span>
+<span class="line">else:</span>
+<span class="line">    return "no solution"</span></code></pre>
+
+</figure>
+<p><span>Although </span><em><span>checking</span></em><span> each specific candidate is pretty fast, the whole algorithm is exponential, because there are too many (exponent of) candidates.</span>
+<span>Turns out, it is possible to write </span>&ldquo;<span>check if solution fits</span>&rdquo;<span> part as a SAT formula!</span>
+<span>And, if you have a magic algorithm which solves SAT, you can use that to find a candidate solution which would work instead of enumerating all solutions!</span></p>
+<p><span>In other words, solving SAT removes </span>&ldquo;<span>search</span>&rdquo;<span> from </span>&ldquo;<span>search and check</span>&rdquo;<span>.</span></p>
+<p><span>That</span>&rsquo;<span>s more or less everything I wanted to say today, but let</span>&rsquo;<span>s make this a tiny bit more formal.</span></p>
+</section>
+<section id="Background">
+
+    <h2>
+    <a href="#Background"><span>Background</span> </a>
+    </h2>
+<p><span>We will be discussing algorithms and their runtime.</span>
+<span>Big-O notation is a standard instrument for describing performance of algorithms, as it erases small differences which depend on a particular implementation of the algorithm.</span>
+<span>Both 2N + 1000 and 100N are O(N), linear.</span></p>
+<p><span>In this post we will be even </span><em><span>less</span></em><span> precise.</span>
+<span>We will talk about </span><dfn><span>polynomial time</span></dfn><span> </span>&mdash;<span> an algorithm is polynomial if it is O(N</span><sup><span>k</span></sup><span>) for some k.</span>
+<span>For example, N</span><sup><span>100</span></sup><span> is polynomial, while 2</span><sup><span>N</span></sup><span> is not.</span></p>
+<p><span>We will also be thinking about Turing machines (</span><dfn><span>TM</span></dfn><span>s) as our implementation device.</span>
+<span>Programming algorithms directly on Turing machines is cumbersome, but TMs have two advantages for our use case:</span></p>
+<ul>
+<li>
+<span>it</span>&rsquo;<span>s natural to define runtime of TM</span>
+</li>
+<li>
+<span>it</span>&rsquo;<span>s easy to simulate a TM as a part of some larger algorithm (an interpreter for a TM is a small program)</span>
+</li>
+</ul>
+<p><span>Finally, we will only think about problems with binary answers (</span><dfn><span>decision problem</span></dfn><span>).</span>
+&ldquo;<span>Is there a solution to this formula?</span>&rdquo;<span> rather than </span>&ldquo;<span>what is the solution to this formula?</span>&rdquo;<span>.</span>
+&ldquo;<span>Is there a path in the graph of length at least N?</span>&rdquo;<span> rather than </span>&ldquo;<span>what is the longest path in this graph?</span>&rdquo;<span>.</span></p>
+</section>
+<section id="Definitions">
+
+    <h2>
+    <a href="#Definitions"><span>Definitions</span> </a>
+    </h2>
+<p><span>Intuitively, a problem is NP if it</span>&rsquo;<span>s easy to check that a solution is valid (even if </span><em><span>finding</span></em><span> the solution might be hard).</span>
+<span>This intuition doesn</span>&rsquo;<span>t exactly work for yes/no problems we are considering.</span>
+<span>To fix this, we will also provide a </span>&ldquo;<span>hint</span>&rdquo;<span> for the checker.</span>
+<span>For example, if the problem is </span>&ldquo;<span>is there a path of length N in a given graph?</span>&rdquo;<span> the hint will be a path.</span></p>
+<p><span>A decision problem is </span><dfn><span>NP</span></dfn><span>, if there</span>&rsquo;<span>s an algorithm that can verify a </span>&ldquo;<span>yes</span>&rdquo;<span> answer in polynomial time, given a suitable hint.</span></p>
+<p><span>That is, for every input where the answer is </span>&ldquo;<span>yes</span>&rdquo;<span> (and only for those inputs) there should be a hint that makes our verifying algorithm answer </span>&ldquo;<span>yes</span>&rdquo;<span>.</span></p>
+<p><span>Boolean satisfiability, or </span><dfn><span>SAT</span></dfn><span> is a decision problem where an input is a boolean formula like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">(A and B and !C) or</span>
+<span class="line">(C and D) or</span>
+<span class="line">!B</span></code></pre>
+
+</figure>
+<p><span>and the answer is </span>&ldquo;<span>yes</span>&rdquo;<span> if the formula evaluates to true for some variable assignment.</span></p>
+<p><span>It</span>&rsquo;<span>s easy to see that SAT is NP: the hint is variable assignment which satisfies the formula, and verifier evaluates the formula.</span></p>
+</section>
+<section id="Sketch-of-a-Proof">
+
+    <h2>
+    <a href="#Sketch-of-a-Proof"><span>Sketch of a Proof</span> </a>
+    </h2>
+<p><span>Turns out, there is the </span>&ldquo;<span>hardest</span>&rdquo;<span> problem in NP </span>&mdash;<span> solving just that single problem in polynomial time automatically solves every other NP problem in polynomial time (we call such problems </span><span class="def"><span>NP-complete</span></span><span>).</span>
+<span>Moreover, there</span>&rsquo;<span>s actually a bunch of such problems, and SAT is one of them.</span>
+<span>Let</span>&rsquo;<span>s see why!</span></p>
+<p><span>First, let</span>&rsquo;<span>s define a (somewhat artificial) problem which is trivially NP-complete.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with this one: </span>&ldquo;<span>Given a Turing machine and an input for it of length N, will the machine output </span>&ldquo;<span>yes</span>&rdquo;<span> after N</span><sup><span>k</span></sup><span> steps?</span>&rdquo;
+<span>(here k is a fixed parameter; pedantically, I describe a family of problems, one for each k)</span></p>
+<p><span>This is </span><em><span>very</span></em><span> similar to a halting problem, but also much easier.</span>
+<span>We explicitly bound the runtime of the Turing machine by a polynomial, so we don</span>&rsquo;<span>t need to worry about </span>&ldquo;<span>looping forever</span>&rdquo;<span> case </span>&mdash;<span> that would be a </span>&ldquo;<span>no</span>&rdquo;<span> for us.</span>
+<span>The naive algorithm here works: we just run the given machine on a given input for a given amount of steps and look at the answer.</span></p>
+<p><span>Now, if we formulate the problem as </span>&ldquo;<em><span>Is</span></em><span> there an input </span><strong><strong><span>I</span></strong></strong><span> for a given Turing machine </span><strong><strong><span>M</span></strong></strong><span> such that </span><strong><strong><span>M(I)</span></strong></strong><span> answers </span>&ldquo;<span>yes</span>&rdquo;<span> after N</span><sup><span>k</span></sup><span> steps?</span>&rdquo;<span> we get our NP-complete problem.</span>
+<span>It</span>&rsquo;<span>s trivially NP </span>&mdash;<span> the hint is the input that makes the machine answer </span>&ldquo;<span>yes</span>&rdquo;<span>, and the verifier just runs our TM with this input for N</span><sup><span>k</span></sup><span> steps.</span>
+<span>It can also be used to efficiently solve any other NP problem (e.g. SAT).</span>
+<span>Indeed, we can use the verifying TM as </span><strong><strong><span>M</span></strong></strong><span>, and that way find if there</span>&rsquo;<span>s any hint that makes it answer </span>&ldquo;<span>yes</span>&rdquo;<span>.</span></p>
+<p><span>This is a bit circular and hard to wrap ones head around, but, at the same time, trivial.</span>
+<span>We essentially just carefully stare at the definition of an NP problem, specifically produce an algorithm that can solve any NP problem by directly using the definition, and notice that the resulting algorithm is also NP.</span>
+<span>Now there</span>&rsquo;<span>s no surprise that there exists the hardest NP problem </span>&mdash;<span> we essentially </span><em><span>defined</span></em><span> NP such that this is the case.</span></p>
+<p><span>What is still a bit mysterious is why non-weird problems like SAT also turn out to be NP-complete?</span>
+<span>This is because SAT is powerful enough to encode a Turing machine!</span></p>
+<p><em><span>First</span></em><span>, note that we can encode a state of a Turing machine as a set of boolean variables.</span>
+<span>We</span>&rsquo;<span>ll need a boolean variable T</span><sub><span>i</span></sub><span> for each position on a tape.</span>
+<span>The tape is in general infinite, but all our Turing machines run for polynomial (finite) time, so they use only a finite amount of cells, and it</span>&rsquo;<span>s enough to create variables only for those cells.</span>
+<span>Position of the head can also be described by a set of booleans variables.</span>
+<span>For example, we can have a P</span><sub><span>i</span></sub><span> </span>&ldquo;<span>is the head at a cell </span><code>i</code>&rdquo;<span> variable for each cell.</span>
+<span>Similarly, we can encode the finite number of states our machine can be in as a set of S</span><sub><span>i</span></sub><span> variables (is the machine in state </span><code>i</code><span>?).</span></p>
+<p><em><span>Second</span></em><span>, we can write a set of boolean equations which describe a single transition of our Turing machine.</span>
+<span>For example  the value of cell i at the second step T2</span><sub><span>i</span></sub><span> will depend on its value on the previous step T1</span><sub><span>i</span></sub><span>, whether the head was at </span><code>i</code><span> (P1</span><sub><span>i</span></sub><span>) and the rules of our specific states.</span>
+<span>For example, if our machine flips bits in state </span><code>0</code><span> and keeps them in state </span><code>1</code><span>, then the formula we get for each cell is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">T2_i &lt;=&gt;</span>
+<span class="line">  (!P1_i and T1_i) # head is not on our cell, it can't change</span>
+<span class="line">or (P1_i and (</span>
+<span class="line">    S1_0 and !T1_i # flip case</span>
+<span class="line">or  S1_1 and T1_i  # keep case</span>
+<span class="line">))</span></code></pre>
+
+</figure>
+<p><span>We can write similar formulas for changes of P and S families of variables.</span></p>
+<p><em><span>Third</span></em><span>, after we wrote the transition formula for a single step, we can stack several such formulas on top of each other to get a formula for N steps.</span></p>
+<p><span>Now let</span>&rsquo;<span>s come back to our universal problem: </span>&ldquo;<span>is there an input which makes a given Turing machine answer </span>&ldquo;<span>yes</span>&rdquo;<span> in N</span><sup><span>k</span></sup><span> steps?</span>&rdquo;<span>.</span>
+<span>At this point, it</span>&rsquo;<span>s clear that we can replace a </span>&ldquo;<span>Turing machine with N</span><sup><span>k</span></sup><span> steps</span>&rdquo;<span> with our transition formula duplicated N</span><sup><span>k</span></sup><span> times.</span>
+<span>So, the question of existence of an input for a Turing machine reduces to the question of existence of a solution to a (big, but still polynomial) SAT formula.</span></p>
+<p><span>And this concludes the sketch!</span></p>
+</section>
+<section id="Summary-Again">
+
+    <h2>
+    <a href="#Summary-Again"><span>Summary, Again</span> </a>
+    </h2>
+<p><span>SAT is hard, because it allows encoding Turing machine transitions.</span>
+<span>We can</span>&rsquo;<span>t encode loops in SAT, but we can encode </span>&ldquo;<span>N steps of a Turing machine</span>&rdquo;<span> by repeating the same formula N times with small variations.</span>
+<span>So, if we know that a particular Turing machine runs in polynomial time, we </span><em><span>can</span></em><span> encode it by a polynomially-sized formula.</span>
+<span>(see also </span><a href="https://mochiro.moe/posts/09-meson-raytracer/"><span>pure meson ray-tracer</span></a><span> for a significantly more practical application of a similar idea).</span></p>
+<p><span>And that means that every problem that can be solved by a brute-force search over all solutions can be reduced to a SAT instance, by encoding the body of the search loop as a SAT formula!</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-02-21-why-SAT-is-hard.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/03/08/an-engine-for-an-editor.html b/2023/03/08/an-engine-for-an-editor.html
new file mode 100644
index 00000000..d7383c62
--- /dev/null
+++ b/2023/03/08/an-engine-for-an-editor.html
@@ -0,0 +1,204 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>An Engine For An Editor</title>
+  <meta name="description" content="A common trope is how, if one wants to build a game, one should build a game, rather than a game engine, because it is all too easy to fall into a trap of building a generic solution, without getting to the game proper.
+It seems to me that the situation with code editors is the opposite --- many people build editors, but few are building editor engines.
+What's an editor engine? A made up term I use to denote a thin waist the editor is build upon, the set of core concepts, entities and APIs which power the variety of editor's components.
+In this post, I will highlight Emacs' thin waist, which I think is worthy of imitation!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/03/08/an-engine-for-an-editor.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#An-Engine-For-An-Editor"><span>An Engine For An Editor</span> <time datetime="2023-03-08">Mar 8, 2023</time></a>
+    </h1>
+<p><span>A common trope is how, if one wants to build a game, one should build a game, rather than a game engine, because it is all too easy to fall into a trap of building a generic solution, without getting to the game proper.</span>
+<span>It seems to me that the situation with code editors is the opposite </span>&mdash;<span> many people build editors, but few are building </span>&ldquo;<span>editor engines</span>&rdquo;<span>.</span>
+<span>What</span>&rsquo;<span>s an </span>&ldquo;<span>editor engine</span>&rdquo;<span>? A made up term I use to denote a </span><a href="https://www.oilshell.org/blog/2022/02/diagrams.html"><dfn><span>thin waist</span></dfn></a><span> the editor is build upon, the set of core concepts, entities and APIs which power the variety of editor</span>&rsquo;<span>s components.</span>
+<span>In this post, I will highlight Emacs</span>&rsquo;<span> thin waist, which I think is worthy of imitation!</span></p>
+<p><span>Before we get to Emacs, lets survey various APIs for building interactive programs.</span></p>
+<dl>
+<dt><span>Plain text</span></dt>
+<dd>
+<p><span>The simplest possible thing, the UNIX way of programs-filters, reading input from stdin and writing data to stdout.</span>
+<span>The language here is just plain text.</span></p>
+</dd>
+<dt><span>ANSI escape sequences</span></dt>
+<dd>
+<p><span>Adding escape codes to plain text (and a bunch of </span><code>ioctl</code><span>s) allows changing colors and clearing the screen.</span>
+<span>The language becomes a sequence of commands for the terminal (with </span>&ldquo;<span>print text</span>&rdquo;<span> being a fairly frequent one).</span>
+<span>This already is rich enough to power a variety of terminal applications, such as vim!</span></p>
+</dd>
+<dt><span>HTML</span></dt>
+<dd>
+<p><span>With more structure, we can disentangle ourselves from text, and say that all the stuff is made of trees of attributed elements (whose content might be text).</span>
+<span>That turns out to be enough to express basically whatever, as the world of modern web apps testifies.</span></p>
+</dd>
+<dt><span>Canvas</span></dt>
+<dd>
+<p><span>Finally, to achieve maximal flexibility, we can start with a clean 2d canvas with pixels and an event stream, and let the app draw however it likes.</span>
+<span>Desktop GUIs usually work that way (using some particular widget library to encapsulate common patterns of presentation and event handling).</span></p>
+</dd>
+</dl>
+<hr>
+<p><span>Emacs is different.</span>
+<span>Its thin waist consists of (using idiosyncratic olden editor terminology) frames, windows, buffers and attributed text.</span>
+<span>This is </span><em><span>less</span></em><span> general than canvas or HTML, but more general (and way more principled) than ANSI escapes.</span>
+<span>Crucially, this also retains most of plain text</span>&rsquo;<span>s </span><em><span>composability</span></em><span>.</span></p>
+<p><span>The foundation is a text with attributes </span>&mdash;<span> a pair of a string and a map from string</span>&rsquo;<span>s subranges to key-value dictionaries.</span>
+<span>Attributes express presentation (color, font, text decoration), but also semantics.</span>
+<span>A range of text can be designated as clickable.</span>
+<span>Or it can specify a custom keymap, which is only active when the cursor is on this range.</span></p>
+<p><span>I find this to be a sweet spot for building efficient user interfaces.</span>
+<span>Consider </span><a href="https://magit.vc"><span>magit</span></a><span>:</span></p>
+
+<figure>
+
+<img alt="" src="/assets/magit.png">
+</figure>
+<p><span>The interface is built from text, but it is more discoverable, more readable, and more efficient than GUI solutions.</span></p>
+<p><span>Text is </span><a href="https://graydon2.dreamwidth.org/193447.html"><span>surprisingly good</span></a><span> at communicating with humans!</span>
+<span>Forgoing arbitrary widgets and restricting oneself to a grid of characters greatly constrains the set of possible designs, but designs which come out of these constraints tend to be better.</span></p>
+<hr>
+<p><span>The rest (buffers, windows, and frames) serve to present attributed strings to the user.</span>
+<span>A Buffer holds a piece of text and stores position of the cursor (and the rest of editor</span>&rsquo;<span>s state for this particular piece of text).</span>
+<span>A tiling window manager displays buffers:</span></p>
+<ul>
+<li>
+<span>there</span>&rsquo;<span>s a set of floating windows (frames in Emacs terminology) managed by a desktop environment</span>
+</li>
+<li>
+<span>each floating window is subdivided into a tree of vertical and horizontal splits (windows) managed by Emacs</span>
+</li>
+<li>
+<span>each split displays a buffer, although some buffers might not have a corresponding split</span>
+</li>
+</ul>
+<p><span>There</span>&rsquo;<span>s also a tasteful selection of extras outside this orthogonal model.</span>
+<span>A buffer holds a status bar at the bottom and a set of fringe decorations at the left edge.</span>
+<span>Each floating window has a minibuffer </span>&mdash;<span> an area to type commands into (minibuffer </span><em><span>is</span></em><span> a buffer though </span>&mdash;<span> only presentation is slightly unusual).</span></p>
+<p><span>But the vast majority of everything else is not special </span>&mdash;<span> every significant thing is a buffer.</span>
+<span>So, </span><code>./main.rs</code><span> file, </span><code>./src</code><span> file tree, a terminal session where you type </span><code>cargo build</code><span> are all displayed as attributed text.</span>
+<span>All use the same tools for navigation and manipulation.</span></p>
+<p><span>Universality is the power of the model.</span>
+<span>Good old UNIX pipes, except interactive.</span>
+<span>With a GUI file manager, mass-renaming files requires </span><a href="https://apps.kde.org/krename/"><span>a dedicated utility</span></a><span>.</span>
+<span>In Emacs, file manager</span>&rsquo;<span>s state is text, so you can use standard text-manipulation tools (regexes, multiple cursors, vim</span>&rsquo;<span>s </span><kbd><kbd><span>.</span></kbd></kbd><span>) for the same task.</span></p>
+<section id="Conclusions">
+
+    <h2>
+    <a href="#Conclusions"><span>Conclusions</span> </a>
+    </h2>
+<p><span>Pay more attention to the editor</span>&rsquo;<span>s thin waist.</span>
+<span>Don</span>&rsquo;<span>t take it as a given that an editor should be a terminal, HTML, or GUI app </span>&mdash;<span> there might be a better vocabulary.</span>
+<span>In particular, Emacs seems to hit the sweet spot with its language of attributed strings and buffers.</span></p>
+<p><span>I am not sure that Emacs is the best we can do, but having a Rust library which implements Emacs model more or less as is would be nice!</span>
+<span>The two best resources to learn about this model are</span></p>
+<ul>
+<li>
+<span>this diagram:</span><br>
+<a href="https://www2.lib.uchicago.edu/keith/emacs/#org9c6cafa" class="url">https://www2.lib.uchicago.edu/keith/emacs/#org9c6cafa</a>
+</li>
+<li>
+<span>this section of Emacs docs:</span><br>
+<a href="https://www.gnu.org/software/emacs/manual/html_node/elisp/Text-Properties.html" class="url">https://www.gnu.org/software/emacs/manual/html_node/elisp/Text-Properties.html</a>
+</li>
+</ul>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-03-08-an-engine-for-an-editor.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/03/26/zig-and-rust.html b/2023/03/26/zig-and-rust.html
new file mode 100644
index 00000000..00fae3d6
--- /dev/null
+++ b/2023/03/26/zig-and-rust.html
@@ -0,0 +1,361 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Zig And Rust</title>
+  <meta name="description" content="This post will be a bit all over the place.
+Several months ago, I wrote Hard Mode Rust, exploring an allocation-conscious style of programming.
+In the ensuing discussion, @jamii name-dropped TigerBeetle, a reliable, distributed, fast, and small database written in Zig in a similar style, and, well, I now find myself writing Zig full-time, after more than seven years of Rust.
+This post is a hand-wavy answer to the why? question.
+It is emphatically not a balanced and thorough comparison of the two languages.
+I haven't yet written my 100k lines of Zig to do that.
+(if you are looking for a more general what the heck is Zig, I can recommend @jamii's post).
+In fact, this post is going to be less about languages, and more about styles of writing software (but pre-existing knowledge of Rust and Zig would be very helpful).
+Without further caveats, let's get started.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/03/26/zig-and-rust.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Zig-And-Rust"><span>Zig And Rust</span> <time datetime="2023-03-26">Mar 26, 2023</time></a>
+    </h1>
+<p><span>This post will be a bit all over the place.</span>
+<span>Several months ago, I wrote </span><a href="https://matklad.github.io/2022/10/06/hard-mode-rust.html"><em><span>Hard Mode Rust</span></em></a><span>, exploring an allocation-conscious style of programming.</span>
+<span>In the ensuing discussion, </span><a href="https://github.com/Jamii"><span>@jamii</span></a><span> name-dropped </span><a href="https://tigerbeetle.com"><span>TigerBeetle</span></a><span>, a reliable, distributed, fast, and small database written in Zig in a similar style, and, well, I now find myself writing Zig full-time, after more than seven years of Rust.</span>
+<span>This post is a hand-wavy answer to the </span>&ldquo;<span>why?</span>&rdquo;<span> question.</span>
+<span>It is emphatically </span><em><span>not</span></em><span> a balanced and thorough comparison of the two languages.</span>
+<span>I haven</span>&rsquo;<span>t yet written my </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html"><span>100k lines of Zig</span></a><span> to do that.</span>
+<span>(if you are looking for a more general </span>&ldquo;<span>what the heck is Zig</span>&rdquo;<span>, I can recommend </span><a href="https://www.scattered-thoughts.net/writing/assorted-thoughts-on-zig-and-rust/"><span>@jamii</span>&rsquo;<span>s post</span></a><span>).</span>
+<span>In fact, this post is going to be less about languages, and more about styles of writing software (but pre-existing knowledge of Rust and Zig would be very helpful).</span>
+<span>Without further caveats, let</span>&rsquo;<span>s get started.</span></p>
+<section id="Reliable-Software">
+
+    <h2>
+    <a href="#Reliable-Software"><span>Reliable Software</span> </a>
+    </h2>
+<p><span>To the first approximation, we all strive to write bug-free programs.</span>
+<span>But I think a closer look reveals that we don</span>&rsquo;<span>t actually care about programs being correct 100% of the time, at least in the majority of the domains.</span>
+<span>Empirically, almost every program has bugs, and yet it somehow works out OK.</span>
+<span>To pick one specific example, most programs use stack, but almost no programs understand what their stack usage is exactly, and how far they can go.</span>
+<span>When we call </span><code>malloc</code><span>, we just hope that we have enough stack space for it, we almost never check.</span>
+<span>Similarly, all Rust programs abort on OOM, and can</span>&rsquo;<span>t state their memory requirements up-front.</span>
+<span>Certainly good enough, but not perfect.</span></p>
+<p><span>The second approximation is that we strive to balance program usefulness with the effort to develop the program.</span>
+<span>Bugs reduce usefulness a lot, and there are two styles of software engineering to deal with the:</span></p>
+<p><em><span>Erlang style</span></em><span>, where we embrace failability of both hardware and software and explicitly design programs to be resilient to partial faults.</span></p>
+<p><a href="https://www.sqlite.org/testing.html"><em><span>SQLite style</span></em></a><span>, where we overcome an unreliable environment at the cost of rigorous engineering.</span></p>
+<p><span>rust-analyzer and TigerBeetle are perfect specimens of the two approaches, let me describe them.</span></p>
+</section>
+<section id="rust-analyzer">
+
+    <h2>
+    <a href="#rust-analyzer"><span>rust-analyzer</span> </a>
+    </h2>
+<p><a href="https://rust-analyzer.github.io"><span>rust-analyzer</span></a><span> is an LSP server for the Rust programming language.</span>
+<span>By its nature, it</span>&rsquo;<span>s expansive.</span>
+<span>Great developer tools usually have a feature for every niche use-case.</span>
+<span>It also is a fast-moving open source project which has to play catch-up with the </span><code>rustc</code><span> compiler.</span>
+<span>Finally, the nature of IDE dev tooling makes availability significantly more important than correctness.</span>
+<span>An erroneous completion option would cause a smirk (if it is noticed at all), while the server crashing and all syntax highlighting turning off will be noticed immediately.</span></p>
+<p><span>For this cluster of reasons, rust-analyzer is shifted far towards the </span>&ldquo;<span>embrace software imperfections</span>&rdquo;<span> side of the spectrum.</span>
+<span>rust-analyzer is designed around having bugs.</span>
+<span>All the various features are carefully compartmentalized at runtime, such that panicking code in just a single feature can</span>&rsquo;<span>t bring down the whole process.</span>
+<span>Critically, almost no code has access to any mutable state, so usage of </span><code>catch_unwind</code><span> can</span>&rsquo;<span>t lead to a rotten state.</span></p>
+<p><span>Development process </span><em><span>itself</span></em><span> is informed by this calculus.</span>
+<span>For example, PRs with new features land when there</span>&rsquo;<span>s a reasonable certainty that the happy case works correctly.</span>
+<span>If some weird incomplete code would cause the feature to crash, that</span>&rsquo;<span>s OK.</span>
+<span>It might be even a benefit </span>&mdash;<span> fixing a well-reproducible bug in an isolated feature is a gateway drug to heavy contribution to rust-analyzer.</span>
+<span>Our tight weekly release schedule (and the nightly release) help to get bug fixes out there faster.</span></p>
+<p><span>Overall, the philosophy is to maximize provided value by focusing on the common case.</span>
+<span>Edge cases become eventually correct over time.</span></p>
+</section>
+<section id="TigerBeetle">
+
+    <h2>
+    <a href="#TigerBeetle"><span>TigerBeetle</span> </a>
+    </h2>
+<p><span>TigerBeetle is the opposite of that.</span></p>
+<p><span>It is a database, with domain model fixed at compile time (we currently do double-entry bookkeeping).</span>
+<span>The database is distributed, meaning that there are six TigerBeetle replicas running on different geographically and operationally isolated machines, which together implement a replicated state machine.</span>
+<span>That is, TigerBeetle replicas exchange messages to make sure every replica processes the same set of transactions, in the same order.</span>
+<span>That</span>&rsquo;<span>s a surprisingly hard problem if you allow machines to fail (the whole point of using many machines for redundancy), so we use a smart </span><a href="https://pmg.csail.mit.edu/papers/vr-revisited.pdf"><span>consensus algorithm</span></a><span>  (non-byzantine) for this.</span>
+<span>Traditionally, consensus algorithms assume reliable storage </span>&mdash;<span> data once written to disk can be always retrieved later.</span>
+<span>In reality, storage is unreliable, nearly byzantine </span>&mdash;<span> a disk can return bogus data without signaling an error, and even a single such error can </span><a href="https://www.usenix.org/conference/fast18/presentation/alagappan"><span>break consensus</span></a><span>.</span>
+<span>TigerBeetle combats that by allowing a replica to repair its local storage using data from other replicas.</span></p>
+<p><span>On the engineering side of things, we are building a reliable, predictable system.</span>
+<span>And predictable means </span><em><span>really</span></em><span> predictable.</span>
+<span>Rather than reining in sources of non-determinism, we build the whole system from the ground up from a set of fully deterministic, hand crafted components.</span>
+<span>Here are some of our unconventional choices (</span><a href="https://github.com/tigerbeetledb/tigerbeetle/blob/fe09404d465df46b2bdfc017633eff37b4ab2343/docs/DESIGN.md"><span>design doc</span></a><span>):</span></p>
+<p><span>It</span>&rsquo;<span>s </span><a href="https://matklad.github.io/2022/10/06/hard-mode-rust.html"><span>hard mode</span></a><span>!</span>
+<span>We allocate all the memory at a startup, and there</span>&rsquo;<span>s zero allocation after that.</span>
+<span>This removes all the uncertainty about allocation.</span></p>
+<p><span>The code is architected with brutal simplicity.</span>
+<span>As a single example, we don</span>&rsquo;<span>t use JSON, or ProtoBuf, or Cap</span>&rsquo;<span>n</span>&rsquo;<span>Proto for serialization.</span>
+<span>Rather, we just cast the bytes we received from the network to a desired type.</span>
+<span>The motivation here is not so much performance, as reduction of the number of moving parts.</span>
+<span>Parsing is hard, but, if you control both sides of the communication channel, you don</span>&rsquo;<span>t need to do it, you can send checksummed data as is.</span></p>
+<p><span>We aggressively minimize all dependencies.</span>
+<span>We know exactly the system calls our system is making, because all IO is our own code (on Linux, our main production platform, we don</span>&rsquo;<span>t link libc).</span></p>
+<p><span>There</span>&rsquo;<span>s little abstraction between components </span>&mdash;<span> all parts of TigerBeetle work in concert.</span>
+<span>For example, one of our core types, </span><a href="https://github.com/tigerbeetledb/tigerbeetle/blob/fe09404d465df46b2bdfc017633eff37b4ab2343/src/message_pool.zig#L64"><code>Message</code></a><span>, is used throughout the stack:</span></p>
+<ul>
+<li>
+<span>network receives bytes from a TCP connection directly into a </span><code>Message</code>
+</li>
+<li>
+<span>consensus processes and sends </span><code>Message</code><span>s</span>
+</li>
+<li>
+<span>similarly, storage writes </span><code>Message</code><span>s to disk</span>
+</li>
+</ul>
+<p><span>This naturally leads to very simple and fast code.</span>
+<span>We don</span>&rsquo;<span>t need to do anything special to be zero copy </span>&mdash;<span> given that we allocate everything up-front, we simply don</span>&rsquo;<span>t have any extra memory to copy the data to!</span>
+<span>(A separate issue is that, arguably, you just can</span>&rsquo;<span>t treat storage as a separate black box in a fault-tolerant distributed system, because storage is also faulty).</span></p>
+<p><em><span>Everything</span></em><span> in TigerBeetle has an explicit upper-bound.</span>
+<span>There</span>&rsquo;<span>s not a thing which is </span><em><span>just</span></em><span> an </span><code>u32</code><span> </span>&mdash;<span> all data is checked to meet specific numeric limits at the edges of the system.</span></p>
+<p><span>This includes </span><code>Message</code><span>s.</span>
+<span>We just upper-bound how many messages can be in-memory at the same time, and allocate precisely that amount of messages (</span><a href="https://github.com/tigerbeetledb/tigerbeetle/blob/53092098d69cc8facf94a2472bc79ca9d525a605/src/message_pool.zig#L16-L40"><span>source</span></a><span>).</span>
+<span>Getting a new message from the message pool can</span>&rsquo;<span>t allocate and can</span>&rsquo;<span>t fail.</span></p>
+<p><span>With all that strictness and explicitness about resources, of course we also fully externalize any IO, including time.</span>
+<em><span>All</span></em><span> inputs are passed in explicitly, there</span>&rsquo;<span>s no ambient influences from the environment.</span>
+<span>And that means that the bulk of our testing consists of trying all possible permutations of effects of the environment.</span>
+<span>Deterministic randomized simulation is </span><a href="https://dl.acm.org/doi/10.1145/3158134"><span>very effective</span></a><span> at uncovering issues in real implementations of distributed systems.</span></p>
+<p><span>What I am getting at is that TigerBeetle isn</span>&rsquo;<span>t really a normal </span>&ldquo;<span>program</span>&rdquo;<span> program.</span>
+<span>It strictly is a finite state machine, explicitly coded as such.</span></p>
+</section>
+<section id="Back-From-The-Weeds">
+
+    <h2>
+    <a href="#Back-From-The-Weeds"><span>Back From The Weeds</span> </a>
+    </h2>
+<p><span>Oh, right, Rust and Zig, the topic of the post!</span></p>
+<p><span>I find myself often returning to </span><a href="http://venge.net/graydon/talks/intro-talk.pdf"><span>the first Rust slide deck</span></a><span>.</span>
+<span>A lot of core things are different (no longer Rust uses only the old ideas), but a lot is the same.</span>
+<span>To be a bit snarky, while Rust </span>&ldquo;<span>is not for lone genius hackers</span>&rdquo;<span>, Zig </span>&hellip;<span> kinda is.</span>
+<span>On more peaceable terms, while Rust is a language for building </span><em><span>modular</span></em><span> software, Zig is in some sense anti-modular.</span></p>
+<p><span>It</span>&rsquo;<span>s appropriate to quote </span><a href="https://youtu.be/HgtRAbE1nBM?t=2359"><span>Bryan Cantrill</span></a><span> here:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>I can write C that frees memory properly</span>&hellip;<span>that basically doesn</span>&rsquo;<span>t suffer from</span>
+<span>memory corruption</span>&hellip;<span>I can do that, because I</span>&rsquo;<span>m controlling heaven and earth in</span>
+<span>my software. It makes it very hard to compose software. Because even if you and</span>
+<span>I both know how to write memory safe C, it</span>&rsquo;<span>s very hard for us to have an</span>
+<span>interface boundary where we can agree about who does what.</span></p>
+</blockquote>
+
+</figure>
+<p><span>That</span>&rsquo;<span>s the core of what Rust is doing: it provides you with a language to precisely express the contracts between components, such that components can be integrated in a machine-checkable way.</span></p>
+<p><span>Zig doesn</span>&rsquo;<span>t do that. It isn</span>&rsquo;<span>t even memory safe. My first experience writing a non-trivial Zig program went like this:</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>ME: Oh wow! Do you mean I can finally </span><em><span>just</span></em><span> store a pointer to a struct</span>&rsquo;<span>s field in the struct itself?</span></p>
+<p><span>30 seconds later</span></p>
+<p><span>PROGRAM: Segmentation fault.</span></p>
+</blockquote>
+
+</figure>
+<p><span>However!</span><br>
+<span>Zig </span><em><span>is</span></em><span> a much smaller language than Rust.</span>
+<span>Although you</span>&rsquo;<span>ll </span><em><span>have</span></em><span> to be able to keep the entirety of the program in your head, to control heaven and earth to not mess up resource management, doing that could be easier.</span></p>
+<p><span>It</span>&rsquo;<span>s not true that rewriting a Rust program in Zig would make it simpler.</span>
+<span>On the contrary, I expect the result to be significantly more complex (and segfaulty).</span>
+<span>I noticed that a lot of Zig code written in </span>&ldquo;<span>let</span>&rsquo;<span>s replace </span><a href="https://doc.rust-lang.org/rust-by-example/scope/raii.html"><span>RAII</span></a><span> with </span><a href="https://ziglang.org/documentation/master/#defer"><span>defer</span></a>&rdquo;<span> style has resource-management bugs.</span></p>
+<p><span>But it often is possible to architect the software such that there</span>&rsquo;<span>s little resource management to do (eg, allocating everything up-front, like TigerBeetle, or even at compile time, like many smaller embedded systems).</span>
+<span>It</span>&rsquo;<span>s hard </span>&mdash;<span> simplicity is always hard.</span>
+<span>But, if you go this  way, I feel like Zig can provide substantial benefits.</span></p>
+<p><span>Zig has just a single feature, dynamically-typed comptime, which subsumes most of the special-cased Rust machinery.</span>
+<span>It is definitely a tradeoff, instantiation-time errors are much worse for complex cases.</span>
+<span>But a lot more of the cases are simple, because there</span>&rsquo;<span>s no need for programming in the language of types.</span>
+<span>Zig is very spartan when it comes to the language.</span>
+<span>There are no closures </span>&mdash;<span> if you want them, you</span>&rsquo;<span>ll have to pack a wide-pointer yourself.</span>
+<span>Zig</span>&rsquo;<span>s expressiveness is aimed at producing just the right assembly, not at allowing maximally concise and abstract source code.</span>
+<span>In the words of Andrew Kelley, Zig is a DSL for emitting machine code.</span></p>
+<p><span>Zig strongly prefers explicit resource management.</span>
+<span>A lot of Rust programs are web-servers.</span>
+<span>Most web servers have a very specific execution pattern of processing multiple independent short-lived requests concurrently.</span>
+<span>The most natural way to code this would be to give each request a dedicated bump allocator, which turns drops into no-ops and </span>&ldquo;<span>frees</span>&rdquo;<span> the memory at bulk after each request by resetting offset to zero.</span>
+<span>This would be pretty efficient, and would provide per-request memory profiling and limiting out of the box.</span>
+<span>I don</span>&rsquo;<span>t think any popular Rust frameworks do this </span>&mdash;<span> using the global allocator is convenient enough and creates a strong local optima.</span>
+<span>Zig forces you to pass the allocator in, so you might as well think about the most appropriate one!</span></p>
+<p><span>Similarly, the standard library is very conscious about allocation, more so than Rust</span>&rsquo;<span>s.</span>
+<span>Collections are </span><em><span>not</span></em><span> parametrized by an allocator, like in C++ or (future) Rust.</span>
+<span>Rather, an allocator is passed in explicitly to every method which actually needs to allocate.</span>
+<span>This is </span><a href="https://matklad.github.io/2020/12/28/csdi.html"><em><span>Call Site Dependency Injection</span></em></a><span>, and it is more flexible.</span>
+<span>For example in TigerBeetle we need a couple of hash maps.</span>
+<span>These maps are sized at a startup time to hold just the right number of elements, and are never resized.</span>
+<span>So we pass an allocator to </span><a href="https://github.com/tigerbeetledb/tigerbeetle/blob/53092098d69cc8facf94a2472bc79ca9d525a605/src/vsr/replica.zig#L540"><code>init</code></a><span> method, but we don</span>&rsquo;<span>t pass it to the </span><a href="https://github.com/tigerbeetledb/tigerbeetle/blob/53092098d69cc8facf94a2472bc79ca9d525a605/src/vsr/replica.zig#L758"><span>event loop</span></a><span>.</span>
+<span>We get to both use the standard hash-map, and to feel confident that there</span>&rsquo;<span>s no way we can allocate in the actual event loop, because it doesn</span>&rsquo;<span>t have access to an allocator.</span></p>
+</section>
+<section id="Wishlist">
+
+    <h2>
+    <a href="#Wishlist"><span>Wishlist</span> </a>
+    </h2>
+<p><span>Finally, my wishlist for Zig.</span></p>
+<p><em><span>First</span></em><span>, I think Zig</span>&rsquo;<span>s strength lies strictly in the realm of writing </span>&ldquo;<span>perfect</span>&rdquo;<span> systems software.</span>
+<span>It is a relatively thin slice of the market, but it is important.</span>
+<span>One of the problems with Rust is that we don</span>&rsquo;<span>t have a reliability-oriented high-level programming language with a good quality of implementation (modern ML, if you will).</span>
+<span>This is a blessing for Rust, because it makes its niche bigger, increasing the amount of community momentum behind the language.</span>
+<span>This is also a curse, because a bigger niche makes it harder to maintain focus.</span>
+<span>For Zig, Rust already plays this role of </span>&ldquo;<span>modern ML</span>&rdquo;<span>, which creates bigger pressure to specialize.</span></p>
+<p><em><span>Second</span></em><span>, my biggest worry about Zig is its semantics around aliasing, provenance, mutability and self-reference ball of problems.</span>
+<span>I don</span>&rsquo;<span>t worry all that much about this creating </span>&ldquo;<span>iterator invalidation</span>&rdquo;<span> style of UB.</span>
+<span>TigerBeetle runs in </span><code>-DReleaseSafe</code><span>, which mostly solves spatial memory safety, it doesn</span>&rsquo;<span>t really do dynamic memory allocation, which unasks the question about temporal memory safety,</span>
+<span>and it has a very thorough fuzzer-driven test suite, which squashes the remaining bugs.</span>
+<span>I do worry about the semantics of the language itself.</span>
+<span>My current understanding is that, to correctly compile a C-like low-level language, one really needs to nail down semantics of pointers.</span>
+<span>I am not sure </span>&ldquo;<span>portable assembly</span>&rdquo;<span> is really a thing: it is possible to create a compiler which does little optimization and </span>&ldquo;<span>works as expected</span>&rdquo;<span> most of the time, but I am doubtful that it</span>&rsquo;<span>s possible to correctly describe the behavior of such a compiler.</span>
+<span>If you start asking questions about what are pointers, and what is memory, you end up in a fairly complicated land, where bytes are poison.</span>
+<span>Rust tries to define that precisely, but writing code which abides by the Rust rules without a borrow-checker isn</span>&rsquo;<span>t really possible </span>&mdash;<span> the rules are too subtle.</span>
+<span>Zig</span>&rsquo;<span>s implementation today is </span><em><span>very</span></em><span> fuzzy around potentially aliased pointers, copies of structs with interior-pointers and the like.</span>
+<span>I wish that Zig had a clear answer to what the desired semantics is.</span></p>
+<p><span id="ide"><em><span>Third</span></em></span><span>, IDE support.</span>
+<span>I</span>&rsquo;<span>ve written about that before </span><a href="https://matklad.github.io/2023/02/10/how-a-zig-ide-could-work.html"><span>on this blog</span></a><span>.</span>
+<span>As of today, developing Zig is quite pleasant </span>&mdash;<span> </span><a href="https://github.com/zigtools/zls"><span>the language server</span></a><span> is pretty spartan, but already is quite helpful, and for the rest, Zig is exceptionally greppable.</span>
+<span>But, with the lazy compilation model and the absence of out-of-the-language meta programming, I feel like Zig could be more ambitious here.</span>
+<span>To position itself well for the future in terms of IDE support, I think it would be nice if the compiler gets the basic data model for IDE use-case.</span>
+<span>That is, there should be an API to create a persistent analyzer process, which ingests a stream of code edits, and produces a continuously updated model of the code without explicit compilation requests.</span>
+<span>The model can be very simple, just </span>&ldquo;<span>give me an AST of this file at this point in time</span>&rdquo;<span> would do </span>&mdash;<span> all the fancy IDE features can be filled in later.</span>
+<span>What matters is a shape of data flow through the compiler </span>&mdash;<span> not an edit-compile cycle, but rather a continuously updated view of the world.</span></p>
+<p><em><span>Fourth</span></em><span>, one of the values of Zig which resonates with me a lot is a preference for low-dependency, self-contained processes.</span>
+<span>Ideally, you get yourself a </span><code>./zig</code><span> binary, and go from there.</span>
+<span>The preference, at this time of changes, is to bundle a particular version of </span><code>./zig</code><span> with a project, instead of using a system-wide </span><code>zig</code><span>.</span>
+<span>There are two aspects that could be better.</span></p>
+<p>&ldquo;<span>Getting yourself a Zig</span>&rdquo;<span> is a finicky problem, because it requires bootstrapping.</span>
+<span>To do that, you need to run some code that will download the binary for your platform, but each platform has its own way to </span>&ldquo;<span>run code</span>&rdquo;<span>.</span>
+<span>I wish that Zig provided a blessed set of scripts, </span><code>get_zig.sh</code><span>, </span><code>get_zig.bat</code><span>, etc (or maybe a small actually portable binary?), which projects could just vendor, so that the contribution experience becomes fully project-local and self-contained:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> ./get_zig.sh</span>
+<span class="line"><span class="hl-title function_">$</span> ./zig build</span></code></pre>
+
+</figure>
+<p><span>Once you have </span><code>./zig</code><span>, you can use that to drive the </span><em><span>rest</span></em><span> of the automation.</span>
+<span>You already can </span><code>./zig build</code><span> to drive the build, but there</span>&rsquo;<span>s more to software than just building.</span>
+<span>There</span>&rsquo;<span>s always a long tail of small things which traditionally get solved with a pile of platform-dependent bash scripts.</span>
+<span>I wish that Zig pushed the users harder towards specifying all that automation in Zig.</span>
+<span>A picture is worth a thousand words, so</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment"># BAD: dependency on the OS</span></span>
+<span class="line"><span class="hl-title function_">$</span> ./scripts/deploy.sh --port 92</span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-comment"># OK: no dependency, but a mouthful to type</span></span>
+<span class="line"><span class="hl-title function_">$</span> ./zig build task -- deploy --port 92</span>
+<span class="line"><span class="hl-output"></span></span>
+<span class="line"><span class="hl-comment"># Would be GREAT:</span></span>
+<span class="line"><span class="hl-title function_">$</span> ./zig do deploy --port 92</span></code></pre>
+
+</figure>
+<p><span>Attempting to summarize,</span></p>
+<ul>
+<li>
+<span>Rust is about compositional safety, it</span>&rsquo;<span>s a more scalable language than Scala.</span>
+</li>
+<li>
+<span>Zig is about perfection.</span>
+<span>It is a very sharp, dangerous, but, ultimately, more flexible tool.</span>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/Zig/comments/123jpia/blog_post_zig_and_rust/"><span>/r/Zig</span></a><span> and </span><a href="https://old.reddit.com/r/rust/comments/123jpry/blog_post_zig_and_rust/"><span>/r/rust</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-03-26-zig-and-rust.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/03/28/rust-is-a-scalable-language.html b/2023/03/28/rust-is-a-scalable-language.html
new file mode 100644
index 00000000..f3558630
--- /dev/null
+++ b/2023/03/28/rust-is-a-scalable-language.html
@@ -0,0 +1,156 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Rust Is a Scalable Language</title>
+  <meta name="description" content="In my last post about Zig and Rust, I mentioned that Rust is a scalable language.
+Let me expand on this a bit.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/03/28/rust-is-a-scalable-language.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Rust-Is-a-Scalable-Language"><span>Rust Is a Scalable Language</span> <time datetime="2023-03-28">Mar 28, 2023</time></a>
+    </h1>
+<p><span>In my last post about </span><a href="https://matklad.github.io/2023/03/26/zig-and-rust.html"><em><span>Zig and Rust</span></em></a><span>, I mentioned that Rust is a </span>&ldquo;<span>scalable language</span>&rdquo;<span>.</span>
+<span>Let me expand on this a bit.</span></p>
+<section id="Vertical-Scalability">
+
+    <h2>
+    <a href="#Vertical-Scalability"><span>Vertical Scalability</span> </a>
+    </h2>
+<p><span>Rust is vertically scalable, in that you can write all kinds of software in it.</span>
+<span>You can write an advanced zero-alloc image compression library, build a web server exposing the library to the world as an HTTP SAAS, and cobble together a </span>&ldquo;<span>script</span>&rdquo;<span> for building, testing, and deploying it to wherever people deploy software these days.</span>
+<span>And you would only need Rust </span>&mdash;<span> while it excels in the lowest half of the stack, it</span>&rsquo;<span>s pretty ok everywhere else too.</span></p>
+</section>
+<section id="Horizontal-Scalability">
+
+    <h2>
+    <a href="#Horizontal-Scalability"><span>Horizontal Scalability</span> </a>
+    </h2>
+<p><span>Rust is horizontally scalable, in that you can easily parallelize development of large software artifacts across many people and teams.</span>
+<span>Rust itself moves with a breakneck speed, which is surprising for such a loosely coordinated and chronically understaffed open source project of this scale.</span>
+<span>The relatively small community  managed to put together a comprehensive ecosystem of composable high-quality crates on a short notice.</span>
+<span>Rust is so easy to compose reliably that even the stdlib itself does not shy from pulling dependencies from crates.io.</span></p>
+<p><span>Steve Klabnik wrote about </span><a href="https://steveklabnik.com/writing/rusts-golden-rule"><em><span>Rust</span>&rsquo;<span>s Golden Rule</span></em></a><span>,</span>
+<span>how function signatures are mandatory and authoritative and explicitly define the interface both for the callers of the function and for the function</span>&rsquo;<span>s body.</span>
+<span>This thinking extends to other parts of the language.</span></p>
+<p><span>My second most favorite feature of Rust (after safety) is its module system.</span>
+<span>It has first-class support for the concept of a library.</span>
+<span>A library is called a crate and is a tree of modules, a unit of compilation, and a principle visibility boundary.</span>
+<span>Modules can contain circular dependencies, but libraries always form a directed acyclic graph.</span>
+<span>There</span>&rsquo;<span>s no global namespace of symbols </span>&mdash;<span> libraries are anonymous, names only appear on dependency edges between two libraries, and are local to the downstream crate.</span></p>
+<p><span>The benefits of this core compilation model are then greatly amplified by Cargo, which is not a generalized task runner, but rather a rigid specification for what is a package of Rust code:</span></p>
+<ul>
+<li>
+<span>a (library) crate,</span>
+</li>
+<li>
+<span>a manifest, which defines dependencies between packages in a declarative way, using semver,</span>
+</li>
+<li>
+<span>an ecosystem-wide agreement on the semantics of dependency specification, and accompanied dependency resolution algorithm.</span>
+</li>
+</ul>
+<p><span>Crucially, there</span>&rsquo;<span>s absolutely no way in Cargo to control the actual build process.</span>
+<span>The </span><code>build.rs</code><span> file can be used to provide extra runtime inputs, but it</span>&rsquo;<span>s </span><code>cargo</code><span> who calls </span><code>rustc</code><span>.</span></p>
+<p><span>Again, Cargo defines a rigid interface for a reusable piece of Rust code.</span>
+<span>Both producers and consumers must abide by these rules, there is no way around them.</span>
+<span>As a reward, they get a super-power of working together by working apart.</span>
+<span>I don</span>&rsquo;<span>t need to ping dtolnay in Slack when I want to use serde-json because we implicitly pre-agreed to a shared golden rule.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-03-28-rust-is-a-scalable-language.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/04/02/ub-might-be-the-wrong-term-for-newer-languages.html b/2023/04/02/ub-might-be-the-wrong-term-for-newer-languages.html
new file mode 100644
index 00000000..7e8c9cbd
--- /dev/null
+++ b/2023/04/02/ub-might-be-the-wrong-term-for-newer-languages.html
@@ -0,0 +1,132 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>UB Might Be a Wrong Term for Newer Languages</title>
+  <meta name="description" content="A short note on undefined behavior, which assumes familiarity with the subject (see this article for the introduction).
+The TL;DR is that I think that carrying the wording from the C standard into newer languages, like Zig and Rust, might be a mistake.
+This is strictly the word choice, the lexical syntax of the comments argument.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/04/02/ub-might-be-the-wrong-term-for-newer-languages.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#UB-Might-Be-a-Wrong-Term-for-Newer-Languages"><span>UB Might Be a Wrong Term for Newer Languages</span> <time datetime="2023-04-02">Apr 2, 2023</time></a>
+    </h1>
+<p><span>A short note on undefined behavior, which assumes familiarity with the subject (see </span><a href="https://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html"><span>this article</span></a><span> for the introduction).</span>
+<span>The TL;DR is that I think that carrying the wording from the C standard into newer languages, like Zig and Rust, might be a mistake.</span>
+<span>This is strictly the word choice, the </span>&ldquo;<span>lexical syntax of the comments</span>&rdquo;<span> argument.</span></p>
+<p><span>The C standard leaves many behaviors undefined.</span>
+<span>However, it allows any particular implementation to fill in the gaps and define some of undefined-in-the-standard behaviors.</span>
+<span>For example, C23 makes </span><code>realloc(ptr, 0)</code><span> into an undefined behavior, so that POSIX can further refine it without interfering with the standard (</span><a href="https://www.open-std.org/jtc1/sc22/wg14/www/docs/n2464.pdf"><span>source</span></a><span>).</span></p>
+<p><span>It</span>&rsquo;<span>s also valid for an implementation to leave UB undefined.</span>
+<span>If a program compiled with this implementation hits this UB path, the behavior of the program </span><em><span>as a whole</span></em><span> is undefined</span>
+<span>(or rather, bounded by the execution environment. It is not </span><em><span>actually</span></em><span> possible to summon nasal daemons, because a user-space process can not escape its memory space other than by calling syscalls, and there are no nasal daemons summoning syscalls).</span></p>
+<p><span>C implementations are </span><em><span>not required to</span></em><span> but </span><em><span>may</span></em><span> define behaviors left undefined by the standard.</span>
+<span>A C program written for a specific implementation may rely on undefined-in-the-standard but defined-in-the-implementation behavior.</span></p>
+<p><span>Modern languages like </span><a href="https://doc.rust-lang.org/reference/behavior-considered-undefined.html"><span>Rust</span></a><span> and </span><a href="https://ziglang.org/documentation/0.10.1/#Undefined-Behavior"><span>Zig</span></a><span> re-use the </span>&ldquo;<span>undefined behavior</span>&rdquo;<span> term.</span>
+<span>However, the intended semantics is subtly different.</span>
+<span>A program exhibiting UB is </span><em><span>always</span></em><span> considered invalid.</span>
+<span>Even if an alternative implementation of Rust defines some of Rust</span>&rsquo;<span>s UB, the programs hitting those behaviors would still be incorrect.</span></p>
+<p><span>For this reason, I think it would be better to use a different term here.</span>
+<span>I am not ready to suggest a specific wording, but a couple of reasonable options would be </span>&ldquo;<span>non-trapping programming error</span>&rdquo;<span> or </span>&ldquo;<span>invalid behavior</span>&rdquo;<span>.</span>
+<span>The intended semantics being that any program execution containing illegal behavior is invalid under any implementation.</span></p>
+<p><span>Curiously, C++ is ahead of the pack here, as it has an explicit notion of </span>&ldquo;<span>ill-formed, no diagnostic required</span>&rdquo;<span>.</span></p>
+<p><span>Update: I</span>&rsquo;<span>ve since learned that Zig is </span><a href="https://github.com/ziglang/zig/issues/2402"><span>updating its terminology</span></a><span>.</span>
+<span>The new term is </span><dfn><span>illegal behavior</span></dfn><span>.</span>
+<span>This is perfect, </span>&ldquo;<span>illegal</span>&rdquo;<span> has just the right connotation of being explicitly declared incorrect by a written specifciation.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-04-02-ub-might-be-the-wrong-term-for-newer-languages.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/04/09/can-you-trust-a-compiler-to-optimize-your-code.html b/2023/04/09/can-you-trust-a-compiler-to-optimize-your-code.html
new file mode 100644
index 00000000..78c9f09f
--- /dev/null
+++ b/2023/04/09/can-you-trust-a-compiler-to-optimize-your-code.html
@@ -0,0 +1,557 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Can You Trust a Compiler to Optimize Your Code?</title>
+  <meta name="description" content="More or less the title this time, but first, a story about SIMD. There are three
+levels of understanding how SIMD works (well, at least I am level 3 at the moment):">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/04/09/can-you-trust-a-compiler-to-optimize-your-code.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Can-You-Trust-a-Compiler-to-Optimize-Your-Code"><span>Can You Trust a Compiler to Optimize Your Code?</span> <time datetime="2023-04-09">Apr 9, 2023</time></a>
+    </h1>
+<p><span>More or less the title this time, but first, a story about SIMD. There are three</span>
+<span>levels of understanding how SIMD works (well, at least I am level 3 at the moment):</span></p>
+<ol>
+<li>
+<p><span>Compilers are smart! They will auto-vectorize all the code!</span></p>
+</li>
+<li>
+<p><span>Compilers are dumb, auto-vectorization is fragile, it</span>&rsquo;<span>s very easy to break it</span>
+<span>by unrelated changes to the code. It</span>&rsquo;<span>s always better to manually write</span>
+<span>explicit SIMD instructions.</span></p>
+</li>
+<li>
+<p><span>Writing SIMD by hand is really hard </span>&mdash;<span> you</span>&rsquo;<span>ll need to re-do the work for</span>
+<span>every different CPU architecture. Also, you probably think that, for scalar</span>
+<span>code, a compiler writes better assembly than you. What makes you think that</span>
+<span>you</span>&rsquo;<span>d beat the compiler at SIMD, where there are more funky instructions and</span>
+<span>constraints? Compilers are tools. They can reliably vectorize code if it is</span>
+<span>written in an amenable-to-vectorization form.</span></p>
+</li>
+</ol>
+<p><span>I</span>&rsquo;<span>ve recently moved from the second level to the third one, and that made me aware of the moment when the model used by a compiler for optimization clicked in my head.</span>
+<span>In this post, I want to explain the general framework for reasoning about compiler optimizations for static languages such as Rust or C++.</span>
+<span>After that, I</span>&rsquo;<span>ll apply that framework to auto-vectorization.</span></p>
+<p><span>I haven</span>&rsquo;<span>t worked on backends of production optimizing compilers, so the following will not be academically correct, but these models are definitely helpful at least to me!</span></p>
+<section id="Seeing-Like-a-Compiler">
+
+    <h2>
+    <a href="#Seeing-Like-a-Compiler"><span>Seeing Like a Compiler</span> </a>
+    </h2>
+<p><span>The first bit of a puzzle is understanding how a compiler views code. Some useful references here include</span>
+<a href="https://link.springer.com/book/10.1007/978-3-030-80515-9"><em><span>The SSA Book</span></em></a><span> or LLVM</span>&rsquo;<span>s</span>
+<a href="https://llvm.org/docs/LangRef.html"><em><span>Language Reference</span></em></a><span>.</span></p>
+<p><span>Another interesting choice would be </span><a href="https://webassembly.github.io/spec/core/"><em><span>WebAssembly Specification</span></em></a><span>.</span>
+<span>While WASM would be a poor IR for an optimizing compiler, it has a lot of structural similarities, and the core spec is exceptionally readable.</span></p>
+<p><span>A unit of optimization is a function.</span>
+<span>Let</span>&rsquo;<span>s take a simple function like the following:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sum</span>(xs: &amp;[<span class="hl-type">i32</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">i32</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..xs.<span class="hl-title function_ invoke__">len</span>() {</span>
+<span class="line">    total = total.<span class="hl-title function_ invoke__">wrapping_add</span>(xs[i]);</span>
+<span class="line">  }</span>
+<span class="line">  total</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In some pseudo-IR, it would look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">fn sum return i32 {</span>
+<span class="line">  param xs_ptr: ptr</span>
+<span class="line">  param xs_len: size</span>
+<span class="line"></span>
+<span class="line">  local total: i32 = 0</span>
+<span class="line">  local i: size = 0</span>
+<span class="line">  local x: i32</span>
+<span class="line"></span>
+<span class="line">loop:</span>
+<span class="line">  branch_if i &gt;= xs_len :ret</span>
+<span class="line">  load x base=xs_ptr offset=i</span>
+<span class="line">  add total x</span>
+<span class="line">  add i 1</span>
+<span class="line">  goto :loop</span>
+<span class="line"></span>
+<span class="line">ret:</span>
+<span class="line">  return total</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The most important characteristic here is that there are two kinds of entities:</span></p>
+<p><em><span>First</span></em><span>, there is program memory, very roughly an array of bytes.</span>
+<span>Compilers generally can not reason about the contents of the memory very well, because it is shared by all the functions, and different functions might interpret the contents of the memory differently.</span></p>
+<p><em><span>Second</span></em><span>, there are local variables.</span>
+<span>Local variables are not bytes </span>&mdash;<span> they are integers, they obey mathematical properties which a compiler can reason about.</span></p>
+<p><span>For example, if a compiler sees a loop like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">param n: u32</span>
+<span class="line">local i: u32 = 0</span>
+<span class="line">local total: u32</span>
+<span class="line">local tmp</span>
+<span class="line"></span>
+<span class="line">loop:</span>
+<span class="line">  branch_if i &gt;= n :ret</span>
+<span class="line">  set tmp i</span>
+<span class="line">  mul tmp 4</span>
+<span class="line">  add t tmp</span>
+<span class="line">  goto :loop</span>
+<span class="line"></span>
+<span class="line">ret:</span>
+<span class="line">  return total</span></code></pre>
+
+</figure>
+<p><span>It can </span><em><span>reason</span></em><span> that on each iteration </span><code>tmp</code><span> holds </span><code>i * 4</code><span> and optimize the code to</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">param n: u32</span>
+<span class="line">local i: u32 = 0</span>
+<span class="line">local total: u32</span>
+<span class="line">local tmp = 0</span>
+<span class="line"></span>
+<span class="line">loop:</span>
+<span class="line">  branch_if i &gt;= n :ret</span>
+<span class="line">  add t tmp</span>
+<span class="line">  add tmp 4  # replace multiplication with addition</span>
+<span class="line">  goto :loop</span>
+<span class="line"></span>
+<span class="line">ret:</span>
+<span class="line">  return total</span></code></pre>
+
+</figure>
+<p><span>This works, because all locals are just numbers.</span>
+<span>If we did the same computation, but all numbers were located in memory, it would be significantly harder for a compiler to reason that the transformation is actually correct.</span>
+<span>What if the storage for </span><code>n</code><span> and </span><code>total</code><span> actually overlaps?</span>
+<span>What if </span><code>tmp</code><span> overlaps with something which isn</span>&rsquo;<span>t even in the current function?</span></p>
+<p><span>However, there</span>&rsquo;<span>s a bridge between the worlds of mathematical local variables and the world of memory bytes </span>&mdash;<span> </span><code>load</code><span> and </span><code>store</code><span> instructions.</span>
+<span>The </span><code>load</code><span> instruction takes a range of bytes in memory, interprets the bytes as an integer, and stores that integer into a local variable.</span>
+<span>The </span><code>store</code><span> instruction does the opposite.</span>
+<span>By loading something from memory into a local, a compiler gains the ability to reason about it precisely.</span>
+<span>Thus, the compiler doesn</span>&rsquo;<span>t need to track the general contents of memory.</span>
+<span>It only needs to check that it would be correct to load from memory at a specific point in time.</span></p>
+<p><span>So, a compiler really doesn</span>&rsquo;<span>t see all that well </span>&mdash;<span> it can only really reason about a single function at a time, and only about the local variables in that function.</span></p>
+</section>
+<section id="Bringing-Code-Closer-to-Compiler-s-Nose">
+
+    <h2>
+    <a href="#Bringing-Code-Closer-to-Compiler-s-Nose"><span>Bringing Code Closer to Compiler</span>&rsquo;<span>s Nose</span> </a>
+    </h2>
+<p><span>Compilers are myopic.</span>
+<span>This can be fixed by giving more context to the compiler, which is the task of two core optimizations.</span></p>
+<p><em><span>The first</span></em><span> core optimization is </span><dfn><span>inlining</span></dfn><span>.</span>
+<span>It substitutes callee</span>&rsquo;<span>s body for a specific call.</span>
+<span>The benefit here is not that we eliminate function call overhead, that</span>&rsquo;<span>s relatively minor.</span>
+<span>The big thing is that locals of both the caller and the callee are now in the same frame, and a compiler can optimize them together.</span></p>
+<p><span>Let</span>&rsquo;<span>s look again at that Rust code:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sum</span>(xs: &amp;[<span class="hl-type">i32</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">i32</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">total</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..xs.<span class="hl-title function_ invoke__">len</span>() {</span>
+<span class="line">    total = total.<span class="hl-title function_ invoke__">wrapping_add</span>(xs[i]);</span>
+<span class="line">  }</span>
+<span class="line">  total</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The </span><code>xs[i]</code><span> expression there is actually a function call.</span>
+<span>The indexing function does a bounds check before accessing the element of an array.</span>
+<span>After inlining it into the </span><code>sum</code><span>, compiler can see that it is dead code and eliminate it.</span></p>
+<p><span>If you look at various standard optimizations, they often look like getting rid of dumb things, which no one would actually write in the first place, so its not clear immediately if it is worth it to implement such optimizations.</span>
+<span>But the thing is, after inlining a lot of dumb things appear, because functions tend to handle the general case, and, at a specific call-site, there are usually enough constraints to dismiss many edge cases.</span></p>
+<p><em><span>The second</span></em><span> core optimization is </span><dfn><span>scalar replacement of aggregates</span></dfn><span>.</span>
+<span>It is a generalization of the </span>&ldquo;<span>let</span>&rsquo;<span>s use </span><code>load</code><span> to avoid reasoning about memory and reason about a local instead</span>&rdquo;<span> idea we</span>&rsquo;<span>ve already seen.</span></p>
+<p><span>If you have a function like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">permute</span>(xs: &amp;<span class="hl-keyword">mut</span> <span class="hl-type">Vec</span>&lt;<span class="hl-type">i32</span>&gt;) {</span>
+<span class="line">  ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>it</span>&rsquo;<span>s pretty difficult for the compiler to reason about it.</span>
+<span>It receives a pointer to some memory which holds a complex struct (ptr, len, capacity triple), so reasoning about evolution of this struct is hard.</span>
+<span>What the compiler can do is to load this struct from memory, replacing the aggregate with a bunch of scalar local variables:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">fn permute(xs: &amp;mut Vec&lt;i32&gt;) {</span>
+<span class="line">  local ptr: ptr</span>
+<span class="line">  local len: usize</span>
+<span class="line">  local cap: usize</span>
+<span class="line"></span>
+<span class="line">  load ptr xs.ptr</span>
+<span class="line">  load len xs.len</span>
+<span class="line">  load cap xs.cap</span>
+<span class="line"></span>
+<span class="line">  ...</span>
+<span class="line"></span>
+<span class="line">  store xs.ptr ptr</span>
+<span class="line">  store xs.len len</span>
+<span class="line">  store xs.cap cap</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This way, a compiler again gains reasoning power.</span>
+<span>SROA is like inlining, but for memory rather than code.</span></p>
+</section>
+<section id="Impossible-and-Possible">
+
+    <h2>
+    <a href="#Impossible-and-Possible"><span>Impossible and Possible</span> </a>
+    </h2>
+<p><span>Using this mental model of a compiler which:</span></p>
+<ul>
+<li>
+<span>optimizes on a per-function basis,</span>
+</li>
+<li>
+<span>can inline function calls,</span>
+</li>
+<li>
+<span>is great at noticing relations between local variables and rearranging the code based on that,</span>
+</li>
+<li>
+<span>is capable of </span><em><span>limited</span></em><span> reasoning about the memory (namely, deciding when it</span>&rsquo;<span>s safe to </span><code>load</code><span> or </span><code>store</code><span>)</span>
+</li>
+</ul>
+<p><span>we can describe which code is reliably optimizable, and which code prevents optimizations, explaining zero cost abstractions.</span></p>
+<p><span>To enable inlining, a compiler needs to know which function is actually called.</span>
+<span>If a function is called directly, it</span>&rsquo;<span>s pretty much guaranteed that a compiler would try to inline it.</span>
+<span>If the call is indirect (via function pointer, or via a table of virtual functions), in the general case a compiler won</span>&rsquo;<span>t be able to inline that.</span>
+<span>Even for indirect calls, sometimes the compiler can reason about the value of the pointer and de-virtualize the call, but that relies on successful optimization elsewhere.</span></p>
+<p><span>This is the reason why, in Rust, every function has a unique, zero-sized type with no runtime representation.</span>
+<span>It statically guarantees that the compiler could always inline the code, and makes this abstraction zero cost, because any decent optimizing compiler will melt it to nothing.</span></p>
+<p><span>A higher level language might choose to </span><em><span>always</span></em><span> represent functions with function pointers.</span>
+<span>In practice, in many cases the resulting code would be equivalently optimizable.</span>
+<span>But there won</span>&rsquo;<span>t be any indication in the source whether this is an optimizable case (the actual pointer is knowable at compile time) or a genuinely dynamic call.</span>
+<span>With Rust, the difference between guaranteed to be optimizable and potentially optimizable is reflected in the source language:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Compiler is guaranteed to be able to inline call to `f`.</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">call1</span>&lt;F: <span class="hl-title function_ invoke__">Fn</span>()&gt;(f: F) {</span>
+<span class="line">  <span class="hl-title function_ invoke__">f</span>()</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Compiler _might_ be able to inline call to `f`.</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">call2</span>(f: <span class="hl-title function_ invoke__">fn</span>()) {</span>
+<span class="line">  <span class="hl-title function_ invoke__">f</span>()</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>So, the first rule is to make most of the calls statically resolvable, to allow inlining.</span>
+<span>Function pointers and dynamic dispatch prevent inlining.</span>
+<span>Separate compilation might also get in a way of inlining, see this </span><a href="https://matklad.github.io/2021/07/09/inline-in-rust.html"><span>separate essay</span></a><span> on the topic.</span></p>
+<p><span>Similarly, indirection in </span><em><span>memory</span></em><span> can cause troubles for the compiler.</span></p>
+<p><span>For something like this</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span> {</span>
+<span class="line">  bar: Bar,</span>
+<span class="line">  baz: Baz,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>the </span><code>Foo</code><span> struct is completely transparent for the compiler.</span></p>
+<p><span>While here:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Foo</span> {</span>
+<span class="line">  bar: <span class="hl-type">Box</span>&lt;Bar&gt;,</span>
+<span class="line">  baz: Baz,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>it is not clear cut.</span>
+<span>Proving something about the memory occupied by </span><code>Foo</code><span> does not in general transfer to the memory occupied by </span><code>Bar</code><span>.</span>
+<span>Again, in many cases a compiler </span><em><span>can</span></em><span> reason through boxes thanks to uniqueness, but this is not guaranteed.</span></p>
+<p><span>A good homework at this point is to look at Rust</span>&rsquo;<span>s iterators and understand why they look the way they do.</span></p>
+<p><span>Why the signature and definition of </span><a href="https://doc.rust-lang.org/stable/core/iter/trait.Iterator.html#method.map"><code>map</code></a><span> is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[inline]</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">map</span>&lt;B, F&gt;(<span class="hl-keyword">self</span>, f: F) <span class="hl-punctuation">-&gt;</span> Map&lt;<span class="hl-keyword">Self</span>, F&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">  <span class="hl-keyword">Self</span>: <span class="hl-built_in">Sized</span>,</span>
+<span class="line">  F: <span class="hl-title function_ invoke__">FnMut</span>(<span class="hl-keyword">Self</span>::Item) <span class="hl-punctuation">-&gt;</span> B,</span>
+<span class="line">{</span>
+<span class="line">  Map::<span class="hl-title function_ invoke__">new</span>(<span class="hl-keyword">self</span>, f)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Another important point about memory is that, in general, a compiler can</span>&rsquo;<span>t change the overall layout of stuff.</span>
+<span>SROA can load some data structure into a bunch of local variables, which then can, eg, replace </span>&ldquo;<span>a pointer and an index</span>&rdquo;<span> representation with </span>&ldquo;<span>a pair of pointers</span>&rdquo;<span>.</span>
+<span>But at the end of the day SROA would have to materialize </span>&ldquo;<span>a pointer and an index</span>&rdquo;<span> back and store that representation back into the memory.</span>
+<span>This is because memory layout is shared across all functions, so a function can not unilaterally dictate a more optimal representation.</span></p>
+<p><span>Together, these observations give a basic rule for the baseline of performant code.</span></p>
+
+<aside class="admn note">
+<svg class="icon"><use href="/assets/icons.svg#info"/></svg>
+<div><p><span>Think about data layout in memory.</span>
+<span>A compiler is of very little help here and would mostly put the bytes where you tell it to.</span>
+<span>Make data structures more compact, reduce indirection, exploit common access patterns for improving cache efficiency.</span></p>
+<p><span>Compilers are much better at reasoning about the code, as long as they can see it.</span>
+<span>Make sure that most calls are known at compile time and can be inlined, trust the compiler to do the rest.</span></p>
+</div>
+</aside></section>
+<section id="SIMD">
+
+    <h2>
+    <a href="#SIMD"><span>SIMD</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s apply this general framework of giving a compiler optimizable code to work with to auto-vectorization.</span>
+<span>We will be optimizing the function which computes the longest common prefix between two slices of bytes (thanks </span><a href="https://github.com/nkkarpov"><span>@nkkarpov</span></a><span> for the example).</span></p>
+<p><span>A  direct implementation would look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> std::iter::zip;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// 650 milliseconds</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">common_prefix</span>(xs: &amp;[<span class="hl-type">u8</span>], ys: &amp;[<span class="hl-type">u8</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">result</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-title function_ invoke__">for</span> (x, y) <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">zip</span>(xs, ys) {</span>
+<span class="line">    <span class="hl-keyword">if</span> x != y { <span class="hl-keyword">break</span>; }</span>
+<span class="line">    result += <span class="hl-number">1</span></span>
+<span class="line">  }</span>
+<span class="line">  result</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If you already have a mental model for auto-vectorization, or if you look at the assembly output, you can realize that the function as written works one byte at a time, which is much slower than it needs to be.</span>
+<span>Let</span>&rsquo;<span>s fix that!</span></p>
+<p><span>SIMD works on many values simultaneously.</span>
+<span>Intuitively, we want the compiler to compare a bunch of bytes at the same time, but our current code does not express that.</span>
+<span>Let</span>&rsquo;<span>s make the structure explicit, by processing 16 bytes at a time, and then handling remainder separately:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// 450 milliseconds</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">common_prefix</span>(xs: &amp;[<span class="hl-type">u8</span>], ys: &amp;[<span class="hl-type">u8</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">chunk_size</span> = <span class="hl-number">16</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">result</span> = <span class="hl-number">0</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-symbol">&#x27;outer</span>: <span class="hl-title function_ invoke__">for</span> (xs_chunk, ys_chunk) <span class="hl-keyword">in</span></span>
+<span class="line">    <span class="hl-title function_ invoke__">zip</span>(xs.<span class="hl-title function_ invoke__">chunks_exact</span>(chunk_size), ys.<span class="hl-title function_ invoke__">chunks_exact</span>(chunk_size))</span>
+<span class="line">  {</span>
+<span class="line">    <span class="hl-title function_ invoke__">for</span> (x, y) <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">zip</span>(xs_chunk, ys_chunk) {</span>
+<span class="line">      <span class="hl-keyword">if</span> x != y { <span class="hl-keyword">break</span> <span class="hl-symbol">&#x27;outer</span>; }</span>
+<span class="line">      result += <span class="hl-number">1</span></span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-title function_ invoke__">for</span> (x, y) <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">zip</span>(&amp;xs[result..], &amp;ys[result..]) {</span>
+<span class="line">    <span class="hl-keyword">if</span> x != y { <span class="hl-keyword">break</span>; }</span>
+<span class="line">    result += <span class="hl-number">1</span></span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  result</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Amusingly, this is already a bit faster, but not quite there yet.</span>
+<span>Specifically, SIMD needs to process all values in the chunk in parallel in the same way.</span>
+<span>In our code above, we have a </span><code>break</code><span>, which means that processing of the nth pair of bytes depends on the n-1st pair.</span>
+<span>Let</span>&rsquo;<span>s fix </span><em><span>that</span></em><span> by disabling short-circuiting.</span>
+<span>We will check if the whole chunk of bytes matches or not, but we won</span>&rsquo;<span>t care which specific byte is a mismatch:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// 80 milliseconds</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">common_prefix3</span>(xs: &amp;[<span class="hl-type">u8</span>], ys: &amp;[<span class="hl-type">u8</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">chunk_size</span> = <span class="hl-number">16</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">result</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-title function_ invoke__">for</span> (xs_chunk, ys_chunk) <span class="hl-keyword">in</span></span>
+<span class="line">    <span class="hl-title function_ invoke__">zip</span>(xs.<span class="hl-title function_ invoke__">chunks_exact</span>(chunk_size), ys.<span class="hl-title function_ invoke__">chunks_exact</span>(chunk_size))</span>
+<span class="line">  {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">chunk_equal</span>: <span class="hl-type">bool</span> = <span class="hl-literal">true</span>;</span>
+<span class="line">    <span class="hl-title function_ invoke__">for</span> (x, y) <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">zip</span>(xs_chunk, ys_chunk) {</span>
+<span class="line">      <span class="hl-comment">// NB: &amp;, unlike &amp;&amp;, doesn&#x27;t short-circuit.</span></span>
+<span class="line">      chunk_equal = chunk_equal &amp; (x == y);</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">if</span> !chunk_equal { <span class="hl-keyword">break</span>; }</span>
+<span class="line">    result += chunk_size;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-title function_ invoke__">for</span> (x, y) <span class="hl-keyword">in</span> <span class="hl-title function_ invoke__">zip</span>(&amp;xs[result..], &amp;ys[result..]) {</span>
+<span class="line">    <span class="hl-keyword">if</span> x != y { <span class="hl-keyword">break</span>; }</span>
+<span class="line">    result += <span class="hl-number">1</span></span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  result</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>And this version finally lets vectorization kick in, reducing the runtime almost by an order of magnitude.</span>
+<span>We can now compress this version using iterators.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// 80 milliseconds</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">common_prefix5</span>(xs: &amp;[<span class="hl-type">u8</span>], ys: &amp;[<span class="hl-type">u8</span>]) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">chunk_size</span> = <span class="hl-number">16</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">off</span> =</span>
+<span class="line">    <span class="hl-title function_ invoke__">zip</span>(xs.<span class="hl-title function_ invoke__">chunks_exact</span>(chunk_size), ys.<span class="hl-title function_ invoke__">chunks_exact</span>(chunk_size))</span>
+<span class="line">      .<span class="hl-title function_ invoke__">take_while</span>(|(xs_chunk, ys_chunk)| xs_chunk == ys_chunk)</span>
+<span class="line">      .<span class="hl-title function_ invoke__">count</span>() * chunk_size;</span>
+<span class="line"></span>
+<span class="line">  off + <span class="hl-title function_ invoke__">zip</span>(&amp;xs[off..], &amp;ys[off..])</span>
+<span class="line">    .<span class="hl-title function_ invoke__">take_while</span>(|(x, y)| x == y)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">count</span>()</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note how the code is meaningfully different from our starting point.</span>
+<span>We do not blindly rely on the compiler</span>&rsquo;<span>s optimization.</span>
+<span>Rather, we are aware about specific optimizations we need in this case, and write the code in a way that triggers them.</span></p>
+<p><span>Specifically, for SIMD:</span></p>
+<ul>
+<li>
+<span>we express the algorithm in terms of processing </span><em><span>chunks</span></em><span> of elements,</span>
+</li>
+<li>
+<span>within each chunk, we make sure that there</span>&rsquo;<span>s no branching and all elements are processed in the same way.</span>
+</li>
+</ul>
+</section>
+<section id="Conclusion">
+
+    <h2>
+    <a href="#Conclusion"><span>Conclusion</span> </a>
+    </h2>
+<p><span>Compilers are tools.</span>
+<span>While there</span>&rsquo;<span>s a fair share of </span>&ldquo;<span>optimistic</span>&rdquo;<span> transformations which sometimes kick in, the bulk of the impact of an optimizing compiler comes from guaranteed optimizations with specific preconditions.</span>
+<span>Compilers are myopic </span>&mdash;<span> they have a hard time reasoning about code outside of the current function and values not held in the local variables.</span>
+<span>Inlining and scalar replacement of aggregates are two optimizations to remedy the situation.</span>
+<span>Zero cost abstractions work by expressing opportunities for guaranteed optimizations in the language</span>&rsquo;<span>s type system.</span></p>
+<p><span>If you like this post, I highly recommend </span><a href="https://www.clear.rice.edu/comp512/Lectures/Papers/1971-allen-catalog.pdf"><em><span>A Catalogue of Optimizing Transformations</span></em></a><span> by Frances Allen.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-04-09-can-you-trust-a-compiler-to-optimize-your-code.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/04/13/reasonable-bootstrap.html b/2023/04/13/reasonable-bootstrap.html
new file mode 100644
index 00000000..da8f4299
--- /dev/null
+++ b/2023/04/13/reasonable-bootstrap.html
@@ -0,0 +1,191 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Reasonable Bootstrap</title>
+  <meta name="description" content="Compilers for systems programming languages (C, C++, Rust, Zig) tend to be implemented in the languages themselves.
+The idea being that the current version of the compiler is built using some previous version.
+But how can you get a working compiler if you start out from nothing?">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/04/13/reasonable-bootstrap.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Reasonable-Bootstrap"><span>Reasonable Bootstrap</span> <time datetime="2023-04-13">Apr 13, 2023</time></a>
+    </h1>
+<p><span>Compilers for systems programming languages (C, C++, Rust, Zig) tend to be implemented in the languages themselves.</span>
+<span>The idea being that the current version of the compiler is built using some previous version.</span>
+<span>But how can you get a working compiler if you start out from nothing?</span></p>
+<p><span>The traditional answer has been </span>&ldquo;<span>via bootstrap chain</span>&rdquo;<span>.</span>
+<span>You start with the first version of the compiler implemented in assembly, use that to compile the latest version of the compiler it is capable of compiling, then repeat.</span>
+<span>This historically worked OK because older versions of GCC were implemented in C (and C is easy to provide a compiler for) and, even today, GCC itself is very conservative in using language features.</span>
+<span>I believe GCC 10.4 released in 2022 can be built with just a C++98 compiler.</span>
+<span>So, if you start with a C compiler, it</span>&rsquo;<span>s not too many hops to get to the latest GCC.</span></p>
+<p><span>This doesn</span>&rsquo;<span>t feel entirely satisfactory, as this approach requires artificially constraining the compiler itself to be very conservative.</span>
+<span>Rust does the opposite of that.</span>
+<span>Rust requires that rustc 1.x.0 is built by rustc 1.x-1.0, and there</span>&rsquo;<span>s a new rustc version every six weeks.</span>
+<span>This seems like a very reasonable way to build compilers, </span><em><span>but</span></em><span> it also is incompatible with chain bootstrapping.</span>
+<span>In the limit, one would need infinite time to compile modern rustc ex nihilo!</span></p>
+<p><span>I </span><em><span>think</span></em><span> there</span>&rsquo;<span>s a better way if the goal is to compile the world from nothing.</span>
+<span>To cut to the chase, the minimal bootstrap seed for Rust could be:</span></p>
+<ul>
+<li>
+<span>source code for current version of the compiler</span>
+</li>
+<li>
+<span>this source code compiled to core WebAssembly</span>
+</li>
+</ul>
+<p><span>Bootstrapping from this should be easy.</span>
+<span>WebAssembly is a very small language, so a runtime for it can be built out of nothing.</span>
+<span>Using this runtime, and rustc-compiled-to-wasm we can re-compile rustc itself.</span>
+<span>Then, we can either cross-compile it to the architecture we need, if that architecture is supported by rustc.</span>
+<span>If the architecture is </span><em><span>not</span></em><span> supported, we can implement a new backend for that arch in Rust, compile our modified compiler to wasm, and then cross-compile to the desired target.</span></p>
+<p><span>More complete bootstrap seed would include:</span></p>
+<ul>
+<li>
+<span>Informal specification of the Rust language, to make sense of the source code.</span>
+</li>
+<li>
+<span>Rust source code for the compiler, which also doubles as a formal specification of the language.</span>
+</li>
+<li>
+<span>Informal specification of WebAssembly, to make sense of .wasm parts of the bootstrap seed.</span>
+</li>
+<li>
+<span>.wasm code for the rust compiler, which triple-checks the Rust specification.</span>
+</li>
+<li>
+<span>Rust implementation of a WebAssembly interpreter, which doubles as a formal spec for WebAssembly.</span>
+</li>
+</ul>
+<p><span>And this seed is provided for every version of a language.</span>
+<span>This way, it is possible to bootstrap, in constant time, any version of Rust.</span></p>
+<p><span>Specific properties we use for this setup:</span></p>
+<ul>
+<li>
+<span>Compilation is deterministic.</span>
+<span>Compiling bootstrap sources with bootstrap .wasm blob should result in a byte-for-byte identical wasm blob.</span>
+</li>
+<li>
+<span>WebAssembly is target-agnostic.</span>
+<span>It describes abstract computation, which is completely independent from the host architecture.</span>
+</li>
+<li>
+<span>WebAssembly is simple.</span>
+<span>Implementing a WebAssembly interpreter is easy in whatever computation substrate you have.</span>
+</li>
+<li>
+<span>Compiler is a cross compiler.</span>
+<span>We don</span>&rsquo;<span>t want to bootstrap </span><em><span>just</span></em><span> the WebAssembly backend, we want to bootstrap everything.</span>
+<span>This requires that the WebAssembly version of the compiler can generate the code for arbitrary architectures.</span>
+</li>
+</ul>
+<p><span>This setup does not prevent the trusting trust attack.</span>
+<span>However, it is possible to rebuild the bootstrap seed using a different compiler.</span>
+<span>Using that compiler to compiler rustc to .wasm will produce a different blob.</span>
+<span>But using that .wasm to recompile rustc again should produce the blob from the seed (unless, of course, there</span>&rsquo;<span>s a trojan in the seed).</span></p>
+<p><span>This setup does not minimize the size of opaque binary blobs in the seed.</span>
+<span>The size of the .wasm would be substantial.</span>
+<span>This setup, however, does minimize the total size of the seed.</span>
+<span>In the traditional bootstrap, source code for rustc 1.0.0, rustc 1.1.0, rustc 1.2.0, etc would also have to be part of the seed.</span>
+<span>For the suggested approach, you need only one version, at the cost of a bigger binary blob.</span></p>
+<p><span>This idea is not new.</span>
+<span>I </span><em><span>think</span></em><span> it was popularized by Pascal with p-code.</span>
+<span>OCaml uses a similar strategy.</span>
+<span>Finally, </span><a href="https://ziglang.org/news/goodbye-cpp/"><span>Zig</span></a><span> makes an important observation that we no longer need to implement language-specific virtual machines, because WebAssembly is a good fit for the job.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-04-13-reasonable-bootstrap.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/04/23/data-oriented-parallel-value-interner.html b/2023/04/23/data-oriented-parallel-value-interner.html
new file mode 100644
index 00000000..1dce4f00
--- /dev/null
+++ b/2023/04/23/data-oriented-parallel-value-interner.html
@@ -0,0 +1,654 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Data Oriented Parallel Value Interner</title>
+  <meta name="description" content="In this post, I will present a theoretical design for an interner.
+It should be fast, but there will be no benchmarks as I haven't implemented the thing.
+So it might actually be completely broken or super slow for one reason or another.
+Still, I think there are a couple of neat ideas, which I would love to call out.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/04/23/data-oriented-parallel-value-interner.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Data-Oriented-Parallel-Value-Interner"><span>Data Oriented Parallel Value Interner</span> <time datetime="2023-04-23">Apr 23, 2023</time></a>
+    </h1>
+<p><span>In this post, I will present a theoretical design for an interner.</span>
+<span>It should be fast, but there will be no benchmarks as I haven</span>&rsquo;<span>t implemented the thing.</span>
+<span>So it might actually be completely broken or super slow for one reason or another.</span>
+<span>Still, I think there are a couple of neat ideas, which I would love to call out.</span></p>
+<p><span>The context for the post is </span><a href="https://www.youtube.com/watch?v=AqDdWEiSwMM"><span>this talk</span></a><span> by Andrew Kelley, which notices that it</span>&rsquo;<span>s hard to reconcile interning and parallel compilation.</span>
+<span>This is something I have been thinking about a lot in the context of rust-analyzer, which relies heavily on pointers, atomic reference counting and indirection to make incremental and parallel computation possible.</span></p>
+<p><span>And yes, interning (or, more generally, assigning unique identities to things) is a big part of that.</span></p>
+<p><span>Usually, compilers intern strings, but we will be interning trees today.</span>
+<span>Specifically, we will be looking at something like a </span><a href="https://github.com/ziglang/zig/blob/b95cdf0aeb4d4d31c0b6a54302ef61baec8f6773/src/value.zig#L20"><code>Value</code></a><span> type from the Zig compiler.</span>
+<span>In a simplified RAII style it could look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> Value = <span class="hl-keyword">union</span>(<span class="hl-keyword">enum</span>) {</span>
+<span class="line">    <span class="hl-comment">// A bunch of payload-less variants.</span></span>
+<span class="line">    u1_type,</span>
+<span class="line">    u8_type,</span>
+<span class="line">    i8_type,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// A number.</span></span>
+<span class="line">    <span class="hl-type">u64</span>: <span class="hl-type">u64</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// A declaration.</span></span>
+<span class="line">    <span class="hl-comment">// Declarations and types are also values in Zig.</span></span>
+<span class="line">    decl: DeclIndex,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Just some bytes for a string.</span></span>
+<span class="line">    bytes: []<span class="hl-type">u8</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// The interesting case which makes it a tree.</span></span>
+<span class="line">    <span class="hl-comment">// This is how struct instances are represented.</span></span>
+<span class="line">    aggregate: []Value,</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">const</span> DeclIndex = <span class="hl-type">u32</span>;</span></code></pre>
+
+</figure>
+<p><span>Such values are individually heap-allocated and in general are held behind pointers.</span>
+<span>Zig</span>&rsquo;<span>s compiler adds a couple of extra tricks to this structure, like not overallocating for small enum variants:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> Value = <span class="hl-keyword">struct</span> {</span>
+<span class="line">    payload: <span class="hl-operator">*</span>Payload</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Payload is an &quot;abstract&quot; type:</span></span>
+<span class="line"><span class="hl-comment">// There&#x27;s some data following the `tag`,</span></span>
+<span class="line"><span class="hl-comment">// whose type and size is determined by</span></span>
+<span class="line"><span class="hl-comment">// this `tag`.</span></span>
+<span class="line"><span class="hl-keyword">const</span> Payload = <span class="hl-keyword">struct</span> {</span>
+<span class="line">    tag: Tag,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">const</span> U64 = <span class="hl-keyword">struct</span> {</span>
+<span class="line">        base: Payload,</span>
+<span class="line">        data: <span class="hl-type">u64</span>,</span>
+<span class="line">    };</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">const</span> Decl = <span class="hl-keyword">struct</span> {</span>
+<span class="line">        base: Payload,</span>
+<span class="line">        decl: DeclIndex,</span>
+<span class="line">    };</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>But how do we intern this stuff, such that:</span></p>
+<ul>
+<li>
+<span>values are just </span><code>u32</code><span> rather than full pointers,</span>
+</li>
+<li>
+<span>values are deduplicated,</span>
+</li>
+<li>
+<span>and this whole construct works efficiently even if there are multiple threads</span>
+<span>using our interner simultaneously?</span>
+</li>
+</ul>
+<p><span>Let</span>&rsquo;<span>s start with concurrent </span><code>SegmentedList</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> SegmentList</span>(<span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>) <span class="hl-type">type</span> {</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-keyword">struct</span> {</span>
+<span class="line">        echelons: [<span class="hl-numbers">31</span>]?[<span class="hl-operator">*</span>]T,</span>
+<span class="line">    };</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Segmented list is like </span><code>ArrayList</code><span> with an extra super power that pushing new items does not move/invalidate old ones.</span>
+<span>In normal </span><code>ArrayList</code><span>, when the backing storage fills up, you allocate a slice twice as long, copy over the elements from the old slice and then destroy it.</span>
+<span>In </span><code>SegmentList</code><span>, you leave the old slice where it is, and just allocate a new one.</span></p>
+<p><span>Now, as we are writing an interner and want to use </span><code>u32</code><span> for an index, we know that we need to store </span><code>1&lt;&lt;32</code><span> items max.</span>
+<span>But that means that we</span>&rsquo;<span>ll need at most 31 segments for our </span><code>SegmentList</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">[1 &lt;&lt; 0]T</span>
+<span class="line">[1 &lt;&lt; 1]T</span>
+<span class="line">[1 &lt;&lt; 2]T</span>
+<span class="line">...</span>
+<span class="line">[1 &lt;&lt; 31]T</span></code></pre>
+
+</figure>
+<p><span>So we can just </span>&ldquo;<span>pre-allocate</span>&rdquo;<span> array of 31 </span><em><span>pointers</span></em><span> to the segments, hence</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">echelons: [<span class="hl-numbers">31</span>]?[<span class="hl-operator">*</span>]T,</span></code></pre>
+
+</figure>
+<p><span>If we want to be more precise with types, we can even use a tuple whose elements are nullable pointers to arrays of power-of-two sizes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> SegmentList</span>(<span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>) <span class="hl-type">type</span> {</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-keyword">struct</span> {</span>
+<span class="line">        echelons: std.meta.Tuple(get_echelons(<span class="hl-numbers">31</span>, T)),</span>
+<span class="line">    };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> get_echelons</span>(</span>
+<span class="line">    <span class="hl-keyword">comptime</span> level: <span class="hl-type">usize</span>,</span>
+<span class="line">    <span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>,</span>
+<span class="line">) []<span class="hl-keyword">const</span> <span class="hl-type">type</span> {</span>
+<span class="line">    <span class="hl-keyword">if</span> (level <span class="hl-operator">==</span> <span class="hl-numbers">0</span>) <span class="hl-keyword">return</span> <span class="hl-operator">&amp;</span>.{ ?<span class="hl-operator">*</span>[<span class="hl-numbers">1</span>]T };</span>
+<span class="line">    <span class="hl-keyword">return</span> get_echelons(level <span class="hl-operator">-</span> <span class="hl-numbers">1</span>, T) <span class="hl-operator">+</span><span class="hl-operator">+</span> .{ ?<span class="hl-operator">*</span>[<span class="hl-numbers">1</span> <span class="hl-operator">&lt;&lt;</span> level]T };</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Indexing into such an echeloned array is still O(1).</span>
+<span>Here</span>&rsquo;<span>s how echelons look in terms of indexes</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">0                      = 1  total</span>
+<span class="line">1 2                    = 3  total</span>
+<span class="line">3 4 5 6                = 7  total</span>
+<span class="line">7 8 9 10 11 12 13 14   = 15 total</span></code></pre>
+
+</figure>
+<p><span>The first </span><code>n</code><span> echelons hold </span><code>2**n - 1</code><span> elements.</span>
+<span>So, if we want to find the </span><code>i</code><span>th item, we first find the echelon it is in, by computing the nearest smaller power of two of </span><code>i + 1</code><span>, and then index into the echelon with </span><code>i - (2**n - 1)</code><span>, give or take a </span><code>+1</code><span> here or there.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Warning: untested, probably has a couple of bugs.</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> get</span>(self: Self, index: <span class="hl-type">u32</span>) <span class="hl-operator">*</span><span class="hl-keyword">const</span> T {</span>
+<span class="line">    <span class="hl-keyword">const</span> e = self.get_echelon(index);</span>
+<span class="line">    <span class="hl-keyword">const</span> i = index <span class="hl-operator">-</span> (<span class="hl-numbers">1</span> <span class="hl-operator">&lt;&lt;</span> e <span class="hl-operator">-</span> <span class="hl-numbers">1</span>);</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-operator">&amp;</span>self.echelons[e].?[i];</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> get_echelon</span>(index: <span class="hl-type">u32</span>) <span class="hl-type">u5</span> {</span>
+<span class="line">    <span class="hl-built_in">@ctz</span>(std.math.floorPowerOfTwo(index <span class="hl-operator">+</span> <span class="hl-numbers">1</span>));</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note that we pre-allocate an array of pointers to segments, but not the segments themselves.</span>
+<span>Pointers are nullable, and we allocate new segments lazily, when we actually write to the corresponding indexes.</span>
+<span>This structure is very friendly to parallel code.</span>
+<span>Reading items works because items are never reallocated.</span>
+<span>Lazily allocating new echelons is easy, because the position of the pointer is fixed.</span>
+<span>That is, we can do something like this to insert an item at position </span><code>i</code><span>:</span></p>
+<ol>
+<li>
+<span>compute the echelon index</span>
+</li>
+<li>
+<code>@atomicLoad(.Acquire)</code><span> the pointer</span>
+</li>
+<li>
+<span>if the pointer is null</span>
+<ul>
+<li>
+<span>allocate the echelon</span>
+</li>
+<li>
+<code>@cmpxchgStrong(.Acquire, .Release)</code><span> the pointer</span>
+</li>
+<li>
+<span>free the redundant echelon if exchange failed</span>
+</li>
+</ul>
+</li>
+<li>
+<span>insert the item</span>
+</li>
+</ol>
+<p><span>Notice how we don</span>&rsquo;<span>t need any locks or even complicated atomics, at the price of sometimes doing a second redundant allocation.</span></p>
+<p><span>One thing this data structure is bad at is doing bounds checks and tracking which items are actually initialized.</span>
+<span>For the interner use-case, we will rely on an invariant that we always use indexes provided to use by someone else, such that possession of the index signifies that:</span></p>
+<ul>
+<li>
+<span>the echelon holding the item is allocated</span>
+</li>
+<li>
+<span>the item itself is initialized</span>
+</li>
+<li>
+<span>there</span>&rsquo;<span>s the relevant happens-before established</span>
+</li>
+</ul>
+<p><span>If, instead, we manufacture an index out of thin air, we might hit all kinds of nasty behavior without any bullet-proof way to check that.</span></p>
+<p><span>Okay, now that we have this </span><code>SegmentList</code><span>, how would we use them?</span></p>
+<p><span>Recall that our simplified value is</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> Value = <span class="hl-keyword">union</span>(<span class="hl-keyword">enum</span>) {</span>
+<span class="line">    <span class="hl-comment">// A bunch of payload-less variants.</span></span>
+<span class="line">    u1_type,</span>
+<span class="line">    u8_type,</span>
+<span class="line">    i8_type,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// A number.</span></span>
+<span class="line">    <span class="hl-type">u64</span>: <span class="hl-type">u64</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// A declaration.</span></span>
+<span class="line">    <span class="hl-comment">// Declarations and types are also values in Zig.</span></span>
+<span class="line">    decl: Decl,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Just some bytes for a string.</span></span>
+<span class="line">    bytes: []<span class="hl-type">u8</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// The interesting case which makes it a tree.</span></span>
+<span class="line">    <span class="hl-comment">// This is how struct instances are represented.</span></span>
+<span class="line">    aggregate: []Value,</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Index of a declaration.</span></span>
+<span class="line"><span class="hl-keyword">const</span> Decl = <span class="hl-type">u32</span>;</span></code></pre>
+
+</figure>
+<p><span>Of course we will struct-of-array it now, to arrive at something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> Value = <span class="hl-type">u32</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">const</span> Tag = <span class="hl-keyword">enum</span>(<span class="hl-type">u8</span>) {</span>
+<span class="line">    u1_type, u8_type, i8_type,</span>
+<span class="line">    <span class="hl-type">u64</span>, decl, bytes, aggregate,</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">const</span> ValueTable = <span class="hl-keyword">struct</span> {</span>
+<span class="line">    tag: SegmentList(Tag),</span>
+<span class="line">    data: SegmentList(<span class="hl-type">u32</span>),</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-type">u64</span>: SegmentList(<span class="hl-type">u64</span>),</span>
+<span class="line">    aggregate: SegmentList([]Value),</span>
+<span class="line">    bytes: SegmentList([]<span class="hl-type">u8</span>),</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>A </span><code>Value</code><span> is now an index.</span>
+<span>This index works for two fields of </span><code>ValueTable</code><span>, </span><code>tag</code><span> and </span><code>data</code><span>.</span>
+<span>That is, the index addresses five bytes of payload, which is all that is needed for small values.</span>
+<span>For large tags like </span><code>aggregate</code><span>, the </span><code>data</code><span> field stores an index into the corresponding payload </span><code>SegmentList</code><span>.</span></p>
+<p><span>That is, every value allocates a </span><code>tag</code><span> and </span><code>data</code><span> elements, but only actual </span><code>u64</code><span>s occupy a slot in </span><code>u64</code><span> </span><code>SegmentList</code><span>.</span></p>
+<p><span>So now we can write a </span><code>lookup</code><span> function which takes a value index and reconstructs a value from pieces:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> ValueFull = <span class="hl-keyword">union</span>(<span class="hl-keyword">enum</span>) {</span>
+<span class="line">    u1_type,</span>
+<span class="line">    u8_type,</span>
+<span class="line">    i8_type,</span>
+<span class="line">    <span class="hl-type">u64</span>: <span class="hl-type">u64</span>,</span>
+<span class="line">    decl: Decl,</span>
+<span class="line">    bytes: []<span class="hl-type">u8</span>,</span>
+<span class="line">    aggregate: []Value,</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> lookup</span>(self: Self, value: Value) ValueFull {</span>
+<span class="line">    <span class="hl-keyword">const</span> tag = self.tag.get(value);</span>
+<span class="line">    <span class="hl-keyword">switch</span> (tag) {</span>
+<span class="line">        .aggregate =&gt; <span class="hl-keyword">return</span> ValueFull{</span>
+<span class="line">            .aggregate = self.aggregate.get(self.data(value)),</span>
+<span class="line">        },</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note that here </span><code>ValueFull</code><span> is non-owning type, it is a </span><em><span>reference</span></em><span> into the actual data.</span>
+<span>Note as well that aggregates now store a slice of indexes, rather than a slice of pointers.</span></p>
+<p><span>Now let</span>&rsquo;<span>s deal with creating and interning values.</span>
+<span>We start by creating a </span><code>ValueFull</code><span> using data owned by us</span>
+<span>(e.g. if we are creating an aggregate, we may use a stack-allocated array as a backing store for </span><code>[]Value</code><span> slice).</span>
+<span>Then we ask </span><code>ValueTable</code><span> to intern the data:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> intern</span>(self: <span class="hl-operator">*</span>Self, value_full: ValueFull) Value {</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If the table already contains an equal value, its index is returned.</span>
+<span>Otherwise, the table </span><em><span>copies</span></em><span> </span><code>ValueFull</code><span> data such that it is owned by the table itself, and returns a freshly allocated index.</span></p>
+<p><span>For bookkeeping, we</span>&rsquo;<span>ll need a hash table with existing values and a counter to use for a fresh index, something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> ValueTable = <span class="hl-keyword">struct</span> {</span>
+<span class="line">    value_set: AutoHashMapUnmanaged(Value, <span class="hl-type">void</span>),</span>
+<span class="line">    value_count: <span class="hl-type">u32</span>,</span>
+<span class="line">    tag: SegmentList(Tag),</span>
+<span class="line">    index: SegmentList(<span class="hl-type">u32</span>),</span>
+<span class="line"></span>
+<span class="line">    u64_count: <span class="hl-type">u32</span>,</span>
+<span class="line">    <span class="hl-type">u64</span>: SegmentList(<span class="hl-type">u64</span>),</span>
+<span class="line"></span>
+<span class="line">    aggregate_count: <span class="hl-type">u32</span>,</span>
+<span class="line">    aggregate: SegmentList([]Value),</span>
+<span class="line"></span>
+<span class="line">    bytes_count: <span class="hl-type">u32</span>,</span>
+<span class="line">    bytes: SegmentList([]<span class="hl-type">u8</span>),</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> intern</span>(self: <span class="hl-operator">*</span>Self, value_full: ValueFull) Value {</span>
+<span class="line">        ...</span>
+<span class="line">    }</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>Pay attention to </span><code>_count</code><span> fields </span>&mdash;<span> we have </span><code>value_count</code><span> guarding the </span><code>tag</code><span> and </span><code>index</code><span>, and separate counts for specific kinds of values, as we don</span>&rsquo;<span>t want to allocate, e.g. an </span><code>u64</code><span> for </span><em><span>every</span></em><span> value.</span></p>
+<p><span>Our hashmap is actually a set which stores </span><code>u32</code><span> integers, but uses </span><code>ValueFull</code><span> to do a lookup: when we consider interning a new </span><code>ValueFull</code><span>, we don</span>&rsquo;<span>t know its index yet.</span>
+<span>Luckily, </span><code>getOrPutAdapted</code><span> API provides the required flexibility.</span>
+<span>We can use it to compare a </span><code>Value</code><span> (index) and a </span><code>ValueFull</code><span> by hashing a </span><code>ValueFull</code><span> and doing component-wise comparisons in the case of a collision.</span></p>
+<p><span>Note that, because of interning, we can also hash </span><code>ValueFull</code><span> efficiently!</span>
+<span>As any subvalues in </span><code>ValueFull</code><span> are guaranteed to be already interned, we can rely on shallow hash and hash only child value</span>&rsquo;<span>s indexes, rather than their data.</span></p>
+<p><span>This is a nice design for a single thread, but how do we make it thread safe?</span>
+<span>The straightforward solution would be to slap a mutex around the logic in </span><code>intern</code><span>.</span></p>
+<p><span>This actually is not as bad as it seems, as we</span>&rsquo;<span>d need a lock only in </span><code>intern</code><span>, and </span><code>lookup</code><span> would work without any synchronization whatsoever.</span>
+<span>Recall that obtaining an index of a value is a proof that the value was properly published.</span>
+<span>Still, we expect to intern a lot of values, and that mutex is all but guaranteed to become a point of contention.</span>
+<span>And some amount of contention is inevitable here </span>&mdash;<span> if two threads try to intern two identical values, we </span><em><span>want</span></em><span> them to clash, communicate, and end up with a single, shared value.</span></p>
+<p><span>There</span>&rsquo;<span>s a rather universal recipe for dealing with contention </span>&mdash;<span> you can shard the data.</span>
+<span>In our case, rather than using something like</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">mutex: Mutex,</span>
+<span class="line">value_set: AutoHashMapUnmanaged(Value, <span class="hl-type">void</span>),</span></code></pre>
+
+</figure>
+<p><span>we can do</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">mutex: [<span class="hl-numbers">16</span>]Mutex,</span>
+<span class="line">value_set: [<span class="hl-numbers">16</span>]AutoHashMapUnmanaged(Value, <span class="hl-type">void</span>),</span></code></pre>
+
+</figure>
+<p><span>That is, we create not one, but sixteen hashmaps, and use, e.g., lower 4 bits of the hash to decide which mutex and hashmap to use.</span>
+<span>Depending on the structure of the hashmap, such locks could even be pushed as far as individual buckets.</span></p>
+<p><span>This doesn</span>&rsquo;<span>t solve all our contention problems </span>&mdash;<span> now that several threads can simultaneously intern values (as long as they are hashed into different shards) we have to make all </span><code>count</code><span> variables atomic.</span>
+<span>So we essentially moved the single global point of contention from a mutex to </span><code>value_count</code><span> field, which is incremented for every interned value.</span></p>
+<p><span>We can apply the sharding trick again, and shard all our </span><code>SegmentList</code><span>s.</span>
+<span>But that would mean that we have to dedicate some bits from </span><code>Value</code><span> index to the shard number, and to waste some extra space for non-perfectly balanced shards.</span></p>
+<p><span>There</span>&rsquo;<span>s a better way </span>&mdash;<span> we can amortize atomic increments by allowing each thread to bulk-allocate indexes.</span>
+<span>That is, if a thread wants to allocate a new value, it atomically increments </span><code>value_cont</code><span> by, say, </span><code>1024</code><span>, and uses those indexes for the next thousand allocations.</span>
+<span>In addition to </span><code>ValueTable</code><span>, each thread now gets its own distinct </span><code>LocalTable</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> LocalTable = <span class="hl-keyword">struct</span> {</span>
+<span class="line">    global: <span class="hl-operator">*</span>ValueTable,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Invariant: if any `index % 1024 == 0`,</span></span>
+<span class="line">    <span class="hl-comment">// it&#x27;s time to visit `global` to</span></span>
+<span class="line">    <span class="hl-comment">// refill our budget via atomic fetchAndAdd.</span></span>
+<span class="line">    value_index: <span class="hl-type">u32</span>,</span>
+<span class="line">    u64_index: <span class="hl-type">u32</span>,</span>
+<span class="line">    aggregate_index: <span class="hl-type">u32</span>,</span>
+<span class="line">    bytes_index: <span class="hl-type">u32</span>,</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>An attentive reader would notice a bonus here: in this setup, a thread allocates a contiguous chunk of values.</span>
+<span>It is reasonable to assume that values allocated together would also be used together, so we potentially increase future spatial locality here.</span></p>
+<p><span>Putting everything together, the pseudo-code for interning would look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> intern</span>(table: <span class="hl-operator">*</span>LocalTable, value_full: ValueFull) Value {</span>
+<span class="line">    <span class="hl-keyword">const</span> hash = shallow_hash(value_full);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Find &amp; lock the shard.</span></span>
+<span class="line">    <span class="hl-keyword">const</span> shard = hash <span class="hl-operator">&amp;</span> <span class="hl-numbers">0xF</span>;</span>
+<span class="line">    let mutex = <span class="hl-operator">&amp;</span>table.global.mutex[shard];</span>
+<span class="line">    let value_set = <span class="hl-operator">&amp;</span>table.global.value_set[shard]</span>
+<span class="line"></span>
+<span class="line">    mutex.lock();</span>
+<span class="line">    <span class="hl-keyword">defer</span> mutex.unlock();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Either find that this value has been interned already...</span></span>
+<span class="line">    <span class="hl-keyword">const</span> gop = value_set.get_or_put(hash, value_full, ...);</span>
+<span class="line">    <span class="hl-keyword">if</span> (gop.found_existing) <span class="hl-keyword">return</span> got.key_ptr.<span class="hl-operator">*</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// ... or proceed to allocate a new index for it</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">if</span> (table.tag_index <span class="hl-operator">&amp;</span> <span class="hl-numbers">0xFF</span> <span class="hl-operator">==</span> <span class="hl-numbers">0</span>) {</span>
+<span class="line">        <span class="hl-comment">// Run out of indexes, refill our budget!</span></span>
+<span class="line">        table.tag_index = <span class="hl-built_in">@atomicRmw</span>(</span>
+<span class="line">            <span class="hl-type">u32</span>, <span class="hl-operator">&amp;</span>table.global.value_count,</span>
+<span class="line">            .Add, <span class="hl-numbers">0xFF</span>,</span>
+<span class="line">            .Relaxed,</span>
+<span class="line">        );</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Assign the index to the new value</span></span>
+<span class="line">    <span class="hl-comment">// and put it into the hash map.</span></span>
+<span class="line">    <span class="hl-keyword">const</span> value = table.tag_index;</span>
+<span class="line">    table.tag_index <span class="hl-operator">+=</span> <span class="hl-numbers">1</span>;</span>
+<span class="line">    gop.key_ptr.<span class="hl-operator">*</span> = value;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Now initialize the value.</span></span>
+<span class="line">    <span class="hl-comment">// Note that we still hold shard&#x27;s mutex at this point.</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">switch</span> (value_full) {</span>
+<span class="line">        .aggregate =&gt; <span class="hl-operator">|</span>fields<span class="hl-operator">|</span> {</span>
+<span class="line">            <span class="hl-comment">// Initialize the tag, common for all values.</span></span>
+<span class="line">            table.global.tag.set(value, .aggregate);</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-comment">// Allocate tag-specific data using</span></span>
+<span class="line">            <span class="hl-comment">// the same atomic add trick.</span></span>
+<span class="line">            <span class="hl-keyword">if</span> (table.aggregate_index <span class="hl-operator">&amp;</span> <span class="hl-numbers">0xFF</span> <span class="hl-operator">==</span> <span class="hl-numbers">0</span>) {</span>
+<span class="line">                table.aggregate_index = <span class="hl-built_in">@atomicRmw</span>(</span>
+<span class="line">                    <span class="hl-type">u32</span>, <span class="hl-operator">&amp;</span>table.global.aggregate_count,</span>
+<span class="line">                    .Add, <span class="hl-numbers">0xFF</span>,</span>
+<span class="line">                    .Relaxed,</span>
+<span class="line">                );</span>
+<span class="line">            }</span>
+<span class="line">            <span class="hl-keyword">const</span> index = table.aggregate_index;</span>
+<span class="line">            table.aggregate_index <span class="hl-operator">+=</span> <span class="hl-numbers">1</span>;</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-comment">// Make it possible to find tag-specific data</span></span>
+<span class="line">            <span class="hl-comment">// from the value index.</span></span>
+<span class="line">            table.global.index.set(value, index);</span>
+<span class="line"></span>
+<span class="line">            <span class="hl-comment">// `value_full` is borrowed, so we must</span></span>
+<span class="line">            <span class="hl-comment">// create a copy that we own.</span></span>
+<span class="line">            <span class="hl-keyword">const</span> fields_owned = allocator.dup(fields)</span>
+<span class="line">                <span class="hl-keyword">catch</span> <span class="hl-keyword">unreachable</span>;</span>
+<span class="line"></span>
+<span class="line">            table.global.aggregate.set(index, fields_owned);</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">return</span> value;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Code for assigning an index of a SegmentList.</span></span>
+<span class="line"><span class="hl-comment">// Shard&#x27;s mutex guarantees exclusive access to the index.</span></span>
+<span class="line"><span class="hl-comment">// Accesses to the echelon might race though.</span></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> set</span>(list: SegmentList(T), index: <span class="hl-type">u32</span>, value: T) {</span>
+<span class="line">    <span class="hl-keyword">const</span> e = list.get_echelon(index);</span>
+<span class="line">    <span class="hl-keyword">const</span> i = index <span class="hl-operator">-</span> ((<span class="hl-numbers">1</span> <span class="hl-operator">&lt;&lt;</span> e) <span class="hl-operator">-</span> <span class="hl-numbers">1</span>);</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">var</span> echelon = <span class="hl-built_in">@atomicLoad</span>(?[<span class="hl-operator">*</span>]T, <span class="hl-operator">&amp;</span>list.echelons[e], .Acquire);</span>
+<span class="line">    <span class="hl-keyword">if</span> (echelon <span class="hl-operator">==</span> <span class="hl-literal">null</span>) {</span>
+<span class="line">        <span class="hl-comment">// Race with other threads to allocate the echelon.</span></span>
+<span class="line">        <span class="hl-keyword">const</span> echelon_new = allocator.alloc(T, <span class="hl-numbers">1</span> <span class="hl-operator">&lt;&lt;</span> e)</span>
+<span class="line">            <span class="hl-keyword">catch</span> <span class="hl-keyword">unreachable</span>;</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">const</span> modified = <span class="hl-built_in">@cmpxchgStrong</span>(</span>
+<span class="line">            ?[<span class="hl-operator">*</span>]T, <span class="hl-operator">&amp;</span>list.echelons[e],</span>
+<span class="line">            <span class="hl-literal">null</span>, echelon_new,</span>
+<span class="line">            .Release, .Acquire,</span>
+<span class="line">        );</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-keyword">if</span> (modified) <span class="hl-operator">|</span>echelon_modified<span class="hl-operator">|</span> {</span>
+<span class="line">            <span class="hl-comment">// Another thread won, free our useless allocation.</span></span>
+<span class="line">            echelon = echelon_modified</span>
+<span class="line">            allocator.free(echelon_new);</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            echelon = echelon_new;</span>
+<span class="line">        }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    echelon.?[i] = value;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note that it is important that we </span><em><span>don</span>&rsquo;<span>t</span></em><span> release the mutex immediately after assigning the index for a value, but rather keep it locked all the way until we fully copied thee value into the </span><code>ValueTable</code><span>.</span>
+<span>If we release the lock earlier, a different thread which tries to intern the same value would get the correct index, but would risk accessing partially-initialized data.</span>
+<span>This can be optimized a bit by adding value-specific lock (or rather, a </span><a href="https://github.com/ziglang/zig/blob/b95cdf0aeb4d4d31c0b6a54302ef61baec8f6773/lib/std/once.zig"><code>Once</code></a><span>).</span>
+<span>So we use the shard lock to assign an index, then release the shard lock, and use value-specific lock to do the actual (potentially slow) initialization.</span></p>
+<p><span>And that</span>&rsquo;<span>s all I have for today!</span>
+<span>Again, I haven</span>&rsquo;<span>t implemented this, so I have no idea how fast or slow it actually is.</span>
+<span>But the end result looks rather beautiful, and builds upon many interesting ideas:</span></p>
+<ul>
+<li>
+<p><code>SegmentList</code><span> allows to maintain index stability despite insertions.</span></p>
+</li>
+<li>
+<p><span>There will be at most 31 echelons in a </span><code>SegmentList</code><span>, so you can put pointes to them into an array, removing the need to synchronize to read an echelon.</span></p>
+</li>
+<li>
+<p><span>With this setup, it becomes easy to initialize a new echelon with a single CAS.</span></p>
+</li>
+<li>
+<p><span>Synchronization is required only when creating a new item.</span>
+<span>If you trust indexes, you can use them to carry happens-before.</span></p>
+</li>
+<li>
+<p><span>In a struct-of-arrays setup for enums, you can save space by requiring that an array for a specific variant is just as long as it needs to be.</span></p>
+</li>
+<li>
+<p><span>One benefit of interning trees is that hash function becomes a shallow operation.</span></p>
+</li>
+<li>
+<p><span>Optimal interners use hashmaps in a fancy way, where the key is not what you actually store in the hashmap.</span>
+<span>I have two related posts about that,</span>
+<a href="https://matklad.github.io/2020/03/22/fast-simple-rust-interner.html"><em><span>Fast and Simple Rust Interner</span></em></a><span> and</span>
+<a href="https://matklad.github.io/2020/12/28/csdi.html"><em><span>Call Site Dependency Injection</span></em></a><span>.</span></p>
+</li>
+<li>
+<p><span>Sharding is an effective way to reduce contention if you are dealing with something like a shared hashmap.</span></p>
+</li>
+<li>
+<p><span>For counters, one alternative to sharding is batching up the increments.</span></p>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/Zig/"><span>/r/Zig</span></a><span>.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-04-23-data-oriented-parallel-value-interner.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/05/02/implicits-for-mvs.html b/2023/05/02/implicits-for-mvs.html
new file mode 100644
index 00000000..fb2f013a
--- /dev/null
+++ b/2023/05/02/implicits-for-mvs.html
@@ -0,0 +1,285 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Value Oriented Programming Needs Implicits?</title>
+  <meta name="description" content="An amateur note on language design which explores two important questions:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/05/02/implicits-for-mvs.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Value-Oriented-Programming-Needs-Implicits"><span>Value Oriented Programming Needs Implicits?</span> <time datetime="2023-05-02">May 2, 2023</time></a>
+    </h1>
+<p><span>An amateur note on language design which explores two important questions:</span></p>
+<ul>
+<li>
+<span>How to do polymorphism?</span>
+</li>
+<li>
+<span>How to do anything at all?</span>
+</li>
+</ul>
+<p><span>Let</span>&rsquo;<span>s start with the second question.</span>
+<span>What is the basic stuff that everything else is made of?</span></p>
+<p><span>Not so long ago, the most popular answer to that question was </span>&ldquo;<span>objects</span>&rdquo;<span> </span>&mdash;<span> blobs of mutable state with references to other blobs.</span>
+<span>This turned out to be problematic </span>&mdash;<span> local mutation of an object might accidentally cause unwanted changes elsewhere.</span>
+<span>Defensive copying of collections at the API boundary was a common pattern.</span></p>
+<p><span>Another answer to the question of basic stuff  is </span>&ldquo;<span>immutable values</span>&rdquo;<span>, as exemplified by functional programming.</span>
+<span>This fixes the ability to reason about programs locally at the cost of developer ergonomics and expressiveness.</span>
+<span>A lot of code is naturally formulated in terms of </span>&ldquo;<span>let</span>&rsquo;<span>s mutate this little thing</span>&rdquo;<span>, and functionally threading the update through all the layers is tiresome.</span></p>
+<p><span>The C answer is that everything is made of </span>&ldquo;<span>memory (*)</span>&rdquo;<span>.</span>
+<span>It is almost as if memory is an array of bytes.</span>
+<span>Almost, but not quite </span>&mdash;<span> to write portable programs amenable to optimization, certain restrictions must be placed on the ways memory is accessed and manipulated, hence (*).</span>
+<span>These restrictions not being checked by the compiler (and not even visible in the source code) create a fertile ground for subtle bugs.</span></p>
+<p><span>Rust takes this basic C model and:</span></p>
+<ul>
+<li>
+<span>Makes the (*) explicit:</span>
+<ul>
+<li>
+<span>pointers always carry the size of addressed memory, possibly at runtime (slices),</span>
+</li>
+<li>
+<span>pointers carry lifetime, accessing the data past the end of the lifetime is forbidden.</span>
+</li>
+</ul>
+</li>
+<li>
+<span>Adds aliasing information to the type system, such that it becomes possible to tell if there are </span><em><span>other</span></em><span> pointers pointing at a particular piece of memory.</span>
+</li>
+</ul>
+<p><span>Curiously, this approach allows rust to have an </span>&ldquo;<span>immutable values</span>&rdquo;<span> feel, without requiring the user to thread updates manually,</span>
+<a href="http://smallcultfollowing.com/babysteps/blog/2018/02/01/in-rust-ordinary-vectors-are-values/">&ldquo;<span>In Rust, Ordinary Vectors are Values</span>&rdquo;</a><span>.</span>
+<span>But the cognitive cost for this approach is pretty high, as the universe of values is now forked by different flavors of owning/referencing.</span></p>
+<p><span>Let</span>&rsquo;<span>s go back to the pure FP model.</span>
+<span>Can we just locally fix it?</span>
+<span>Let</span>&rsquo;<span>s take a look at an example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">let xs1 = get_items() in</span>
+<span class="line">let xs2  = modify_items(xs1) in</span>
+<span class="line">let xs3 = sort_items(xs2) in</span>
+<span class="line">...</span></code></pre>
+
+</figure>
+<p><span>It is pretty clear that we can allow mutation of local variables via a simple rewrite, as that won</span>&rsquo;<span>t compromise local reasoning:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">var xs = get_items()</span>
+<span class="line">xs = modify_items(xs)</span>
+<span class="line">xs = sort_items(xs)</span></code></pre>
+
+</figure>
+<p><span>Similarly, we can introduce a rewrite rule for the ubiquitous </span><code>x = f(x)</code><span> pattern, such that the code looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">var xs = get_items()</span>
+<span class="line">modify_items(xs)</span>
+<span class="line">sort_items(xs)</span></code></pre>
+
+</figure>
+<p><span>Does this actually work?</span>
+<span>Yes, it does, as popularized by Swift and distilled in its pure form by </span><a href="https://www.val-lang.dev"><span>Val</span></a><span>.</span></p>
+<p><span>Formalizing the rewriting reasoning, we introduce second-class references, which can </span><em><span>only</span></em><span> appear in function arguments (</span><code>inout</code><span> parameters), but, eg, can</span>&rsquo;<span>t be stored as fields.</span>
+<span>With these restrictions, </span>&ldquo;<span>borrow checking</span>&rdquo;<span> becomes fairly simple </span>&mdash;<span> at each function call it suffices to check that no two </span><code>inout</code><span> arguments overlap.</span></p>
+<p><span>Now, let</span>&rsquo;<span>s switch gears and explore the second question </span>&mdash;<span> polymorphism.</span></p>
+<p><span>Starting again with OOP, you can use subtyping with its familiar </span><span class="display"><code>class Dog extends Triangle</code><span>,</span></span><span> but that is not very flexible.</span>
+<span>In particular, expressing something like </span>&ldquo;<span>sorting a list of items</span>&rdquo;<span> with pure subtyping is not too natural.</span>
+<span>What works better is parametric polymorphism, where you add type parameters to your data structures:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sort</span>&lt;T&gt;(items: &amp;<span class="hl-keyword">mut</span> <span class="hl-type">Vec</span>&lt;T&gt;)</span></code></pre>
+
+</figure>
+<p><span>Except that it doesn</span>&rsquo;<span>t quite work as, as we also need to specify how to sort the </span><code>T</code><span>s.</span>
+<span>One approach here would be to introduce some sort of type-of-types, to group types with similar traits into a class:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sort</span>&lt;T: Comparable&gt;(items: &amp;<span class="hl-keyword">mut</span> <span class="hl-type">Vec</span>&lt;T&gt;)</span></code></pre>
+
+</figure>
+<p><span>A somewhat simpler approach is to just explicitly pass in a comparison function:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sort</span>&lt;T&gt;(</span>
+<span class="line">    compare: <span class="hl-title function_ invoke__">fn</span>(T, T) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">bool</span>,</span>
+<span class="line">    items: &amp;<span class="hl-keyword">mut</span> <span class="hl-type">Vec</span>&lt;T&gt;,</span>
+<span class="line">)</span></code></pre>
+
+</figure>
+<p><span>How does this relate to value oriented programming?</span>
+<span>It happens that, when programming with values, a very common pattern is to use indexes to express relationships.</span>
+<span>For example, to model parent-child relations (or arbitrary graphs), the following setup works:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">Tree</span> = <span class="hl-type">Vec</span>&lt;Node&gt;;</span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span> {</span>
+<span class="line">    parent: <span class="hl-type">usize</span>,</span>
+<span class="line">    children: <span class="hl-type">Vec</span>&lt;<span class="hl-type">usize</span>&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Using direct references hits language limitations:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Node</span> {</span>
+<span class="line">    parent: Node, <span class="hl-comment">// Who owns that?</span></span>
+<span class="line">    children: <span class="hl-type">Vec</span>&lt;Node&gt;,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Another good use-case is interning, where you have something like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">NameTable</span> {</span>
+<span class="line">    strings: <span class="hl-type">Vec</span>&lt;<span class="hl-type">String</span>&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Name</span>(<span class="hl-type">u32</span>);</span></code></pre>
+
+</figure>
+<p><span>How do we sort a </span><code>Vec&lt;Name&gt;</code><span>?</span>
+<span>We can</span>&rsquo;<span>t use the type class approach here, as knowing the </span><em><span>type</span></em><span> of </span><code>Name</code><span> isn</span>&rsquo;<span>t enough to sort names lexicographically, an instance of </span><code>NameTable</code><span> is also required to fetch the actual string data.</span>
+<span>The approach with just passing in comparison function works, as it can close over the correct </span><code>NameTable</code><span> in scope.</span></p>
+<p><span>The problem with </span>&ldquo;<span>just pass a function</span>&rdquo;<span> is that it gets tedious quickly.</span>
+<span>Rather than </span><span class="display"><code>xs.print()</code></span><span> you now need to say </span><span class="display"><code>xs.print(Int::print)</code><span>.</span></span>
+<span>Luckily, similarly to how the compiler infers the type parameter </span><code>T</code><span> by default, we can allow limited inference of value parameters, which should remove most of the boilerplate.</span>
+<span>So, something which looks like </span><span class="display"><code>names.print()</code></span><span> would desugar to </span><span class="display"><code>Vec::print_vec(self.name_table.print, names)</code><span>.</span></span></p>
+<p><span>This could also synergize well with compile-time evaluation.</span>
+<span>If (as is the common case), the value of the implicit function table is known at compile time, no table needs to be passed in at runtime (and we don</span>&rsquo;<span>t have to repeatedly evaluate the table itself).</span>
+<span>We can even compile-time partially evaluate things within the compilation unit, and use runtime parameters at the module boundaries, just like Swift does.</span></p>
+<p><span>And that</span>&rsquo;<span>s basically it!</span>
+<span>TL;DR: value oriented programming / mutable value semantics is an interesting </span>&ldquo;<span>everything is X</span>&rdquo;<span> approach to get the benefits of functional purity without giving up on mutable hash tables.</span>
+<span>This style of programming doesn</span>&rsquo;<span>t work with cyclic data structures (values are always trees), so indexes are often used to express auxiliary relations.</span>
+<span>This, however, gets in a way of type-based generic programming </span>&mdash;<span> a </span><code>T</code><span> is no longer </span><code>Comparable</code><span>, only </span><code>T + Context</code><span> is.</span>
+<span>A potential fix for that is to base generic programming on explicit dictionary passing combined with implicit value parameter inference.</span></p>
+<p><span>Is there a language like this already?</span></p>
+<p><span>Links:</span></p>
+<ul>
+<li>
+<a href="https://www.val-lang.dev"><span>Val</span></a>
+</li>
+<li>
+<a href="https://arxiv.org/pdf/1512.01895.pdf"><span>Modular implicits</span></a>
+</li>
+<li>
+<a href="https://rust-lang.github.io/async-fundamentals-initiative/evaluation/design/with_clauses.html"><span>With clauses</span></a>
+</li>
+<li>
+<a href="https://www.youtube.com/watch?v=ctS8FzqcRug"><span>Implementing Swift generics</span></a>
+</li>
+</ul>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-05-02-implicits-for-mvs.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/05/06/zig-language-server-and-cancellation.html b/2023/05/06/zig-language-server-and-cancellation.html
new file mode 100644
index 00000000..64c05068
--- /dev/null
+++ b/2023/05/06/zig-language-server-and-cancellation.html
@@ -0,0 +1,292 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Zig Language Server And Cancellation</title>
+  <meta name="description" content="I already have a dedicated post about a hypothetical Zig language server.
+But perhaps the most important thing I've written so far on the topic is the short note at the end of Zig and Rust.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/05/06/zig-language-server-and-cancellation.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Zig-Language-Server-And-Cancellation"><span>Zig Language Server And Cancellation</span> <time datetime="2023-05-06">May 6, 2023</time></a>
+    </h1>
+<p><span>I already have a dedicated post about a hypothetical </span><a href="https://matklad.github.io/2023/02/10/how-a-zig-ide-could-work.html"><span>Zig language server</span></a><span>.</span>
+<span>But perhaps the most important thing I</span>&rsquo;<span>ve written so far on the topic is the short note at the end of </span><a href="https://matklad.github.io/2023/03/26/zig-and-rust.html#ide"><em><span>Zig and Rust</span></em></a><span>.</span></p>
+<p><span>If you want to implement an LSP for a language, you need to start with a data model.</span>
+<span>If you correctly implement a store of source code which evolves over time and allows computing (initially trivial) derived data, then filling in the data until it covers the whole language is a question of incremental improvement.</span>
+<span>If, however, you don</span>&rsquo;<span>t start with a rock-solid data model, and rush to implement language features, you might find yourself needing to make a sharp U-turn several years down the road.</span></p>
+<p><span>I find this pretty insightful!</span>
+<span>At least, this evening I</span>&rsquo;<span>ve been pondering a  particular aspect of the data model, and I think I realized something new about the problem space!</span>
+<span>The aspect is cancellation.</span></p>
+<section id="Cancellation">
+
+    <h2>
+    <a href="#Cancellation"><span>Cancellation</span> </a>
+    </h2>
+<p><span>Consider this.</span>
+<span>Your language server is happily doing something very useful and computationally-intensive </span>&mdash;
+<span>typechecking a </span><a href="https://github.com/microsoft/TypeScript/blob/04d4580f4eedc036b014ef4329cffe9979da3af9/src/compiler/checker.ts"><span>giant typechecker</span></a><span>,</span>
+<span>computing comptime </span><a href="https://en.wikipedia.org/wiki/Ackermann_function"><span>Ackermann function</span></a><span>,</span>
+<span>or </span><a href="https://github.com/launchbadge/sqlx#sqlx-is-not-an-orm"><span>talking to Postgres</span></a><span>.</span>
+<span>Now, the user comes in and starts typing in the very file the server is currently processing.</span>
+<span>What is the desired behavior, and how could it be achieved?</span></p>
+<p><span>One useful model here is strong consistency.</span>
+<span>If the language server acknowledged a source code edit, all future semantic requests (like </span>&ldquo;<span>go to definition</span>&rdquo;<span> or </span>&ldquo;<span>code completion</span>&rdquo;<span>) reflect this change.</span>
+<span>The behavior is </span><em><span>as if</span></em><span> all changes and requests are sequentially ordered, and the server fully processes all preceding edits before responding to a request.</span>
+<span>There are two great benefits to this model.</span>
+<span>First, for the implementor it</span>&rsquo;<span>s an easy model to reason about. It</span>&rsquo;<span>s always clear what the answer to a particular request should be, the model is fully deterministic.</span>
+<span>Second, the model gives maximally useful guarantees to the user, strict serializability.</span></p>
+<p><span>So consider this sequence of events:</span></p>
+<ol>
+<li>
+<span>User types </span><code>fo</code><span>.</span>
+</li>
+<li>
+<span>The editor sends the edit to the language server.</span>
+</li>
+<li>
+<span>The editor requests completions for </span><code>fo</code><span>.</span>
+</li>
+<li>
+<span>The server starts furiously typechecking modified file to compute the result.</span>
+</li>
+<li>
+<span>User types </span><code>o</code><span>.</span>
+</li>
+<li>
+<span>The editor sends the </span><code>o</code><span>.</span>
+</li>
+<li>
+<span>The editor re-requests completions, now for </span><code>foo</code><span>.</span>
+</li>
+</ol>
+<p><span>How does the server deal with this?</span></p>
+<p><span>The trivial solution is to run everything sequentially to completion.</span>
+<span>So, on the step </span><code>6</code><span>, the server doesn</span>&rsquo;<span>t immediately acknowledge the edit, but rather blocks until it fully completes </span><code>4</code><span>.</span>
+<span>This is a suboptimal behavior, because reads (computing completion) block writes (updating source code).</span>
+<span>As a rule of thumb, writes should be prioritized over reads, because they reflect more up-to-date and more useful data.</span></p>
+<p><span>A more optimal solution is to make the whole data model of the server immutable, such that edits do not modify data inplace, but rather create a separate, new state.</span>
+<span>In this model, computing results for </span><code>3</code><span> and </span><code>7</code><span> proceeds in parallel, and, crucially, the edit </span><code>6</code><span> is accepted immediately.</span>
+<span>The cost of this model is the requirement that all data structures are immutable.</span>
+<span>It also is a bit wasteful </span>&mdash;<span> burning CPU to compute code completion for an already old file is useless, better dedicate all cores to the latest version.</span></p>
+<p><span>A third approach is cancellation.</span>
+<span>On step </span><code>6</code><span>, when the server becomes aware about the pending edit, it actively cancels all in-flight work pertaining to the old state and then applies modification in-place.</span>
+<span>That way we don</span>&rsquo;<span>t need to defensively copy the data, and also avoid useless CPU work.</span>
+<span>This is the strategy employed by rust-analyzer.</span></p>
+<p><span>It</span>&rsquo;<span>s useful to think about why the server can</span>&rsquo;<span>t just, like, apply the edit in place completely ignoring any possible background work.</span>
+<span>The edit ultimately changes some memory somewhere, which might be concurrently read by the code completion thread, yielding a data race and full-on UB.</span>
+<span>It is possible to work-around this by applying </span><a href="https://dl.acm.org/doi/10.1145/2723372.2737784"><span>feral concurrency control</span></a><span> and just wrapping each individual bit of data in a mutex.</span>
+<span>This removes the data race, but leads to excessive synchronization, sprawling complexity and broken logical invariants (function body might change in the middle of typechecking).</span></p>
+<p><span>Finally, there</span>&rsquo;<span>s this final solution, or rather, idea for a solution.</span>
+<span>One interesting approach for dealing with memory which is needed now, but not in the future, is semi-space garbage collection.</span>
+<span>We divide the available memory in two equal parts, use one half as a working copy which accumulates useful objects and garbage, and then at some point switch the halves, copying the live objects (but not the garbage) over.</span>
+<span>Another place where this idea comes up is Carmack</span>&rsquo;<span>s architecture for functional games.</span>
+<span>On every frame, a game copies over the game state applying frame update function.</span>
+<span>Because frames happen sequentially, you only need two copies of game state for this.</span>
+<span>We can think about applying something like that for cancellation </span>&mdash;<span> without going for full immutability, we can let cancelled analysis to work with the old half-state, while we switch to the new one.</span></p>
+<p><span>This </span>&hellip;<span> is not particularly actionable, but a good set of ideas to start thinking about evolution of a state in a language server.</span>
+<span>And now for something completely different!</span></p>
+</section>
+<section id="Relaxed-Consistency">
+
+    <h2>
+    <a href="#Relaxed-Consistency"><span>Relaxed Consistency</span> </a>
+    </h2>
+<p><span>The strict consistency is a good default, and works especially well for languages with good support for separate compilation, as the amount of work a language server needs to do after an update is proportional to the size of the update, and to the amount of code on the screen, both of which are typically O(1).</span>
+<span>For Zig, whose compilation model is </span>&ldquo;<span>start from the entry point and lazily compile everything that</span>&rsquo;<span>s actually used</span>&rdquo;<span>, this might be difficult to pull off.</span>
+<span>It seems that Zig naturally gravitates to a smalltalk-like image-based programming model, where the server stores fully resolved code all the time, and, if some edit triggers re-analysis of a huge chunk of code, the user just has to wait until the server catches up.</span></p>
+<p><span>But what if we don</span>&rsquo;<span>t do strong consistency?</span>
+<span>What if we allow IDE to temporarily return non-deterministic and wrong results?</span>
+<span>I think we can get some nice properties in exchange, if we use that semi-space idea.</span></p>
+<p><span>The state of our language server would be comprised of three separate pieces of data:</span></p>
+<ul>
+<li>
+<span>A fully analyzed snapshot of the world, </span><strong><code>ready</code></strong><span>.</span>
+<span>This is a bunch of source file, plus their ASTs, ZIRs and AIRs.</span>
+<span>This also probably contains an index of cross-references, so that finding all usages of an identifier requires just listing already precomputed results.</span>
+</li>
+<li>
+<span>The next snapshot, which is being analyzed, </span><strong><code>working</code></strong><span>.</span>
+<span>This is essentially the same data, but the AIR is being constructed.</span>
+<span>We need </span><em><span>two</span></em><span> snapshots because we want to be able to query one of them while the second one is being updated.</span>
+</li>
+<li>
+<span>Finally, we also hold ASTs for the files which are currently being modified, </span><strong><code>pending</code></strong><span>.</span>
+</li>
+</ul>
+<p><span>The overall evolution of data is as follows.</span></p>
+<p><span>All edits synchronously go to the </span><code>pending</code><span> state.</span>
+<code>pending</code><span> is organized strictly on a per-file basis, so updating it can be done quickly on the main thread (maaaybe we want to move the parsing off the main thread, but my gut feeling is that we don</span>&rsquo;<span>t need to).</span>
+<code>pending</code><span> always reflects the latest state of the world, it </span><em><span>is</span></em><span> the latest state of the world.</span></p>
+<p><span>Periodically, we collect a batch of changes from </span><code>pending</code><span>, create a new </span><code>working</code><span> and kick off a full analysis in background.</span>
+<span>A good point to do that would be when there</span>&rsquo;<span>s no syntax errors, or when the user saves a file.</span>
+<span>There</span>&rsquo;<span>s at most one analysis in progress, so we accumulate changes in </span><code>pending</code><span> until the previous analysis finishes.</span></p>
+<p><span>When </span><code>working</code><span> is fully processed, we atomically update the </span><code>ready</code><span>.</span>
+<span>As </span><code>ready</code><span> is just an inert piece of data, it can be safely accessed from whatever thread.</span></p>
+<p><span>When processing requests, we only use </span><code>ready</code><span> and </span><code>pending</code><span>.</span>
+<span>Processing requires some heuristics.</span>
+<code>ready</code><span> and </span><code>pending</code><span> describe different states of the world.</span>
+<code>pending</code><span> guarantees that its state is up-to-date, but it only has AST-level data.</span>
+<code>ready</code><span> is outdated, </span><em><span>but</span></em><span> it has every bit of semantic information pre-computed.</span>
+<span>In particular, it includes cross-reference data.</span></p>
+<p><span>So, our choices for computing results are:</span></p>
+<ul>
+<li>
+<p><span>Use the </span><code>pending</code><span> AST.</span>
+<span>Features like displaying the outline of the current file or globally fuzzy-searching function by name can be implemented like this.</span>
+<span>These features always give correct results.</span></p>
+</li>
+<li>
+<p><span>Find the match between the </span><code>pending</code><span> AST and the </span><code>ready</code><span> semantics.</span>
+<span>This works perfectly for non-local </span>&ldquo;<span>goto definition</span>&rdquo;<span>.</span>
+<span>Here, we can temporarily get </span>&ldquo;<span>wrong</span>&rdquo;<span> results, or no result at all.</span>
+<span>However, the results we get are always instant.</span></p>
+</li>
+<li>
+<p><span>Re-analyze </span><code>pending</code><span> AST using results from </span><code>ready</code><span> for the analysis of the context.</span>
+<span>This is what we</span>&rsquo;<span>ll use for code completion.</span>
+<span>For code completion, </span><code>pending</code><span> will be maximally diverging from </span><code>ready</code><span> (especially if we use </span>&ldquo;<span>no syntax errors</span>&rdquo;<span> as a heuristic for promoting </span><code>pending</code><span> to </span><code>working</code><span>),</span>
+<span>so we won</span>&rsquo;<span>t be able to complete based purely on </span><code>ready</code><span>.</span>
+<span>At the same time, completion is heavily semantics-dependent, so we won</span>&rsquo;<span>t be able to drive it through </span><code>pending</code><span>.</span>
+<span>And we also can</span>&rsquo;<span>t launch full semantic analysis on </span><code>pending</code><span> (what we effectively do in </span><code>rust-analyzer</code><span>), due to </span>&ldquo;<span>from root</span>&rdquo;<span> analysis nature.</span></p>
+<p><span>But we can merge two analysis techniques.</span>
+<span>For example, if we are completing in a function which starts as </span><span class="display"><code>fn f(comptime T: type, param: T)</code><span>,</span></span>
+<span>we can use </span><code>ready</code><span> to get a set of values of </span><code>T</code><span> the function is actually called with, to complete </span><code>param.</code><span> in a useful way.</span>
+<span>Dually, if inside </span><code>f</code><span> we have something like </span><span class="display"><code>const list = std.ArrayList(u32){}</code><span>,</span></span><span> we don</span>&rsquo;<span>t have to </span><code>comptime</code><span> evaluate the </span><code>ArrayList</code><span> function, we can fetch the result from </span><code>ready</code><span>.</span></p>
+<p><span>Of course, we must also handle the case where there</span>&rsquo;<span>s no </span><code>ready</code><span> yet (it</span>&rsquo;<span>s a first compilation, or we switched branches), so completion would be somewhat non-deterministic.</span></p>
+</li>
+</ul>
+<p><span>One important flow where non-determinism would get in a way is refactoring.</span>
+<span>When you rename something, you should be 100% sure that you</span>&rsquo;<span>ve found all usages.</span>
+<span>So, any refactor would have to be a blocking operation where we first wait for the current </span><code>working</code><span> to complete, then update </span><code>working</code><span> with the </span><code>pending</code><span> accumulated so far, and wait for </span><em><span>that</span></em><span> to complete, to, finally, apply the refactor using only up-to-date </span><code>ready</code><span>.</span>
+<span>Luckily, refactoring is almost always a two-phase flow, reminiscent of a GET/POST flow for HTTP form (</span><a href="https://rust-analyzer.github.io/blog/2020/09/28/how-to-make-a-light-bulb.html"><span>more about that</span></a><span>).</span>
+<span>Any refactor starts with read-only analysis to inform the user about available options and to gather input.</span>
+<span>For </span>&ldquo;<span>rename</span>&rdquo;<span>, you wait for the user to type the new name, for </span>&ldquo;<span>change signature</span>&rdquo;<span> the user needs to rearrange params.</span>
+<span>This brief interactive window should give enough headroom to flush all </span><code>pending</code><span> changes, masking the latency.</span></p>
+<p><span>I am pretty excited about this setup.</span>
+<span>I think that</span>&rsquo;<span>s the way to go for Zig.</span></p>
+<ul>
+<li>
+<span>The approach meshes extremely well with the ambition of doing incremental binary patching, both because it leans on complete global analysis, and because it contains an explicit notion of switching from one snapshot to the next one</span>
+<span>(in contrast, rust-analyzer never really thinks about </span>&ldquo;<span>previous</span>&rdquo;<span> state of the code. There</span>&rsquo;<span>s always only the </span>&ldquo;<span>current</span>&rdquo;<span> state, with lazy, partially complete analysis).</span>
+</li>
+<li>
+<span>Zig lacks declared interfaces, so a quick </span>&ldquo;<span>find all calls to this function</span>&rdquo;<span> operation is required for useful completion.</span>
+<span>Fully resolved historical snapshot gives us just that.</span>
+</li>
+<li>
+<span>Zig is carefully designed to make a lot of semantic information obvious just from the syntax.</span>
+<span>Unlike Rust, Zig lacks syntactic macros or glob imports.</span>
+<span>This makes is possible to do a lot of analysis correctly using only </span><code>pending</code><span> ASTs.</span>
+</li>
+<li>
+<span>This approach nicely dodges the cancellation problem I</span>&rsquo;<span>ve spend half of the blog post explaining, and has a relatively simple threading story, which reduces implementation complexity.</span>
+</li>
+<li>
+<span>Finally, it feels like it should be </span><em><span>super</span></em><span> fast (if not the most CPU efficient).</span>
+</li>
+</ul>
+
+<figure>
+
+<img alt="" src="/assets/zig-lsp.jpg">
+</figure>
+<p><span>Discussion on </span><a href="https://old.reddit.com/r/Zig/comments/13a8d9l/blog_post_zig_language_server_and_cancellation/"><span>/r/Zig</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-05-06-zig-language-server-and-cancellation.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/05/21/resilient-ll-parsing-tutorial.html b/2023/05/21/resilient-ll-parsing-tutorial.html
new file mode 100644
index 00000000..0f0c9061
--- /dev/null
+++ b/2023/05/21/resilient-ll-parsing-tutorial.html
@@ -0,0 +1,1624 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Resilient LL Parsing Tutorial</title>
+  <meta name="description" content="In this tutorial, I will explain a particular approach to parsing, which gracefully handles syntax errors and is thus suitable for language servers, which, by their nature, have to handle incomplete and invalid code.
+Explaining the problem and the solution requires somewhat less than a trivial worked example, and I want to share a couple of tricks not directly related to resilience, so the tutorial builds a full, self-contained parser, instead of explaining abstractly just the resilience.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/05/21/resilient-ll-parsing-tutorial.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Resilient-LL-Parsing-Tutorial"><span>Resilient LL Parsing Tutorial</span> <time datetime="2023-05-21">May 21, 2023</time></a>
+    </h1>
+<p><span>In this tutorial, I will explain a particular approach to parsing, which gracefully handles syntax errors and is thus suitable for language servers, which, by their nature, have to handle incomplete and invalid code.</span>
+<span>Explaining the problem and the solution requires somewhat less than a trivial worked example, and I want to share a couple of tricks not directly related to resilience, so the tutorial builds a full, self-contained parser, instead of explaining abstractly </span><em><span>just</span></em><span> the resilience.</span></p>
+<p><span>The tutorial is descriptive, rather than prescriptive </span>&mdash;<span> it tells you what you </span><em><span>can</span></em><span> do, not what you </span><em><span>should</span></em><span> do.</span></p>
+<ul>
+<li>
+<span>If you are looking into building a production grade language server, treat it as a library of ideas, not as a blueprint.</span>
+</li>
+<li>
+<span>If you want to get something working quickly, I think today the best answer is </span>&ldquo;<span>just use </span><a href="https://tree-sitter.github.io"><span>Tree-sitter</span></a>&rdquo;<span>, so you</span>&rsquo;<span>d better read its docs rather than this tutorial.</span>
+</li>
+<li>
+<span>If you are building an IDE-grade parser from scratch, then techniques presented here might be directly applicable.</span>
+</li>
+</ul>
+<section id="Why-Resilience-is-Needed">
+
+    <h2>
+    <a href="#Why-Resilience-is-Needed"><span>Why Resilience is Needed?</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s look at one motivational example for resilient parsing:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">fib_rec</span>(f1: <span class="hl-type">u32</span>,</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">fib</span>(n: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">  <span class="hl-title function_ invoke__">fib_rec</span>(<span class="hl-number">1</span>, <span class="hl-number">1</span>, n)</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, a user is in the process of defining the </span><code>fib_rec</code><span> helper function.</span>
+<span>For a language server, it</span>&rsquo;<span>s important that the incompleteness doesn</span>&rsquo;<span>t get in the way.</span>
+<span>In particular:</span></p>
+<ul>
+<li>
+<p><span>The following function, </span><code>fib</code><span>, should be parsed without any errors such that syntax and semantic highlighting is not disturbed, and all calls to </span><code>fib</code><span> elsewhere typecheck correctly.</span></p>
+</li>
+<li>
+<p><span>The </span><code>fib_rec</code><span> function itself should be recognized as a partially complete function, so that various language server assists can help complete it correctly.</span></p>
+</li>
+<li>
+<p><span>In particular, a smart language server can actually infer the expected type of </span><code>fib_rec</code><span> from a call we already have, and suggest completing the whole prototype.</span>
+<span>rust-analyzer doesn</span>&rsquo;<span>t do that today, but one day it should.</span></p>
+</li>
+</ul>
+<p><span>Generalizing this example, what we want from our parser is to recognize as much of the syntactic structure as feasible.</span>
+<span>It should be able to localize errors </span>&mdash;<span> a mistake in a function generally should not interfere with parsing unrelated functions.</span>
+<span>As the code is read and written left-to-right, the parser should also recognize valid partial prefixes of various syntactic constructs.</span></p>
+<p><span>Academic literature suggests another lens to use when looking at this problem: error recovery.</span>
+<span>Rather than just recognizing incomplete constructs, the parser can attempt to guess a minimal edit which completes the construct and gets rid of the syntax error.</span>
+<span>From this angle, the above example would look rather like </span><span class="display"><code>fn fib_rec(f1: u32, /* ) {} */</code><span> ,</span></span><span> where the stuff in a comment is automatically inserted by the parser.</span></p>
+<p><span>Resilience is a more fruitful framing to use for a language server </span>&mdash;<span> incomplete code is the ground truth, and only the user knows how to correctly complete it.</span>
+<span>An language server can only offer guesses and suggestions, and they are more precise if they employ post-parsing semantic information.</span></p>
+<p><span>Error recovery might work better when emitting understandable syntax errors, but, in a language server, the importance of clear error messages for </span><em><span>syntax</span></em><span> errors is relatively lower, as highlighting such errors right in the editor synchronously with typing usually provides tighter, more useful tacit feedback.</span></p>
+</section>
+<section id="Approaches-to-Error-Resilience">
+
+    <h2>
+    <a href="#Approaches-to-Error-Resilience"><span>Approaches to Error Resilience</span> </a>
+    </h2>
+<p><span>The classic approach for handling parser errors is to explicitly encode error productions and synchronization tokens into the language grammar.</span>
+<span>This approach isn</span>&rsquo;<span>t a natural fit for resilience framing </span>&mdash;<span> you don</span>&rsquo;<span>t want to anticipate every possible error, as there are just too many possibilities.</span>
+<span>Rather, you want to recover as much of a valid syntax tree as possible, and more or less ignore arbitrary invalid parts.</span></p>
+<p><span>Tree-sitter does something more interesting.</span>
+<span>It is a </span><strong><strong><span>G</span></strong></strong><span>LR parser, meaning that it non-deterministically tries many possible LR (bottom-up) parses, and looks for the best one.</span>
+<span>This allows Tree-sitter to recognize many complete valid small fragments of a tree, but it might have trouble assembling them into incomplete larger fragments.</span>
+<span>In our example </span><span class="display"><code>fn fib_rec(f1: u32,</code><span> ,</span></span><span> Tree-sitter correctly recognizes </span><code>f1: u32</code><span> as a formal parameter, but doesn</span>&rsquo;<span>t recognize </span><code>fib_rec</code><span> as a function.</span></p>
+<p><span>Top-down (LL) parsing paradigm makes it harder to recognize valid small fragments, but naturally allows for incomplete large nodes.</span>
+<span>Because code is written top-down and left-to-right, LL seems to have an advantage for typical patterns of incomplete code.</span>
+<span>Moreover, there isn</span>&rsquo;<span>t really anything special you need to do to make LL parsing resilient.</span>
+<span>You sort of</span>&hellip;<span> just not crash on the first error, and everything else more or less just works.</span></p>
+<p><span>Details are fiddly though, so, in the rest of the post, we will write a complete implementation of a hand-written recursive descent + Pratt resilient parser.</span></p>
+</section>
+<section id="Introducing-L">
+
+    <h2>
+    <a href="#Introducing-L"><span>Introducing L</span> </a>
+    </h2>
+<p><span>For the lack of imagination on my side, the toy language we will be parsing is called </span><code>L</code><span>.</span>
+<span>It is a subset of Rust, which has just enough features to make some syntax mistakes.</span>
+<span>Here</span>&rsquo;<span>s Fibonacci:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">fib</span>(n: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">f1</span> = <span class="hl-title function_ invoke__">fib</span>(n - <span class="hl-number">1</span>);</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">f2</span> = <span class="hl-title function_ invoke__">fib</span>(n - <span class="hl-number">2</span>);</span>
+<span class="line">    <span class="hl-keyword">return</span> f1 + f2;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note that there</span>&rsquo;<span>s no base case, because L doesn</span>&rsquo;<span>t have syntax for </span><code>if</code><span>.</span>
+<span>Here</span>&rsquo;<span>s the syntax it does have, as an </span><a href="https://rust-analyzer.github.io/blog/2020/10/24/introducing-ungrammar.html"><span>ungrammar</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-literal">File</span> = Fn*</span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">Fn</span> = <span class="hl-string">&#x27;fn&#x27;</span> <span class="hl-string">&#x27;name&#x27;</span> ParamList (<span class="hl-string">&#x27;-&gt;&#x27;</span> TypeExpr)? Block</span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">ParamList</span> = <span class="hl-string">&#x27;(&#x27;</span> Param* <span class="hl-string">&#x27;)&#x27;</span></span>
+<span class="line"><span class="hl-literal">Param</span> = <span class="hl-string">&#x27;name&#x27;</span> <span class="hl-string">&#x27;:&#x27;</span> TypeExpr <span class="hl-string">&#x27;,&#x27;</span>?</span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">TypeExpr</span> = <span class="hl-string">&#x27;name&#x27;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">Block</span> = <span class="hl-string">&#x27;{&#x27;</span> Stmt* <span class="hl-string">&#x27;}&#x27;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">Stmt</span> =</span>
+<span class="line">  StmtExpr</span>
+<span class="line">| StmtLet</span>
+<span class="line">| StmtReturn</span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">StmtExpr</span> = Expr <span class="hl-string">&#x27;;&#x27;</span></span>
+<span class="line"><span class="hl-literal">StmtLet</span> = <span class="hl-string">&#x27;let&#x27;</span> <span class="hl-string">&#x27;name&#x27;</span> <span class="hl-string">&#x27;=&#x27;</span> Expr <span class="hl-string">&#x27;;&#x27;</span></span>
+<span class="line"><span class="hl-literal">StmtReturn</span> = <span class="hl-string">&#x27;return&#x27;</span> Expr <span class="hl-string">&#x27;;&#x27;</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">Expr</span> =</span>
+<span class="line">  ExprLiteral</span>
+<span class="line">| ExprName</span>
+<span class="line">| ExprParen</span>
+<span class="line">| ExprBinary</span>
+<span class="line">| ExprCall</span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">ExprLiteral</span> = <span class="hl-string">&#x27;int&#x27;</span> | <span class="hl-string">&#x27;true&#x27;</span> | <span class="hl-string">&#x27;false&#x27;</span></span>
+<span class="line"><span class="hl-literal">ExprName</span> = <span class="hl-string">&#x27;name&#x27;</span></span>
+<span class="line"><span class="hl-literal">ExprParen</span> = <span class="hl-string">&#x27;(&#x27;</span> Expr <span class="hl-string">&#x27;)&#x27;</span></span>
+<span class="line"><span class="hl-literal">ExprBinary</span> = Expr (<span class="hl-string">&#x27;+&#x27;</span> | <span class="hl-string">&#x27;-&#x27;</span> | <span class="hl-string">&#x27;*&#x27;</span> | <span class="hl-string">&#x27;/&#x27;</span>) Expr</span>
+<span class="line"><span class="hl-literal">ExprCall</span> = Expr ArgList</span>
+<span class="line"></span>
+<span class="line"><span class="hl-literal">ArgList</span> = <span class="hl-string">&#x27;(&#x27;</span> Arg* <span class="hl-string">&#x27;)&#x27;</span></span>
+<span class="line"><span class="hl-literal">Arg</span> = Expr <span class="hl-string">&#x27;,&#x27;</span>?</span></code></pre>
+
+</figure>
+<p><span>The meta syntax here is similar to BNF, with two important differences:</span></p>
+<ul>
+<li>
+<span>the notation is better specified and more familiar (recursive regular expressions),</span>
+</li>
+<li>
+<span>it describes syntax </span><em><span>trees</span></em><span>, rather than strings (</span><em><span>sequences</span></em><span> of tokens).</span>
+</li>
+</ul>
+<p><span>Single quotes signify terminals: </span><code>'fn'</code><span> and </span><code>'return'</code><span> are keywords, </span><code>'name'</code><span> stands for any identifier token, like </span><code>foo</code><span>, and </span><code>'('</code><span> is punctuation.</span>
+<span>Unquoted names are non-terminals. For example, </span><code>x: i32,</code><span> would be an example of </span><code>Param</code><span>.</span>
+<span>Unquoted punctuation are meta symbols of ungrammar itself, semantics identical to regular expressions. Zero or more repetition is </span><code>*</code><span>, zero or one is </span><code>?</code><span>, </span><code>|</code><span> is alternation and </span><code>()</code><span> are used for grouping.</span></p>
+<p><span>The grammar doesn</span>&rsquo;<span>t nail the syntax precisely. For example, the rule for </span><code>Param</code><span>, </span><span class="display"><code>Param = 'name' ':' Type ','?</code><span> ,</span></span><span> says that </span><code>Param</code><span> syntax node has an optional comma, but there</span>&rsquo;<span>s nothing in the above </span><code>ungrammar</code><span> specifying whether the trailing commas are allowed.</span></p>
+<p><span>Overall, </span><code>L</code><span> has very little to it </span>&mdash;<span> a program is a series of function declarations, each function has a body which is a sequence of statements, the set of expressions is spartan, not even an </span><code>if</code><span>. Still, it</span>&rsquo;<span>ll take us some time to parse all that.</span>
+<span>But you can already try the end result in the text-box below.</span>
+<span>The syntax tree is updated automatically on typing.</span>
+<span>Do make mistakes to see how a partial tree is recovered.</span></p>
+<aside id="playground" style="min-height: 400px; min-width: 400px; ; display: flex; flex-direction: row;">
+<textarea class="input"  style="height: 400px; width: 50%; margin: 2px; padding: 2px; resize: none;">
+fn fib_rec(f1: u32,
+
+fn fib(n: u32) -> u32 {
+  fib_rec(1, 1, n)
+}
+</textarea>
+<textarea class="output" style="height: 400px; width: 50%; margin: 2px; padding: 2px; resize: none;" readonly=true>
+</textarea>
+</aside>
+</section>
+<section id="Designing-the-Tree">
+
+    <h2>
+    <a href="#Designing-the-Tree"><span>Designing the Tree</span> </a>
+    </h2>
+<p><span>A traditional AST for L might look roughly like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">File</span> {</span>
+<span class="line">  functions: <span class="hl-type">Vec</span>&lt;Function&gt;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Function</span> {</span>
+<span class="line">  name: <span class="hl-type">String</span>,</span>
+<span class="line">  params: <span class="hl-type">Vec</span>&lt;Param&gt;,</span>
+<span class="line">  return_type: <span class="hl-type">Option</span>&lt;TypeExpr&gt;,</span>
+<span class="line">  block: Block,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Extending this structure to be resilient is non-trivial. There are two problems: trivia and errors.</span></p>
+<p><span>For resilient parsing, we want the AST to contain every detail about the source text.</span>
+<span>We actually don</span>&rsquo;<span>t want to use an </span><em><span>abstract</span></em><span> syntax tree, and need a </span><em><span>concrete</span></em><span> one.</span>
+<span>In a traditional AST, the tree structure is rigidly defined </span>&mdash;<span> any syntax node has a fixed number of children.</span>
+<span>But there can be any number of comments and whitespace anywhere in the tree, and making space for them in the structure requires some fiddly data manipulation.</span>
+<span>Similarly, errors (e.g., unexpected tokens), can appear anywhere in the tree.</span></p>
+<p><span>One trick to handle these in the AST paradigm is to attach trivia and error tokens to other tokens.</span>
+<span>That is, for something like</span>
+<span class="display"><code>fn /* name of the function -&gt; */ f() {}</code><span> ,</span></span>
+<span>the </span><code>fn</code><span> and </span><code>f</code><span> tokens would be explicit parts of the AST, while the comment and surrounding whitespace would belong to the collection of trivia tokens hanging off the </span><code>fn</code><span> token.</span></p>
+<p><span>One complication here is that it</span>&rsquo;<span>s not always just tokens that can appear anywhere, sometimes you can have full trees like that.</span>
+<span>For example, comments might support markdown syntax, and you might actually want to parse that properly (e.g., to resolve links to declarations).</span>
+<span>Syntax errors can also span whole subtrees.</span>
+<span>For example, when parsing </span><code>pub(crate) nope</code><span> in Rust, it would be smart to parse </span><code>pub(crate)</code><span> as a visibility modifier, and nest it into a bigger </span><code>Error</code><span> node.</span></p>
+<p><span>SwiftSyntax meticulously adds error placeholders between any two fields of an AST node, giving rise to</span>
+<span class="display"><code>unexpectedBetweenModifiersAndDeinitKeyword</code></span>
+<span>and such (</span><a href="https://github.com/apple/swift-syntax/blob/66450960b1ed88b842d63f7a38254aaba08bbd4d/Sources/SwiftSyntax/generated/syntaxNodes/SyntaxDeclNodes.swift#L1368"><span>source</span></a><span>, </span><a href="https://swiftpackageindex.com/apple/swift-syntax/508.0.1/documentation/swiftsyntax/classdeclsyntax#instance-properties"><span>docs</span></a><span>).</span></p>
+<p><span>An alternative approach, used by IntelliJ and rust-analyzer, is to treat the syntax tree as a somewhat dynamically-typed data structure:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">TokenKind</span> {</span>
+<span class="line">  ErrorToken, LParen, RParen, <span class="hl-built_in">Eq</span>,</span>
+<span class="line">  ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Token</span> {</span>
+<span class="line">  kind: TokenKind,</span>
+<span class="line">  text: <span class="hl-type">String</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">TreeKind</span> {</span>
+<span class="line">  ErrorTree, File, <span class="hl-built_in">Fn</span>, Param,</span>
+<span class="line">  ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Tree</span> {</span>
+<span class="line">  kind: TreeKind,</span>
+<span class="line">  children: <span class="hl-type">Vec</span>&lt;Child&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Child</span> {</span>
+<span class="line">  <span class="hl-title function_ invoke__">Token</span>(Token),</span>
+<span class="line">  <span class="hl-title function_ invoke__">Tree</span>(Tree),</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This structure does not enforce any constraints on the shape of the syntax tree at all, and so it naturally accommodates errors anywhere.</span>
+<span>It is possible to layer a well-typed API on top of this dynamic foundation.</span>
+<span>An extra benefit of this representation is that you can use the same tree </span><em><span>type</span></em><span> for different languages; this is a requirement for universal tools.</span></p>
+<p><span>Discussing specifics of syntax tree representation goes beyond this article, as the topic is vast and lacks a clear winning solution.</span>
+<span>To learn about it, take a look at Roslyn, SwiftSyntax, rowan and IntelliJ.</span></p>
+<p><span>To simplify things, we</span>&rsquo;<span>ll ignore comments and whitespace, though you</span>&rsquo;<span>ll absolutely want those in a real implementation.</span>
+<span>One approach would be to do the parsing without comments, like we do here, and then attach comments to the nodes in a separate pass.</span>
+<span>Attaching comments needs some heuristics </span>&mdash;<span> for example, non-doc comments generally want to be a part of the following syntax node.</span></p>
+<p><span>Another design choice is handling of error messages.</span>
+<span>One approach is to treat error messages as properties of the syntax tree itself, by either inferring them from the tree structure, or just storing them inline.</span>
+<span>Alternatively, errors can be considered to be a side-effect of the parsing process (that way, trees constructed manually during, eg, refactors, won</span>&rsquo;<span>t carry any error messages, even if they are invalid).</span></p>
+<p><span>Here</span>&rsquo;<span>s the full set of token and tree kinds for our language L:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">TokenKind</span> {</span>
+<span class="line">  ErrorToken, Eof,</span>
+<span class="line"></span>
+<span class="line">  LParen, RParen, LCurly, RCurly,</span>
+<span class="line">  <span class="hl-built_in">Eq</span>, Semi, Comma, Colon, Arrow,</span>
+<span class="line">  Plus, Minus, Star, Slash,</span>
+<span class="line"></span>
+<span class="line">  FnKeyword, LetKeyword, ReturnKeyword,</span>
+<span class="line">  TrueKeyword, FalseKeyword,</span>
+<span class="line"></span>
+<span class="line">  Name, Int,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">TreeKind</span> {</span>
+<span class="line">  ErrorTree,</span>
+<span class="line">  File, <span class="hl-built_in">Fn</span>, TypeExpr,</span>
+<span class="line">  ParamList, Param,</span>
+<span class="line">  Block,</span>
+<span class="line">  StmtLet, StmtReturn, StmtExpr,</span>
+<span class="line">  ExprLiteral, ExprName, ExprParen,</span>
+<span class="line">  ExprBinary, ExprCall,</span>
+<span class="line">  ArgList, Arg,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Things to note:</span></p>
+<ul>
+<li>
+<span>explicit </span><code>Error</code><span> kinds;</span>
+</li>
+<li>
+<span>no whitespace or comments, as an unrealistic simplification;</span>
+</li>
+<li>
+<code>Eof</code><span> virtual token simplifies parsing, removing the need to handle </span><code>Option&lt;Token&gt;</code><span>;</span>
+</li>
+<li>
+<span>punctuators are named after what they are, rather than after what they usually mean: </span><code>Star</code><span>, rather than </span><code>Mult</code><span>;</span>
+</li>
+<li>
+<span>a good set of name for various kinds of braces is </span><span class="display"><code>{L,R}{Paren,Curly,Brack,Angle}</code><span>.</span></span>
+</li>
+</ul>
+</section>
+<section id="Lexer">
+
+    <h2>
+    <a href="#Lexer"><span>Lexer</span> </a>
+    </h2>
+<p><span>Won</span>&rsquo;<span>t be covering lexer here, let</span>&rsquo;<span>s just say we have </span><span class="display"><code>fn lex(text: &amp;str) -&gt; Vec&lt;Token&gt;</code><span>,</span></span><span> function. Two points worth mentioning:</span></p>
+<ul>
+<li>
+<span>Lexer itself should be resilient, but that</span>&rsquo;<span>s easy </span>&mdash;<span> produce an </span><code>Error</code><span> token for anything which isn</span>&rsquo;<span>t a valid token.</span>
+</li>
+<li>
+<span>Writing lexer by hand is somewhat tedious, but is very simple relative to everything else.</span>
+<span>If you are stuck in an analysis-paralysis picking a lexer generator, consider cutting the Gordian knot and hand-writing.</span>
+</li>
+</ul>
+</section>
+<section id="Parser">
+
+    <h2>
+    <a href="#Parser"><span>Parser</span> </a>
+    </h2>
+<p><span>With homogenous syntax trees, the task of parsing admits an elegant formalization </span>&mdash;<span> we want to insert extra parenthesis into a stream of tokens.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">+-Fun</span>
+<span class="line">|      +-Param</span>
+<span class="line">|      |</span>
+<span class="line">[fn f( [x: Int] ) {}]</span>
+<span class="line">     |            |</span>
+<span class="line">     |            +-Block</span>
+<span class="line">     +-ParamList</span></code></pre>
+
+</figure>
+<p><span>Note how the sequence of tokens with extra parenthesis is still a flat sequence.</span>
+<span>The parsing will be two-phase:</span></p>
+<ul>
+<li>
+<span>in the first phase, the parser emits a flat list of events,</span>
+</li>
+<li>
+<span>in the second phase, the list is converted to a tree.</span>
+</li>
+</ul>
+<p><span>Here</span>&rsquo;<span>s the basic setup for the parser:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">enum</span> <span class="hl-title class_">Event</span> {</span>
+<span class="line">  Open { kind: TreeKind }, <i class="callout" data-value="2"></i></span>
+<span class="line">  Close,</span>
+<span class="line">  Advance,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">MarkOpened</span> {</span>
+<span class="line">  index: <span class="hl-type">usize</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Parser</span> {</span>
+<span class="line">  tokens: <span class="hl-type">Vec</span>&lt;Token&gt;,</span>
+<span class="line">  pos: <span class="hl-type">usize</span>,</span>
+<span class="line">  fuel: Cell&lt;<span class="hl-type">u32</span>&gt;, <i class="callout" data-value="4"></i></span>
+<span class="line">  events: <span class="hl-type">Vec</span>&lt;Event&gt;,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Parser</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">open</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> MarkOpened { <i class="callout" data-value="1"></i></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">mark</span> = MarkOpened { index: <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">len</span>() };</span>
+<span class="line">    <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">push</span>(Event::Open { kind: TreeKind::ErrorTree });</span>
+<span class="line">    mark</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">close</span>(  <i class="callout" data-value="1"></i></span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">    m: MarkOpened,</span>
+<span class="line">    kind: TreeKind, <i class="callout" data-value="2"></i></span>
+<span class="line">  ) {</span>
+<span class="line">    <span class="hl-keyword">self</span>.events[m.index] = Event::Open { kind };</span>
+<span class="line">    <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">push</span>(Event::Close);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">advance</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) { <i class="callout" data-value="1"></i></span>
+<span class="line">    <span class="hl-built_in">assert!</span>(!<span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">eof</span>());</span>
+<span class="line">    <span class="hl-keyword">self</span>.fuel.<span class="hl-title function_ invoke__">set</span>(<span class="hl-number">256</span>); <i class="callout" data-value="4"></i></span>
+<span class="line">    <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">push</span>(Event::Advance);</span>
+<span class="line">    <span class="hl-keyword">self</span>.pos += <span class="hl-number">1</span>;</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">eof</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">bool</span> {</span>
+<span class="line">    <span class="hl-keyword">self</span>.pos == <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">len</span>()</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">nth</span>(&amp;<span class="hl-keyword">self</span>, lookahead: <span class="hl-type">usize</span>) <span class="hl-punctuation">-&gt;</span> TokenKind { <i class="callout" data-value="3"></i></span>
+<span class="line">    <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.fuel.<span class="hl-title function_ invoke__">get</span>() == <span class="hl-number">0</span> { <i class="callout" data-value="4"></i></span>
+<span class="line">      <span class="hl-built_in">panic!</span>(<span class="hl-string">&quot;parser is stuck&quot;</span>)</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">self</span>.fuel.<span class="hl-title function_ invoke__">set</span>(<span class="hl-keyword">self</span>.fuel.<span class="hl-title function_ invoke__">get</span>() - <span class="hl-number">1</span>);</span>
+<span class="line">    <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">get</span>(<span class="hl-keyword">self</span>.pos + lookahead)</span>
+<span class="line">      .<span class="hl-title function_ invoke__">map_or</span>(TokenKind::Eof, |it| it.kind)</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">at</span>(&amp;<span class="hl-keyword">self</span>, kind: TokenKind) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">bool</span> { <i class="callout" data-value="3"></i></span>
+<span class="line">    <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">nth</span>(<span class="hl-number">0</span>) == kind</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">eat</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, kind: TokenKind) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">bool</span> { <i class="callout" data-value="3"></i></span>
+<span class="line">    <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">at</span>(kind) {</span>
+<span class="line">      <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">      <span class="hl-literal">true</span></span>
+<span class="line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line">      <span class="hl-literal">false</span></span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">expect</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, kind: TokenKind) {</span>
+<span class="line">    <span class="hl-keyword">if</span> <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">eat</span>(kind) {</span>
+<span class="line">      <span class="hl-keyword">return</span>;</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-comment">// <span class="hl-doctag">TODO:</span> Error reporting.</span></span>
+<span class="line">    eprintln!(<span class="hl-string">&quot;expected {kind:?}&quot;</span>);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">advance_with_error</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, error: &amp;<span class="hl-type">str</span>) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line">    <span class="hl-comment">// <span class="hl-doctag">TODO:</span> Error reporting.</span></span>
+<span class="line">    eprintln!(<span class="hl-string">&quot;{error}&quot;</span>);</span>
+<span class="line">    <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">    <span class="hl-keyword">self</span>.<span class="hl-title function_ invoke__">close</span>(m, ErrorTree);</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><code>open</code><span>, </span><code>advance</code><span>, and </span><code>close</code><span> form the basis for constructing the stream of events.</span></p>
+</li>
+<li>
+<p><span>Note how </span><code>kind</code><span> is stored in the </span><code>Open</code><span> event, but is supplied with the </span><code>close</code><span> method.</span>
+<span>This is required for flexibility </span>&mdash;<span> sometimes it</span>&rsquo;<span>s possible to decide on the type of syntax node only after it is parsed.</span>
+<span>The way this works is that the </span><code>open</code><span> method returns a </span><code>Mark</code><span> which is subsequently passed to </span><code>close</code><span> to modify the corresponding </span><code>Open</code><span> event.</span></p>
+</li>
+<li>
+<p><span>There</span>&rsquo;<span>s a set of short, convenient methods to navigate through the sequence of tokens:</span></p>
+<ul>
+<li>
+<code>nth</code><span> is the lookahead method. Note how it doesn</span>&rsquo;<span>t return an </span><code>Option</code><span>, and uses </span><code>Eof</code><span> special value for </span>&ldquo;<span>out of bounds</span>&rdquo;<span> indexes.</span>
+<span>This simplifies the call-site, </span>&ldquo;<span>no more tokens</span>&rdquo;<span> and </span>&ldquo;<span>token of a wrong kind</span>&rdquo;<span> are always handled the same.</span>
+</li>
+<li>
+<code>at</code><span> is a convenient specialization to check for a specific next token.</span>
+</li>
+<li>
+<code>eat</code><span> is </span><code>at</code><span> combined with consuming the next token.</span>
+</li>
+<li>
+<code>expect</code><span> is </span><code>eat</code><span> combined with error reporting.</span>
+</li>
+</ul>
+<p><span>These methods are not a very orthogonal basis, but they are a convenience basis for parsing.</span>
+<span>Finally, </span><code>advance_with_error</code><span> advanced over any token, but also wraps it into an error node.</span></p>
+</li>
+<li>
+<p><span>When writing parsers by hand, it</span>&rsquo;<span>s very easy to accidentally write the code which loops or recurses forever.</span>
+<span>To simplify debugging, it</span>&rsquo;<span>s helpful to add an explicit notion of </span>&ldquo;<span>fuel</span>&rdquo;<span>, which is replenished every time the parser makes progress,</span>
+<span>and is spent every time it does not.</span></p>
+</li>
+</ol>
+<p><span>The function to transform a flat list of events into a tree is a bit involved.</span>
+<span>It juggles three things: an iterator of events, an iterator of tokens, and a stack of partially constructed nodes (we expect the stack to contain just one node at the end).</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Parser</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">build_tree</span>(<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> Tree {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">tokens</span> = <span class="hl-keyword">self</span>.tokens.<span class="hl-title function_ invoke__">into_iter</span>();</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">events</span> = <span class="hl-keyword">self</span>.events;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">stack</span> = <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Special case: pop the last `Close` event to ensure</span></span>
+<span class="line">    <span class="hl-comment">// that the stack is non-empty inside the loop.</span></span>
+<span class="line">    <span class="hl-built_in">assert!</span>(matches!(events.<span class="hl-title function_ invoke__">pop</span>(), <span class="hl-title function_ invoke__">Some</span>(Event::Close)));</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">for</span> <span class="hl-variable">event</span> <span class="hl-keyword">in</span> events {</span>
+<span class="line">      <span class="hl-keyword">match</span> event {</span>
+<span class="line">        <span class="hl-comment">// Starting a new node; just push an empty tree to the stack.</span></span>
+<span class="line">        Event::Open { kind } =&gt; {</span>
+<span class="line">          stack.<span class="hl-title function_ invoke__">push</span>(Tree { kind, children: <span class="hl-type">Vec</span>::<span class="hl-title function_ invoke__">new</span>() })</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-comment">// A tree is done.</span></span>
+<span class="line">        <span class="hl-comment">// Pop it off the stack and append to a new current tree.</span></span>
+<span class="line">        Event::Close =&gt; {</span>
+<span class="line">          <span class="hl-keyword">let</span> <span class="hl-variable">tree</span> = stack.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">          stack</span>
+<span class="line">            .<span class="hl-title function_ invoke__">last_mut</span>()</span>
+<span class="line">            <span class="hl-comment">// If we don&#x27;t pop the last `Close` before this loop,</span></span>
+<span class="line">            <span class="hl-comment">// this unwrap would trigger for it.</span></span>
+<span class="line">            .<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">            .children</span>
+<span class="line">            .<span class="hl-title function_ invoke__">push</span>(Child::<span class="hl-title function_ invoke__">Tree</span>(tree));</span>
+<span class="line">        }</span>
+<span class="line"></span>
+<span class="line">        <span class="hl-comment">// Consume a token and append it to the current tree</span></span>
+<span class="line">        Event::Advance =&gt; {</span>
+<span class="line">          <span class="hl-keyword">let</span> <span class="hl-variable">token</span> = tokens.<span class="hl-title function_ invoke__">next</span>().<span class="hl-title function_ invoke__">unwrap</span>();</span>
+<span class="line">          stack</span>
+<span class="line">            .<span class="hl-title function_ invoke__">last_mut</span>()</span>
+<span class="line">            .<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">            .children</span>
+<span class="line">            .<span class="hl-title function_ invoke__">push</span>(Child::<span class="hl-title function_ invoke__">Token</span>(token));</span>
+<span class="line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// Our parser will guarantee that all the trees are closed</span></span>
+<span class="line">    <span class="hl-comment">// and cover the entirety of tokens.</span></span>
+<span class="line">    <span class="hl-built_in">assert!</span>(stack.<span class="hl-title function_ invoke__">len</span>() == <span class="hl-number">1</span>);</span>
+<span class="line">    <span class="hl-built_in">assert!</span>(tokens.<span class="hl-title function_ invoke__">next</span>().<span class="hl-title function_ invoke__">is_none</span>());</span>
+<span class="line"></span>
+<span class="line">    stack.<span class="hl-title function_ invoke__">pop</span>().<span class="hl-title function_ invoke__">unwrap</span>()</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Grammar">
+
+    <h2>
+    <a href="#Grammar"><span>Grammar</span> </a>
+    </h2>
+<p><span>We are finally getting to the actual topic of resilient parser.</span>
+<span>Now we will write a full grammar for L as a sequence of functions.</span>
+<span>Usually both atomic parser operations, like </span><code>fn advance</code><span>, and grammar productions, like </span><code>fn parse_fn</code><span> are implemented as methods on the </span><code>Parser</code><span> struct.</span>
+<span>I prefer to separate the two and to use free functions for the latter category, as the code is a bit more readable that way.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with parsing the top level.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">use</span> TokenKind::*;</span>
+<span class="line"><span class="hl-keyword">use</span> TreeKind::*;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// File = Fn*</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">file</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>(); <i class="callout" data-value="1"></i></span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">while</span> !p.<span class="hl-title function_ invoke__">eof</span>() { <i class="callout" data-value="2"></i></span>
+<span class="line">    <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at</span>(FnKeyword) {</span>
+<span class="line">      <span class="hl-title function_ invoke__">func</span>(p)</span>
+<span class="line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">advance_with_error</span>(<span class="hl-string">&quot;expected a function&quot;</span>); <i class="callout" data-value="3"></i></span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, File);  <i class="callout" data-value="1"></i></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><span>Wrap the whole thing into a </span><code>File</code><span> node.</span></p>
+</li>
+<li>
+<p><span>Use the </span><code>while</code><span> loop to parse a file as a series of functions.</span>
+<span>Importantly, the entirety of the file is parsed; we break out of the loop only when the eof is reached.</span></p>
+</li>
+<li>
+<p><span>To not get stuck in this loop, it</span>&rsquo;<span>s crucial that every iteration consumes at least one token.</span>
+<span>If the token is </span><code>fn</code><span>, we</span>&rsquo;<span>ll parse at least a part of a function.</span>
+<span>Otherwise, we consume the token and wrap it into an error node.</span></p>
+</li>
+</ol>
+<p><span>Lets parse functions now:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Fn = &#x27;fn&#x27; &#x27;name&#x27; ParamList (&#x27;-&gt;&#x27; TypeExpr)? Block</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">func</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(FnKeyword)); <i class="callout" data-value="1"></i></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>(); <i class="callout" data-value="2"></i></span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(FnKeyword);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Name);</span>
+<span class="line">  <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at</span>(LParen) { <i class="callout" data-value="3"></i></span>
+<span class="line">    <span class="hl-title function_ invoke__">param_list</span>(p);</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">eat</span>(Arrow) {</span>
+<span class="line">    <span class="hl-title function_ invoke__">type_expr</span>(p);</span>
+<span class="line">  }</span>
+<span class="line">  <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at</span>(LCurly) { <i class="callout" data-value="3"></i></span>
+<span class="line">    <span class="hl-title function_ invoke__">block</span>(p);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, <span class="hl-built_in">Fn</span>); <i class="callout" data-value="2"></i></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><span>When parsing a function, we assert that the current token is </span><code>fn</code><span>.</span>
+<span>There</span>&rsquo;<span>s some duplication with the </span><span class="display"><code>if p.at(FnKeyword)</code><span> ,</span></span><span> check at the call-site, but this duplication actually helps readability.</span></p>
+</li>
+<li>
+<p><span>Again, we surround the body of the function with </span><code>open</code><span>/</span><code>close</code><span> pair.</span></p>
+</li>
+<li>
+<p><span>Although parameter list and function body are mandatory, we precede them with an </span><code>at</code><span> check.</span>
+<span>We can still report the syntax error by analyzing the structure of the syntax tree (or we can report it as a side effect of parsing in the </span><code>else</code><span> branch if we want).</span>
+<span>It wouldn</span>&rsquo;<span>t be wrong to just remove the </span><code>if</code><span> altogether and try to parse </span><code>param_list</code><span> unconditionally, but the </span><code>if</code><span> helps with reducing cascading errors.</span></p>
+</li>
+</ol>
+<p><span>Now, the list of parameters:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// ParamList = &#x27;(&#x27; Param* &#x27;)&#x27;</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">param_list</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(LParen));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(LParen); <i class="callout" data-value="1"></i></span>
+<span class="line">  <span class="hl-keyword">while</span> !p.<span class="hl-title function_ invoke__">at</span>(RParen) &amp;&amp; !p.<span class="hl-title function_ invoke__">eof</span>() { <i class="callout" data-value="2"></i></span>
+<span class="line">    <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at</span>(Name) { <i class="callout" data-value="3"></i></span>
+<span class="line">      <span class="hl-title function_ invoke__">param</span>(p);</span>
+<span class="line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line">      <span class="hl-keyword">break</span>; <i class="callout" data-value="3"></i></span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(RParen); <i class="callout" data-value="1"></i></span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, ParamList);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>Inside, we have a standard code shape for parsing a bracketed list.</span>
+<span>It can be extracted into a high-order function, but typing out the code manually is not a problem either.</span>
+<span>This bit of code starts and ends with consuming the corresponding parenthesis.</span>
+</li>
+<li>
+<span>In the happy case, we loop until the closing parenthesis.</span>
+<span>However, it could also be the case that there</span>&rsquo;<span>s no closing parenthesis at all, so we add an </span><code>eof</code><span> condition as well.</span>
+<span>Generally, every loop we write would have </span><code>&amp;&amp; !p.eof()</code><span> tackled on.</span>
+</li>
+<li>
+<span>As with any loop, we need to ensure that each iteration consumes at least one token to not get stuck.</span>
+<span>If the current token is an identifier, everything is ok, as we</span>&rsquo;<span>ll parse at least some part of the parameter.</span>
+</li>
+</ol>
+<p><span>Parsing parameter is almost nothing new at this point:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Param = &#x27;name&#x27; &#x27;:&#x27; TypeExpr &#x27;,&#x27;?</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">param</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(Name));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Name);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Colon);</span>
+<span class="line">  <span class="hl-title function_ invoke__">type_expr</span>(p);</span>
+<span class="line">  <span class="hl-keyword">if</span> !p.<span class="hl-title function_ invoke__">at</span>(RParen) { <i class="callout" data-value="1"></i></span>
+<span class="line">    p.<span class="hl-title function_ invoke__">expect</span>(Comma);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, Param);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<span>This is the only interesting bit.</span>
+<span>To parse a comma-separated list of parameters with a trailing comma, it</span>&rsquo;<span>s enough to check if the following token after parameter is </span><code>)</code><span>.</span>
+<span>This correctly handles all three cases:</span>
+<ul>
+<li>
+<span>if the next token is </span><code>)</code><span>, we are at the end of the list, and no comma is required;</span>
+</li>
+<li>
+<span>if the next token is </span><code>,</code><span>, we correctly advance past it;</span>
+</li>
+<li>
+<span>finally, if the next token is anything else, then it</span>&rsquo;<span>s not a </span><code>)</code><span>, so we are not at the last element of the list and correctly emit an error.</span>
+</li>
+</ul>
+</li>
+</ol>
+<p><span>Parsing types is trivial:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// TypeExpr = &#x27;name&#x27;</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">type_expr</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Name);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, TypeExpr);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The notable aspect here is naming.</span>
+<span>The production is deliberately named </span><code>TypeExpr</code><span>, rather than </span><code>Type</code><span>, to avoid confusion down the line.</span>
+<span>Consider </span><span class="display"><code>fib(92)</code><span> .</span></span>
+<span>It is an </span><em><span>expression</span></em><span>, which evaluates to a </span><em><span>value</span></em><span>.</span>
+<span>The same thing happens with types.</span>
+<span>For example, </span><span class="display"><code>Foo&lt;Int&gt;</code></span><span> is not a type yet, it</span>&rsquo;<span>s an expression which can be </span>&ldquo;<span>evaluated</span>&rdquo;<span> (at compile time) to a type (if </span><code>Foo</code><span> is a type alias, the result might be something like </span><code>Pair&lt;Int, Int&gt;</code><span>).</span></p>
+<p><span>Parsing a block gets a bit more involved:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Block = &#x27;{&#x27; Stmt* &#x27;}&#x27;</span></span>
+<span class="line"><span class="hl-comment">//</span></span>
+<span class="line"><span class="hl-comment">// Stmt =</span></span>
+<span class="line"><span class="hl-comment">//   StmtLet</span></span>
+<span class="line"><span class="hl-comment">// | StmtReturn</span></span>
+<span class="line"><span class="hl-comment">// | StmtExpr</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">block</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(LCurly));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(LCurly);</span>
+<span class="line">  <span class="hl-keyword">while</span> !p.<span class="hl-title function_ invoke__">at</span>(RCurly) &amp;&amp; !p.<span class="hl-title function_ invoke__">eof</span>() {</span>
+<span class="line">    <span class="hl-keyword">match</span> p.<span class="hl-title function_ invoke__">nth</span>(<span class="hl-number">0</span>) {</span>
+<span class="line">      LetKeyword =&gt; <span class="hl-title function_ invoke__">stmt_let</span>(p),</span>
+<span class="line">      ReturnKeyword =&gt; <span class="hl-title function_ invoke__">stmt_return</span>(p),</span>
+<span class="line">      _ =&gt; <span class="hl-title function_ invoke__">stmt_expr</span>(p),</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(RCurly);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, Block);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Block can contain many different kinds of statements, so we branch on the first token in the loop</span>&rsquo;<span>s body.</span>
+<span>As usual, we need to maintain an invariant that the body consumes at least one token.</span>
+<span>For </span><code>let</code><span> and </span><code>return</code><span> statements that</span>&rsquo;<span>s easy, they consume the fixed first token.</span>
+<span>For the expression statement (things like </span><code>1 + 1;</code><span>) it gets more interesting, as an expression can start with many different tokens.</span>
+<span>For the time being, we</span>&rsquo;<span>ll just kick the can down the road and require </span><code>stmt_expr</code><span> to deal with it (that is, to guarantee that at least one token is consumed).</span></p>
+<p><span>Statements themselves are straightforward:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// StmtLet = &#x27;let&#x27; &#x27;name&#x27; &#x27;=&#x27; Expr &#x27;;&#x27;</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">stmt_let</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(LetKeyword));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(LetKeyword);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Name);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(<span class="hl-built_in">Eq</span>);</span>
+<span class="line">  <span class="hl-title function_ invoke__">expr</span>(p);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Semi);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, StmtLet);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// StmtReturn = &#x27;return&#x27; Expr &#x27;;&#x27;</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">stmt_return</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(ReturnKeyword));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(ReturnKeyword);</span>
+<span class="line">  <span class="hl-title function_ invoke__">expr</span>(p);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Semi);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, StmtReturn);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// StmtExpr = Expr &#x27;;&#x27;</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">stmt_expr</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-title function_ invoke__">expr</span>(p);</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(Semi);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, StmtExpr);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Again, for </span><code>stmt_expr</code><span>, we push </span>&ldquo;<span>must consume a token</span>&rdquo;<span> invariant onto </span><code>expr</code><span>.</span></p>
+<p><span>Expressions are tricky.</span>
+<span>They always are.</span>
+<span>For starters, let</span>&rsquo;<span>s handle just the clearly-delimited cases, like literals and parenthesis:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-title function_ invoke__">expr_delimited</span>(p)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_delimited</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line">  <span class="hl-keyword">match</span> p.<span class="hl-title function_ invoke__">nth</span>(<span class="hl-number">0</span>) {</span>
+<span class="line">    <span class="hl-comment">// ExprLiteral = &#x27;int&#x27; | &#x27;true&#x27; | &#x27;false&#x27;</span></span>
+<span class="line">    Int | TrueKeyword | FalseKeyword =&gt; {</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">close</span>(m, ExprLiteral)</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// ExprName = &#x27;name&#x27;</span></span>
+<span class="line">    Name =&gt; {</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">close</span>(m, ExprName)</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// ExprParen   = &#x27;(&#x27; Expr &#x27;)&#x27;</span></span>
+<span class="line">    LParen =&gt; {</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">expect</span>(LParen);</span>
+<span class="line">      <span class="hl-title function_ invoke__">expr</span>(p);</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">expect</span>(RParen);</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">close</span>(m, ExprParen)</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    _ =&gt; {</span>
+<span class="line">      <span class="hl-keyword">if</span> !p.<span class="hl-title function_ invoke__">eof</span>() {</span>
+<span class="line">        p.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">      }</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">close</span>(m, ErrorTree)</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>In the catch-all arm, we take care to consume the token, to make sure that the statement loop in </span><code>block</code><span> can always make progress.</span></p>
+<p><span>Next expression to handle would be </span><code>ExprCall</code><span>.</span>
+<span>This requires some preparation.</span>
+<span>Consider this example: </span><span class="display"><code>f(1)(2)</code><span> .</span></span></p>
+<p><span>We want the following parenthesis structure here:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">+-ExprCall</span>
+<span class="line">|</span>
+<span class="line">|   +-ExprName</span>
+<span class="line">|   |       +-ArgList</span>
+<span class="line">|   |       |</span>
+<span class="line">[ [ [f](1) ](2) ]</span>
+<span class="line">  |    |</span>
+<span class="line">  |    +-ArgList</span>
+<span class="line">  |</span>
+<span class="line">  +-ExprCall</span></code></pre>
+
+</figure>
+<p><span>The problem is, when the parser is at </span><code>f</code><span>, it doesn</span>&rsquo;<span>t yet know how many </span><code>Open</code><span> events it should emit.</span></p>
+<p><span>We solve the problem by adding an API to go back and inject a new </span><code>Open</code><span> event into the middle of existing events.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">MarkOpened</span> {</span>
+<span class="line">  index: <span class="hl-type">usize</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">MarkClosed</span> {</span>
+<span class="line">  index: <span class="hl-type">usize</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Parser</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">open</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> MarkOpened {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">mark</span> = MarkOpened { index: <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">len</span>() };</span>
+<span class="line">    <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">push</span>(Event::Open { kind: TreeKind::ErrorTree });</span>
+<span class="line">    mark</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">close</span>(</span>
+<span class="line">    &amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">    m: MarkOpened,</span>
+<span class="line">    kind: TreeKind,</span>
+<span class="line">  ) <span class="hl-punctuation">-&gt;</span> MarkClosed { <i class="callout" data-value="1"></i></span>
+<span class="line">    <span class="hl-keyword">self</span>.events[m.index] = Event::Open { kind };</span>
+<span class="line">    <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">push</span>(Event::Close);</span>
+<span class="line">    MarkClosed { index: m.index }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">open_before</span>(&amp;<span class="hl-keyword">mut</span> <span class="hl-keyword">self</span>, m: MarkClosed) <span class="hl-punctuation">-&gt;</span> MarkOpened { <i class="callout" data-value="2"></i></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">mark</span> = MarkOpened { index: m.index };</span>
+<span class="line">    <span class="hl-keyword">self</span>.events.<span class="hl-title function_ invoke__">insert</span>(</span>
+<span class="line">      m.index,</span>
+<span class="line">      Event::Open { kind: TreeKind::ErrorTree },</span>
+<span class="line">    );</span>
+<span class="line">    mark</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><span>Here we adjust </span><code>close</code><span> to also return a </span><code>MarkClosed</code><span>, such that we can go back and add a new event before it.</span></p>
+</li>
+<li>
+<p><span>The new API. It is like </span><code>open</code><span>, but also takes a </span><code>MarkClosed</code><span> which carries an index of an </span><code>Open</code><span> event in front of which we are to inject a new </span><code>Open</code><span>.</span>
+<span>In the current implementation, for simplicity, we just inject into the middle of the vector, which is an O(N) operation worst-case.</span>
+<span>A proper solution here would be to use an index-based linked list.</span>
+<span>That is, </span><code>open_before</code><span> can push the new open event to the end of the list, and also mark the old event with a pointer to the freshly inserted one.</span>
+<span>To store a pointer, an extra field is needed:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Event</span> {</span>
+<span class="line">  Open {</span>
+<span class="line">    kind: TreeKind,</span>
+<span class="line">    <span class="hl-comment">// Points forward into a list at the Open event</span></span>
+<span class="line">    <span class="hl-comment">// which logically happens before this one.</span></span>
+<span class="line">    open_before: <span class="hl-type">Option</span>&lt;<span class="hl-type">usize</span>&gt;,</span>
+<span class="line">  },</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The loop in </span><code>build_tree</code><span> needs to follow the </span><code>open_before</code><span> links.</span></p>
+</li>
+</ol>
+<p><span>With this new API, we can parse function calls:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_delimited</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) <span class="hl-punctuation">-&gt;</span> MarkClosed { <i class="callout" data-value="1"></i></span>
+<span class="line">  ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_delimited</span>(p); <i class="callout" data-value="1"></i></span>
+<span class="line"></span>
+<span class="line">  <span class="hl-comment">// ExprCall = Expr ArgList</span></span>
+<span class="line">  <span class="hl-keyword">while</span> p.<span class="hl-title function_ invoke__">at</span>(LParen) { <i class="callout" data-value="2"></i></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open_before</span>(lhs);</span>
+<span class="line">    <span class="hl-title function_ invoke__">arg_list</span>(p);</span>
+<span class="line">    lhs = p.<span class="hl-title function_ invoke__">close</span>(m, ExprCall);</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// ArgList = &#x27;(&#x27; Arg* &#x27;)&#x27;</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">arg_list</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(LParen));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(LParen);</span>
+<span class="line">  <span class="hl-keyword">while</span> !p.<span class="hl-title function_ invoke__">at</span>(RParen) &amp;&amp; !p.<span class="hl-title function_ invoke__">eof</span>() { <i class="callout" data-value="3"></i></span>
+<span class="line">    <span class="hl-title function_ invoke__">arg</span>(p);</span>
+<span class="line">  }</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(RParen);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, ArgList);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Arg = Expr &#x27;,&#x27;?</span></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">arg</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-title function_ invoke__">expr</span>(p);</span>
+<span class="line">  <span class="hl-keyword">if</span> !p.<span class="hl-title function_ invoke__">at</span>(RParen) { <i class="callout" data-value="4"></i></span>
+<span class="line">    p.<span class="hl-title function_ invoke__">expect</span>(Comma);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, Arg);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><code>expr_delimited</code><span> now returns a </span><code>MarkClosed</code><span> rather than </span><code>()</code><span>.</span>
+<span>No code changes are required for this, as </span><code>close</code><span> calls are already in the tail position.</span></p>
+</li>
+<li>
+<p><span>To parse function calls, we check whether we are at </span><code>(</code><span> and use </span><code>open_before</code><span> API if that is the case.</span></p>
+</li>
+<li>
+<p><span>Parsing argument list should be routine by now.</span>
+<span>Again, as an expression can start with many different tokens, we don</span>&rsquo;<span>t add an </span><code>if p.at</code><span> check to the loop</span>&rsquo;<span>s body, and require </span><code>arg</code><span> to consume at least one token.</span></p>
+</li>
+<li>
+<p><span>Inside </span><code>arg</code><span>, we use an already familiar construct to parse an optionally trailing comma.</span></p>
+</li>
+</ol>
+<p><span>Now only binary expressions are left.</span>
+<span>We will use a Pratt parser for those.</span>
+<span>This is genuinely tricky code, so I have a dedicated article explaining how it all works:</span></p>
+<p><span class="display"><a href="https://matklad.github.io/2020/04/13/simple-but-powerful-pratt-parsing.html"><em><span>Simple but Powerful Pratt Parsing</span></em></a><span> .</span></span></p>
+<p><span>Here, I</span>&rsquo;<span>ll just dump a pageful of code without much explanation:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-title function_ invoke__">expr_rec</span>(p, Eof); <i class="callout" data-value="2"></i></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_rec</span>(p: &amp;<span class="hl-keyword">mut</span> Parser, left: TokenKind) { <i class="callout" data-value="1"></i></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lhs</span> = <span class="hl-title function_ invoke__">expr_delimited</span>(p);</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">while</span> p.<span class="hl-title function_ invoke__">at</span>(LParen) {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open_before</span>(lhs);</span>
+<span class="line">    <span class="hl-title function_ invoke__">arg_list</span>(p);</span>
+<span class="line">    lhs = p.<span class="hl-title function_ invoke__">close</span>(m, ExprCall);</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">loop</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">right</span> = p.<span class="hl-title function_ invoke__">nth</span>(<span class="hl-number">0</span>);</span>
+<span class="line">    <span class="hl-keyword">if</span> <span class="hl-title function_ invoke__">right_binds_tighter</span>(left, right) { <i class="callout" data-value="1"></i></span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open_before</span>(lhs);</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">      <span class="hl-title function_ invoke__">expr_rec</span>(p, right);</span>
+<span class="line">      lhs = p.<span class="hl-title function_ invoke__">close</span>(m, ExprBinary);</span>
+<span class="line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line">      <span class="hl-keyword">break</span>;</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">right_binds_tighter</span>( <i class="callout" data-value="1"></i></span>
+<span class="line">  left: TokenKind,</span>
+<span class="line">  right: TokenKind,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">bool</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">tightness</span>(kind: TokenKind) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;<span class="hl-type">usize</span>&gt; {</span>
+<span class="line">    [</span>
+<span class="line">      <span class="hl-comment">// Precedence table:</span></span>
+<span class="line">      [Plus, Minus].<span class="hl-title function_ invoke__">as_slice</span>(),</span>
+<span class="line">      &amp;[Star, Slash],</span>
+<span class="line">    ]</span>
+<span class="line">    .<span class="hl-title function_ invoke__">iter</span>()</span>
+<span class="line">    .<span class="hl-title function_ invoke__">position</span>(|level| level.<span class="hl-title function_ invoke__">contains</span>(&amp;kind))</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(right_tightness) = <span class="hl-title function_ invoke__">tightness</span>(right) <span class="hl-keyword">else</span> { <i class="callout" data-value="3"></i></span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-literal">false</span></span>
+<span class="line">  };</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(left_tightness) = <span class="hl-title function_ invoke__">tightness</span>(left) <span class="hl-keyword">else</span> {</span>
+<span class="line">    <span class="hl-built_in">assert!</span>(left == Eof);</span>
+<span class="line">    <span class="hl-keyword">return</span> <span class="hl-literal">true</span>;</span>
+<span class="line">  };</span>
+<span class="line"></span>
+<span class="line">  right_tightness &gt; left_tightness</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><span>In this version of pratt, rather than passing numerical precedence, I pass the actual token (learned that from </span><a href="https://www.scattered-thoughts.net/writing/better-operator-precedence/"><span>jamii</span>&rsquo;<span>s post</span></a><span>).</span>
+<span>So, to determine whether to break or recur in the Pratt loop, we ask which of the two tokens binds tighter and act accordingly.</span></p>
+</li>
+<li>
+<p><span>When we start parsing an expression, we don</span>&rsquo;<span>t have an operator to the left yet, so I just pass </span><code>Eof</code><span> as a dummy token.</span></p>
+</li>
+<li>
+<p><span>The code naturally handles the case when the next token is not an operator (that is, when expression is complete, or when there</span>&rsquo;<span>s some syntax error).</span></p>
+</li>
+</ol>
+<p><span>And that</span>&rsquo;<span>s it! We have parsed the entirety of L!</span></p>
+</section>
+<section id="Basic-Resilience">
+
+    <h2>
+    <a href="#Basic-Resilience"><span>Basic Resilience</span> </a>
+    </h2>
+<p><span>Let</span>&rsquo;<span>s see how resilient our basic parser is.</span>
+<span>Let</span>&rsquo;<span>s check our motivational example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">fib_rec</span>(f1: <span class="hl-type">u32</span>,</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">fib</span>(n: <span class="hl-type">u32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">u32</span> {</span>
+<span class="line">  <span class="hl-keyword">return</span> <span class="hl-title function_ invoke__">fib_rec</span>(<span class="hl-number">1</span>, <span class="hl-number">1</span>, n);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, the syntax tree our parser produces is surprisingly exactly what we want:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">File</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;fib_rec&#x27;</span></span>
+<span class="line">    ParamList</span>
+<span class="line">      <span class="hl-string">&#x27;(&#x27;</span></span>
+<span class="line">      (Param <span class="hl-string">&#x27;f1&#x27;</span> <span class="hl-string">&#x27;:&#x27;</span> (TypeExpr <span class="hl-string">&#x27;u32&#x27;</span>) <span class="hl-string">&#x27;,&#x27;</span>)</span>
+<span class="line">    error: expected RParen</span>
+<span class="line"></span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;fib&#x27;</span></span>
+<span class="line">    ...</span></code></pre>
+
+</figure>
+<p><span>For the first incomplete function, we get </span><code>Fn</code><span>, </span><code>Param</code><span> and </span><code>ParamList</code><span>, as we should.</span>
+<span>The second function is parsed without any errors.</span></p>
+<p><span>Curiously, we get this great result without much explicit effort to make parsing resilient, it</span>&rsquo;<span>s a natural outcome of just not failing in the presence of errors.</span>
+<span>The following ingredients help us:</span></p>
+<ul>
+<li>
+<span>homogeneous syntax tree supports arbitrary malformed code,</span>
+</li>
+<li>
+<span>any syntactic construct is parsed left-to-right, and valid prefixes are always recognized,</span>
+</li>
+<li>
+<span>our top-level loop in </span><code>file</code><span> is greedy: it either parses a function, or skips a single token and tries to parse a function again.</span>
+<span>That way, if there</span>&rsquo;<span>s a valid function somewhere, it will be recognized.</span>
+</li>
+</ul>
+<p><span>Thinking about the last case both reveals the limitations of our current code, and shows avenues for improvement.</span>
+<span>In general, parsing works as a series of nested loops:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">loop</span> { <span class="hl-comment">// parse a list of functions</span></span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">loop</span> { <span class="hl-comment">// parse a list of statements inside a function</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">loop</span> { <span class="hl-comment">// parse a list of expressions</span></span>
+<span class="line"></span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If something goes wrong inside a loop, our choices are:</span></p>
+<ul>
+<li>
+<span>skip a token, and continue with the next iteration of the current loop,</span>
+</li>
+<li>
+<span>break out of the inner loop, and let the outer loop handle recovery.</span>
+</li>
+</ul>
+<p><span>The top-most loop must use the </span>&ldquo;<span>skip a token</span>&rdquo;<span> solution, because it needs to consume all of the input tokens.</span></p>
+</section>
+<section id="Improving-Resilience">
+
+    <h2>
+    <a href="#Improving-Resilience"><span>Improving Resilience</span> </a>
+    </h2>
+<p><span>Right now, each loop either always skips, or always breaks.</span>
+<span>This is not optimal.</span>
+<span>Consider this example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f1</span>(x: <span class="hl-type">i32</span>,</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f2</span>(x: <span class="hl-type">i32</span>,, z: <span class="hl-type">i32</span>) {}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f3</span>() {}</span></code></pre>
+
+</figure>
+<p><span>Here, for </span><code>f1</code><span> we want to break out of </span><code>param_list</code><span> loop, and our code does just that.</span>
+<span>For </span><code>f2</code><span> though, the error is a duplicated comma (the user will add a new parameter between </span><code>x</code><span> and </span><code>z</code><span> shortly), so we want to skip here.</span>
+<span>We don</span>&rsquo;<span>t, and, as a result, the syntax tree for </span><code>f2</code><span> is a train wreck:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">Fn</span>
+<span class="line">  <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">  <span class="hl-string">&#x27;f2&#x27;</span></span>
+<span class="line">  ParamList</span>
+<span class="line">    <span class="hl-string">&#x27;(&#x27;</span></span>
+<span class="line">    (Param <span class="hl-string">&#x27;x&#x27;</span> <span class="hl-string">&#x27;:&#x27;</span> (TypeExpr <span class="hl-string">&#x27;i32&#x27;</span>) <span class="hl-string">&#x27;,&#x27;</span>)</span>
+<span class="line">(ErrorTree <span class="hl-string">&#x27;,&#x27;</span>)</span>
+<span class="line">(ErrorTree <span class="hl-string">&#x27;z&#x27;</span>)</span>
+<span class="line">(ErrorTree <span class="hl-string">&#x27;:&#x27;</span>)</span>
+<span class="line">(ErrorTree <span class="hl-string">&#x27;i32&#x27;</span>)</span>
+<span class="line">(ErrorTree <span class="hl-string">&#x27;)&#x27;</span>)</span>
+<span class="line">(ErrorTree <span class="hl-string">&#x27;{&#x27;</span>)</span>
+<span class="line">(ErrorTree <span class="hl-string">&#x27;}&#x27;</span>)</span></code></pre>
+
+</figure>
+<p><span>For parameters, it is reasonable to skip tokens until we see something which implies the end of the parameter list.</span>
+<span>For example, if we are parsing a list of parameters and see an </span><code>fn</code><span> token, then we</span>&rsquo;<span>d better stop.</span>
+<span>If we see some less salient token, it</span>&rsquo;<span>s better to gobble it up.</span>
+<span>Let</span>&rsquo;<span>s implement the idea:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line hl-line"><span class="hl-keyword">const</span> PARAM_LIST_RECOVERY: &amp;[TokenKind] = &amp;[Arrow, LCurly, FnKeyword];</span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">param_list</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(LParen));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(LParen);</span>
+<span class="line">  <span class="hl-keyword">while</span> !p.<span class="hl-title function_ invoke__">at</span>(RParen) &amp;&amp; !p.<span class="hl-title function_ invoke__">eof</span>() {</span>
+<span class="line">    <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at</span>(Name) {</span>
+<span class="line">      <span class="hl-title function_ invoke__">param</span>(p);</span>
+<span class="line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line hl-line">      <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at_any</span>(PARAM_LIST_RECOVERY) {</span>
+<span class="line hl-line">        <span class="hl-keyword">break</span>;</span>
+<span class="line hl-line">      }</span>
+<span class="line hl-line">      p.<span class="hl-title function_ invoke__">advance_with_error</span>(<span class="hl-string">&quot;expected parameter&quot;</span>);</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(RParen);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, ParamList);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, we use </span><code>at_any</code><span> helper function, which is like </span><code>at</code><span>, but takes a list of tokens.</span>
+<span>The real implementation would use bitsets for this purpose.</span></p>
+<p><span>The example now parses correctly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">File</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;f1&#x27;</span></span>
+<span class="line">    ParamList</span>
+<span class="line">      <span class="hl-string">&#x27;(&#x27;</span></span>
+<span class="line">      (Param <span class="hl-string">&#x27;x&#x27;</span> <span class="hl-string">&#x27;:&#x27;</span> (TypeExpr <span class="hl-string">&#x27;i32&#x27;</span>) <span class="hl-string">&#x27;,&#x27;</span>)</span>
+<span class="line">      error: expected RParen</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;f2&#x27;</span></span>
+<span class="line">    ParamList</span>
+<span class="line">      <span class="hl-string">&#x27;(&#x27;</span></span>
+<span class="line">      (Param <span class="hl-string">&#x27;x&#x27;</span> <span class="hl-string">&#x27;:&#x27;</span> (TypeExpr <span class="hl-string">&#x27;i32&#x27;</span>) <span class="hl-string">&#x27;,&#x27;</span>)</span>
+<span class="line">      ErrorTree</span>
+<span class="line">        error: expected parameter</span>
+<span class="line">        <span class="hl-string">&#x27;,&#x27;</span></span>
+<span class="line">      (Param <span class="hl-string">&#x27;z&#x27;</span> <span class="hl-string">&#x27;:&#x27;</span> (TypeExpr <span class="hl-string">&#x27;i32&#x27;</span>))</span>
+<span class="line">      <span class="hl-string">&#x27;)&#x27;</span></span>
+<span class="line">    (Block <span class="hl-string">&#x27;{&#x27;</span> <span class="hl-string">&#x27;}&#x27;</span>)</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;f3&#x27;</span></span>
+<span class="line">    (ParamList <span class="hl-string">&#x27;(&#x27;</span> <span class="hl-string">&#x27;)&#x27;</span>)</span>
+<span class="line">    (Block <span class="hl-string">&#x27;{&#x27;</span> <span class="hl-string">&#x27;}&#x27;</span>)</span></code></pre>
+
+</figure>
+<p><span>What is a reasonable </span><code>RECOVERY</code><span> set in a general case?</span>
+<span>I don</span>&rsquo;<span>t know the answer to this question, but </span><dfn>follow</dfn><span> sets from formal grammar theory give a good intuition.</span>
+<span>We don</span>&rsquo;<span>t want </span><em><span>exactly</span></em><span> the </span><dfn>follow</dfn><span> set: for </span><code>ParamList</code><span>, </span><code>{</code><span> is in </span><dfn>follow</dfn><span>, and we do want it to be a part of the recovery set, but </span><code>fn</code><span> is </span><em><span>not</span></em><span> in </span><dfn>follow</dfn><span>, and yet it is important to recover on it.</span>
+<code>fn</code><span> is included because it</span>&rsquo;<span>s in the </span><dfn>follow</dfn><span> for </span><code>Fn</code><span>, and </span><code>ParamList</code><span> is a child of </span><code>Fn</code><span>: we also want to recursively include ancestor </span><dfn>follow</dfn><span> sets into the recovery set.</span></p>
+<p><span>For expressions and statements, we have the opposite problem </span>&mdash;<span> </span><code>block</code><span> and </span><code>arg_list</code><span> loops eagerly consume erroneous tokens, but sometimes it would be wise to break out of the loop instead.</span></p>
+<p><span>Consider this example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f</span>() {</span>
+<span class="line">  <span class="hl-title function_ invoke__">g</span>(<span class="hl-number">1</span>,</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">x</span> =</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">g</span>() {}</span></code></pre>
+
+</figure>
+<p><span>It gives another train wreck syntax tree, where the </span><code>g</code><span> function is completely missed:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">File</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;f&#x27;</span></span>
+<span class="line">    (ParamList <span class="hl-string">&#x27;(&#x27;</span> <span class="hl-string">&#x27;)&#x27;</span>)</span>
+<span class="line">    Block</span>
+<span class="line">      <span class="hl-string">&#x27;{&#x27;</span></span>
+<span class="line">      StmtExpr</span>
+<span class="line">        ExprCall</span>
+<span class="line">          (ExprName <span class="hl-string">&#x27;g&#x27;</span>)</span>
+<span class="line">          ArgList</span>
+<span class="line">            <span class="hl-string">&#x27;(&#x27;</span></span>
+<span class="line">            (Arg (ExprLiteral <span class="hl-string">&#x27;1&#x27;</span>) <span class="hl-string">&#x27;,&#x27;</span>)</span>
+<span class="line">            (Arg (ErrorTree <span class="hl-string">&#x27;let&#x27;</span>))</span>
+<span class="line">            (Arg (ExprName <span class="hl-string">&#x27;x&#x27;</span>))</span>
+<span class="line">            (Arg (ErrorTree <span class="hl-string">&#x27;=&#x27;</span>))</span>
+<span class="line">            (Arg (ErrorTree <span class="hl-string">&#x27;}&#x27;</span>))</span>
+<span class="line">            (Arg (ErrorTree <span class="hl-string">&#x27;fn&#x27;</span>))</span>
+<span class="line">            Arg</span>
+<span class="line">              ExprCall</span>
+<span class="line">                (ExprName <span class="hl-string">&#x27;g&#x27;</span>)</span>
+<span class="line">                (ArgList <span class="hl-string">&#x27;(&#x27;</span> <span class="hl-string">&#x27;)&#x27;</span>)</span>
+<span class="line">            (Arg (ErrorTree <span class="hl-string">&#x27;{&#x27;</span>))</span>
+<span class="line">            (Arg (ErrorTree <span class="hl-string">&#x27;}&#x27;</span>))</span></code></pre>
+
+</figure>
+<p><span>Recall that the root cause here is that we require </span><code>expr</code><span> to consume at least one token, because it</span>&rsquo;<span>s not immediately obvious which tokens can start an expression.</span>
+<span>It</span>&rsquo;<span>s not immediately obvious, but easy to compute </span>&mdash;<span> that</span>&rsquo;<span>s exactly </span><dfn>first</dfn><span> set from formal grammars.</span></p>
+<p><span>Using it, we get:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line hl-line"><span class="hl-keyword">const</span> STMT_RECOVERY: &amp;[TokenKind] = &amp;[FnKeyword];</span>
+<span class="line hl-line"><span class="hl-keyword">const</span> EXPR_FIRST: &amp;[TokenKind] =</span>
+<span class="line hl-line">  &amp;[Int, TrueKeyword, FalseKeyword, Name, LParen];</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">block</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(LCurly));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(LCurly);</span>
+<span class="line">  <span class="hl-keyword">while</span> !p.<span class="hl-title function_ invoke__">at</span>(RCurly) &amp;&amp; !p.<span class="hl-title function_ invoke__">eof</span>() {</span>
+<span class="line">    <span class="hl-keyword">match</span> p.<span class="hl-title function_ invoke__">nth</span>(<span class="hl-number">0</span>) {</span>
+<span class="line">      LetKeyword =&gt; <span class="hl-title function_ invoke__">stmt_let</span>(p),</span>
+<span class="line">      ReturnKeyword =&gt; <span class="hl-title function_ invoke__">stmt_return</span>(p),</span>
+<span class="line">      _ =&gt; {</span>
+<span class="line hl-line">        <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at_any</span>(EXPR_FIRST) {</span>
+<span class="line hl-line">          <span class="hl-title function_ invoke__">stmt_expr</span>(p)</span>
+<span class="line hl-line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line hl-line">          <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at_any</span>(STMT_RECOVERY) {</span>
+<span class="line hl-line">            <span class="hl-keyword">break</span>;</span>
+<span class="line hl-line">          }</span>
+<span class="line hl-line">          p.<span class="hl-title function_ invoke__">advance_with_error</span>(<span class="hl-string">&quot;expected statement&quot;</span>);</span>
+<span class="line hl-line">        }</span>
+<span class="line">      }</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(RCurly);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, Block);</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">arg_list</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) {</span>
+<span class="line">  <span class="hl-built_in">assert!</span>(p.<span class="hl-title function_ invoke__">at</span>(LParen));</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(LParen);</span>
+<span class="line">  <span class="hl-keyword">while</span> !p.<span class="hl-title function_ invoke__">at</span>(RParen) &amp;&amp; !p.<span class="hl-title function_ invoke__">eof</span>() {</span>
+<span class="line hl-line">    <span class="hl-keyword">if</span> p.<span class="hl-title function_ invoke__">at_any</span>(EXPR_FIRST) {</span>
+<span class="line hl-line">      <span class="hl-title function_ invoke__">arg</span>(p);</span>
+<span class="line hl-line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line hl-line">        <span class="hl-keyword">break</span>;</span>
+<span class="line hl-line">    }</span>
+<span class="line">  }</span>
+<span class="line">  p.<span class="hl-title function_ invoke__">expect</span>(RParen);</span>
+<span class="line"></span>
+<span class="line">  p.<span class="hl-title function_ invoke__">close</span>(m, ArgList);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This fixes the syntax tree:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">File</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;f&#x27;</span></span>
+<span class="line">    (ParamList <span class="hl-string">&#x27;(&#x27;</span> <span class="hl-string">&#x27;)&#x27;</span>)</span>
+<span class="line">    Block</span>
+<span class="line">      <span class="hl-string">&#x27;{&#x27;</span></span>
+<span class="line">      StmtExpr</span>
+<span class="line">        ExprCall</span>
+<span class="line">          (ExprName <span class="hl-string">&#x27;g&#x27;</span>)</span>
+<span class="line">          ArgList</span>
+<span class="line">            <span class="hl-string">&#x27;(&#x27;</span></span>
+<span class="line">            (Arg (ExprLiteral <span class="hl-string">&#x27;1&#x27;</span> <span class="hl-string">&#x27;,&#x27;</span>))</span>
+<span class="line">      StmtLet</span>
+<span class="line">        <span class="hl-string">&#x27;let&#x27;</span></span>
+<span class="line">        <span class="hl-string">&#x27;x&#x27;</span></span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span></span>
+<span class="line">        (ErrorTree <span class="hl-string">&#x27;}&#x27;</span>)</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;g&#x27;</span></span>
+<span class="line">    (ParamList <span class="hl-string">&#x27;(&#x27;</span> <span class="hl-string">&#x27;)&#x27;</span>)</span>
+<span class="line">    (Block <span class="hl-string">&#x27;{&#x27;</span> <span class="hl-string">&#x27;}&#x27;</span>)</span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s only one issue left.</span>
+<span>Our </span><code>expr</code><span> parsing is still greedy, so, in a case like this</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">x</span> = <span class="hl-number">1</span> +</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">y</span> = <span class="hl-number">2</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>the </span><code>let</code><span> will be consumed as a right-hand-side operand of </span><code>+</code><span>.</span>
+<span>Now that the callers of </span><code>expr</code><span> contain a check for </span><code>EXPR_FIRST</code><span>, we no longer need this greediness and can return </span><code>None</code><span> if no expression can be parsed:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line hl-line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_delimited</span>(p: &amp;<span class="hl-keyword">mut</span> Parser) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;MarkClosed&gt; {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">result</span> = <span class="hl-keyword">match</span> p.<span class="hl-title function_ invoke__">nth</span>(<span class="hl-number">0</span>) {</span>
+<span class="line">    <span class="hl-comment">// ExprLiteral = &#x27;int&#x27; | &#x27;true&#x27; | &#x27;false&#x27;</span></span>
+<span class="line">    Int | TrueKeyword | FalseKeyword =&gt; {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">close</span>(m, ExprLiteral)</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// ExprName = &#x27;name&#x27;</span></span>
+<span class="line">    Name =&gt; {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">advance</span>();</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">close</span>(m, ExprName)</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-comment">// ExprParen   = &#x27;(&#x27; Expr &#x27;)&#x27;</span></span>
+<span class="line">    LParen =&gt; {</span>
+<span class="line">      <span class="hl-keyword">let</span> <span class="hl-variable">m</span> = p.<span class="hl-title function_ invoke__">open</span>();</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">expect</span>(LParen);</span>
+<span class="line">      <span class="hl-title function_ invoke__">expr</span>(p);</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">expect</span>(RParen);</span>
+<span class="line">      p.<span class="hl-title function_ invoke__">close</span>(m, ExprParen)</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    _ =&gt; {</span>
+<span class="line hl-line">      <span class="hl-built_in">assert!</span>(!p.<span class="hl-title function_ invoke__">at_any</span>(EXPR_FIRST));</span>
+<span class="line hl-line">      <span class="hl-keyword">return</span> <span class="hl-literal">None</span>;</span>
+<span class="line">    }</span>
+<span class="line">  };</span>
+<span class="line">  <span class="hl-title function_ invoke__">Some</span>(result)</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">expr_rec</span>(p: &amp;<span class="hl-keyword">mut</span> Parser, left: TokenKind) {</span>
+<span class="line hl-line">  <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>(<span class="hl-keyword">mut</span> lhs) = <span class="hl-title function_ invoke__">expr_delimited</span>(p) <span class="hl-keyword">else</span> {</span>
+<span class="line hl-line">    <span class="hl-keyword">return</span>;</span>
+<span class="line hl-line">  };</span>
+<span class="line">  ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This gives the following syntax tree:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">File</span>
+<span class="line">  Fn</span>
+<span class="line">    <span class="hl-string">&#x27;fn&#x27;</span></span>
+<span class="line">    <span class="hl-string">&#x27;f&#x27;</span></span>
+<span class="line">    (ParamList <span class="hl-string">&#x27;(&#x27;</span> <span class="hl-string">&#x27;)&#x27;</span>)</span>
+<span class="line">    Block</span>
+<span class="line">      <span class="hl-string">&#x27;{&#x27;</span></span>
+<span class="line">      StmtLet</span>
+<span class="line">        <span class="hl-string">&#x27;let&#x27;</span></span>
+<span class="line">        <span class="hl-string">&#x27;x&#x27;</span></span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span></span>
+<span class="line">        (ExprBinary (ExprLiteral <span class="hl-string">&#x27;1&#x27;</span>) <span class="hl-string">&#x27;+&#x27;</span>)</span>
+<span class="line">      StmtLet</span>
+<span class="line">        <span class="hl-string">&#x27;let&#x27;</span></span>
+<span class="line">        <span class="hl-string">&#x27;y&#x27;</span></span>
+<span class="line">        <span class="hl-string">&#x27;=&#x27;</span></span>
+<span class="line">        (ExprLiteral <span class="hl-string">&#x27;2&#x27;</span>)</span>
+<span class="line">      <span class="hl-string">&#x27;}&#x27;</span></span></code></pre>
+
+</figure>
+<p><span>And this concludes the tutorial!</span>
+<span>You are now capable of implementing an IDE-grade parser for a real programming language from scratch.</span></p>
+<p><span>Summarizing:</span></p>
+<ul>
+<li>
+<p><span>Resilient parsing means recovering as much syntactic structure from erroneous code as possible.</span></p>
+</li>
+<li>
+<p><span>Resilient parsing is important for IDEs and language servers, who</span>&rsquo;<span>s job mostly ends when the code does not have errors any more.</span></p>
+</li>
+<li>
+<p><span>Resilient parsing is related, but distinct from error recovery and repair.</span>
+<span>Rather than guessing what the user meant to write, the parser tries to make sense of what is actually written.</span></p>
+</li>
+<li>
+<p><span>Academic literature tends to focus on error repair, and mostly ignores pure resilience.</span></p>
+</li>
+<li>
+<p><span>The biggest challenge of resilient parsing is the design of a syntax tree data structure.</span>
+<span>It should provide convenient and type-safe access to well-formed syntax trees, while allowing arbitrary malformed trees.</span></p>
+</li>
+<li>
+<p><span>One possible design here is to make the underlying tree a dynamically-typed data structure (like JSON), and layer typed accessors on top (not covered in this article).</span></p>
+</li>
+<li>
+<p><span>LL style parsers are a good fit for resilient parsing.</span>
+<span>Because code is written left-to-right, it</span>&rsquo;<span>s important that the parser recognizes well-formed prefixes of incomplete syntactic constructs, and LL does just that.</span></p>
+</li>
+<li>
+<p><span>Ultimately, parsing works as a stack of nested </span><code>for</code><span> loops.</span>
+<span>Inside a single </span><code>for</code><span> loop, on each iteration, we need to decide between:</span></p>
+<ul>
+<li>
+<span>trying to parse a sequence element,</span>
+</li>
+<li>
+<span>skipping over an unexpected token,</span>
+</li>
+<li>
+<span>breaking out of the nested loop and delegating recovery to the parent loop.</span>
+</li>
+</ul>
+</li>
+<li>
+<p><dfn>first</dfn><span>, </span><dfn>follow</dfn><span> and recovery sets help making a specific decision.</span></p>
+</li>
+<li>
+<p><span>In any case, if a loop tries to parse an item, item parsing </span><em><span>must</span></em><span> consume at least one token (if only to report an error).</span></p>
+</li>
+</ul>
+<script type="module" src="/assets/resilient-parsing/main.js"></script>
+<p><span>Source code for the article is here: </span><span class="display"><a href="https://github.com/matklad/resilient-ll-parsing/blob/master/src/lib.rs#L44" class="url">https://github.com/matklad/resilient-ll-parsing/blob/master/src/lib.rs#L44</a><span> .</span></span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-05-21-resilient-ll-parsing-tutorial.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/06/02/the-worst-zig-version-manager.html b/2023/06/02/the-worst-zig-version-manager.html
new file mode 100644
index 00000000..5619104b
--- /dev/null
+++ b/2023/06/02/the-worst-zig-version-manager.html
@@ -0,0 +1,240 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>The Worst Zig Version Manager</title>
+  <meta name="description" content="https://github.com/matklad/hello-getzig">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/06/02/the-worst-zig-version-manager.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#The-Worst-Zig-Version-Manager"><span>The Worst Zig Version Manager</span> <time datetime="2023-06-02">Jun 2, 2023</time></a>
+    </h1>
+
+<figure class="code-block">
+<figcaption class="title">./getzig.ps1</figcaption>
+
+
+<pre><code><span class="line">#!/bin/sh</span>
+<span class="line">echo `# &lt;#`</span>
+<span class="line"></span>
+<span class="line">mkdir -p ./zig</span>
+<span class="line"></span>
+<span class="line">wget https://ziglang.org/download/0.10.1/zig-linux-x86_64-0.10.1.tar.xz -O ./zig/zig-linux-x86_64-0.10.1.tar.xz</span>
+<span class="line">tar -xf ./zig/zig-linux-x86_64-0.10.1.tar.xz -C ./zig --strip-components=1</span>
+<span class="line">rm ./zig/zig-linux-x86_64-0.10.1.tar.xz</span>
+<span class="line"></span>
+<span class="line">echo "Zig installed."</span>
+<span class="line">./zig/zig version</span>
+<span class="line"></span>
+<span class="line">exit</span>
+<span class="line">#&gt; &gt; $null</span>
+<span class="line"></span>
+<span class="line">Invoke-WebRequest -Uri "https://ziglang.org/download/0.10.1/zig-windows-x86_64-0.10.1.zip" -OutFile ".\zig-windows-x86_64-0.10.1.zip"</span>
+<span class="line">Expand-Archive -Path ".\zig-windows-x86_64-0.10.1.zip" -DestinationPath ".\" -Force</span>
+<span class="line">Remove-Item -Path " .\zig-windows-x86_64-0.10.1.zip"</span>
+<span class="line">Rename-Item -Path ".\zig-windows-x86_64-0.10.1" -NewName ".\zig"</span>
+<span class="line"></span>
+<span class="line">Write-Host "Zig installed."</span>
+<span class="line">./zig/zig.exe version</span></code></pre>
+
+</figure>
+<p class="display"><a href="https://github.com/matklad/hello-getzig" class="url">https://github.com/matklad/hello-getzig</a></p>
+<p><span>Longer version:</span></p>
+<p><span>One of the values of Zig which resonates with me deeply is a mindful approach to dependencies.</span>
+<span>Zig tries hard not to ask too much from the environment, such that, if you get </span><code>zig version</code><span> running, you can be reasonably sure that everything else works.</span>
+<span>That</span>&rsquo;<span>s one of the main motivations for adding an HTTP client to the Zig distribution recently.</span>
+<span>Building software today involves downloading various components from the Internet, and, if Zig wants for software built with Zig to be hermetic and self-sufficient, it needs to provide ability to download files from HTTP servers.</span></p>
+<p><span>There</span>&rsquo;<span>s one hurdle for self-sufficiency: how do you get Zig in the first place?</span>
+<span>One answer to this question is </span>&ldquo;<span>from your distribution</span>&rsquo;<span>s package manager</span>&rdquo;<span>.</span>
+<span>This is not a very satisfying answer, at least until the language is both post 1.0 and semi-frozen in development.</span>
+<span>And even then, what if your distribution is Windows?</span>
+<span>How many distributions should be covered by </span>&ldquo;<span>Installing Zig</span>&rdquo;<span> section of your </span><code>CONTRIBUTING.md</code><span>?</span></p>
+<p><span>Another answer would be a version manager, a-la </span><code>rustup</code><span>, </span><code>nvm</code><span>, or </span><code>asdf</code><span>.</span>
+<span>These tools work well, but they are quite complex, and rely on various subtle properties of the environment, like </span><code>PATH</code><span>, shell activation scripts and busybox-style multipurpose executable.</span>
+<span>And, well, this also kicks the can down the road </span>&mdash;<span> you can use </span><code>zvm</code><span> to get Zig, but how do you get </span><code>zvm</code><span>?</span></p>
+<p><span>I like how we do this in </span><a href="https://github.com/tigerbeetledb/tigerbeetle/blob/56d14e82769deb6817809f866253220ae0f499d1/scripts/install_zig.sh"><span>TigerBeetle</span></a><span>.</span>
+<span>We don</span>&rsquo;<span>t use </span><code>zig</code><span> from </span><code>PATH</code><span>.</span>
+<span>Instead, we just put the correct version of Zig into </span><code>./zig</code><span> folder in the root of the repository, and run it like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> ./zig/zig build test</span></code></pre>
+
+</figure>
+<p><span>Suddenly, whole swaths of complexity go away.</span>
+<span>Quiz time: if you need to add a directory to </span><code>PATH</code><span>, which script should be edited so that both the graphical environment and the terminal are affected?</span></p>
+<p><span>Finally, another interesting case study is Gradle.</span>
+<span>Usually Gradle is a negative example, but they do have a good approach for installing Gradle itself.</span>
+<span>The standard pattern is to store two scripts, </span><code>gradlew.sh</code><span> and </span><code>gradlew.bat</code><span>, which bootstrap the right version of Gradle by downloading a jar file (java itself is not bootstrapped this way though).</span></p>
+<p><span>What all these approaches struggle to overcome is the problem of bootstrapping.</span>
+<span>Generally, if you need to automate anything, you can write a program to do that.</span>
+<span>But you need some pre-existing program runner!</span>
+<span>And there</span>&rsquo;<span>s just no good options out of the box </span>&mdash;<span> bash and powershell are passable, but barely, and they are different.</span>
+<span>And </span>&ldquo;<span>bash</span>&rdquo;<span> and the set of coreutils also differs depending on the Unix in question.</span>
+<span>But there</span>&rsquo;<span>s just no good solution here </span>&mdash;<span> if you want to bootstrap automatically, you must start with universally available tools.</span></p>
+<p><span>But is there perhaps some scripting language which is shared between Windows and Unix?</span>
+<a href="https://github.com/cspotcode"><span>@cspotcode</span></a><span> suggests </span><a href="https://cspotcode.com/posts/polyglot-powershell-and-bash-script"><span>a horrible workaround</span></a><span>.</span>
+<span>You can write a script which is </span><em><span>both</span></em><span> a bash script and a powershell script.</span>
+<span>And it even isn</span>&rsquo;<span>t too too ugly!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">!/bin/bash</span>
+<span class="line">echo `# &lt;#`</span>
+<span class="line"></span>
+<span class="line">echo "Bash!"</span>
+<span class="line"></span>
+<span class="line">exit</span>
+<span class="line">#&gt; &gt; $null</span>
+<span class="line"></span>
+<span class="line">Write-Host "PowerShell!"</span></code></pre>
+
+</figure>
+<p><span>So, here</span>&rsquo;<span>s an idea for a hermetic Zig version management workflow.</span>
+<span>There</span>&rsquo;<span>s a canonical, short </span><code>getzig.ps1</code><span> PowerShell/sh script which is vendored verbatim by various projects.</span>
+<span>Running this script downloads an appropriate version of Zig, and puts it into </span><code>./zig/zig</code><span> inside the repository (</span><code>.gitignore</code><span> contains </span><code>/zig</code><span>).</span>
+<span>Building, testing, and other workflows use </span><code>./zig/zig</code><span> instead of relying on global system state (</span><code>$PATH</code><span>).</span></p>
+<p><span>A proof-of-concept </span><code>getzig.ps1</code><span> is at the start of this article.</span>
+<span>Note that I don</span>&rsquo;<span>t know bash, powershell, and how to download files from the Internet securely, so the above PoC was mostly written by Chat GPT.</span>
+<span>But it seems to work on my machine.</span>
+<span>I clone </span><a href="https://github.com/matklad/hello-getzig" class="url">https://github.com/matklad/hello-getzig</a><span> and run</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> ./getzig.ps1</span>
+<span class="line"><span class="hl-title function_">$</span> ./zig/zig run ./hello.zig</span></code></pre>
+
+</figure>
+<p><span>on both NixOS and Windows 10, and it prints hello.</span></p>
+<p><span>If anyone wants to make an actual thing out of this idea, here</span>&rsquo;<span>s possible desiderata:</span></p>
+<ul>
+<li>
+<p><span>A single polyglot </span><code>getzig.sh.ps1</code><span> is cute, but using a couple of different scripts wouldn</span>&rsquo;<span>t be a big problem.</span></p>
+</li>
+<li>
+<p><span>Size of the scripts </span><em><span>could</span></em><span> be a problem, as they are supposed to be vendored into each repository.</span>
+<span>I</span>&rsquo;<span>d say 512 lines for combined </span><code>getzig.sh.ps1</code><span> would be a reasonable complexity limit.</span></p>
+</li>
+<li>
+<p><span>The script must </span>&ldquo;<span>just work</span>&rdquo;<span> on all four major desktop operating systems: Linux, Mac, Windows, and WSL.</span></p>
+</li>
+<li>
+<p><span>The script should be polymorphic in </span><code>curl</code><span> / </span><code>wget</code><span> and </span><code>bash</code><span> / </span><code>sh</code><span>.</span></p>
+</li>
+<li>
+<p><span>It</span>&rsquo;<span>s ok if it doesn</span>&rsquo;<span>t work absolutely everywhere </span>&mdash;<span> downloading/building Zig manually for an odd platform is also an acceptable workflow.</span></p>
+</li>
+<li>
+<p><span>The script should auto-detect appropriate host platform and architecture.</span></p>
+</li>
+<li>
+<p><span>Zig version should be specified in a separate </span><code>zig-version.txt</code><span> file.</span></p>
+</li>
+<li>
+<p><span>After downloading the file, its integrity should be verified.</span>
+<span>For this reason, </span><code>zig-version.txt</code><span> should include a hash alongside the version.</span>
+<span>As downloads are different depending on the platform, I think we</span>&rsquo;<span>ll need some help from Zig upstream here.</span>
+<span>In particular, each published Zig version should include a cross-platform manifest file, which lists hashes and urls of per-platform binaries.</span>
+<span>The hash included into </span><code>zig-version.txt</code><span> should be the manifest</span>&rsquo;<span>s hash.</span></p>
+</li>
+</ul>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-06-02-the-worst-zig-version-manager.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/06/18/GitHub-merge-queue.html b/2023/06/18/GitHub-merge-queue.html
new file mode 100644
index 00000000..86936a42
--- /dev/null
+++ b/2023/06/18/GitHub-merge-queue.html
@@ -0,0 +1,129 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>GitHub Merge Queue</title>
+  <meta name="description" content="Short, unedited note on GitHub merge queue.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/06/18/GitHub-merge-queue.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#GitHub-Merge-Queue"><span>GitHub Merge Queue</span> <time datetime="2023-06-18">Jun 18, 2023</time></a>
+    </h1>
+<p><span>Short, unedited note on </span><a href="https://docs.github.com/en/repositories/configuring-branches-and-merges-in-your-repository/configuring-pull-request-merges/managing-a-merge-queue"><span>GitHub merge queue</span></a><span>.</span></p>
+<p><span>TL;DR, </span><a href="https://bors.tech" class="url">https://bors.tech</a><span> delivers a meaningfully better experience, although it suffers from being a third-party integration.</span></p>
+<p><span>Specific grievances:</span></p>
+<p><em><span>Complexity</span></em><span>. This is a vague feeling, but merge queue feels like it is built by complexity merchants </span>&mdash;<span> there are a lot of unclear settings and voluminous and byzantine docs.</span>
+<span>Good for allocating extra budget towards build engineering, bad for actual build engineering.</span></p>
+<p><em><span>GUI-only configuration</span></em><span>. Bors is setup using bors.toml in the repository, merge queue is setup by clicking through web GUI.</span>
+<span>To share config with other maintainers, I resorted to a zoomed-out screenshot of the page.</span></p>
+<p><em><span>Unclear set of checks</span></em><span>. The purpose of the merge queue is to enforce not rocket science rule of software engineering </span>&mdash;<span> making sure that the code in the main branch satisfies certain quality invariants (all tests are passing).</span>
+<span>It is impossible to tell what merge queue actually enforces.</span>
+<span>Typically, when you enable merge queue, you subsequently find out that it actually merges anything, without any checks whatsoever.</span></p>
+<p><em><span>Double latency</span></em><span>. One of the biggest benefits of a merge queue for a high velocity project is its </span><em><span>asynchrony</span></em><span>.</span>
+<span>After submitting a PR, you can do a review and schedule PR to be merged </span><em><span>without</span></em><span> waiting for CI to finish.</span>
+<span>This is massive: it is 2X reduction to human attention required.</span>
+<span>Without queue, you need to look at a PR twice: once to do a review, and once to click merge after the green checkmark is in.</span>
+<span>With the queue, you only need a review, and the green checkmark comes in asynchronously.</span>
+<span>Except that with GitHub merge queue, you can</span>&rsquo;<span>t actually add a PR to the queue until you get a green checkmark.</span>
+<span>In effect, that</span>&rsquo;<span>s still 2X attention, and then a PR runs through the same CI checks twice (yes, you can have separate checks for merge queue and PR. No, this is not a good idea, this is complexity and busywork).</span></p>
+<p><em><span>Lack of delegation</span></em><span>. With bors, you can use </span><code>bors delegate+</code><span> to delegate merging of a single, specific pull request to its author.</span>
+<span>This is helpful to drive contributor engagement, and to formalize </span>&ldquo;<span>LGTM with the nits fixed</span>&rdquo;<span> approval (which again reduces number of human round trips).</span></p>
+<p><span>You still should use GitHub merge queue, rather than bors-ng, as that</span>&rsquo;<span>s now a first-party feature.</span>
+<span>Still, its important to understand how things </span><em><span>should</span></em><span> work, to be able to improve state of the art some other time.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-06-18-GitHub-merge-queue.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/07/16/three-different-cuts.html b/2023/07/16/three-different-cuts.html
new file mode 100644
index 00000000..fa3c0f91
--- /dev/null
+++ b/2023/07/16/three-different-cuts.html
@@ -0,0 +1,244 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Three Different Cuts</title>
+  <meta name="description" content="In this post, we'll look at how Rust, Go, and Zig express the signature of function cut --- the power tool of string manipulation.
+Cut takes a string and a pattern, and splits the string around the first occurrence of the pattern:
+cut("life", "if") = ("l", "e").">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/07/16/three-different-cuts.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Three-Different-Cuts"><span>Three Different Cuts</span> <time datetime="2023-07-16">Jul 16, 2023</time></a>
+    </h1>
+<p><span>In this post, we</span>&rsquo;<span>ll look at how Rust, Go, and Zig express the signature of function </span><code>cut</code><span> </span>&mdash;<span> the power tool of string manipulation.</span>
+<span>Cut takes a string and a pattern, and splits the string around the first occurrence of the pattern:</span>
+<span class="display"><code>cut("life", "if") = ("l", "e")</code><span>.</span></span></p>
+<p><span>At a glance, it seems like a non-orthogonal jumbling together of searching and slicing.</span>
+<span>However, in practice a lot of ad-hoc string processing can be elegantly expressed via </span><code>cut</code><span>.</span></p>
+<p><span>A lot of things are </span><code>key=value</code><span> pairs, and cut fits perfectly there.</span>
+<span>What</span>&rsquo;<span>s more, many more complex sequencies, like</span>
+<span class="display"><code>--arg=key=value</code><span>,</span></span>
+<span>can be viewed as nested pairs.</span>
+<span>You can cut around </span><code>=</code><span> once to get </span><code>--arg</code><span> and </span><code>key=value</code><span>, and then cut the second time to separate </span><code>key</code><span> from </span><code>value</code><span>.</span></p>
+<p><span>In Rust, this function looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">split_once</span>&lt;<span class="hl-symbol">&#x27;a</span>, P&gt;(</span>
+<span class="line">  &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">  delimiter: P,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(&amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>, &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>)&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">  P: Pattern&lt;<span class="hl-symbol">&#x27;a</span>&gt;,</span>
+<span class="line">{</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Rust</span>&rsquo;<span>s </span><code>Option</code><span> is a good fit for the result type, it clearnly describes the behavior of the function when the pattern isn</span>&rsquo;<span>t found in the string at all.</span>
+<span>Lifetime </span><code>'a</code><span> expresses the relationship between the result and the input </span>&mdash;<span> both pieces of result are substrings of </span><code>&amp;'a self</code><span>, so, as long as they are used, the original string must be kept alive as well.</span>
+<span>Finally, the separator isn</span>&rsquo;<span>t another string, but a generic </span><code>P: Pattern</code><span>.</span>
+<span>This gives a somewhat crowded signature, but allows using strings, single characters, and even </span><code class="display">fn(c: char) -&gt; bool</code><span> functions as patterns.</span></p>
+<p><span>When using the function, there are is a multitude of ways to access the result:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Propagate `None` upwards:</span></span>
+<span class="line"><span class="hl-keyword">let</span> (prefix, suffix) = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>)?;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Handle `None` in an ad-hoc way:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((prefix, suffix)) = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>) <span class="hl-keyword">else</span> {</span>
+<span class="line">    <span class="hl-keyword">return</span></span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Ignore `None`:</span></span>
+<span class="line"><span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((prefix, suffix)) = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>) {</span>
+<span class="line">    ...</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Handle `Some` and `None` in a symmetric way:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">result</span> = <span class="hl-keyword">match</span> line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>) {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>((prefix, suffix)) =&gt; { ... }</span>
+<span class="line">    <span class="hl-literal">None</span> =&gt; { ... }</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Access only one component of the result:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">suffix</span> = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>)?.<span class="hl-number">1</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Use high-order functions to extract key with a default:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">key</span> = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">map</span>(|(key, _value)| key)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">unwrap_or</span>(line);</span></code></pre>
+
+</figure>
+<p><span>Here</span>&rsquo;<span>s a Go equivalent:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-function"><span class="hl-keyword">func</span> <span class="hl-title">Cut</span><span class="hl-params">(s, sep <span class="hl-type">string</span>)</span></span> (before, after <span class="hl-type">string</span>, found <span class="hl-type">bool</span>) {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It has a better name!</span>
+<span>It</span>&rsquo;<span>s important that frequently used building-block functions have short, memorable names, and </span>&ldquo;<span>cut</span>&rdquo;<span> is just perfect for what the function does.</span>
+<span>Go doesn</span>&rsquo;<span>t have an </span><code>Option</code><span>, but it allows multiple return values, and any type in Go has a zero value, so a boolean flag can be used to signal </span><code>None</code><span>.</span>
+<span>Curiously if the </span><code>sep</code><span> is not found in </span><code>s</code><span>, </span><code>after</code><span> is set to </span><code>""</code><span>, but </span><code>before</code><span> is set to </span><code>s</code><span> (that is, the whole string).</span>
+<span>This is occasionally useful, and corresponds to the last Rust example.</span>
+<span>But it also isn</span>&rsquo;<span>t something immediately obvious from the signature, it</span>&rsquo;<span>s an extra detail to keep in mind.</span>
+<span>Which might be fine for a foundational function!</span>
+<span>Similarly to Rust, the resulting strings point to the same memory as </span><code>s</code><span>.</span>
+<span>There are no lifetimes, but a potential performance gotcha </span>&mdash;<span> if one of the resulting strings is alive, then the entire </span><code>s</code><span> can</span>&rsquo;<span>t be garbage collected.</span></p>
+<p><span>There isn</span>&rsquo;<span>t much in way of using the function in Go:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">prefix, suffix, ok = strings.Cut(line, <span class="hl-string">&quot;=&quot;</span>)</span>
+<span class="line"><span class="hl-keyword">if</span> !ok {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Zig doesn</span>&rsquo;<span>t yet have an equivalent function in its standard library, but it probably will at some point, and the signature might look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> cut</span>(</span>
+<span class="line">    s: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span>,</span>
+<span class="line">    sep: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span></span>
+<span class="line">) ?<span class="hl-keyword">struct</span> { prefix: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span>, suffix: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span> } {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Similarly to Rust, Zig can express optional values.</span>
+<span>Unlike Rust, the option is a built-in, rather than a user-defined type (Zig </span><em><span>can</span></em><span> express a generic user-defined option, but chooses not to).</span>
+<span>All types in Zig are strictly prefix, so leading </span><code>?</code><span> concisely signals optionality.</span>
+<span>Zig doesn</span>&rsquo;<span>t have first-class tuple types, but uses very concise and flexible type declaration syntax, so we can return a named tuple.</span>
+<span>Curiously, this anonymous struct is still a nominal, rather than a structural, type!</span>
+<span>Similarly to Rust, </span><code>prefix</code><span> and </span><code>suffix</code><span> borrow the same memory that </span><code>s</code><span> does.</span>
+<span>Unlike Rust, this isn</span>&rsquo;<span>t expressed in the signature </span>&mdash;<span> while in this case it is obvious that the lifetime would be bound to </span><code>s</code><span>, rather than </span><code>sep</code><span>, there are no type system guardrails here.</span></p>
+<p><span>Because </span><code>?</code><span> is a built-in type, we need some amount of special syntax to handle the result, but it curiously feels less special-case and more versatile than the Rust version.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Propagate `null` upwards / handle `null` in an ad-hoc way.</span></span>
+<span class="line"><span class="hl-keyword">const</span> cut = mem.cut(line, <span class="hl-string">&quot;=&quot;</span>) <span class="hl-keyword">orelse</span> <span class="hl-keyword">return</span> <span class="hl-literal">null</span>;</span>
+<span class="line"><span class="hl-keyword">const</span> cut = mem.cut(line, <span class="hl-string">&quot;=&quot;</span>) <span class="hl-keyword">orelse</span> <span class="hl-keyword">return</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Ignore or handle `null`.</span></span>
+<span class="line"><span class="hl-keyword">if</span> (mem.cut(line, <span class="hl-string">&quot;=&quot;</span>)) <span class="hl-operator">|</span>cut<span class="hl-operator">|</span> {</span>
+<span class="line"></span>
+<span class="line">} <span class="hl-keyword">else</span> {</span>
+<span class="line"></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Go semantics: extract key with a default</span></span>
+<span class="line">let key = <span class="hl-keyword">if</span> (mem.cut(line, <span class="hl-string">&quot;=&quot;</span>)) <span class="hl-operator">|</span>cut<span class="hl-operator">|</span> cut.first <span class="hl-keyword">else</span> line;</span></code></pre>
+
+</figure>
+<p><span>Moral of the story?</span>
+<span>Work with the grain of the language </span>&mdash;<span> expressing the same concept in different languages usually requires a slightly different vocabulary.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-07-16-three-different-cuts.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/08/01/on-modularity-of-lexical-analysis.html b/2023/08/01/on-modularity-of-lexical-analysis.html
new file mode 100644
index 00000000..2e38a934
--- /dev/null
+++ b/2023/08/01/on-modularity-of-lexical-analysis.html
@@ -0,0 +1,216 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>On Modularity of Lexical Analysis</title>
+  <meta name="description" content="I was going to write a long post about designing an IDE-friendly language. I wrote an intro and
+figured that it would make a better, shorter post on its own. Enjoy!">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/08/01/on-modularity-of-lexical-analysis.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#On-Modularity-of-Lexical-Analysis"><span>On Modularity of Lexical Analysis</span> <time datetime="2023-08-01">Aug 1, 2023</time></a>
+    </h1>
+<p><span>I was going to write a long post about designing an IDE-friendly language. I wrote an intro and</span>
+<span>figured that it would make a better, shorter post on its own. Enjoy!</span></p>
+<p><span>The big idea of language server construction is that language servers are not magic </span>&mdash;<span> capabilities</span>
+<span>and performance of tooling are constrained by the syntax and semantics of the underlying language.</span>
+<span>If a language is not designed with toolability in mind, some capabilities (e.g, fully automated</span>
+<span>refactors) are impossible to implement correctly. What</span>&rsquo;<span>s more, an IDE-friendly language turns out to</span>
+<span>be a fast-to-compile language with easy-to-compose libraries!</span></p>
+<p><span>More abstractly, there</span>&rsquo;<span>s this cluster of unrelated at a first sight, but intimately intertwined and</span>
+<span>mutually supportive properties:</span></p>
+<ul>
+<li>
+<span>parallel, separate compilation,</span>
+</li>
+<li>
+<span>incremental compilation,</span>
+</li>
+<li>
+<span>resilience to errors.</span>
+</li>
+</ul>
+<p><span>Separate compilation measures how fast we can compile codebase from scratch if we have unlimited</span>
+<span>number of CPU cores. For a language server, it solves the cold start problem </span>&mdash;<span> time to</span>
+<span>code-completion when the user opens the project for the first time or switches branches. Incremental</span>
+<span>compilation is the steady state of the language server </span>&mdash;<span> user types code and expects to see</span>
+<span>immediate effects throughout the project. Resilience to errors is important for two different</span>
+<span>sub-reasons. First, when the user edits the code it is by definition incomplete and erroneous, but a</span>
+<span>language server still must analyze the surrounding context correctly. But the killer feature of</span>
+<span>resilience is that, if you are absolutely immune to some errors, you don</span>&rsquo;<span>t even have to look at the</span>
+<span>code. If a language server can ignore errors in function bodies, it doesn</span>&rsquo;<span>t have to look at the</span>
+<span>bodies of functions from dependencies.</span></p>
+<p><span>All three properties, parallelism, incrementality, and resilience, boil down to modularity </span>&mdash;
+<span>partitioning the code into disjoint components with well-defined interfaces, such that each</span>
+<span>particular component is aware only about the interfaces of other components.</span></p>
+<section id="Minimized-Example-Lexical-Analysis">
+
+    <h2>
+    <a href="#Minimized-Example-Lexical-Analysis"><span>Minimized Example: Lexical Analysis</span> </a>
+    </h2>
+<p><span>Lets do a short drill and observe how the three properties interact at a small scale. Let</span>&rsquo;<span>s</span>
+<span>minimize the problem of separate compilation to just </span>&hellip;<span> lexical analysis. How can we build a</span>
+<span>language that is easier to tokenize for an language server?</span></p>
+<p><span>An unclosed quote is a nasty little problem! Practically, it is rare enough that it doesn</span>&rsquo;<span>t really</span>
+<span>matter how you handle it, but qualitatively it is illuminating. In a language like Rust, where</span>
+<span>strings can span multiple lines, inserting a </span><code>"</code><span> in the middle of a file changes the lexical structure</span>
+<span>of the following text completely (</span><code>/*</code><span>, start of a block comment, has the same effect). When tokens</span>
+<span>change, so does the syntax tree and the set of symbols defined by the file. A tiny edit, just one</span>
+<span>symbol, unhinges semantic structure of the entire compilation unit.</span></p>
+<p><span>Zig solves this problem. In Zig, no token can span several lines. That is, it would be correct to</span>
+<span>first split Zig source file by </span><code>\n</code><span>, and then tokenize each line separately. This is achieved by</span>
+<span>solving underlying problems requiring multi-line tokens better. Specifically:</span></p>
+<ul>
+<li>
+<p><span>there</span>&rsquo;<span>s a single syntax for comments, </span><code>//</code><span>,</span></p>
+</li>
+<li>
+<p><span>double-quoted strings can</span>&rsquo;<span>t contain a </span><code>\n</code><span>,</span></p>
+</li>
+<li>
+<p><span>but there</span>&rsquo;<span>s a really nice syntax for multiline strings:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> greeting =</span>
+<span class="line">    <span class="hl-string">\\This is</span></span>
+<span class="line">    <span class="hl-string">\\a multiline string</span></span>
+<span class="line">    <span class="hl-string">\\   &lt;- with a leading whitespace here.</span></span>
+<span class="line">    <span class="hl-string">\\</span></span></code></pre>
+
+</figure>
+</li>
+</ul>
+<p><span>Do you see modules here? Disjoint-partitioning into interface-connected components? From the</span>
+<span>perspective of lexical analysis, each </span><em><span>line</span></em><span> is a module. And a line always has a trivial, empty</span>
+<span>interface </span>&mdash;<span> different lines are completely independent. As a result:</span></p>
+<p><em><span>First</span></em><span>, we can do lexical analysis in parallel. If you have N CPU cores, you can split file into N</span>
+<span>equal chunks, then in parallel locally adjust chunk boundaries such that they fall on newlines, and</span>
+<span>then tokenize each chunk separately.</span></p>
+<p><em><span>Second</span></em><span>, we have quick incremental tokenization </span>&mdash;<span> given a source edit, you determine the set of</span>
+<span>lines affected, and re-tokenize only those. The work is proportional to the size of the edit plus at</span>
+<span>most two boundary lines.</span></p>
+<p><em><span>Third</span></em><span>, any lexical error in a line is isolated just to this line. There</span>&rsquo;<span>s no unclosed quote</span>
+<span>problem, mistakes are contained.</span></p>
+<p><span>I am by no means saying that line-by-line lexing is a requirement for an IDE-friendly language</span>
+<span>(though it would be nice)! Rather, I want you to marvel how the same underlying structure of the</span>
+<span>problem can be exploited for quarantining errors, reacting to changes quickly, and parallelizing the</span>
+<span>processing.</span></p>
+<p><span>The three properties are just three different faces of modularity in the end!</span></p>
+<hr>
+<p><span>I do want to write that </span>&ldquo;<span>IDE-friendly language</span>&rdquo;<span> post at some point, but, as a hedge (after all, I</span>
+<span>still owe you </span>&ldquo;<a href="https://matklad.github.io/2022/04/25/why-lsp.html"><span>Why LSP</span></a><span> Sucks?</span>&rdquo;<span> one</span>&hellip;<span>), here are two comments where I explored the idea somewhat:</span>
+<a href="https://todo.sr.ht/~icefox/garnet/52#event-242650"><span>1</span></a><span>,</span>
+<a href="https://lobste.rs/s/u7y4lk/modules_matter_most_for_masses#c_i6a8n9"><span>2</span></a><span>.</span></p>
+<p><span>I also recommend these posts, which explore the same underlying phenomenon from the software</span>
+<span>architecture perspective:</span></p>
+<ul>
+<li>
+<a href="https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html" class="url">https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html</a>
+</li>
+<li>
+<a href="https://www.tedinski.com/2018/02/06/system-boundaries.html" class="url">https://www.tedinski.com/2018/02/06/system-boundaries.html</a>
+</li>
+<li>
+<a href="https://www.pathsensitive.com/2023/03/modules-matter-most-for-masses.html" class="url">https://www.pathsensitive.com/2023/03/modules-matter-most-for-masses.html</a>
+</li>
+</ul>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-08-01-on-modularity-of-lexical-analysis.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/08/06/fantastic-learning-resources.html b/2023/08/06/fantastic-learning-resources.html
new file mode 100644
index 00000000..dc56151a
--- /dev/null
+++ b/2023/08/06/fantastic-learning-resources.html
@@ -0,0 +1,382 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Fantastic Learning Resources</title>
+  <meta name="description" content="People sometimes ask me: Alex, how do I learn X?. This article is a compilation of advice I
+usually give. This is things that worked for me rather than the most awesome things on earth. I
+do consider every item on the list to be fantastic though, and I am forever grateful to people
+putting these resources together.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/08/06/fantastic-learning-resources.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Fantastic-Learning-Resources"><span>Fantastic Learning Resources</span> <time datetime="2023-08-06">Aug 6, 2023</time></a>
+    </h1>
+<p><span>People sometimes ask me: </span>&ldquo;<span>Alex, how do I learn X?</span>&rdquo;<span>. This article is a compilation of advice I</span>
+<span>usually give. This is </span>&ldquo;<span>things that worked for me</span>&rdquo;<span> rather than </span>&ldquo;<span>the most awesome things on earth</span>&rdquo;<span>. I</span>
+<span>do consider every item on the list to be fantastic though, and I am forever grateful to people</span>
+<span>putting these resources together.</span></p>
+<section id="Learning-to-Code">
+
+    <h2>
+    <a href="#Learning-to-Code"><span>Learning to Code</span> </a>
+    </h2>
+<p><span>I don</span>&rsquo;<span>t think I have any useful advice on how to learn programming from zero. The rest of the post</span>
+<span>assumes that you at least can, given sufficient time, write simple programs. E.g., a program that</span>
+<span>reads a list of integers from an input textual file, sorts them using a quadratic algorithm, and</span>
+<span>writes the result to a different file.</span></p>
+</section>
+<section id="Project-Euler">
+
+    <h2>
+    <a href="#Project-Euler"><span>Project Euler</span> </a>
+    </h2>
+<p><a href="https://projecteuler.net/archives" class="url">https://projecteuler.net/archives</a><span> is fantastic. The first 50 problems or so are a perfect </span>&ldquo;<span>drill</span>&rdquo;
+<span>to build programming muscle, to go from </span>&ldquo;<span>I can write a program to sort a list of integers</span>&rdquo;<span> to </span>&ldquo;<span>I can</span>
+<em><span>easily</span></em><span> write a program to sort a list of integers</span>&rdquo;<span>.</span></p>
+<p><span>Later problems are very heavily math based. If you are mathematically inclined, this is perfect </span>&mdash;
+<span>you got to solve fun puzzles while also practicing coding. If advanced math isn</span>&rsquo;<span>t your cup of tea,</span>
+<span>feel free to stop doing problems as soon as it stops being fun.</span></p>
+</section>
+<section id="Modern-Operating-System">
+
+    <h2>
+    <a href="#Modern-Operating-System"><span>Modern Operating System</span> </a>
+    </h2>
+<p><a href="https://en.wikipedia.org/wiki/Modern_Operating_Systems" class="url">https://en.wikipedia.org/wiki/Modern_Operating_Systems</a><span> is fantastic. A </span><a href="https://en.wikipedia.org/wiki/Operating_Systems:_Design_and_Implementation"><span>version of the</span>
+<span>book</span></a><span> was the first</span>
+<span>thick programming related tome I devoured. It gives a big picture of the inner workings of software</span>
+<span>stack, and was a turning point for me personally. After reading this book I realized that I want to</span>
+<span>be a programmer.</span></p>
+</section>
+<section id="Nand-to-Tetris">
+
+    <h2>
+    <a href="#Nand-to-Tetris"><span>Nand to Tetris</span> </a>
+    </h2>
+<p><a href="https://www.nand2tetris.org" class="url">https://www.nand2tetris.org</a><span> is fantastic. It plays a similar </span>&ldquo;<span>big picture</span>&rdquo;<span> role as MOS,</span>
+<span>but this time you are the painter. In this course you build a whole computing system yourself,</span>
+<span>starting almost from nothing. It doesn</span>&rsquo;<span>t teach you how the real software/hardware stack works, but</span>
+<span>it thoroughly dispels any magic, and is extremely fun.</span></p>
+</section>
+<section id="CSES-Problem-Set">
+
+    <h2>
+    <a href="#CSES-Problem-Set"><span>CSES Problem Set</span> </a>
+    </h2>
+<p><a href="https://cses.fi/problemset/" class="url">https://cses.fi/problemset/</a><span> is fantastic. This is a list of algorithmic problems, which is</span>
+<span>meticulously crafted to cover all the standard topics to a reasonable depth. This is by far the best</span>
+<span>source for practicing algorithms.</span></p>
+</section>
+<section id="Programming-Languages">
+
+    <h2>
+    <a href="#Programming-Languages"><span>Programming Languages</span> </a>
+    </h2>
+<p><a href="https://www.coursera.org/learn/programming-languages" class="url">https://www.coursera.org/learn/programming-languages</a><span> is fantastic. This course is a whirlwind tour</span>
+<span>across several paradigms of programming, and makes you really </span><em><span>get</span></em><span> what programming languages are</span>
+<span>about (and variance).</span></p>
+</section>
+<section id="Compilers">
+
+    <h2>
+    <a href="#Compilers"><span>Compilers</span> </a>
+    </h2>
+<p><a href="http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=Compilers" class="url">http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=Compilers</a><span> is fantastic. In this</span>
+<span>course, you implement a working compiler for a simple, but real programming language. Note that you</span>
+<span>can implement your compiler in any language.</span></p>
+</section>
+<section id="Software-Architecture">
+
+    <h2>
+    <a href="#Software-Architecture"><span>Software Architecture</span> </a>
+    </h2>
+<p><a href="https://www.tedinski.com/archive/" class="url">https://www.tedinski.com/archive/</a><span> is fantastic. Work through the whole archive in chronological</span>
+<span>order. This is by far the best resource on </span>&ldquo;<span>programming in the large</span>&rdquo;<span>.</span></p>
+</section>
+<section id="Random-Bits-of-Advice">
+
+    <h2>
+    <a href="#Random-Bits-of-Advice"><span>Random Bits of Advice</span> </a>
+    </h2>
+<p><span>What follows are some things I</span>&rsquo;<span>ve learned for myself. Take with a pinch of salt!</span></p>
+<section id="On-Mentorship">
+
+    <h3>
+    <a href="#On-Mentorship"><span>On Mentorship</span> </a>
+    </h3>
+<p><span>Having a great mentor is fantastic, but mentors are not always available. Luckily, programming can</span>
+<span>be mastered without a mentor, if you got past the initial learning step. When you code, you get </span><em><span>a</span>
+<span>lot</span></em><span> of feedback, and, through trial and error, you can process the feedback to improve your skills.</span>
+<span>In fact, the hardest bit is actually finding the problems to solve (and this article suggests many).</span>
+<span>But if you have the problem, you can self-improve noticing the following:</span></p>
+<ul>
+<li>
+<span>How you verify that the solution works.</span>
+</li>
+<li>
+<span>Common bugs and techniques to avoid them in the future.</span>
+</li>
+<li>
+<span>Length of the solution: can you solve the problem using shorter, simpler code?</span>
+</li>
+<li>
+<span>Techniques </span>&mdash;<span> can you apply anything you</span>&rsquo;<span>ve read about this week? How would the problem be solved</span>
+<span>in Haskell? Could you apply pattern from language X in language Y?</span>
+</li>
+</ul>
+<p><span>In this context it is important to solve the same problem repeatedly. E.g., you could try solving</span>
+<span>the same model problem in all languages you know, with a month or two break between attempts.</span>
+<span>Repeatedly doing the same thing and noticing differences and similarities between tries is the</span>
+<span>essence of self-learning.</span></p>
+</section>
+<section id="On-Programming-Languages">
+
+    <h3>
+    <a href="#On-Programming-Languages"><span>On Programming Languages</span> </a>
+    </h3>
+<p><span>Learning your first programming language is a nightmare, because you are learning your editing</span>
+<span>environment (PyScripter, IntelliJ IDEA, VS Code) first, simple algorithms second, and the language</span>
+<span>itself third. It gets much easier afterwards!</span></p>
+<p><span>Learning different programming languages is one of the best way to improve your programming skills.</span>
+<span>By seeing what</span>&rsquo;<span>s similar, and what</span>&rsquo;<span>s different, you deeper learn how the things work under the hood.</span>
+<span>Different languages put different idioms to the forefront, and learning several expands your</span>
+<span>vocabulary considerably. As a bonus, after learning N languages, learning N+1st becomes a question</span>
+<span>of skimming through the official docs.</span></p>
+<p><span>In general, you want to cover big families of languages: Python, Java, Haskell, C, Rust, Clojure</span>
+<span>would be a good baseline. Erlang, Forth, and Prolog would be good additions afterwards.</span></p>
+</section>
+<section id="On-Algorithms">
+
+    <h3>
+    <a href="#On-Algorithms"><span>On Algorithms</span> </a>
+    </h3>
+<p><span>There are three levels of learning algorithms</span></p>
+<dl>
+<dt><span>Level 1</span></dt>
+<dd>
+<p><span>You are not actually learning algorithms, you are learning programming. At this stage, it doesn</span>&rsquo;<span>t</span>
+<span>matter how long your code is, how pretty it is, or how efficient it is. The only thing that</span>
+<span>matters is that it solves the problem. Generally, this level ends when you are fairly comfortable</span>
+<span>with recursion. Few first problems from Project Euler are a great resource here.</span></p>
+</dd>
+<dt><span>Level 2</span></dt>
+<dd>
+<p><span>Here you learn algorithms proper. The goal here is mostly encyclopedic knowledge of common</span>
+<span>techniques. There are quite a few, but not too many of those. At this stage, the most useful thing</span>
+<span>is understanding the math behind the algorithms </span>&mdash;<span> being able to explain algorithm using</span>
+<span>pencil&amp;paper, prove its correctness, and analyze Big-O runtime. Generally, you want to learn the</span>
+<span>name of algorithm or technique, read and grok the full explanation, and then implement it.</span></p>
+<p><span>I recommend doing an abstract implementation first (i.e., not </span>&ldquo;<span>HashMap to solve problem X</span>&rdquo;<span>, but</span>
+&ldquo;<span>just HashMap</span>&rdquo;<span>). Include tests in your implementation. Use randomized testing (e.g., when testing</span>
+<span>sorting algorithms, don</span>&rsquo;<span>t use a finite set of example, generate a million random ones).</span></p>
+<p><span>It</span>&rsquo;<span>s OK and even desirable to implement the same algorithm multiple times. When solving problems,</span>
+<span>like CSES, you </span><em><span>could</span></em><span> abstract your solutions and re-use them, but it</span>&rsquo;<span>s better to code everything</span>
+<span>from scratch every time, until you</span>&rsquo;<span>ve fully internalized the algorithm.</span></p>
+</dd>
+<dt><span>Level 3</span></dt>
+<dd>
+<p><span>One day, long after I</span>&rsquo;<span>ve finished my university, I was a TA for an algorithms course. The lecturer</span>
+<span>for the course was the person who originally taught me to program, through a similar algorithms</span>
+<span>course. And, during one coffee break, he said something like</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>We don</span>&rsquo;<span>t teach algorithms so that students can code Dijkstra with their eyes closed on the job.</span>
+<span>They probably won</span>&rsquo;<span>t have to code any fancy algorithms themselves.</span></p>
+<p><span>We teach algorithms so that students learn to think about invariants and properties when writing</span>
+<span>code. Real-life code is usually simple enough that it mostly works if you just throw spaghetti</span>
+<span>onto the wall. But it doesn</span>&rsquo;<span>t always work. To write correct, robust code at work, you need to</span>
+<span>think about invariants.</span></p>
+<p><span>The trick with algorithms is that coding them is hard. The only way to avoid bugs is to force</span>
+<span>yourself to think in terms of invariants.</span></p>
+</blockquote>
+
+</figure>
+<p><span>I was thunderstruck! I didn</span>&rsquo;<span>t realize that</span>&rsquo;<span>s the reason why I am learning (well, teaching at that</span>
+<span>point) algorithms! Before, I always muddled through my algorithms by randomly tweaking generally</span>
+<span>correct stuff until it works. E.g., with a binary search, just add </span><code>+1</code><span> somewhere until it doesn</span>&rsquo;<span>t</span>
+<span>loop on random arrays. After hearing this advice, I went home and wrote my millionth binary</span>
+<span>search, but this time I actually added comments with loop invariants, and it worked from the first</span>
+<span>try! I applied similar techniques for the rest of the course, and since then my subjective</span>
+<span>perception of bug rate (for normal work code) went down dramatically.</span></p>
+<p><span>So this is the third level of algorithms </span>&mdash;<span> you hone your coding skills to program without bugs.</span>
+<span>If you are already fairly comfortable with algorithms, try doing CSES again. But this time, spend</span>
+<span>however much you need double-checking the code </span><em><span>before</span></em><span> submission, but try to get everything</span>
+<span>correct on the first try.</span></p>
+</dd>
+</dl>
+</section>
+<section id="On-Algorithm-Names">
+
+    <h3>
+    <a href="#On-Algorithm-Names"><span>On Algorithm Names</span> </a>
+    </h3>
+<p><span>Here</span>&rsquo;<span>s the list of things you might want to be able to do, algorithmically. You don</span>&rsquo;<span>t need to be</span>
+<span>able to code everything on the spot. I think it would help if you know what each word is about, and</span>
+<span>have implemented the thing at least once in the past.</span></p>
+<p><span>Linear search, binary search, quadratic sorting, quick sort, merge sort, heap sort, binary heap,</span>
+<span>growable array (aka ArrayList, vector), doubly-linked list, binary search tree, avl tree, red-black</span>
+<span>tree, B-tree, splay tree, hash table (chaining and open addressing), depth first search, breadth first</span>
+<span>search, topological sort, strongly connected components, minimal spanning tree (Prim &amp; Kruskal),</span>
+<span>shortest paths (bfs, Dijkstra, Floyd–Warshall, Bellman–Ford), substring search (quadratic,</span>
+<span>Rabin-Karp, Boyer-Moore, Knuth-Morris-Pratt), trie, Aho-Corasick, dynamic programming (longest</span>
+<span>common subsequence, edit distance).</span></p>
+</section>
+<section id="On-Larger-Programs">
+
+    <h3>
+    <a href="#On-Larger-Programs"><span>On Larger Programs</span> </a>
+    </h3>
+<p><span>A very powerful exercise is coding a medium-sized project from scratch. Something that takes more</span>
+<span>than a day, but less than a week, and has a meaningful architecture which can be just right, or</span>
+<span>messed up. Here are some great projects to do:</span></p>
+<dl>
+<dt><span>Ray Tracer</span></dt>
+<dd>
+<p><span>Given an analytical description of a 3D scene, convert it to a colored 2D image, by simulating a</span>
+<span>path of a ray of light as it bounces off objects.</span></p>
+</dd>
+<dt><span>Software Rasterizer</span></dt>
+<dd>
+<p><span>Given a description of a 3D scene as a set of triangles, convert it to a colored 2D image by</span>
+<span>projecting triangles onto the viewing plane and drawing the projections in the correct order.</span></p>
+</dd>
+<dt><span>Dynamically Typed Programming Language</span></dt>
+<dd>
+<p><span>An </span><em><span>interpreter</span></em><span> which reads source code as text, parses it into an AST, and directly executes the</span>
+<span>AST (or maybe converts AST to the byte code for some speed up)</span></p>
+</dd>
+<dt><span>Statically Typed Programming Language</span></dt>
+<dd>
+<p><span>A </span><em><span>compiler</span></em><span> which reads source code as text, and spits out a binary (WASM would be a terrific</span>
+<span>target).</span></p>
+</dd>
+<dt><span>Relational Database</span></dt>
+<dd>
+<p><span>Several components:</span></p>
+<ul>
+<li>
+<span>Storage engine, which stores data durably on disk and implements on-disk ordered data structures</span>
+<span>(B-tree or LSM)</span>
+</li>
+<li>
+<span>Relational data model which is implemented on top of primitive ordered data structures.</span>
+</li>
+<li>
+<span>Relational language to express schema and queries.</span>
+</li>
+<li>
+<span>Either a TCP server to accept transactions as a database server, or an API for embedding for an</span>
+<span>in-processes </span>&ldquo;<span>embedded</span>&rdquo;<span> database.</span>
+</li>
+</ul>
+</dd>
+<dt><span>Chat Server</span></dt>
+<dd>
+<p><span>An exercise in networking and asynchronous programming. Multiple client programs connect to a</span>
+<span>server program. A client can send a message either to a specific different client, or to all other</span>
+<span>clients (broadcast). There are many variations on how to implement this: blocking read/write</span>
+<span>calls, </span><code>epoll</code><span>, </span><code>io_uring</code><span>, threads, callbacks, futures, manually-coded state machines.</span></p>
+</dd>
+</dl>
+<p><span>Again, it</span>&rsquo;<span>s more valuable to do the same exercise six times with variations, than to blast through</span>
+<span>everything once.</span></p>
+</section>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-08-06-fantastic-learning-resources.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/08/09/types-and-zig.html b/2023/08/09/types-and-zig.html
new file mode 100644
index 00000000..1f043c60
--- /dev/null
+++ b/2023/08/09/types-and-zig.html
@@ -0,0 +1,389 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Types and the Zig Programming Language</title>
+  <meta name="description" content="Notes on less-than-obvious aspects of Zig's type system and things that surprised me after diving
+deeper into the language.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/08/09/types-and-zig.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Types-and-the-Zig-Programming-Language"><span>Types and the Zig Programming Language</span> <time datetime="2023-08-09">Aug 9, 2023</time></a>
+    </h1>
+<p><span>Notes on less-than-obvious aspects of Zig</span>&rsquo;<span>s type system and things that surprised me after diving</span>
+<span>deeper into the language.</span></p>
+<section id="Nominal-Types">
+
+    <h2>
+    <a href="#Nominal-Types"><span>Nominal Types</span> </a>
+    </h2>
+<p><span>Zig has a nominal type system despite the fact that types lack names. A struct type is declared by</span>
+<span class="display"><code>struct { field: T }</code><span>.</span></span>
+<span>It</span>&rsquo;<span>s anonymous; an explicit assignment is required to name the type:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> S = <span class="hl-keyword">struct</span> {</span>
+<span class="line">  field: T,</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>Still, the type system is nominal, not structural. The following does not compile:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> f</span>() <span class="hl-keyword">struct</span> { f: <span class="hl-type">i32</span> } {</span>
+<span class="line">  <span class="hl-keyword">return</span> .{ .f = <span class="hl-numbers">92</span> };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> g</span>(s: <span class="hl-keyword">struct</span> { f: <span class="hl-type">i32</span> }) <span class="hl-type">void</span> {</span>
+<span class="line">  _ = s;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  g(f()); <span class="hl-comment">// &lt;- type mismatch</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The following does:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> S = <span class="hl-keyword">struct</span> { f: <span class="hl-type">i32</span> };</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> f</span>() S {</span>
+<span class="line">  <span class="hl-keyword">return</span> .{ .f = <span class="hl-numbers">92</span> };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> g</span>(s: S) <span class="hl-type">void</span> {</span>
+<span class="line">  _ = s;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  g(f());</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>One place where Zig is structural are anonymous struct literals:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  <span class="hl-keyword">const</span> x                      = .{ .foo = <span class="hl-numbers">1</span> };</span>
+<span class="line">  <span class="hl-keyword">const</span> y: <span class="hl-keyword">struct</span> { foo: <span class="hl-type">i32</span> } = x;</span>
+<span class="line">  <span class="hl-keyword">comptime</span> assert(<span class="hl-built_in">@TypeOf</span>(x) <span class="hl-operator">!=</span> <span class="hl-built_in">@TypeOf</span>(y));</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Types of </span><code>x</code><span> and </span><code>y</code><span> are different, but </span><code>x</code><span> can be coerced to </span><code>y</code><span>.</span></p>
+<p><span>In other words, Zig structs are anonymous and nominal, but anonymous structs are structural!</span></p>
+</section>
+<section id="No-Unification">
+
+    <h2>
+    <a href="#No-Unification"><span>No Unification</span> </a>
+    </h2>
+<p><span>Simple type inference for an expression works by first recursively inferring the types of</span>
+<span>subexpressions, and then deriving the result type from that. So, to infer types in</span>
+<span class="display"><code>foo().bar()</code><span>,</span></span><span> we first derive the type of </span><code>foo()</code><span>, then lookup method </span><code>bar</code><span> on that</span>
+<span>type, and use the return type of the method.</span></p>
+<p><span>More complex type inference works through so called unification algorithm. It starts with a similar</span>
+<span>recursive walk over the expression tree, but this walk doesn</span>&rsquo;<span>t infer types directly, but rather</span>
+<span>assigns a type variable to each subexpression, and generates equations relating type variables. So the</span>
+<span>result of this first phase look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">x = y</span>
+<span class="line">Int = y</span></code></pre>
+
+</figure>
+<p><span>Then, in the second phase the equations are solved, yielding, in this case, </span><code>x = Int</code><span> and </span><code>y = Int</code><span>.</span></p>
+<p><span>Usually languages with powerful type systems have unification somewhere, though often unification</span>
+<span>is limited in scope (for example, Kotlin infers types statement-at-a-time).</span></p>
+<p><span>It is curious that Zig doesn</span>&rsquo;<span>t do unification, type inference is a simple single-pass recursion (or</span>
+<span>at least it should be, I haven</span>&rsquo;<span>t looked at how it is actually implemented). So, anytime there</span>&rsquo;<span>s a</span>
+<span>generic function like</span>
+<span class="display"><code>fn reverse(comptime T: type, xs: []T) void</code><span>,</span></span>
+<span>the call site has to pass the type in explicitly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  <span class="hl-keyword">var</span> xs: [<span class="hl-numbers">3</span>]<span class="hl-type">i32</span> = .{<span class="hl-numbers">1</span>, <span class="hl-numbers">2</span>, <span class="hl-numbers">3</span>};</span>
+<span class="line">  reverse(<span class="hl-type">i32</span>, <span class="hl-operator">&amp;</span>xs);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Does it mean that you have to pass the types all the time? Not really! In fact, the only place which</span>
+<span>feels like a burden are functions in </span><code>std.mem</code><span> module which operate on slices, but that</span>&rsquo;<span>s just</span>
+<span>because slices are builtin types (a kind of pointer really) without methods. The thing is, when you</span>
+<span>call a method on a </span>&ldquo;<span>generic type</span>&rdquo;<span>, its type parameters are implicitly in scope, and don</span>&rsquo;<span>t have to be</span>
+<span>specified. Study this example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> std = <span class="hl-built_in">@import</span>(<span class="hl-string">&quot;std&quot;</span>);</span>
+<span class="line"><span class="hl-keyword">const</span> assert = std.debug.assert;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> Slice</span>(<span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>) <span class="hl-type">type</span> {</span>
+<span class="line">  <span class="hl-keyword">return</span> <span class="hl-keyword">struct</span> {</span>
+<span class="line">    ptr: [<span class="hl-operator">*</span>]T,</span>
+<span class="line">    len: <span class="hl-type">usize</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span><span class="hl-function"> init</span>(ptr: [<span class="hl-operator">*</span>]T, len: <span class="hl-type">usize</span>) <span class="hl-built_in">@This</span>() {</span>
+<span class="line">      <span class="hl-keyword">return</span> .{ .ptr = ptr, .len = len };</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span><span class="hl-function"> reverse</span>(slice: <span class="hl-built_in">@This</span>()) <span class="hl-type">void</span>{</span>
+<span class="line">      ...</span>
+<span class="line">    }</span>
+<span class="line">  };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  <span class="hl-keyword">var</span> xs: [<span class="hl-numbers">3</span>]<span class="hl-type">i32</span> = .{<span class="hl-numbers">1</span>, <span class="hl-numbers">2</span>, <span class="hl-numbers">3</span>};</span>
+<span class="line">  <span class="hl-keyword">var</span> slice = Slice(<span class="hl-type">i32</span>).init(<span class="hl-operator">&amp;</span>xs, xs.len);</span>
+<span class="line"></span>
+<span class="line">  slice.reverse(); <span class="hl-comment">// &lt;- look, no types!</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s a runtime parallel here. At runtime, there</span>&rsquo;<span>s a single dynamic dispatch, which prioritizes</span>
+<span>dynamic type of the first argument, and multiple dynamic dispatch, which can look at dynamic types</span>
+<span>of all arguments. Here, at compile time, the type of the first argument gets a preferential</span>
+<span>treatment. And, similarly to runtime, this covers 80% of use cases! Though, I</span>&rsquo;<span>d love for things like</span>
+<code>std.mem.eql</code><span> to be actual methods on slices</span>&hellip;</p>
+</section>
+<section id="Mandatory-Function-Signatures">
+
+    <h2>
+    <a href="#Mandatory-Function-Signatures"><span>Mandatory Function Signatures</span> </a>
+    </h2>
+<p><span>One of the best tricks a language server can pull off for as-you-type analysis is skipping bodies of</span>
+<span>the functions in dependencies. This works as long as the language requires complete signatures. In</span>
+<span>functional languages, its customary to make signatures optional, which precludes this crucial</span>
+<span>optimization. As per </span><a href="https://matklad.github.io/2023/08/01/on-modularity-of-lexical-analysis.html"><em><span>Modularity Of Lexical</span>
+<span>Analysis</span></em></a><span>, this has</span>
+<span>repercussions for all of:</span></p>
+<ul>
+<li>
+<span>incremental compilation,</span>
+</li>
+<li>
+<span>parallel compilation,</span>
+</li>
+<li>
+<span>robustness to errors.</span>
+</li>
+</ul>
+<p><span>I always assumed that Zig with its crazy </span><code>comptime</code><span> requires autopsy.</span>
+<span>But that</span>&rsquo;<span>s not actually the case! Zig doesn</span>&rsquo;<span>t have </span><code>decltype(auto)</code><span>, signatures are always explicit!</span></p>
+<p><span>Let</span>&rsquo;<span>s look at, e.g., </span><code>std.mem.bytesAsSlice</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> bytesAsSlice</span>(</span>
+<span class="line">  <span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>,</span>
+<span class="line">  bytes: <span class="hl-type">anytype</span>,</span>
+<span class="line">) BytesAsSliceReturnType(T, <span class="hl-built_in">@TypeOf</span>(bytes)) {</span></code></pre>
+
+</figure>
+<p><span>Note how the return type is not </span><code>anytype</code><span>, but the actual, real thing. You could write complex</span>
+<span>computations there, but you can</span>&rsquo;<span>t look inside the body. Of course, it also is possible to write </span><span class="display"><code>fn
+foo() @TypeOf(bar()) {</code><span>,</span></span><span> but that feels like a fair game </span>&mdash;<span> </span><code>bar()</code><span> will be evaluated at</span>
+<span>compile time. In other words, only bodies of functions invoked at comptime needs to be looked at by</span>
+<span>a language server. This potentially improves performance for this use-case quite a bit!</span></p>
+<p><span>It</span>&rsquo;<span>s useful to contrast this with Rust. There, you could write</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sneaky</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">impl</span> <span class="hl-title class_">Sized</span> {</span>
+<span class="line">  <span class="hl-number">0i32</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Although it feels like you are stating the interface, it</span>&rsquo;<span>s not really the case. Auto traits like</span>
+<code>Send</code><span> and </span><code>Sync</code><span> leak, and that can be detected by downstream code and lead to, e.g., different</span>
+<span>methods being called via </span><code>Deref</code><span>-based specialization depending on </span><code>: Send</code><span> being implemented:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">X</span>&lt;T&gt;(T);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;T: <span class="hl-built_in">Send</span>&gt; X&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">i32</span> { todo!() }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Y</span>;</span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Y</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">String</span> { todo!() }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; std::ops::Deref <span class="hl-keyword">for</span> <span class="hl-title class_">X</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">type</span> <span class="hl-title class_">Target</span> = Y;</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">deref</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;Y { todo!() }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">impl</span> <span class="hl-title class_">Sized</span> {</span>
+<span class="line">  ()</span>
+<span class="line"><span class="hl-comment">//  std::rc::Rc::new(())</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">x</span> = <span class="hl-title function_ invoke__">X</span>(<span class="hl-title function_ invoke__">f</span>());</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = x.<span class="hl-title function_ invoke__">foo</span>(); <span class="hl-comment">// &lt;- which `foo`?</span></span>
+<span class="line">  <span class="hl-comment">// The answer is inside f&#x27;s body!</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Zig is much more strict here, you have to fully name the return type (the name doesn</span>&rsquo;<span>t have to be</span>
+<span>pretty, take a second look at </span><code>bytesAsSlice</code><span>). But its not perfect, a genuine leakage happens  with</span>
+<span>inferred error types (</span><code>!T</code><span> syntax). A bad example would look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> f</span>() <span class="hl-operator">!</span><span class="hl-type">void</span> {</span>
+<span class="line">   <span class="hl-comment">// Mystery!</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-operator">!</span><span class="hl-type">void</span> {</span>
+<span class="line">  f() <span class="hl-keyword">catch</span> <span class="hl-operator">|</span>err<span class="hl-operator">|</span> {</span>
+<span class="line">    <span class="hl-keyword">comptime</span> assert(</span>
+<span class="line">      <span class="hl-built_in">@typeInfo</span>(<span class="hl-built_in">@TypeOf</span>(err)).ErrorSet.?.len <span class="hl-operator">==</span> <span class="hl-numbers">1</span>,</span>
+<span class="line">    );</span>
+<span class="line">  };</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, to check </span><code>main</code><span>, we actually do need to dissect </span><code>f</code>&rsquo;<span>s body, we can</span>&rsquo;<span>t treat the error union</span>
+<span>abstractly. When the compiler analyzes </span><code>main</code><span>, it needs to stop to process </span><code>f</code><span> signature (which is</span>
+<span>very fast, as it is very short) and then </span><code>f</code><span>’s body (this part could be quite slow, there might be a</span>
+<span>lot of code behind that </span><code>Mystery</code><span>! It</span>&rsquo;<span>s interesting to ponder alternative semantics, where, during</span>
+<span>type checking, inferred types are treated abstractly, and error exhastiveness is a separate late</span>
+<span>pass in the compiler. That way, complier only needs </span><code>f</code>&rsquo;<span>s signature to check </span><code>main</code><span>. And that means</span>
+<span>that bodies of </span><code>main</code><span> and </span><code>f</code><span> could be checked in parallel.</span></p>
+<p><span>That</span>&rsquo;<span>s all for today! The type system surprising I</span>&rsquo;<span>ve found so far are:</span></p>
+<ul>
+<li>
+<p><span>Nominal type system despite notable absence of names of types.</span></p>
+</li>
+<li>
+<p><span>Unification-less generics which don</span>&rsquo;<span>t incur unreasonable annotation burden due to methods </span>&ldquo;<span>closing</span>
+<span>over</span>&rdquo;<span> generic parameters.</span></p>
+</li>
+<li>
+<p><span>Explicit signatures with no </span><a href="https://wiki.dlang.org/Voldemort_types"><span>Voldemort types</span></a><span> with a</span>
+<span>notable exception of error unions.</span></p>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://ziggit.dev/t/types-and-the-zig-programming-language/1430"><span>ziggit.dev</span></a><span>.</span></p>
+</section>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-08-09-types-and-zig.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/08/13/role-of-algorithms.html b/2023/08/13/role-of-algorithms.html
new file mode 100644
index 00000000..9d1a10d2
--- /dev/null
+++ b/2023/08/13/role-of-algorithms.html
@@ -0,0 +1,384 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Role Of Algorithms</title>
+  <meta name="description" content="This is lobste.rs comment as an article, so expect even more abysmal editing than usual.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/08/13/role-of-algorithms.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Role-Of-Algorithms"><span>Role Of Algorithms</span> <time datetime="2023-08-13">Aug 13, 2023</time></a>
+    </h1>
+<p><span>This is lobste.rs comment as an article, so expect even more abysmal editing than usual.</span></p>
+<p><span>Let me expand on something I mentioned in the</span>
+<a href="https://matklad.github.io/2023/08/06/fantastic-learning-resources.html" class="display url">https://matklad.github.io/2023/08/06/fantastic-learning-resources.html</a>
+<span>post:</span></p>
+<p>&ldquo;<span>Algorithms</span>&rdquo;<span> are a useful skill not because you use it at work every day, but because they train you</span>
+<span>to be better at particular aspects of software engineering.</span></p>
+<p><span>Specifically:</span></p>
+<p><em><span>First</span></em><span>, algorithms drill the skill of bug-free coding. Algorithms are hard and frustrating! Subtle</span>
+<span>off-by-one might not matter for simple tests, but breaks corner cases. But if you practice</span>
+<span>algorithms, you get better at this particular skill of writing correct small programs, and I think</span>
+<span>this probably generalizes.</span></p>
+<p><span>To give an array of analogies:</span></p>
+<ul>
+<li>
+<p><span>People do cardio or strength exercises not because they need to lift heavy weights in real life.</span>
+<span>Quite the opposite </span>&mdash;<span> there</span>&rsquo;<span>s </span><em><span>too little</span></em><span> physical exertion in our usual lives, so we need extra</span>
+<span>exercises for our bodies to gain generalized health (which </span><em><span>is</span></em><span> helpful in day-to-day life).</span></p>
+</li>
+<li>
+<p><span>You don</span>&rsquo;<span>t practice complex skill by mere repetition. You first break it down into atomic trainable</span>
+<span>sub skills, and drill each sub skill separately in unrealistic condition. Writing correct</span>
+<span>algorithmy code is a sub skill of software engineering.</span></p>
+</li>
+<li>
+<p><span>When you optimize system, you don</span>&rsquo;<span>t just repeatedly run end-to-end test until things go fast. You</span>
+<span>first identify the problematic area, then write a targeted micro benchmark to isolate this</span>
+<span>particular effect, and then you optimize that using much shorter event loop.</span></p>
+</li>
+</ul>
+<p><span>I still remember two specific lessons I learned when I started doing algorithms many years ago:</span></p>
+<dl>
+<dt><span>Debugging complex code is hard, </span><em><span>first</span></em><span> simplify, </span><em><span>then</span></em><span> debug</span></dt>
+<dd>
+<p><span>Originally, when I was getting a failed test, I sort of tried to add more code to my program to</span>
+<span>make it pass. At some point I realized that this is going nowhere, and then I changed my workflow</span>
+<span>to first try to </span><em><span>remove</span></em><span> as much code as I can, and only then investigate the problematic test</span>
+<span>case (which with time morphed into a skill of not writing more code then necessary in the first</span>
+<span>place).</span></p>
+</dd>
+<dt><span>Single source of truth is good</span></dt>
+<dd>
+<p><span>A lot of my early bugs was due to me duplicating the same piece of information in two places and</span>
+<span>then getting them out of sync. Internalizing that as a single source of truth fixed the issues.</span></p>
+</dd>
+</dl>
+<p><span>Meta note: if you already know this, my lessons are useless. If you don</span>&rsquo;<span>t yet know them, they are</span>
+<em><span>still</span></em><span> useless and most likely will bounce off you. This is tacit knowledge </span>&mdash;<span> it</span>&rsquo;<span>s very hard to</span>
+<span>convey it verbally, it is much more efficient to learn these things yourself by doing.</span></p>
+<p><span>Somewhat related, I noticed a surprising correlation between programming skills in the small, and</span>
+<span>programming skills in the large. You can solve a problem in five lines of code, or, if you try hard,</span>
+<span>in ten lines of code. If you consistently come up with concise solutions in the small, chances are</span>
+<span>large scale design will be simple as well.</span></p>
+<p><span>I don</span>&rsquo;<span>t know how true is that, as I never tried to look at a proper study, but it looks very</span>
+<span>plausible from what I</span>&rsquo;<span>ve seen. </span><em><span>If</span></em><span> this is true, the next interesting question is: </span>&ldquo;<span>if you train</span>
+<span>programming-in-the-small skills, do they transfer to programming in the large?</span>&rdquo;<span>. Again, I don</span>&rsquo;<span>t</span>
+<span>know, but I</span>&rsquo;<span>d take this Pascal</span>&rsquo;<span>s wager.</span></p>
+<p><em><span>Second</span></em><span>, algorithms teach about properties and invariants. Some lucky people get those skills from</span>
+<span>a hard math background, but algorithms are a much more accessible way to learn them, as everything</span>
+<span>is very visual, immediately testable, and has very short and clear feedback loop.</span></p>
+<p><span>And properties and invariants is what underlines most big and successful systems. Like 90% of the</span>
+<span>code is just fluff and glue, and if you have the skill to see the 10% that is architecturally</span>
+<span>salient properties, you could comprehend the system much faster.</span></p>
+<p><em><span>Third</span></em><span>, algorithms occasionally </span><em><span>are</span></em><span> useful at the job! Just last week on our design walk&amp;talk we</span>
+<span>were brainstorming one particular problem, and I was like</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>Wait, so the problem here is that our solution is O(1) amortized, but really that means O(N)</span>
+<span>occasionally and that creates problem. I wonder if we could shift amortized work to when we do the</span>
+<span>real work, sort of how there are helper threads in concurrent programming. Ohh, this actually sounds</span>
+<span>like range query problem! Yeah, I think that cryptic trick that is called </span>&ldquo;<span>дерево отрезков</span>&rdquo;<span> in</span>
+<span>Russian and doesn</span>&rsquo;<span>t have a meme name in English (</span>&ldquo;<span>monoid tree</span>&rdquo;<span> is a good, but unknown, name) could</span>
+<span>help here. Yup, that actually does solve amortization issue, this will be O(log N) non-amortized.</span></p>
+</blockquote>
+
+</figure>
+<p><span>We probably won</span>&rsquo;<span>t go with that solution as that</span>&rsquo;<span>s too complex algorithmically for what ultimately is</span>
+<span>a corner case, </span><em><span>but</span></em><span> it</span>&rsquo;<span>s important that we understand problem space in detail before we pick a</span>
+<span>solution.</span></p>
+<p><span>Note also how algorithms </span><em><span>vocabulary</span></em><span> helps me to think about the problem. In math (including</span>
+<span>algorithms), there</span>&rsquo;<span>s just like a handful of ideas which are applied again and again under different</span>
+<span>guises. You need some amount of insight of course, but, for most simple problems, what you actually</span>
+<span>need is just an ability to recognize the structure you</span>&rsquo;<span>ve seen somewhere already.</span></p>
+<p><em><span>Fourth</span></em><span>, connecting to the previous ones, the ideas really do form interconnected web which, on a</span>
+<span>deep level, underpins a whole lot of stuff. So, if you do have non-zero amount of pure curiosity</span>
+<span>when it comes to learning programming, algorithms cut pretty deep to the foundation. Let me repeat</span>
+<span>the list from the last post, but with explicit connections to other things:</span></p>
+<dl>
+<dt><span>linear search</span></dt>
+<dd>
+<p><span>assoc lists in most old functional languages work that way</span></p>
+</dd>
+<dt><span>binary search</span></dt>
+<dd>
+<p><span>It is literally everywhere. Also, binary search got a cute name, but actually it isn</span>&rsquo;<span>t the</span>
+<span>primitive operation. The primitive operation is </span><code>partition_point</code><span>, a predicate version of binary</span>
+<span>search. This is what you should add to your language</span>&rsquo;<span>s stdlib as a primitive, and base everything</span>
+<span>else in terms of it. Also, it is one of the few cases where we know lower bound of complexity. If</span>
+<span>an algorithm does k binary comparisons, it can give at most 2</span><sup><span>k</span></sup><span> distinct answers. So, to find</span>
+<span>insertion point among n items, you need at least k questions such that 2</span><sup><span>k</span></sup><span> &gt; n.</span></p>
+</dd>
+<dt><span>quadratic sorting</span></dt>
+<dd>
+<p><span>We use it at work! Some collections are statically bound by a small constant, and quadratically</span>
+<span>sorting them just needs less machine code. We are also a bit paranoid that production sort</span>
+<span>algorithms are very complex and </span><em><span>might</span></em><span> have subtle bugs, esp in newer languages.</span></p>
+</dd>
+<dt><span>merge sort</span></dt>
+<dd>
+<p><span>This is how you sort things on disk. This is also how LSM-trees, the most practically important</span>
+<span>data structure you haven</span>&rsquo;<span>t learned about in school, works! And k-way merge also is occasionally</span>
+<span>useful (this is from work from three weeks ago).</span></p>
+</dd>
+<dt><span>heap sort</span></dt>
+<dd>
+<p><span>Well, this one is only actually useful for the heap, </span><em><span>but</span></em><span> I think maybe the kernel uses it when</span>
+<span>it needs to sort something in place, without extra memory, and in guaranteed O(N log N)?</span></p>
+</dd>
+<dt><span>binary heap</span></dt>
+<dd>
+<p><span>Binary heaps are everywhere! Notably, simple timers are a binary heap of things in the order of</span>
+<span>expiration. This is also a part of Dijkstra and k-way-merge.</span></p>
+</dd>
+<dt><span>growable array</span></dt>
+<dd>
+<p><span>That</span>&rsquo;<span>s the mostly widely used collection of them all! Did you know that grow factor 2 has a</span>
+<span>problem that the size after </span><code>n</code><span> reallocations is larger then the sum total of all previous sizes,</span>
+<span>so the allocator can</span>&rsquo;<span>t re-use the space? Anecdotally, growth factors less than two are preferable</span>
+<span>for this reason.</span></p>
+</dd>
+<dt><span>doubly-linked list</span></dt>
+<dd>
+<p><span>At the heart of rust-analyzer is a </span><a href="https://github.com/rust-analyzer/rowan/blob/87909d03dfe78d07ae932151e105dfde7ae87536/src/sll.rs"><span>two-dimensional doubly-linked</span>
+<span>list</span></a><span>.</span></p>
+</dd>
+<dt><span>binary search tree</span></dt>
+<dd>
+<p><span>Again, rust-analyzer green tree are binary search trees using offset as an implicit key.</span>
+<span>Monoid trees are also binary search trees.</span></p>
+</dd>
+<dt><span>AVL tree</span></dt>
+<dd>
+<p><span>Ok, this one I actually don</span>&rsquo;<span>t know a direct application of! </span><em><span>But</span></em><span> I remember two</span>
+<span>programming-in-the-small lessons AVL could have taught me, but didn</span>&rsquo;<span>t. I struggled a lot</span>
+<span>implementing all of </span>&ldquo;<span>small left rotation</span>&rdquo;<span>, </span>&ldquo;<span>small right rotation</span>&rdquo;<span>, </span>&ldquo;<span>big left rotation</span>&rdquo;<span>, </span>&ldquo;<span>big right</span>
+<span>rotation</span>&rdquo;<span>. Some years later, I</span>&rsquo;<span>ve learned that you don</span>&rsquo;<span>t do</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">left: Tree,</span>
+<span class="line">right: Tree,</span></code></pre>
+
+</figure>
+<p><span>as that forces code duplication. Rather, you do </span><code class="display">children: [Tree; 2]</code><span> and then you could</span>
+<span>use </span><code>child_index</code><span> and </span><code>child_index ^ 1</code><span> to abstract over left-right.</span></p>
+<p><span>And then some years later still I read in wikipedia that big rotations are actually a composition</span>
+<span>of two small rotations.</span></p>
+<p><span>Actually, I</span>&rsquo;<span>ve lied that I don</span>&rsquo;<span>t know connections here. You use the same rotations for the splay</span>
+<span>tree.</span></p>
+</dd>
+<dt><span>Red Black Tree</span></dt>
+<dd>
+<p><span>red-black tree is a 2-3 tree is a B-tree. Also, you probably use jemalloc, and it has a red-black</span>
+<span>tree </span><a href="https://github.com/aerospike/jemalloc/blob/05108b5010a511226fb7586543f4162dd2d31d2b/include/jemalloc/internal/rb.h#L338"><span>implemented as a C</span>
+<span>macro</span></a><span>.</span>
+<span>Left-leaning red-black tree are an interesting variation, which is claimed to be simpler, but is</span>
+<span>also claimed to not actually be simpler, because it is not symmetric and neuters the </span><code>children</code>
+<span>trick.</span></p>
+</dd>
+<dt><span>B-tree</span></dt>
+<dd>
+<p><span>If you use Rust, you probably use B-tree. Also, if you use a database, it stores data either in</span>
+<span>LSM or in a B-tree. Both of these are because B-trees play nice with memory hierarchy.</span></p>
+</dd>
+<dt><span>Splay Tree</span></dt>
+<dd>
+<p><span>Worth knowing just to have a laugh at </span><a href="https://www.link.cs.cmu.edu/splay/tree5.jpg" class="url">https://www.link.cs.cmu.edu/splay/tree5.jpg</a><span>.</span></p>
+</dd>
+<dt><span>HashTable</span></dt>
+<dd>
+<p><span>Literally everywhere, both chaining and open-addressing versions are widely used.</span></p>
+</dd>
+<dt><span>Depth First Search</span></dt>
+<dd>
+<p><span>This is something I have to code, explicitly or implicitly, fairly often. Every time where you</span>
+<span>have a DAG, when things depend on other things, you</span>&rsquo;<span>d have a DFS somewhere. In rust-analyzer,</span>
+<span>there are at least a couple </span>&mdash;<span> one in borrow checker for something (have no idea what that does,</span>
+<span>just grepped for </span><code>fn dfs</code><span>) and one in crate graph to detect cycles.</span></p>
+</dd>
+<dt><span>Breadth First Search</span></dt>
+<dd>
+<p><span>Ditto, any kind of exploration problem is usually solved with bfs. Eg, rust-analyzer uses </span><code>bfs</code>
+<span>for directory traversal.</span></p>
+<p><span>Which is better, </span><code>bfs</code><span> or </span><code>dfs</code><span>? Why not both?! Take a look at bdfs from rust-analyzer:</span></p>
+<p><a href="https://github.com/rust-lang/rust-analyzer/blob/2fbe69d117ff8e3ffb9b21c4a564f835158eb67b/crates/hir-expand/src/ast_id_map.rs#L195-L222" class="url">https://github.com/rust-lang/rust-analyzer/blob/2fbe69d117ff8e3ffb9b21c4a564f835158eb67b/crates/hir-expand/src/ast_id_map.rs#L195-L222</a></p>
+</dd>
+<dt><span>Topological Sort</span></dt>
+<dd>
+<p><span>Again, comes up every time you deal with things which depend on each other. rust-analyzer has</span>
+<code>crates_in_topological_order</code></p>
+</dd>
+<dt><span>Strongly Connected Components</span></dt>
+<dd>
+<p><span>This is needed every time things depend on each other, but you also allow cyclic dependencies. I</span>
+<span>don</span>&rsquo;<span>t think I</span>&rsquo;<span>ve needed this one in real life. But, given that SCC is how you solve 2-SAT in</span>
+<span>polynomial time, seems important to know to understand the 3 in 3-SAT</span></p>
+</dd>
+<dt><span>Minimal Spanning Tree</span></dt>
+<dd>
+<p><span>Ok, really drawing a blank here! Connects to sorting, disjoint set union (which is needed for</span>
+<span>unification in type-checkers), and binary heap. Seems practically important algorithm though! Ah,</span>
+<span>MST also gives an approximation for planar traveling salseman I think, another border between hard</span>
+<span>&amp; easy problems.</span></p>
+</dd>
+<dt><span>Dijkstra</span></dt>
+<dd>
+<p><span>Dijkstra is what I think about when I imagine a Platonic </span><dfn><span>algorithm</span></dfn><span>, though</span>
+<span>I don</span>&rsquo;<span>t think I</span>&rsquo;<span>ve used it in practice? Connects to heap.</span></p>
+<p><span>Do you know why we use </span><code>i</code><span>, </span><code>j</code><span>, </span><code>k</code><span> for loop indices? Because </span><code>D ijk stra</code><span>!</span></p>
+</dd>
+<dt><span>Floyd-Warshall</span></dt>
+<dd>
+<p><span>This one is cool! Everybody knows why any regular expression can be complied to an equivalent</span>
+<span>finite state machine. Few people know the reverse, why each automaton has an equivalent regex</span>
+<span>(many people know this fact, but few understand why). Well, because Floyd-Warshall! To convert an</span>
+<span>automaton to regex use the same algorithm you use to find pairwise distances in a graph.</span></p>
+<p><span>Also, this is a final boss of dynamic programming. If you understand why this algorithm works, you</span>
+<span>understand dynamic programming. Despite being tricky to understand, it</span>&rsquo;<span>s very easy to implement! I</span>
+<span>randomly stumbled into Floyd-Warshall, when I tried to implement a different, wrong approach, and</span>
+<span>made a bug which turned my broken algo into a correct Floyd-Warshall.</span></p>
+</dd>
+<dt><span>Bellman-Ford</span></dt>
+<dd>
+<p><span>Again, not much practical applicaions here, but the theory is well connected. All shortest path</span>
+<span>algorithms are actually fixed-point iterations! But with Bellman-Ford and its explicit edge</span>
+<span>relaxation operator that</span>&rsquo;<span>s most obvious. Next time you open static analysis textbook and learn</span>
+<span>about fixed point iteration, map that onto the problem of finding shortest paths!</span></p>
+</dd>
+<dt><span>Quadratic Substring Search</span></dt>
+<dd>
+<p><span>This is what you language standard library does</span></p>
+</dd>
+<dt><span>Rabin-Karp</span></dt>
+<dd>
+<p><span>An excellent application of hashes. The same idea, </span><span class="display"><code>hash(composite) =
+compbine(hash(component)*)</code><span>,</span></span><span> is used in rust-analyzer to </span><a href="https://github.com/rust-analyzer/rowan/blob/87909d03dfe78d07ae932151e105dfde7ae87536/src/green/node_cache.rs#L86-L97"><span>intern syntax</span>
+<span>trees</span></a><span>.</span></p>
+</dd>
+<dt><span>Boyer-Moore</span></dt>
+<dd>
+<p><span>This is beautiful and practical algorithm which probably handles the bulk of real-world searches</span>
+<span>(that is, it</span>&rsquo;<span>s probably the hottest bit of </span><code>ripgrep</code><span> as used by an average person). Delightfully,</span>
+<span>this algorithm is faster than theoretically possible </span>&mdash;<span> it doesn</span>&rsquo;<span>t even look at every byte of</span>
+<span>input data!</span></p>
+</dd>
+<dt><span>Knuth-Morris-Pratt</span></dt>
+<dd>
+<p><span>Another </span>&ldquo;<span>this is how you do string search in the real world</span>&rdquo;<span> algorithm. It also is the platonic</span>
+<span>ideal of a finite state machine, and almost everything is an FSM. It also is Aho-Corasick.</span></p>
+</dd>
+<dt><span>Aho-Corasick</span></dt>
+<dd>
+<p><span>This is the same as Knuth-Morris-Pratt, but also teaches you about tries. Again, super-useful for</span>
+<span>string searches. As it is an FSM, and a regex is an FSM, and there</span>&rsquo;<span>s a general construct for</span>
+<span>building a product of two FSMs, you can use it to implement fuzzy search. </span>&ldquo;<span>Workspace symbol</span>&rdquo;
+<span>feature in rust-analyzer works like this. Here</span>&rsquo;<span>s </span><a href="https://github.com/BurntSushi/fst/pull/64"><span>a part</span>
+<span>of</span></a><span> implementation.</span></p>
+</dd>
+<dt><span>Edit Distance</span></dt>
+<dd>
+<p><span>Everywhere in Bioinformatics (not the actual edit distance, but this problem shape). The first</span>
+<span>post on this blog is about this problem:</span></p>
+<p><a href="https://matklad.github.io/2017/03/12/min-of-three.html" class="url">https://matklad.github.io/2017/03/12/min-of-three.html</a></p>
+<p><span>It</span>&rsquo;<span>s not about algorithms though, its about CPU-level parallelism.</span></p>
+</dd>
+</dl>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-08-13-role-of-algorithms.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/08/17/typescript-is-surprisingly-ok-for-compilers.html b/2023/08/17/typescript-is-surprisingly-ok-for-compilers.html
new file mode 100644
index 00000000..84280937
--- /dev/null
+++ b/2023/08/17/typescript-is-surprisingly-ok-for-compilers.html
@@ -0,0 +1,605 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>TypeScript is Surprisingly OK for Compilers</title>
+  <meta name="description" content="There are two main historical trends when choosing an implementation language for something
+compiler-shaped.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/08/17/typescript-is-surprisingly-ok-for-compilers.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#TypeScript-is-Surprisingly-OK-for-Compilers"><span>TypeScript is Surprisingly OK for Compilers</span> <time datetime="2023-08-17">Aug 17, 2023</time></a>
+    </h1>
+<p><span>There are two main historical trends when choosing an implementation language for something</span>
+<span>compiler-shaped.</span></p>
+<p><span>For more language-centric tasks, like a formal specification, or a toy hobby language, OCaml makes</span>
+<span>most sense. See, for example, </span><a href="https://plzoo.andrej.com"><span>plzoo</span></a><span> or </span><a href="https://github.com/WebAssembly/spec/tree/653938a88c6f40eb886d5980ca315136eb861d03/interpreter"><span>WebAssembly reference</span>
+<span>interpreter</span></a><span>.</span></p>
+<p><span>For something implementation-centric and production ready, C++ is often chosen: LLVM, clang, v8,</span>
+<span>HotSpot are all C++.</span></p>
+<p><span>These days, Rust is a great new addition to the landscape. It is influenced most directly by ML and</span>
+<span>C++, combines their strengths, and even brings something new of its own to the table, like seamless,</span>
+<span>safe multithreading. Still, Rust leans heavily towards production readiness side of the spectrum.</span>
+<span>While some aspects of it, like a </span>&ldquo;<span>just works</span>&rdquo;<span> build system, help with prototyping as well, there</span>&rsquo;<span>s</span>
+<span>still extra complexity tax due to the necessity to model physical layout of data. The usual advice,</span>
+<span>when you start building a compiler in Rust, is to avoid pointers and use indexes. Indexes are great!</span>
+<span>In large codebase, they allow greater decoupling (side tables can stay local to relevant modules),</span>
+<span>improved performance (an index is  </span><code>u32</code><span> and nudges you towards struct-of-arrays layouts), and more</span>
+<span>flexible computation strategies (indexes are easier to serialize or plug into incremental</span>
+<span>compilation framework). But they do make programming-in-the-small significantly more annoying, which</span>
+<span>is a deal-breaker for hobbyist tinkering.</span></p>
+<p><span>But OCaml is crufty! Is there something better? Today, I realized that TypeScript might actually be</span>
+<span>OK? It is not really surprising, given how the language works, but it never occured to me to think</span>
+<span>about TypeScript as an ML equivalent before.</span></p>
+<p><span>So, let</span>&rsquo;<span>s write a tiny-tiny typechecker in TS!</span></p>
+<p><span>Of course, we start with </span><a href="https://deno.land"><span>deno</span></a><span>. See </span><a href="https://matklad.github.io/2023/02/12/a-love-letter-to-deno.html"><em><span>A Love Letter to</span>
+<span>Deno</span></em></a><span> for more details, but the</span>
+<span>TL;DR is that deno provides out-of-the-box experience for TypeScript. This is a pain point for</span>
+<span>OCaml, and something that Rust does better than either OCaml or C++. But deno does this better than</span>
+<span>Rust! It</span>&rsquo;<span>s just a single binary, it comes with linting and formatting, there</span>&rsquo;<span>s no compilation step,</span>
+<span>and there are built-in task runner and watch mode. A dream setup for quick PLT hacks!</span></p>
+<p><span>And then there</span>&rsquo;<span>s TypeScript itself, with its sufficiently flexible, yet light-ceremony type system.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with defining an AST. As we are hacking, we won</span>&rsquo;<span>t bother with making it an IDE-friendly</span>
+<span>concrete syntax tree, or incremental-friendly </span>&ldquo;<span>only store relative offsets</span>&rdquo;<span> tree, and will just tag</span>
+<span>AST nodes with locations in file:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">Location</span> {</span>
+<span class="line">  <span class="hl-attr">file</span>: <span class="hl-built_in">string</span>;</span>
+<span class="line">  <span class="hl-attr">line</span>: <span class="hl-built_in">number</span>;</span>
+<span class="line">  <span class="hl-attr">column</span>: <span class="hl-built_in">number</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Even here, we already see high-level nature of TypeScript </span>&mdash;<span> string is just a </span><code>string</code><span>, there</span>&rsquo;<span>s no</span>
+<span>thinking about </span><code>usize</code><span> vs </span><code>u32</code><span> as numbers are just </span><code>number</code><span>s.</span></p>
+<p><span>Usually, an expression is defined as a sum-type. As we want to tag each expression with a location,</span>
+<span>that representation would be slightly inconvenient for us, so we split things up a bit:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">Expr</span> {</span>
+<span class="line">    <span class="hl-attr">location</span>: <span class="hl-title class_">Location</span>;</span>
+<span class="line">    <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprKind</span>;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprKind</span> = <span class="hl-title class_">ExprBool</span> | <span class="hl-title class_">ExprInt</span> | ... ;</span></code></pre>
+
+</figure>
+<p><span>One more thing </span>&mdash;<span> as we are going for something quick, we</span>&rsquo;<span>ll be storing inferred types directly in</span>
+<span>the AST nodes. Still, we want to keep raw and type-checked AST separate, so what we are going to do</span>
+<span>here is to parametrize the </span><code>Expr</code><span> over associated data it stores. A freshly parsed expression would</span>
+<span>use </span><code>void</code><span> as data, and the type checker will set it to </span><code>Type</code><span>. Here</span>&rsquo;<span>s what we get:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">Expr</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-attr">location</span>: <span class="hl-title class_">Location</span>;</span>
+<span class="line">  <span class="hl-attr">data</span>: T;</span>
+<span class="line">  <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprKind</span>&lt;T&gt;;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprKind</span>&lt;T&gt; =</span>
+<span class="line">  | <span class="hl-title class_">ExprBool</span>&lt;T&gt;</span>
+<span class="line">  | <span class="hl-title class_">ExprInt</span>&lt;T&gt;</span>
+<span class="line">  | <span class="hl-title class_">ExprBinary</span>&lt;T&gt;</span>
+<span class="line">  | <span class="hl-title class_">ExprControl</span>&lt;T&gt;;</span></code></pre>
+
+</figure>
+<p><span>A definition of </span><code>ExprBinary</code><span> could look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprBinary</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-attr">op</span>: <span class="hl-title class_">BinaryOp</span>;</span>
+<span class="line">  <span class="hl-attr">lhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">rhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">BinaryOp</span> {</span>
+<span class="line">  <span class="hl-title class_">Add</span>, <span class="hl-title class_">Sub</span>, <span class="hl-title class_">Mul</span>, <span class="hl-title class_">Div</span>,</span>
+<span class="line">  <span class="hl-title class_">Eq</span>, <span class="hl-title class_">Neq</span>,</span>
+<span class="line">  <span class="hl-title class_">Lt</span>, <span class="hl-title class_">Gt</span>, <span class="hl-title class_">Le</span>, <span class="hl-title class_">Ge</span>,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note how I don</span>&rsquo;<span>t introduce separate types for, e.g, </span><code>AddExpr</code><span> and </span><code>SubExpr</code><span> </span>&mdash;<span> all binary</span>
+<span>expressions have the same shape, so one type is enough!</span></p>
+<p><span>But we need a tiny adjustment here. Our </span><code>Expr</code><span> kind is defined as a union type. To match a value of</span>
+<span>a union type a bit of runtime type information is needed. However, it</span>&rsquo;<span>s one of the core properties</span>
+<span>of TypeScript that it doesn</span>&rsquo;<span>t add any runtime behaviors. So, if we want to match on expression kinds</span>
+<span>(and we for sure want!), we need to give a helping hand to the compiler and include a bit of RTTI</span>
+<span>manually. That would be the </span><code>tag</code><span> field:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprBinary</span>&lt;T&gt; {</span>
+<span class="line hl-line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;binary&quot;</span>;</span>
+<span class="line">  <span class="hl-attr">op</span>: <span class="hl-title class_">BinaryOp</span>;</span>
+<span class="line">  <span class="hl-attr">lhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">rhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><code>tag: "binary"</code><span> means that the only possible runtime value for </span><code>tag</code><span> is the string </span><code>"binary"</code><span>.</span></p>
+<p><span>Similarly to various binary expressions, boolean literal and int literal expressions have </span><em><span>almost</span></em>
+<span>identical shape.  Almost, because the payload (</span><code>boolean</code><span> or </span><code>number</code><span>) is different. TypeScript</span>
+<span>allows us to neatly abstract this over:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprBool</span>&lt;T&gt; = <span class="hl-title class_">ExprLiteral</span>&lt;T, <span class="hl-built_in">boolean</span>, <span class="hl-string">&quot;bool&quot;</span>&gt;;</span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprInt</span>&lt;T&gt; = <span class="hl-title class_">ExprLiteral</span>&lt;T, <span class="hl-built_in">number</span>, <span class="hl-string">&quot;int&quot;</span>&gt;;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprLiteral</span>&lt;T, V, <span class="hl-title class_">Tag</span>&gt; {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-title class_">Tag</span>;</span>
+<span class="line">  <span class="hl-attr">value</span>: V;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Finally, for control-flow expressions we only add </span><code>if</code><span> for now:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprControl</span>&lt;T&gt; = <span class="hl-title class_">ExprIf</span>&lt;T&gt;;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprIf</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;if&quot;</span>;</span>
+<span class="line">  <span class="hl-attr">cond</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">then_branch</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">else_branch</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This concludes the definition of the ast! Let</span>&rsquo;<span>s move on to the type inference! Start with types:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">Type</span> = <span class="hl-title class_">TypeBool</span> | <span class="hl-title class_">TypeInt</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">TypeBool</span> {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Bool&quot;</span>;</span>
+<span class="line">}</span>
+<span class="line"><span class="hl-keyword">const</span> <span class="hl-title class_">TypeBool</span>: <span class="hl-title class_">TypeBool</span> = { <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Bool&quot;</span> };</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">TypeInt</span> {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Int&quot;</span>;</span>
+<span class="line">}</span>
+<span class="line"><span class="hl-keyword">const</span> <span class="hl-title class_">TypeInt</span>: <span class="hl-title class_">TypeInt</span> = { <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Int&quot;</span> };</span></code></pre>
+
+</figure>
+<p><span>Our types are really simple, we could have gone with </span><span class="display"><code>type Type = "Int" | "Bool"</code><span>,</span></span><span> but</span>
+<span>lets do this a bit more enterprisy! We define separate types for integer and boolean types. As these</span>
+<span>types are singletons, we also provide canonical definitions. And here is another TypeScript-ism.</span>
+<span>Because TypeScript fully erases types, everything related to types lives in a separate namespace. So</span>
+<span>you can have a type and a value sharing the same name. Which is exactly what we use to define the</span>
+<span>singletons!</span></p>
+<p><span>Finally, we can take advantage of our associated-data parametrized expression and write the</span>
+<span>signature of</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt;</span></code></pre>
+
+</figure>
+<p><span>As it says on the tin, </span><code>inter_types</code><span> fills in </span><code>Type</code><span> information into the void! Let</span>&rsquo;<span>s fill in the</span>
+<span>details!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    cas</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If at this point we hit Enter, the editor completes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;bool&quot;</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;int&quot;</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;binary&quot;</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;if&quot;</span>:</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s one problem though. What we really want to write here is something like</span>
+<span class="display"><code>const inferred_type = switch(..)</code><span>,</span></span>
+<span>but in TypeScript </span><code>switch</code><span> is a statement, not an expression.</span>
+<span>So let</span>&rsquo;<span>s define a generic visitor!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">Visitor</span>&lt;T, R&gt; = {</span>
+<span class="line">  <span class="hl-title function_">bool</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprBool</span>&lt;T&gt;): R;</span>
+<span class="line">  <span class="hl-title function_">int</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprInt</span>&lt;T&gt;): R;</span>
+<span class="line">  <span class="hl-title function_">binary</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprBinary</span>&lt;T&gt;): R;</span>
+<span class="line">  <span class="hl-keyword">if</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprIf</span>&lt;T&gt;): R;</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">function</span> visit&lt;T, R&gt;(</span>
+<span class="line">  <span class="hl-attr">expr</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;,</span>
+<span class="line">  <span class="hl-attr">v</span>: <span class="hl-title class_">Visitor</span>&lt;T, R&gt;,</span>
+<span class="line">): R {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;bool&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">bool</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;int&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">int</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;binary&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">binary</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;if&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">if</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Armed with the </span><code>visit</code><span>, we can ergonomically match over the expression:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">const</span> ty = <span class="hl-title function_">visit</span>(expr, {</span>
+<span class="line">    <span class="hl-attr">bool</span>: <span class="hl-function">() =&gt;</span> <span class="hl-title class_">TypeBool</span>,</span>
+<span class="line">    <span class="hl-attr">int</span>: <span class="hl-function">() =&gt;</span> <span class="hl-title class_">TypeInt</span>,</span>
+<span class="line">    <span class="hl-attr">binary</span>: <span class="hl-function">(<span class="hl-params">kind: ast.ExprBinary&lt;<span class="hl-built_in">void</span>&gt;</span>) =&gt;</span> <span class="hl-title function_">result_type</span>(kind.<span class="hl-property">op</span>),</span>
+<span class="line">    <span class="hl-attr">if</span>: (<span class="hl-attr">kind</span>: ast.<span class="hl-property">ExprIf</span>&lt;<span class="hl-built_in">void</span>&gt;) {</span>
+<span class="line">      ...</span>
+<span class="line">    },</span>
+<span class="line">  });</span>
+<span class="line">  ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">result_type</span>(<span class="hl-params">op: ast.BinaryOp</span>): <span class="hl-title class_">Type</span> {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (op) { <span class="hl-comment">// A tad verbose, but auto-completed!</span></span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Add</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Sub</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Mul</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Div</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title class_">TypeInt</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Eq</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Neq</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title class_">TypeBool</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Lt</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Gt</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Le</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Ge</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title class_">TypeBool</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Before we go further, let</span>&rsquo;<span>s generalize this visiting pattern a bit! Recall that our expressions are</span>
+<span>parametrized by the type of associated data, and type-checker-shaped transformations are essentially an</span>
+<code class="display">Expr&lt;U&gt; -&gt; Expr&lt;V&gt;</code>
+<span>transformation.</span></p>
+<p><span>Let</span>&rsquo;<span>s make this generic!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">function</span> transform&lt;U, V&gt;(<span class="hl-attr">expr</span>: <span class="hl-title class_">Expr</span>&lt;U&gt;, <span class="hl-attr">v</span>: <span class="hl-title class_">Visitor</span>&lt;V, V&gt;): <span class="hl-title class_">Expr</span>&lt;V&gt; {</span></code></pre>
+
+</figure>
+<p><span>Transform maps an expression carrying </span><code>T</code><span> into an expression carrying </span><code>V</code><span> by applying an </span><code>f</code>
+<span>visitor. Importantly, it</span>&rsquo;<span>s </span><code>Visitor&lt;V, V&gt;</code><span>, rather than a </span><code>Visitor&lt;U, V&gt;</code><span>. This is</span>
+<span>counter-intuitive, but correct </span>&mdash;<span> we run transformation bottom up, transforming the leaves first.</span>
+<span>So, when the time comes to visit an interior node, all subexpression will have been transformed!</span></p>
+<p><span>The body of </span><code>transform</code><span> is wordy, but regular, rectangular, and auto-completes itself:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">function</span> transform&lt;U, V&gt;(<span class="hl-attr">expr</span>: <span class="hl-title class_">Expr</span>&lt;U&gt;, <span class="hl-attr">v</span>: <span class="hl-title class_">Visitor</span>&lt;V, V&gt;): <span class="hl-title class_">Expr</span>&lt;V&gt; {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;bool&quot;</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">bool</span>(expr.<span class="hl-property">kind</span>),</span>
+<span class="line">        <span class="hl-attr">kind</span>: expr.<span class="hl-property">kind</span>, <i class="callout" data-value="1"></i></span>
+<span class="line">      };</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;int&quot;</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">int</span>(expr.<span class="hl-property">kind</span>),</span>
+<span class="line">        <span class="hl-attr">kind</span>: expr.<span class="hl-property">kind</span>,</span>
+<span class="line">      };</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;binary&quot;</span>: {</span>
+<span class="line">      <span class="hl-keyword">const</span> <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprBinary</span>&lt;V&gt; = { <i class="callout" data-value="2"></i></span>
+<span class="line">        <span class="hl-attr">tag</span>: <span class="hl-string">&quot;binary&quot;</span>,</span>
+<span class="line">        <span class="hl-attr">op</span>: expr.<span class="hl-property">kind</span>.<span class="hl-property">op</span>,</span>
+<span class="line">        <span class="hl-attr">lhs</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">lhs</span>, v),</span>
+<span class="line">        <span class="hl-attr">rhs</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">rhs</span>, v),</span>
+<span class="line">      };</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">binary</span>(kind), <i class="callout" data-value="2"></i></span>
+<span class="line">        <span class="hl-attr">kind</span>: kind,</span>
+<span class="line">      };</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;if&quot;</span>: {</span>
+<span class="line">      <span class="hl-keyword">const</span> <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprIf</span>&lt;V&gt; = {</span>
+<span class="line">        <span class="hl-attr">tag</span>: <span class="hl-string">&quot;if&quot;</span>,</span>
+<span class="line">        <span class="hl-attr">cond</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">cond</span>, v),</span>
+<span class="line">        <span class="hl-attr">then_branch</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">then_branch</span>, v),</span>
+<span class="line">        <span class="hl-attr">else_branch</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">else_branch</span>, v),</span>
+<span class="line">      };</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">if</span>(kind),</span>
+<span class="line">        <span class="hl-attr">kind</span>: kind,</span>
+<span class="line">      };</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><span>Note how here </span><code>expr.kind</code><span> is both </span><code>Expr&lt;U&gt;</code><span> and  </span><code>Expr&lt;V&gt;</code><span> </span>&mdash;<span> literals don</span>&rsquo;<span>t depend on this type</span>
+<span>parameter, and TypeScript is smart enough to figure this out without us manually re-assembling</span>
+<span>the same value with a different type.</span></p>
+</li>
+<li>
+<p><span>This is where that magic with </span><code>Visitor&lt;V, V&gt;</code><span> happens.</span></p>
+</li>
+</ol>
+<p><span>The code is pretty regular here though! So at this point we might actually recall that TypeScript is</span>
+<span>a dynamically-typed language, and write a generic traversal using </span><code>Object.keys</code><span>, </span><em><span>while keeping the</span>
+<span>static function signature in-place</span></em><span>. I don</span>&rsquo;<span>t think we need to do it here, but there</span>&rsquo;<span>s comfort in</span>
+<span>knowing that it</span>&rsquo;<span>s possible!</span></p>
+<p><em><span>Now</span></em><span> implementing type inference should be a breeze! We need some way to emit type errors though.</span>
+<span>With TypeScript, it would be trivial to accumulate errors into an array as a side-effect, but let</span>&rsquo;<span>s</span>
+<span>actually represent type errors as instances of a specific type, </span><code>TypeError</code><span> (pun intended):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">Type</span> = <span class="hl-title class_">TypeBool</span> | <span class="hl-title class_">TypeInt</span> | <span class="hl-title class_">TypeError</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">TypeError</span> {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>;</span>
+<span class="line">  <span class="hl-attr">location</span>: ast.<span class="hl-property">Location</span>;</span>
+<span class="line">  <span class="hl-attr">message</span>: <span class="hl-built_in">string</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To check ifs and binary expressions, we would also need a utility for comparing types:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">type_equal</span>(<span class="hl-params">lhs: Type, rhs: Type</span>): <span class="hl-built_in">boolean</span> {</span>
+<span class="line">  <span class="hl-keyword">if</span> (lhs.<span class="hl-property">tag</span> == <span class="hl-string">&quot;Error&quot;</span> || rhs.<span class="hl-property">tag</span> == <span class="hl-string">&quot;Error&quot;</span>) <span class="hl-keyword">return</span> <span class="hl-literal">true</span>;</span>
+<span class="line">  <span class="hl-keyword">return</span> lhs.<span class="hl-property">tag</span> == rhs.<span class="hl-property">tag</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>We make the </span><code>Error</code><span> type equal to any other type to prevent cascading failures. With all that</span>
+<span>machinery in place, our type checker is finally:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">return</span> ast.<span class="hl-title function_">transform</span>(expr, {</span>
+<span class="line">    <span class="hl-attr">bool</span>: (): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> <span class="hl-title class_">TypeBool</span>,</span>
+<span class="line">    <span class="hl-attr">int</span>: (): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> <span class="hl-title class_">TypeInt</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-attr">binary</span>: (<span class="hl-attr">kind</span>: ast.<span class="hl-property">ExprBinary</span>&lt;<span class="hl-title class_">Type</span>&gt;, <span class="hl-attr">location</span>: ast.<span class="hl-property">Location</span>): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> {</span>
+<span class="line">      <span class="hl-keyword">if</span> (!<span class="hl-title function_">type_equal</span>(kind.<span class="hl-property">lhs</span>.<span class="hl-property">data</span>, kind.<span class="hl-property">rhs</span>.<span class="hl-property">data</span>)) {</span>
+<span class="line">        <span class="hl-keyword">return</span> {</span>
+<span class="line">          <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>,</span>
+<span class="line">          location,</span>
+<span class="line">          <span class="hl-attr">message</span>: <span class="hl-string">&quot;binary expression operands have different types&quot;</span>,</span>
+<span class="line">        };</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title function_">result_type</span>(kind.<span class="hl-property">op</span>);</span>
+<span class="line">    },</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-attr">if</span>: (<span class="hl-attr">kind</span>: ast.<span class="hl-property">ExprIf</span>&lt;<span class="hl-title class_">Type</span>&gt;, <span class="hl-attr">location</span>: ast.<span class="hl-property">Location</span>): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> {</span>
+<span class="line">      <span class="hl-keyword">if</span> (!<span class="hl-title function_">type_equal</span>(kind.<span class="hl-property">cond</span>.<span class="hl-property">data</span>, <span class="hl-title class_">TypeBool</span>)) {</span>
+<span class="line">        <span class="hl-keyword">return</span> {</span>
+<span class="line">          <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>,</span>
+<span class="line">          location,</span>
+<span class="line">          <span class="hl-attr">message</span>: <span class="hl-string">&quot;if condition is not a boolean&quot;</span>,</span>
+<span class="line">        };</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-keyword">if</span> (!<span class="hl-title function_">type_equal</span>(kind.<span class="hl-property">then_branch</span>.<span class="hl-property">data</span>, kind.<span class="hl-property">else_branch</span>.<span class="hl-property">data</span>)) {</span>
+<span class="line">        <span class="hl-keyword">return</span> {</span>
+<span class="line">          <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>,</span>
+<span class="line">          location,</span>
+<span class="line">          <span class="hl-attr">message</span>: <span class="hl-string">&quot;if branches have different types&quot;</span>,</span>
+<span class="line">        };</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-keyword">return</span> kind.<span class="hl-property">then_branch</span>.<span class="hl-property">data</span>;</span>
+<span class="line">    },</span>
+<span class="line">  });</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">result_type</span>(<span class="hl-params">op: ast.BinaryOp</span>): <span class="hl-title class_">Type</span> {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Astute reader will notice that our visitor functions now take an extra </span><code>ast.Location</code><span> argument.</span>
+<span>TypeScript allows using this argument only in cases where it is needed, cutting down verbosity.</span></p>
+<p><span>And that</span>&rsquo;<span>s all for today! The end result is pretty neat and concise. It took some typing to get there,</span>
+<span>but TypeScript autocompletion really helps with that! What</span>&rsquo;<span>s more important, there was very little</span>
+<span>fighting with the language, and the result feels quite natural and directly corresponds to the shape</span>
+<span>of the problem.</span></p>
+<p><span>I am not entirely sure in the conclusion just yet, but I think I</span>&rsquo;<span>ll be using TypeScript as my tool</span>
+<span>of choice for various small language hacks. It is surprisingly productive due to the confluence of</span>
+<span>three aspects:</span></p>
+<ul>
+<li>
+<span>deno is a perfect scripting runtime! Small, hermetic, powerful, and optimized for effective</span>
+<span>development workflows.</span>
+</li>
+<li>
+<span>TypeScript tooling is great </span>&mdash;<span> the IDE is helpful and productive (and deno makes sure that it</span>
+<span>also requires zero configuration)</span>
+</li>
+<li>
+<span>The language is powerful both at runtime and at compile time. You can get pretty fancy with types,</span>
+<span>but you can also just escape to dynamic world if you need some very high-order code.</span>
+</li>
+</ul>
+<hr>
+<p><span>Just kidding, here</span>&rsquo;<span>s one more cute thing. Let</span>&rsquo;<span>s say that we want to have lots of syntactic sugar,</span>
+<span>and also want type-safe desugaring. We could tweak our setup a bit for that: instead of </span><code>Expr</code><span> and</span>
+<code>ExprKind</code><span> being parametrized over associated data, we circularly parametrize </span><code>Expr</code><span> by the whole</span>
+<code>ExprKind</code><span> and  vice verse:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">Expr</span>&lt;K&gt; {</span>
+<span class="line">  <span class="hl-attr">location</span>: <span class="hl-title class_">Location</span>,</span>
+<span class="line">  <span class="hl-attr">kind</span>: K,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">ExprBinary</span>&lt;E&gt; {</span>
+<span class="line">  <span class="hl-attr">op</span>: <span class="hl-title class_">BinaryOp</span>,</span>
+<span class="line">  <span class="hl-attr">lhs</span>: E,</span>
+<span class="line">  <span class="hl-attr">rhs</span>: E,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This allows expressing desugaring in a type-safe manner!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Fundamental, primitive expressions.</span></span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprKindCore</span>&lt;E&gt; =</span>
+<span class="line">    <span class="hl-title class_">ExprInt</span>&lt;E&gt; | <span class="hl-title class_">ExprBinary</span>&lt;E&gt; | <span class="hl-title class_">ExprIf</span>&lt;E&gt;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Expressions which are either themselves primitive,</span></span>
+<span class="line"><span class="hl-comment">// or can be desugared to primitives.</span></span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprKindSugar</span>&lt;E&gt; = <span class="hl-title class_">ExprKindCore</span>&lt;E&gt;</span>
+<span class="line">    | <span class="hl-title class_">ExprCond</span>&lt;E&gt; | <span class="hl-title class_">ExprUnless</span>&lt;E&gt;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprCore</span> = <span class="hl-title class_">Expr</span>&lt;<span class="hl-title class_">ExprKindCore</span>&lt;<span class="hl-title class_">ExprCore</span>&gt;&gt;;</span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprSugar</span> = <span class="hl-title class_">Expr</span>&lt;<span class="hl-title class_">ExprKindSugar</span>&lt;<span class="hl-title class_">ExprSugar</span>&gt;&gt;;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Desugaring works by reducing the set of expression kinds.</span></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">desugar</span>(<span class="hl-params">expr: ExprSugar</span>): <span class="hl-title class_">ExprCore</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// A desugaring steps takes a (potentially sugar) expression,</span></span>
+<span class="line"><span class="hl-comment">// whose subexpression are already desugared,</span></span>
+<span class="line"><span class="hl-comment">// and produces an equivalent core expression.</span></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">desugar_one</span>(<span class="hl-params"></span></span>
+<span class="line"><span class="hl-params">    expr: ExprKindSugar&lt;ExprCore&gt;,</span></span>
+<span class="line"><span class="hl-params"></span>): <span class="hl-title class_">ExprKindCore</span>&lt;<span class="hl-title class_">ExprCore</span>&gt;</span></code></pre>
+
+</figure>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-08-17-typescript-is-surprisingly-ok-for-compilers.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/09/13/comparative-analysis.html b/2023/09/13/comparative-analysis.html
new file mode 100644
index 00000000..cd0851ec
--- /dev/null
+++ b/2023/09/13/comparative-analysis.html
@@ -0,0 +1,196 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>Comparative Analysis</title>
+  <meta name="description" content="Most languages provide 6 comparison operators:">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/09/13/comparative-analysis.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#Comparative-Analysis"><span>Comparative Analysis</span> <time datetime="2023-09-13">Sep 13, 2023</time></a>
+    </h1>
+<p><span>Most languages provide 6 comparison operators:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">&lt;</span>
+<span class="line">&lt;=</span>
+<span class="line">&gt;</span>
+<span class="line">&gt;=</span>
+<span class="line">=</span>
+<span class="line">!=</span></code></pre>
+
+</figure>
+<p><span>That</span>&rsquo;<span>s too damn many of them! Some time ago I</span>&rsquo;<span>ve noticed that my code involving comparisons is often</span>
+<span>hard to understand, and hides bugs. I</span>&rsquo;<span>ve figured some rules of thumb to reduce complexity, which I</span>
+<span>want to share.</span></p>
+<p><span>The core idea is to canonicalize things. Both </span><code>x &lt; y</code><span> and </span><code>y &gt; x</code><span> mean the same, and, if you use</span>
+<span>them with roughly equal frequency, you need to spend extra mental capacity to fold the two versions</span>
+<span>into the single </span>&ldquo;<span>x tiny, y HUGE</span>&rdquo;<span> concept in your head.</span></p>
+<p><span>The </span><a href="https://en.wikipedia.org/wiki/Number_line"><span>number line</span></a><span> is a great intuition and visualization</span>
+<span>for comparisons. If you order things from small to big,</span>
+<span class="display"><code>A B C D</code><span>,</span></span>
+<span>you get intuitive concept of ordering without using comparison operators. You also plug into your</span>
+<span>existing intuition that the sort function arranges arrays in the ascending order.</span></p>
+<p><span>So, as a first order rule-of-thumb:</span>
+<span class="display"><strong><span>Strongly prefer </span><code>&lt;</code><span> and </span><code>&lt;=</code><span> over </span><code>&gt;</code><span> and </span><code>&gt;=</code></strong></span>
+<span>And, when using comparisons, use number line intuition.</span></p>
+<p><span>Some snippets:</span></p>
+<p><span>Checking if a point is inside the interval:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">lo &lt;= x <span class="hl-keyword">and</span> x &lt;= hi</span></code></pre>
+
+</figure>
+<p><span>Checking if a point is outside of the interval:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">x &lt; lo <span class="hl-keyword">or</span> hi &lt; x</span></code></pre>
+
+</figure>
+<p><span>Segment </span><code>a</code><span> is inside segment </span><code>b</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">b.start &lt;= a.start <span class="hl-keyword">and</span> a.end &lt;= b.end</span></code></pre>
+
+</figure>
+<p><span>Segments </span><code>a</code><span> and </span><code>b</code><span> are disjoint (either </span><code>a</code><span> is to the left of </span><code>b</code><span> or </span><code>a</code><span> is to the right of </span><code>b</code><span>):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">a.end &lt; b.start <span class="hl-keyword">or</span> b.end &lt; a.start</span></code></pre>
+
+</figure>
+<p><span>A particular common case for ordered comparisons is checking that an index is in bounds for an</span>
+<span>array. Here, the rule about number line works together with another important rule: </span><span class="display"><strong><span>State</span>
+<span>invariants positively</span></strong></span></p>
+<p><span>The indexing invariant is spelled as </span><span class="display"><code>index &lt; xs.len()</code><span>,</span></span></p>
+<p><span>and you should prefer to see it exactly that way in the source code. Concretely,</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">if</span> (index &gt;= xs.len) {</span>
+<span class="line"></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>is hard to get right, because is spells the converse of the invariant, and involves an extra mental</span>
+<span>negation (this is subtle </span>&mdash;<span> although there isn</span>&rsquo;<span>t a literal negation operator, you absolutely do</span>
+<span>think about this as a negation of the invariant). If possible, the code should be reshaped to</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">if</span> (index &lt; xs.len) {</span>
+<span class="line"></span>
+<span class="line">} <span class="hl-keyword">else</span> {</span>
+<span class="line"></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-09-13-comparative-analysis.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/2023/10/06/what-is-an-invariant.html b/2023/10/06/what-is-an-invariant.html
new file mode 100644
index 00000000..d50247dc
--- /dev/null
+++ b/2023/10/06/what-is-an-invariant.html
@@ -0,0 +1,385 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>What is an Invariant?</title>
+  <meta name="description" content="I extolled the benefits of programming with invariants in a couple of recent posts.
+Naturally, I didn't explain what I think when I write invariant. This post fixes that.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/2023/10/06/what-is-an-invariant.html">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <article >
+
+    <h1>
+    <a href="#What-is-an-Invariant"><span>What is an Invariant?</span> <time datetime="2023-10-06">Oct 6, 2023</time></a>
+    </h1>
+<p><span>I extolled the benefits of programming with invariants in a couple of recent posts.</span>
+<span>Naturally, I didn</span>&rsquo;<span>t explain what I think when I write </span>&ldquo;<span>invariant</span>&rdquo;<span>. This post fixes that.</span></p>
+<p><span>There are at least three different concepts I label with </span>&ldquo;<span>invariant</span>&rdquo;<span>:</span></p>
+<ul>
+<li>
+<span>a general </span>&ldquo;<span>math</span>&rdquo;<span> mode of thinking, where you distinguish between fuzzy, imprecise thoughts and</span>
+<span>precise statements with logical meaning.</span>
+</li>
+<li>
+<span>a specific technique for writing correct code when programming in the small.</span>
+</li>
+<li>
+<span>when programming in the large, compact, viral, descriptive properties of the systems.</span>
+</li>
+</ul>
+<p><span>I wouldn</span>&rsquo;<span>t discuss the first point here </span>&mdash;<span> I don</span>&rsquo;<span>t know how to describe this better than </span>&ldquo;<span>that</span>
+<span>thing that you do when you solve non-trivial math puzzler</span>&rdquo;<span>. The bulk of the post describes the</span>
+<span>second bullet point, for which I think I have a perfect litmus test to explain exactly what I am</span>
+<span>thinking here. I also touch a bit on the last point in the end.</span></p>
+<p><span>So let</span>&rsquo;<span>s start with a </span><a href="https://research.swtch.com/hwmm"><span>litmus test program</span></a><span> to show invariants in</span>
+<span>the small in action:</span></p>
+
+<aside class="block">
+
+<p><span>Write a binary search variation which computes insertion point </span>&mdash;<span> the smallest index such that, if</span>
+<span>the new element is inserted at this index, the array remains sorted:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">insertion_point</span>(xs: &amp;[<span class="hl-type">i32</span>], x: <span class="hl-type">i32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span></span></code></pre>
+
+</figure>
+
+</aside>
+  <p><span>You might want to write one yourself before proceeding. Here</span>&rsquo;<span>s an </span><a href="https://matklad.github.io/2021/11/07/generate-all-the-things.html"><span>exhaustive</span>
+<span>test</span></a><span> for this functionality,</span>
+<span>using </span><a href="https://crates.io/crates/exhaustigen"><span>exhaustigen crate</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">N</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">M</span> = <span class="hl-number">5</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = exhaustigen::Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-comment">// Generate an arbitrary sorted array of length at most M.</span></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">xs</span> =</span>
+<span class="line">      (<span class="hl-number">0</span>..g.<span class="hl-title function_ invoke__">gen</span>(N)).<span class="hl-title function_ invoke__">map</span>(|_| g.<span class="hl-title function_ invoke__">gen</span>(M) <span class="hl-keyword">as</span> <span class="hl-type">i32</span>).collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line">    xs.<span class="hl-title function_ invoke__">sort</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">x</span> = g.<span class="hl-title function_ invoke__">gen</span>(M) <span class="hl-keyword">as</span> <span class="hl-type">i32</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">i</span> = <span class="hl-title function_ invoke__">insertion_point</span>(&amp;xs, x);</span>
+<span class="line">    <span class="hl-keyword">if</span> i &gt; <span class="hl-number">0</span>        { <span class="hl-built_in">assert!</span>(xs[i - <span class="hl-number">1</span>] &lt; x) }</span>
+<span class="line">    <span class="hl-keyword">if</span> i &lt; xs.<span class="hl-title function_ invoke__">len</span>() { <span class="hl-built_in">assert!</span>(x &lt;= xs[i]) }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here</span>&rsquo;<span>s how I would naively write this function. First, I start with defining the boundaries for the</span>
+<span>binary search:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">insertion_point</span>(xs: &amp;[<span class="hl-type">i32</span>], x: <span class="hl-type">i32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lo</span> = <span class="hl-number">0</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">hi</span> = xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Then, repeatedly cut the interval in half until it vanishes</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">    <span class="hl-keyword">while</span> hi &gt; lo {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">mid</span> = lo + (hi - lo) / <span class="hl-number">2</span>;</span>
+<span class="line">        ...</span>
+<span class="line">    }</span></code></pre>
+
+</figure>
+<p><span>and recur into the left or the right half accordingly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">        <span class="hl-keyword">if</span> x &lt; xs[mid] {</span>
+<span class="line">            lo = mid;</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            hi = mid;</span>
+<span class="line">        }</span></code></pre>
+
+</figure>
+<p><span>Altogether:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">insertion_point</span>(xs: &amp;[<span class="hl-type">i32</span>], x: <span class="hl-type">i32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lo</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">hi</span> = xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">while</span> lo &lt; hi {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">mid</span> = lo + (hi - lo) / <span class="hl-number">2</span>;</span>
+<span class="line">    <span class="hl-keyword">if</span> x &lt; xs[mid] {</span>
+<span class="line">      hi = mid;</span>
+<span class="line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line">      lo = mid;</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  lo</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I love this code! It has so many details right!</span></p>
+<ul>
+<li>
+<span>The </span><code>insertion_point</code><span> interface compactly compresses usually messy result of a binary search to</span>
+<span>just one index.</span>
+</li>
+<li>
+<code>xs / x</code><span> pair of names for the sequence and its element crisply describes abstract algorithm on</span>
+<span>sequencies.</span>
+</li>
+<li>
+<span>Similarly, </span><code>lo / hi</code><span> name pair is symmetric, expressing the relation between the two indexes.</span>
+</li>
+<li>
+<span>Half-open intervals are used for indexing.</span>
+</li>
+<li>
+<span>There are no special casing anywhere, the natural </span><code>lo &lt; hi</code><span> condition handles empty slice.</span>
+</li>
+<li>
+<span>We even dodge Java</span>&rsquo;<span>s binary search bug by computing midpoint without overflow.</span>
+</li>
+</ul>
+<p><span>There</span>&rsquo;<span>s only one problem with this code </span>&mdash;<span> it doesn</span>&rsquo;<span>t work. Just blindly following rules-of-thumb</span>
+<span>gives you working code surprisingly often, but this particular algorithm is an exception.</span></p>
+<p><span>The question is, how do we fix this overwise great code? And here</span>&rsquo;<span>s where thinking invariants helps.</span>
+<span>Before I internalized invariants, my approach would be to find a failing example, and to fumble with</span>
+<span>some plus or minus ones here and there and other special casing to make it work. That is, find a</span>
+<span>concrete problem, solve it. This works, but is slow, and doesn</span>&rsquo;<span>t allow discovering the problem</span>
+<span>before running the code.</span></p>
+<p><span>The alternative is to actually make an effort and spell out, explicitly, what the code is supposed</span>
+<span>to do. In this case, we want </span><code>lo</code><span> and </span><code>hi</code><span> to bound the result. That is,</span>
+<code class="display">lo &lt;= insertion_point &lt;= hi</code>
+<span>should hold on every iteration. It clearly holds before we enter the loop. On each iteration, we</span>
+<span>would like to shorten this interval, cutting away the part that definitely does not contain</span>
+<span>insertion point.</span></p>
+<p><span>Elaborating the invariant, all elements to the left of </span><code>lo</code><span> should be less than the target.</span>
+<span>Conversely, all elements to the right of </span><code>hi</code><span> should be at least as large as the target.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..lo: xs[i] &lt; x</span>
+<span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> hi..:  x &lt;= xs[i]</span></code></pre>
+
+</figure>
+<p><span>Let</span>&rsquo;<span>s now take a second look at the branching condition:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">x &lt; xs[mid]</span></code></pre>
+
+</figure>
+<p><span>It matches neither invariant prong exactly: </span><code>x</code><span> is on the left, but inequality is strict. We can</span>
+<span>rearrange the code to follow the invariant more closely:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">if</span> xs[mid] &lt; x {</span>
+<span class="line">    lo = mid + <span class="hl-number">1</span>;</span>
+<span class="line">} <span class="hl-keyword">else</span> {</span>
+<span class="line">    hi = mid;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ul>
+<li>
+<span>we flip the condition and if-branches, so that </span><code>xs[mid] &lt; x</code><span> matches </span><code>xs[i] &lt; x</code><span> from the</span>
+<span>invariant for </span><code>lo</code>
+</li>
+<li>
+<span>to make the invariant tight, we add </span><code>mid + 1</code><span> (if </span><code>xs[mid]</code><span> is less than </span><code>x</code><span>, we know that the</span>
+<span>insertion point is at least </span><code>mid + 1</code><span>)</span>
+</li>
+</ul>
+<p><span>The code now works. So what went wrong with the original version with </span><code>x &lt; xs[mid]</code><span>? In the else</span>
+<span>case, when </span><code>x &gt;= xs[mid]</code><span> we set </span><code>lo = mid</code><span>, but that</span>&rsquo;<span>s wrong! It might be the case that </span><code>x ==
+xs[mid]</code><span> and </span><code>x == xs[mid - 1]</code><span>, which would break the invariant for </span><code>lo</code><span>.</span></p>
+<p><span>The point isn</span>&rsquo;<span>t in this </span><em><span>particular</span></em><span> invariant or this particular algorithm. It</span>&rsquo;<span>s the general</span>
+<span>pattern that  it</span>&rsquo;<span>s easy to write the code which implements the right algorithm, and sort-of works,</span>
+<span>but is wrong in details. To get the details right for the right reason, you need to understand</span>
+<em><span>precisely</span></em><span> what the result should be, and formulating this as a (loop or recursion) invariant</span>
+<span>helps.</span></p>
+<hr>
+<p><span>Perhaps it</span>&rsquo;<span>s time to answer the title question: invariant is some property which holds at all times</span>
+<span>during dynamic evolution of the system. In the above example, the evolution is the program</span>
+<span>progressing through subsequent loop iterations. The invariant, the condition binding </span><code>lo</code><span> and </span><code>hi</code><span>,</span>
+<span>holds on every iteration. Invariants are powerful, because they are </span><em><span>compressed</span></em><span> descriptions of</span>
+<span>the system, they collapse away the time dimension, which is a huge simplification. Reasoning about</span>
+<span>each particular path the program could take is hard, because there are so many different paths.</span>
+<span>Reasoning about invariants is easy, because they capture properties shared by </span><em><span>all</span></em><span> execution paths.</span></p>
+<p><span>The same idea applies when programming in the large. In the small, we looked at how the state of a</span>
+<span>running program evolves over time. In the large, we will look at how the source code of the program</span>
+<span>itself evolves, as it is being refactored and extended to support new features. Here are some</span>
+<span>systems invariants from the systems I</span>&rsquo;<span>ve worked with:</span></p>
+<p><strong><span>Cargo:</span></strong></p>
+<p><span>File system paths entered by users are preserved exactly. If the user types</span>
+<span class="display"><code>cargo frob ../some/dir</code><span>,</span></span>
+<span>Cargo doesn</span>&rsquo;<span>t attempt to resolve </span><code>../some/dir</code><span> to an absolute path and passes the path</span>
+<span>to the underlying OS as is. The reason for that is that file systems are very finicky. Although it</span>
+<span>might look as if two paths are equivalent, there are bound to be cases where they are not. If the</span>
+<span>user typed a particular form of a path, they believe that it</span>&rsquo;<span>ll work, and any changes can mess</span>
+<span>things up easily.</span></p>
+<p><span>This is a relatively compact invariant </span>&mdash;<span> basically, code is just forbidden from calling</span>
+<code>fs::canonicalize</code><span>.</span></p>
+<p><strong><span>rust-analyzer:</span></strong></p>
+<p><span>Syntax trees are identity-less value types. That is, if you take an object representing an </span><code>if</code>
+<span>expression, that object doesn</span>&rsquo;<span>t have any knowledge of where in the larger program the </span><code>if</code>
+<span>expression is. The thinking about this invariant was that it simplifies refactors </span>&mdash;<span> while in the</span>
+<span>static program it</span>&rsquo;<span>s natural to talk about </span>&ldquo;<code>if</code><span> on the line X in file Y</span>&rdquo;<span>, when you start modifying</span>
+<span>code, identity becomes much more fluid.</span></p>
+<p><span>This is an invariant with far reaching consequences </span>&mdash;<span> that means that literally everything in</span>
+<span>rust-analyzer needs to track identities of things explicitly. You don</span>&rsquo;<span>t just pass around syntax</span>
+<span>nodes, you pass nodes with extra breadcrumbs describing their origin. I think this might have been a</span>
+<span>mistake </span>&mdash;<span> while it does make refactoring APIs more principled, refactoring is not the common case!</span>
+<span>Most of the work of a language server consists of read-only analysis of existing code, and the</span>
+<span>actual refactor is just a cherry on top. So perhaps it</span>&rsquo;<span>s better to try to bind identity mode tightly</span>
+<span>into the core data structure, and just use fake identities for temporary trees that arise during</span>
+<span>refactors.</span></p>
+<p><span>A more successful invariant from rust-analyzer is that the IDE has a full, frozen view of a snapshot</span>
+<span>of the world. There</span>&rsquo;<span>s no API for inferring the types, rather, the API looks as if all the types are</span>
+<span>computed at all times. Similarly, there</span>&rsquo;<span>s no explicit API for changing the code or talking about</span>
+<span>different historical versions of the code </span>&mdash;<span> the IDE sees a single </span>&ldquo;<span>current</span>&rdquo;<span> snapshot with all</span>
+<span>derived data computed. Underneath, there</span>&rsquo;<span>s a smart system to secretly compute the information on</span>
+<span>demand and re-use previous results, but this is all hidden from the API.</span></p>
+<p><span>This is a great, simple mental model, and it provides for a nice boundary between the compiler</span>
+<span>proper and IDE fluff like refactors and code completion. Long term, I</span>&rsquo;<span>d love to see several</span>
+<span>implementations of the </span>&ldquo;<span>compiler parts</span>&rdquo;<span>.</span></p>
+<p><strong><span>TigerBeetle:</span></strong></p>
+<p><span>A </span><em><span>lot</span></em><span> of thoughtful invariants here! To touch only a few:</span></p>
+<p><span>TigerBeetle doesn</span>&rsquo;<span>t allocate memory after startup. This simple invariant affects every bit of code</span>
+&mdash;<span> whatever you do, you must manage with existing, pre-allocated data structures. You can</span>&rsquo;<span>t just</span>
+<code>memcpy</code><span> stuff around, there</span>&rsquo;<span>s no ambient available space to </span><code>memcpy</code><span> to! As a consequence (and,</span>
+<span>historically, as a motivation for the design)</span>
+<a href="https://github.com/tigerbeetle/tigerbeetle/blob/cfb46eff4e001bb6b33f5e48924a2de44db20e8f/src/constants.zig#L417-L418"><span>everything</span></a>
+<span>has a specific numeric limit.</span></p>
+<p><span>Another fun one is that transaction logic can</span>&rsquo;<span>t read from disk. Every object which could be touched</span>
+<span>by a transaction needs to be explicitly prefetched into memory before transaction begins. Because</span>
+<span>disk IO happens separately from the execution, it is possible to parallelize IO for a whole batch of</span>
+<span>transactions. The actual transaction execution is then a very tight serial CPU loop without any</span>
+<span>locks.</span></p>
+<p><span>Speaking of disk IO, in TigerBeetle </span>&ldquo;<span>reading from disk</span>&rdquo;<span> can</span>&rsquo;<span>t fail. The central API for reading</span>
+<span>takes a data block address, a checksum, and invokes the callback with data with a matching checksum.</span>
+<span>Everything built on top doesn</span>&rsquo;<span>t need to worry about error handling. The way this works internally is</span>
+<span>that reads that fail on a local disk are repaired through other replicas in the cluster. It</span>&rsquo;<span>s just</span>
+<span>that the repair happens transparently to the caller. If the block of data of interest isn</span>&rsquo;<span>t found on</span>
+<span>the set of reachable replicas, the cluster correctly gets stuck until it is found.</span></p>
+<hr>
+<p><span>Summing up: invariants are helpful for describing systems that evolve over time. There</span>&rsquo;<span>s a</span>
+<span>combinatorial explosion of trajectories that a system </span><em><span>could</span></em><span> take. Invariants compactly describe</span>
+<span>properties shared by an infinite amount of trajectories.</span></p>
+<p><span>In the small, formulating invariants about program state helps to wire correct code.</span></p>
+<p><span>In the large, formulating invariants about the code itself helps to go from a small, simple system</span>
+<span>that works to a large system which is used in production.</span></p>
+</article>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/posts/2023-10-06-what-is-an-invariant.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/about.html b/about.html
new file mode 100644
index 00000000..0eb99cac
--- /dev/null
+++ b/about.html
@@ -0,0 +1,114 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>matklad</title>
+  <meta name="description" content="Yet another programming blog by Alex Kladov aka matklad.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/about">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <section id="Hello">
+
+    <h2>
+    <a href="#Hello"><span>Hello!</span> </a>
+    </h2>
+<p><img alt="matklad" src="https://avatars.githubusercontent.com/u/1711539?v=4" class="about-ava" width="128">
+<span>I am Alex Kladov, a programmer who loves simple code and programming languages.</span>
+<span>You can find me on </span><a href="https://github.com/matklad"><span>GitHub</span></a><span>.</span>
+<span>If you want to contact me, please write an e-mail (address is on the GitHub profile).</span></p>
+<p><span>Code samples on this blog are dual licensed under MIT OR Apache-2.0.</span></p>
+</section>
+
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/about.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/assets/LSP-MxN.png b/assets/LSP-MxN.png
new file mode 100644
index 00000000..0b234d45
Binary files /dev/null and b/assets/LSP-MxN.png differ
diff --git a/assets/PPerlang.png b/assets/PPerlang.png
new file mode 100644
index 00000000..73666f21
Binary files /dev/null and b/assets/PPerlang.png differ
diff --git a/assets/active-window.png b/assets/active-window.png
new file mode 100644
index 00000000..68cf3f8c
Binary files /dev/null and b/assets/active-window.png differ
diff --git a/assets/adoc-hl-error.png b/assets/adoc-hl-error.png
new file mode 100644
index 00000000..2c71d81d
Binary files /dev/null and b/assets/adoc-hl-error.png differ
diff --git a/assets/adoc-slide.png b/assets/adoc-slide.png
new file mode 100644
index 00000000..a56723ae
Binary files /dev/null and b/assets/adoc-slide.png differ
diff --git a/assets/cargo-timings.png b/assets/cargo-timings.png
new file mode 100644
index 00000000..2b9a8910
Binary files /dev/null and b/assets/cargo-timings.png differ
diff --git a/assets/goto-definition-test.png b/assets/goto-definition-test.png
new file mode 100644
index 00000000..43bd2182
Binary files /dev/null and b/assets/goto-definition-test.png differ
diff --git a/assets/gotodef-macro-1.gif b/assets/gotodef-macro-1.gif
new file mode 100644
index 00000000..985ddf85
Binary files /dev/null and b/assets/gotodef-macro-1.gif differ
diff --git a/assets/gotodef-macro-2.gif b/assets/gotodef-macro-2.gif
new file mode 100644
index 00000000..8c9ba235
Binary files /dev/null and b/assets/gotodef-macro-2.gif differ
diff --git a/assets/icons.svg b/assets/icons.svg
new file mode 100644
index 00000000..a10cbfb7
--- /dev/null
+++ b/assets/icons.svg
@@ -0,0 +1,28 @@
+<svg xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
+  <defs>
+    <symbol viewBox="0 0 512 512" id="edit">
+      <path d="M471.6 21.7c-21.9-21.9-57.3-21.9-79.2 0L362.3 51.7l97.9 97.9 30.1-30.1c21.9-21.9 21.9-57.3 0-79.2L471.6 21.7zm-299.2 220c-6.1 6.1-10.8 13.6-13.5 21.9l-29.6 88.8c-2.9 8.6-.6 18.1 5.8 24.6s15.9 8.7 24.6 5.8l88.8-29.6c8.2-2.7 15.7-7.4 21.9-13.5L437.7 172.3 339.7 74.3 172.4 241.7zM96 64C43 64 0 107 0 160V416c0 53 43 96 96 96H352c53 0 96-43 96-96V320c0-17.7-14.3-32-32-32s-32 14.3-32 32v96c0 17.7-14.3 32-32 32H96c-17.7 0-32-14.3-32-32V160c0-17.7 14.3-32 32-32h96c17.7 0 32-14.3 32-32s-14.3-32-32-32H96z"/>
+    </symbol>
+
+    <symbol viewBox="0 0 496 512" id="github">
+      <path d="M165.9 397.4c0 2-2.3 3.6-5.2 3.6-3.3.3-5.6-1.3-5.6-3.6 0-2 2.3-3.6 5.2-3.6 3-.3 5.6 1.3 5.6 3.6zm-31.1-4.5c-.7 2 1.3 4.3 4.3 4.9 2.6 1 5.6 0 6.2-2s-1.3-4.3-4.3-5.2c-2.6-.7-5.5.3-6.2 2.3zm44.2-1.7c-2.9.7-4.9 2.6-4.6 4.9.3 2 2.9 3.3 5.9 2.6 2.9-.7 4.9-2.6 4.6-4.6-.3-1.9-3-3.2-5.9-2.9zM244.8 8C106.1 8 0 113.3 0 252c0 110.9 69.8 205.8 169.5 239.2 12.8 2.3 17.3-5.6 17.3-12.1 0-6.2-.3-40.4-.3-61.4 0 0-70 15-84.7-29.8 0 0-11.4-29.1-27.8-36.6 0 0-22.9-15.7 1.6-15.4 0 0 24.9 2 38.6 25.8 21.9 38.6 58.6 27.5 72.9 20.9 2.3-16 8.8-27.1 16-33.7-55.9-6.2-112.3-14.3-112.3-110.5 0-27.5 7.6-41.3 23.6-58.9-2.6-6.5-11.1-33.3 2.6-67.9 20.9-6.5 69 27 69 27 20-5.6 41.5-8.5 62.8-8.5s42.8 2.9 62.8 8.5c0 0 48.1-33.6 69-27 13.7 34.7 5.2 61.4 2.6 67.9 16 17.7 25.8 31.5 25.8 58.9 0 96.5-58.9 104.2-114.8 110.5 9.2 7.9 17 22.9 17 46.4 0 33.7-.3 75.4-.3 83.6 0 6.5 4.6 14.4 17.3 12.1C428.2 457.8 496 362.9 496 252 496 113.3 383.5 8 244.8 8zM97.2 352.9c-1.3 1-1 3.3.7 5.2 1.6 1.6 3.9 2.3 5.2 1 1.3-1 1-3.3-.7-5.2-1.6-1.6-3.9-2.3-5.2-1zm-10.8-8.1c-.7 1.3.3 2.9 2.3 3.9 1.6 1 3.6.7 4.3-.7.7-1.3-.3-2.9-2.3-3.9-2-.6-3.6-.3-4.3.7zm32.4 35.6c-1.6 1.3-1 4.3 1.3 6.2 2.3 2.3 5.2 2.6 6.5 1 1.3-1.3.7-4.3-1.3-6.2-2.2-2.3-5.2-2.6-6.5-1zm-11.4-14.7c-1.6 1-1.6 3.6 0 5.9 1.6 2.3 4.3 3.3 5.6 2.3 1.6-1.3 1.6-3.9 0-6.2-1.4-2.3-4-3.3-5.6-2z"/>
+    </symbol>
+
+    <symbol viewBox="0 0 448 512" id="rss">
+      <path d="M0 64C0 46.3 14.3 32 32 32c229.8 0 416 186.2 416 416c0 17.7-14.3 32-32 32s-32-14.3-32-32C384 253.6 226.4 96 32 96C14.3 96 0 81.7 0 64zM0 416a64 64 0 1 1 128 0A64 64 0 1 1 0 416zM32 160c159.1 0 288 128.9 288 288c0 17.7-14.3 32-32 32s-32-14.3-32-32c0-123.7-100.3-224-224-224c-17.7 0-32-14.3-32-32s14.3-32 32-32z"/>
+    </symbol>
+
+    <symbol viewBox="0 0 512 512" id="exclamation">
+      <path d="M256 512A256 256 0 1 0 256 0a256 256 0 1 0 0 512zm0-384c13.3 0 24 10.7 24 24V264c0 13.3-10.7 24-24 24s-24-10.7-24-24V152c0-13.3 10.7-24 24-24zM224 352a32 32 0 1 1 64 0 32 32 0 1 1 -64 0z"/>
+    </symbol>
+  
+    <symbol viewBox="0 0 512 512" id="info">
+      <path d="M256 512A256 256 0 1 0 256 0a256 256 0 1 0 0 512zM216 336h24V272H216c-13.3 0-24-10.7-24-24s10.7-24 24-24h48c13.3 0 24 10.7 24 24v88h8c13.3 0 24 10.7 24 24s-10.7 24-24 24H216c-13.3 0-24-10.7-24-24s10.7-24 24-24zm40-208a32 32 0 1 1 0 64 32 32 0 1 1 0-64z"/>
+    </symbol>
+  
+    <symbol viewBox="0 0 512 512" id="question">
+      <path d="M256 512A256 256 0 1 0 256 0a256 256 0 1 0 0 512zM169.8 165.3c7.9-22.3 29.1-37.3 52.8-37.3h58.3c34.9 0 63.1 28.3 63.1 63.1c0 22.6-12.1 43.5-31.7 54.8L280 264.4c-.2 13-10.9 23.6-24 23.6c-13.3 0-24-10.7-24-24V250.5c0-8.6 4.6-16.5 12.1-20.8l44.3-25.4c4.7-2.7 7.6-7.7 7.6-13.1c0-8.4-6.8-15.1-15.1-15.1H222.6c-3.4 0-6.4 2.1-7.5 5.3l-.4 1.2c-4.4 12.5-18.2 19-30.6 14.6s-19-18.2-14.6-30.6l.4-1.2zM224 352a32 32 0 1 1 64 0 32 32 0 1 1 -64 0z"/>
+    </symbol>
+
+  </defs>
+</svg>
diff --git a/assets/lang-pop.png b/assets/lang-pop.png
new file mode 100644
index 00000000..fc32f71a
Binary files /dev/null and b/assets/lang-pop.png differ
diff --git a/assets/magit.png b/assets/magit.png
new file mode 100644
index 00000000..f88a5939
Binary files /dev/null and b/assets/magit.png differ
diff --git a/assets/min3_diag.png b/assets/min3_diag.png
new file mode 100644
index 00000000..f4c2edcd
Binary files /dev/null and b/assets/min3_diag.png differ
diff --git a/assets/min3_diag_color.png b/assets/min3_diag_color.png
new file mode 100644
index 00000000..d9f49ad2
Binary files /dev/null and b/assets/min3_diag_color.png differ
diff --git a/assets/min3_par.png b/assets/min3_par.png
new file mode 100644
index 00000000..fc1523bf
Binary files /dev/null and b/assets/min3_par.png differ
diff --git a/assets/min3_rows.png b/assets/min3_rows.png
new file mode 100644
index 00000000..533c073f
Binary files /dev/null and b/assets/min3_rows.png differ
diff --git a/assets/min3_table.png b/assets/min3_table.png
new file mode 100644
index 00000000..f642a340
Binary files /dev/null and b/assets/min3_table.png differ
diff --git a/assets/priority-inversion.png b/assets/priority-inversion.png
new file mode 100644
index 00000000..e89c7ffd
Binary files /dev/null and b/assets/priority-inversion.png differ
diff --git a/assets/ra-code.png b/assets/ra-code.png
new file mode 100644
index 00000000..1eb5758d
Binary files /dev/null and b/assets/ra-code.png differ
diff --git a/assets/resilient-parsing/lparso.js b/assets/resilient-parsing/lparso.js
new file mode 100644
index 00000000..25d241cd
--- /dev/null
+++ b/assets/resilient-parsing/lparso.js
@@ -0,0 +1,195 @@
+let wasm;
+
+let WASM_VECTOR_LEN = 0;
+
+let cachedUint8Memory0 = null;
+
+function getUint8Memory0() {
+    if (cachedUint8Memory0 === null || cachedUint8Memory0.byteLength === 0) {
+        cachedUint8Memory0 = new Uint8Array(wasm.memory.buffer);
+    }
+    return cachedUint8Memory0;
+}
+
+const cachedTextEncoder = (typeof TextEncoder !== 'undefined' ? new TextEncoder('utf-8') : { encode: () => { throw Error('TextEncoder not available') } } );
+
+const encodeString = (typeof cachedTextEncoder.encodeInto === 'function'
+    ? function (arg, view) {
+    return cachedTextEncoder.encodeInto(arg, view);
+}
+    : function (arg, view) {
+    const buf = cachedTextEncoder.encode(arg);
+    view.set(buf);
+    return {
+        read: arg.length,
+        written: buf.length
+    };
+});
+
+function passStringToWasm0(arg, malloc, realloc) {
+
+    if (realloc === undefined) {
+        const buf = cachedTextEncoder.encode(arg);
+        const ptr = malloc(buf.length) >>> 0;
+        getUint8Memory0().subarray(ptr, ptr + buf.length).set(buf);
+        WASM_VECTOR_LEN = buf.length;
+        return ptr;
+    }
+
+    let len = arg.length;
+    let ptr = malloc(len) >>> 0;
+
+    const mem = getUint8Memory0();
+
+    let offset = 0;
+
+    for (; offset < len; offset++) {
+        const code = arg.charCodeAt(offset);
+        if (code > 0x7F) break;
+        mem[ptr + offset] = code;
+    }
+
+    if (offset !== len) {
+        if (offset !== 0) {
+            arg = arg.slice(offset);
+        }
+        ptr = realloc(ptr, len, len = offset + arg.length * 3) >>> 0;
+        const view = getUint8Memory0().subarray(ptr + offset, ptr + len);
+        const ret = encodeString(arg, view);
+
+        offset += ret.written;
+    }
+
+    WASM_VECTOR_LEN = offset;
+    return ptr;
+}
+
+let cachedInt32Memory0 = null;
+
+function getInt32Memory0() {
+    if (cachedInt32Memory0 === null || cachedInt32Memory0.byteLength === 0) {
+        cachedInt32Memory0 = new Int32Array(wasm.memory.buffer);
+    }
+    return cachedInt32Memory0;
+}
+
+const cachedTextDecoder = (typeof TextDecoder !== 'undefined' ? new TextDecoder('utf-8', { ignoreBOM: true, fatal: true }) : { decode: () => { throw Error('TextDecoder not available') } } );
+
+if (typeof TextDecoder !== 'undefined') { cachedTextDecoder.decode(); };
+
+function getStringFromWasm0(ptr, len) {
+    ptr = ptr >>> 0;
+    return cachedTextDecoder.decode(getUint8Memory0().subarray(ptr, ptr + len));
+}
+/**
+* @param {string} text
+* @returns {string}
+*/
+export function print_syntax_tree(text) {
+    let deferred2_0;
+    let deferred2_1;
+    try {
+        const retptr = wasm.__wbindgen_add_to_stack_pointer(-16);
+        const ptr0 = passStringToWasm0(text, wasm.__wbindgen_malloc, wasm.__wbindgen_realloc);
+        const len0 = WASM_VECTOR_LEN;
+        wasm.print_syntax_tree(retptr, ptr0, len0);
+        var r0 = getInt32Memory0()[retptr / 4 + 0];
+        var r1 = getInt32Memory0()[retptr / 4 + 1];
+        deferred2_0 = r0;
+        deferred2_1 = r1;
+        return getStringFromWasm0(r0, r1);
+    } finally {
+        wasm.__wbindgen_add_to_stack_pointer(16);
+        wasm.__wbindgen_free(deferred2_0, deferred2_1);
+    }
+}
+
+async function __wbg_load(module, imports) {
+    if (typeof Response === 'function' && module instanceof Response) {
+        if (typeof WebAssembly.instantiateStreaming === 'function') {
+            try {
+                return await WebAssembly.instantiateStreaming(module, imports);
+
+            } catch (e) {
+                if (module.headers.get('Content-Type') != 'application/wasm') {
+                    console.warn("`WebAssembly.instantiateStreaming` failed because your server does not serve wasm with `application/wasm` MIME type. Falling back to `WebAssembly.instantiate` which is slower. Original error:\n", e);
+
+                } else {
+                    throw e;
+                }
+            }
+        }
+
+        const bytes = await module.arrayBuffer();
+        return await WebAssembly.instantiate(bytes, imports);
+
+    } else {
+        const instance = await WebAssembly.instantiate(module, imports);
+
+        if (instance instanceof WebAssembly.Instance) {
+            return { instance, module };
+
+        } else {
+            return instance;
+        }
+    }
+}
+
+function __wbg_get_imports() {
+    const imports = {};
+    imports.wbg = {};
+
+    return imports;
+}
+
+function __wbg_init_memory(imports, maybe_memory) {
+
+}
+
+function __wbg_finalize_init(instance, module) {
+    wasm = instance.exports;
+    __wbg_init.__wbindgen_wasm_module = module;
+    cachedInt32Memory0 = null;
+    cachedUint8Memory0 = null;
+
+
+    return wasm;
+}
+
+function initSync(module) {
+    if (wasm !== undefined) return wasm;
+
+    const imports = __wbg_get_imports();
+
+    __wbg_init_memory(imports);
+
+    if (!(module instanceof WebAssembly.Module)) {
+        module = new WebAssembly.Module(module);
+    }
+
+    const instance = new WebAssembly.Instance(module, imports);
+
+    return __wbg_finalize_init(instance, module);
+}
+
+async function __wbg_init(input) {
+    if (wasm !== undefined) return wasm;
+
+    if (typeof input === 'undefined') {
+        input = new URL('lparso_bg.wasm', import.meta.url);
+    }
+    const imports = __wbg_get_imports();
+
+    if (typeof input === 'string' || (typeof Request === 'function' && input instanceof Request) || (typeof URL === 'function' && input instanceof URL)) {
+        input = fetch(input);
+    }
+
+    __wbg_init_memory(imports);
+
+    const { instance, module } = await __wbg_load(await input, imports);
+
+    return __wbg_finalize_init(instance, module);
+}
+
+export { initSync }
+export default __wbg_init;
diff --git a/assets/resilient-parsing/lparso_bg.wasm b/assets/resilient-parsing/lparso_bg.wasm
new file mode 100644
index 00000000..5c0c5e08
Binary files /dev/null and b/assets/resilient-parsing/lparso_bg.wasm differ
diff --git a/assets/resilient-parsing/main.js b/assets/resilient-parsing/main.js
new file mode 100644
index 00000000..236f1000
--- /dev/null
+++ b/assets/resilient-parsing/main.js
@@ -0,0 +1,15 @@
+import init, { print_syntax_tree  } from "./lparso.js";
+
+async function main() {
+  await init();
+  const input = document.querySelector("#playground > .input");
+  const output = document.querySelector("#playground > .output");
+
+  function update(text) {
+    output.textContent = print_syntax_tree(text);
+  }
+  input.addEventListener("input", (event) => update(event.target.value));
+  update(input.textContent);
+}
+
+main();
diff --git a/assets/zig-lsp.jpg b/assets/zig-lsp.jpg
new file mode 100644
index 00000000..af821a56
Binary files /dev/null and b/assets/zig-lsp.jpg differ
diff --git a/css/EBGaramond-400-Italic.woff2 b/css/EBGaramond-400-Italic.woff2
new file mode 100644
index 00000000..924fe133
Binary files /dev/null and b/css/EBGaramond-400-Italic.woff2 differ
diff --git a/css/EBGaramond-400-Normal.woff2 b/css/EBGaramond-400-Normal.woff2
new file mode 100644
index 00000000..88953224
Binary files /dev/null and b/css/EBGaramond-400-Normal.woff2 differ
diff --git a/css/EBGaramond-700-Italic.woff2 b/css/EBGaramond-700-Italic.woff2
new file mode 100644
index 00000000..e53c9394
Binary files /dev/null and b/css/EBGaramond-700-Italic.woff2 differ
diff --git a/css/EBGaramond-700-Normal.woff2 b/css/EBGaramond-700-Normal.woff2
new file mode 100644
index 00000000..9c46aca1
Binary files /dev/null and b/css/EBGaramond-700-Normal.woff2 differ
diff --git a/css/JetBrainsMono-400-Normal.woff2 b/css/JetBrainsMono-400-Normal.woff2
new file mode 100644
index 00000000..fdf95dde
Binary files /dev/null and b/css/JetBrainsMono-400-Normal.woff2 differ
diff --git a/css/JetBrainsMono-700-Normal.woff2 b/css/JetBrainsMono-700-Normal.woff2
new file mode 100644
index 00000000..d0980761
Binary files /dev/null and b/css/JetBrainsMono-700-Normal.woff2 differ
diff --git a/css/OpenSans-300-Normal.woff2 b/css/OpenSans-300-Normal.woff2
new file mode 100644
index 00000000..67a6cffc
Binary files /dev/null and b/css/OpenSans-300-Normal.woff2 differ
diff --git a/css/main.css b/css/main.css
new file mode 100644
index 00000000..b220a620
--- /dev/null
+++ b/css/main.css
@@ -0,0 +1,161 @@
+html { font-family: "EB Garamond", serif; font-size: 22px; line-height: 1.3em; }
+
+h1     { margin-bottom: 0.75rem; }
+h2, h3 { margin-bottom: 0.5rem; }
+section { margin-top: 1rem; }
+p, table, ol, ul, figure, aside, dl, hr { margin-bottom: 0.5rem; }
+
+sup, sub { line-height: 0; }
+svg.icon { width: 1rem; height: 1rem; vertical-align: middle; }
+
+h1, h2, h3 {
+    font-family: "Open Sans", sans-serif;
+    font-weight: 300;
+    color: #ba3925;
+    text-rendering: optimizeLegibility;
+    line-height: 1em;
+}
+h1, h2 { font-size: 1.5rem; }
+h3 { font-size: 1.2rem;}
+
+section:target > :is(h1, h2, h3)::before { content: "§"; }
+:is(h1, h2, h3) > a { color: inherit; text-decoration:none }
+:is(h1, h2, h3) > a:hover { color: inherit; text-decoration:none }
+
+
+/* Block */
+
+img, video { display: inline-block; vertical-align: middle; max-width: 100%; height: auto; }
+figure > img, figure > video { display: block; margin-left: auto; margin-right: auto; }
+
+p { hyphens: auto; -webkit-hyphens: auto; text-align: justify; }
+figure.blockquote { padding-left: 1em; border-left: 3px solid #ba3925; }
+figure.blockquote > figcaption { text-align: right; }
+
+table { border-collapse: collapse; background: #fff;  }
+table, td { border: 1px solid #dedede; }
+td { padding: 0.5625em 0.625em }
+
+ol, ul, dd { margin-left: 3ch; }
+ul { list-style-type: circle;}
+.roman { list-style-type: lower-roman; }
+
+dt { font-weight: bold; }
+
+aside.admn { display: flex; flex-direction: row; align-items: center; }
+aside.admn > svg.icon { width: 2rem; height: 2rem; fill: #19407c; }
+aside.admn.warn > svg.icon { fill: #ba3925; }
+aside.admn > div { padding-left: 1ch; margin-left: 1ch; border-left: 1px solid #dddddf }
+
+aside.block {
+    border-style: solid; border-width: 1px; border-radius: 4px; border-color: #dbdbd6;
+    padding: 1em;
+    background: #f3f3f2;
+}
+aside.block > .title {
+    font-family: "Open Sans", sans-serif; font-size: 1.5rem; color: #7a2518;
+    text-align: center;
+    margin-top: 0; margin-bottom: 0.5rem;
+}
+aside.block > :last-child { margin-bottom: 0; }
+
+details { padding-left: 1em; border-left: 3px solid #19407c; }
+
+pre { line-height: 1rem;}
+code {
+    font-family: "JetBrains Mono", monospace; font-variant-ligatures: none; font-size: 0.75em;
+    color: rgba(0, 0, 0, .9);
+}
+figcaption.title {
+    font-style: italic; font-weight: 400;
+    line-height: 1.45;
+    color: #7a2518;
+    margin-top: 0; margin-bottom: 0.25em;
+}
+figure.code-block > pre > code {
+    display: flex; flex-direction: column;
+    overflow-x: auto; overflow-y: clip;
+    counter-reset: line;
+}
+figure.code-block > pre > code > .line { counter-increment: line; }
+figure.code-block > pre > code > .line:before {
+    content: counter(line);
+    display: inline-block;
+    width: 3ch; padding-right: 0.5ch; margin-right: 1ch;
+    text-align: right;
+    opacity: .35;
+    border-right: 1px solid black;
+}
+ol.callout { list-style: none; counter-reset: callout; }
+ol.callout > li { position: relative; }
+ol.callout > li::before {
+    counter-increment: callout; content: counter(callout);
+    position: absolute; top: 0.2rem; left: -1.1rem;
+}
+i.callout::after {
+    content: attr(data-value);
+}
+ol.callout > li::before, i.callout::after {
+    font-family: "JetBrains Mono"; font-style: normal; font-size: 0.75rem; font-weight: bold;
+    display: inline-block; width: 0.9rem; height: 0.9rem; line-height: 0.9rem;
+    border-radius: 100%;
+    background-color: black;
+    color: white;
+    text-align: center;
+}
+
+.two-col { display: flex; flex-direction: row; }
+.two-col > *:first-child { flex: 30%; }
+.two-col > *:last-child { flex: 30%; }
+
+hr { border: solid #dddddf; border-width: 1px 0 0; height: 0; }
+
+/* Inline */
+
+p>code { white-space: nowrap; } /* Sadly, overflow-wrap: anywhere doesn't compose with this */
+.display { display: block; margin: 1em 0; text-align: center }
+
+a { text-decoration-color: #2156a5; color: black; }
+a:hover, a:focus { color: #2156a5; fill: #2156a5; }
+a.url { word-break: break-all; }
+
+kbd > kbd {
+    display: inline-block;
+    font-family: "JetBrains Mono", monospace; font-variant-ligatures: none; font-size: .65em;
+    line-height: 1.45;
+    color: rgba(0, 0, 0, .8); background: #f7f7f7; border: 1px solid #ccc; border-radius: 3px; box-shadow: 0 1px 0 rgb(0 0 0 / 20%), 0 0 0 0.1em #fff inset;
+    margin: 0 0.15em; padding: 0.2em 0.5em; top: -0.1em;
+    vertical-align: middle; position: relative; white-space: nowrap;
+}
+
+dfn { font-style: normal; font-variant: small-caps; }
+
+time { display: block; color: #828282; font-family: "Open Sans", sans-serif; font-size: 1rem; }
+
+.menu { font-weight: bold; }
+
+/* Special Cases */
+
+.post-list      { margin-left: 0; list-style: none; }
+.post-list > li { margin-top: 1em; }
+.post-list h2   { margin-top: 0; }
+.post-list a       { color: #ba3925; text-decoration: none; display: block; }
+.post-list a:hover { color: #ba3925; text-decoration: underline; }
+
+.about-ava { float: left; margin-right: 2ch; display: inline;}
+
+/* Highlighting */
+
+.hl-keyword, .hl-literal { color: #000000; font-weight: bold; }
+.hl-type    { color: #0086B3; }
+.hl-tag     { color: #000080; }
+.hl-title.function_ { color: #990000; font-weight: bold; }
+.hl-title.class_{ color: #445588; font-weight: bold; }
+.hl-comment { color: #999988; font-style: italic; }
+.hl-built_in, .hl-meta { color: #3c5d5d; font-weight: bold; }
+.hl-number { color: #009999; }
+.hl-string { color: #d14; }
+.hl-output { color: #2156a5; }
+.hl-subst { color: rgba(0, 0, 0, .9); }
+.hl-attr, .hl-symbol { color: #008080; }
+.hl-line { background-color: #ffc; }
diff --git a/css/resume.css b/css/resume.css
new file mode 100644
index 00000000..e376af8a
--- /dev/null
+++ b/css/resume.css
@@ -0,0 +1,16 @@
+@media print {
+  header { display: none; }
+  footer { display: none; }
+  main { display: block; }
+  html { font-size: 18px; }
+  h1 { display: none; }
+  .page-break { break-before: page; }
+  section:has(>h3) { break-inside: avoid;}
+}
+
+h3 {
+    font: inherit;
+    font-weight: bold;
+    color: black;
+    font-size: 1.2rem;
+}
diff --git a/favicon.png b/favicon.png
new file mode 100644
index 00000000..e3c0ed70
Binary files /dev/null and b/favicon.png differ
diff --git a/favicon.svg b/favicon.svg
new file mode 100644
index 00000000..6eda2dd5
--- /dev/null
+++ b/favicon.svg
@@ -0,0 +1,4 @@
+<svg height="215" viewBox="0 0 215 215" width="215" xmlns="http://www.w3.org/2000/svg">
+<rect width="100%" height="100%" fill="#ffffff"/>
+<path d="m62 25.5 39.5 100.25a3.392 3.392 0 0 0 2.128 1.965 4.677 4.677 0 0 0 .622.16q1.666.312 2.638-.849a3.598 3.598 0 0 0 .362-.526l49.25-102.5q1.25-4.25 2.5-9.5a256.543 256.543 0 0 0 .868-3.762q.332-1.494.615-2.866a147.113 147.113 0 0 0 .517-2.622 9.882 9.882 0 0 1 .346-1.186q.446-1.211 1.111-1.712a1.693 1.693 0 0 1 1.043-.352h9a125.567 125.567 0 0 0 9.717-.391 149.792 149.792 0 0 0 6.158-.609q6.78-.81 10.857-.964a47.043 47.043 0 0 1 1.768-.036 10.917 10.917 0 0 1 .855.037q.955.075 2.22.306a35.001 35.001 0 0 1 .175.032 3.578 3.578 0 0 1 .84.257q1.072.494 1.154 1.67a2.864 2.864 0 0 1 .006.198 3.562 3.562 0 0 1 -.235 1.321q-.334.843-1.14 1.429a9.172 9.172 0 0 1 -.836.531q-1.476.844-4.289 1.969-3.25 1.25-7.25 2.75a187.268 187.268 0 0 0 -4.804 1.875 157.467 157.467 0 0 0 -2.696 1.125 9.488 9.488 0 0 0 -3.684 2.666 11.658 11.658 0 0 0 -.566.709 9.004 9.004 0 0 0 -1.415 2.791 8.18 8.18 0 0 0 -.335 2.334 872.489 872.489 0 0 0 .244 21.045 718.629 718.629 0 0 0 .381 11.83 2457.468 2457.468 0 0 0 1.008 22.359 2163.584 2163.584 0 0 0 .367 7.141q.75 14.125 1.75 27.875 1 13.75 1.75 28a23.03 23.03 0 0 0 .817 4.9q1.23 4.268 4.183 6.975a18.596 18.596 0 0 0 4.241 2.86q2.118 1.066 4.763 1.867a45.641 45.641 0 0 0 4.746 1.148q3.534.624 5.513 1.42a10.026 10.026 0 0 1 .737.33 4.92 4.92 0 0 1 .945.603q.552.458.816 1.025a2.629 2.629 0 0 1 .239 1.122 1.865 1.865 0 0 1 -.861 1.592q-.451.318-1.139.533a11.708 11.708 0 0 1 -1.599.368q-1.569.257-3.651.257a212.234 212.234 0 0 1 -3.226-.023q-4.311-.066-7.457-.316a69.595 69.595 0 0 1 -.442-.036 377.041 377.041 0 0 0 -6.351-.492 328.39 328.39 0 0 0 -2.024-.133 113.022 113.022 0 0 0 -3.246-.152q-1.455-.049-3.081-.074a228.411 228.411 0 0 0 -3.423-.024q-4.94 0-8.294.244a62.815 62.815 0 0 0 -.081.006 291.551 291.551 0 0 0 -3.5.281 359.654 359.654 0 0 0 -3.75.344 64.24 64.24 0 0 1 -2.702.198q-2.821.152-6.644.173a228.27 228.27 0 0 1 -1.279.004 28.089 28.089 0 0 1 -1.909-.061q-.983-.067-1.818-.208a11.58 11.58 0 0 1 -1.523-.356 4.643 4.643 0 0 1 -.827-.341q-.431-.233-.703-.536a1.801 1.801 0 0 1 -.47-1.248 2.82 2.82 0 0 1 .882-2.049q.446-.448 1.118-.826 2-1.125 6.25-1.625a22.877 22.877 0 0 0 3.723-.81q1.982-.622 3.471-1.588a9.583 9.583 0 0 0 2.806-2.727q2.75-4.125 2.75-12.875a1488.958 1488.958 0 0 0 -.092-16.998q-.092-8.067-.277-15.273a908.4 908.4 0 0 0 -.006-.229q-.375-14.5-1.125-28.625-.75-14.125-1.5-30.875a9.215 9.215 0 0 0 -.118-.877q-.333-1.753-1.275-1.753a1.073 1.073 0 0 0 -.107.005 1.689 1.689 0 0 0 -.82.338q-.84.607-1.68 2.287l-57.5 117.25a2.023 2.023 0 0 1 -1.289 1.191q-.368.127-.828.173a4.891 4.891 0 0 1 -.133.011q-1.408.101-2.006-1.012a3.07 3.07 0 0 1 -.244-.613q-9.75-29.25-20.875-57.625a1946.314 1946.314 0 0 0 -11.843-29.469 2458.125 2458.125 0 0 0 -13.282-31.656 2.385 2.385 0 0 0 -.357-.625q-.509-.625-1.393-.625-1.049 0-1.394.881a2.193 2.193 0 0 0 -.106.369 103280.54 103280.54 0 0 0 -2.156 16.781 78932.297 78932.297 0 0 0 -1.969 15.344q-1.875 14.625-2.875 28.875-1 14.25-1 31.25 0 6.75 4.25 12.375a18.229 18.229 0 0 0 9.44 6.488 24.323 24.323 0 0 0 2.56.637q3.534.624 5.513 1.42a10.026 10.026 0 0 1 .737.33 4.92 4.92 0 0 1 .945.603q.552.458.816 1.025a2.629 2.629 0 0 1 .239 1.122 1.865 1.865 0 0 1 -.861 1.592q-.451.318-1.139.533a11.708 11.708 0 0 1 -1.599.368q-1.569.257-3.651.257a200.022 200.022 0 0 1 -3.164-.023q-4.429-.071-6.961-.352-3.375-.375-6-.625-2.009-.191-5.265-.236a153.469 153.469 0 0 0 -2.11-.014q-3.709 0-6.059.17a34.396 34.396 0 0 0 -.941.08q-2.148.215-4.85.522a533.185 533.185 0 0 0 -.9.103 39.958 39.958 0 0 1 -1.843.164q-2.115.144-5.15.19a200.654 200.654 0 0 1 -3.007.021 28.089 28.089 0 0 1 -1.909-.061q-.983-.067-1.818-.208a11.58 11.58 0 0 1 -1.523-.356 4.643 4.643 0 0 1 -.827-.341q-.431-.233-.703-.536a1.801 1.801 0 0 1 -.47-1.248 2.677 2.677 0 0 1 1.011-2.109q.411-.352.989-.641 2-1 6.25-1.75 6.25-1.25 10.25-6 4-4.75 5.25-13.5a2069.108 2069.108 0 0 0 2.813-21.969 1775.803 1775.803 0 0 0 2.437-20.906 1682.128 1682.128 0 0 0 1.975-18.8q.941-9.507 1.65-17.95a534.446 534.446 0 0 0 .602-7.82q.773-11.18.773-19.055v-4.25a34.894 34.894 0 0 0 -.048-1.873q-.05-.928-.152-1.752a20.464 20.464 0 0 0 -.05-.375 38.443 38.443 0 0 0 -.571-3.333 43.378 43.378 0 0 0 -.179-.792 5.664 5.664 0 0 0 -.82-1.835q-.459-.692-1.142-1.369a12.977 12.977 0 0 0 -1.038-.921q-3.832-3.193-7.482-4.664a19.221 19.221 0 0 0 -1.268-.461 58.853 58.853 0 0 0 -4.006-1.131q-2.759-.681-5.994-1.244a10.235 10.235 0 0 1 -1.866-.551q-1.126-.46-1.964-1.174a6.312 6.312 0 0 1 -.17-.15 5.17 5.17 0 0 1 -.912-1.081 3.811 3.811 0 0 1 -.588-2.044 2.454 2.454 0 0 1 .21-1.044q.458-.986 1.918-1.286a5.457 5.457 0 0 1 .247-.045q1.596-.252 3.812-.335a62.233 62.233 0 0 1 2.313-.04q2.5 0 7.625.25 5.125.25 11.125.75a366.718 366.718 0 0 0 5.782.438 288.593 288.593 0 0 0 5.343.312q5.08.248 7.58.25a50.503 50.503 0 0 0 .045 0q.889 0 1.649.258a4.426 4.426 0 0 1 1.351.742 4.523 4.523 0 0 1 1.156 1.417q.314.584.521 1.31a8.42 8.42 0 0 1 .073.273q1.25 4.5 2.5 9.5 1.25 5 3 9.75z" fill="#ba3925" fill-rule="evenodd" stroke="#ba3925" stroke-linecap="round" stroke-width=".25mm" transform="translate(2.119326 24.52363)"/>
+</svg>
diff --git a/feed.xml b/feed.xml
new file mode 100644
index 00000000..35a9ac3c
--- /dev/null
+++ b/feed.xml
@@ -0,0 +1,2251 @@
+<?xml version="1.0" encoding="utf-8"?>
+<feed xmlns="http://www.w3.org/2005/Atom">
+<link href="https://matklad.github.io/feed.xml" rel="self" type="application/atom+xml"/>
+<link href="https://matklad.github.io" rel="alternate" type="text/html"/>
+<updated>2023-10-09T12:30:30.971Z</updated>
+<id>https://matklad.github.io/feed.xml</id>
+<title type="html">matklad</title>
+<subtitle>Yet another programming blog by Alex Kladov aka matklad.</subtitle>
+<author><name>Alex Kladov</name></author>
+
+<entry>
+<title type="text">What is an Invariant?</title>
+<link href="https://matklad.github.io/2023/10/06/what-is-an-invariant.html" rel="alternate" type="text/html" title="What is an Invariant?" />
+<published>2023-10-06T00:00:00+00:00</published>
+<updated>2023-10-06T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/10/06/what-is-an-invariant</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[I extolled the benefits of programming with invariants in a couple of recent posts.
+Naturally, I didn't explain what I think when I write invariant. This post fixes that.]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/10/06/what-is-an-invariant.html"><![CDATA[
+    <h1>
+    <a href="#What-is-an-Invariant"><span>What is an Invariant?</span> <time datetime="2023-10-06">Oct 6, 2023</time></a>
+    </h1>
+<p><span>I extolled the benefits of programming with invariants in a couple of recent posts.</span>
+<span>Naturally, I didn</span>&rsquo;<span>t explain what I think when I write </span>&ldquo;<span>invariant</span>&rdquo;<span>. This post fixes that.</span></p>
+<p><span>There are at least three different concepts I label with </span>&ldquo;<span>invariant</span>&rdquo;<span>:</span></p>
+<ul>
+<li>
+<span>a general </span>&ldquo;<span>math</span>&rdquo;<span> mode of thinking, where you distinguish between fuzzy, imprecise thoughts and</span>
+<span>precise statements with logical meaning.</span>
+</li>
+<li>
+<span>a specific technique for writing correct code when programming in the small.</span>
+</li>
+<li>
+<span>when programming in the large, compact, viral, descriptive properties of the systems.</span>
+</li>
+</ul>
+<p><span>I wouldn</span>&rsquo;<span>t discuss the first point here </span>&mdash;<span> I don</span>&rsquo;<span>t know how to describe this better than </span>&ldquo;<span>that</span>
+<span>thing that you do when you solve non-trivial math puzzler</span>&rdquo;<span>. The bulk of the post describes the</span>
+<span>second bullet point, for which I think I have a perfect litmus test to explain exactly what I am</span>
+<span>thinking here. I also touch a bit on the last point in the end.</span></p>
+<p><span>So let</span>&rsquo;<span>s start with a </span><a href="https://research.swtch.com/hwmm"><span>litmus test program</span></a><span> to show invariants in</span>
+<span>the small in action:</span></p>
+
+<aside class="block">
+
+<p><span>Write a binary search variation which computes insertion point </span>&mdash;<span> the smallest index such that, if</span>
+<span>the new element is inserted at this index, the array remains sorted:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">insertion_point</span>(xs: &amp;[<span class="hl-type">i32</span>], x: <span class="hl-type">i32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span></span></code></pre>
+
+</figure>
+
+</aside>
+  <p><span>You might want to write one yourself before proceeding. Here</span>&rsquo;<span>s an </span><a href="https://matklad.github.io/2021/11/07/generate-all-the-things.html"><span>exhaustive</span>
+<span>test</span></a><span> for this functionality,</span>
+<span>using </span><a href="https://crates.io/crates/exhaustigen"><span>exhaustigen crate</span></a><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">N</span> = <span class="hl-number">5</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">M</span> = <span class="hl-number">5</span>;</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">g</span> = exhaustigen::Gen::<span class="hl-title function_ invoke__">new</span>();</span>
+<span class="line">  <span class="hl-keyword">while</span> !g.<span class="hl-title function_ invoke__">done</span>() {</span>
+<span class="line">    <span class="hl-comment">// Generate an arbitrary sorted array of length at most M.</span></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">xs</span> =</span>
+<span class="line">      (<span class="hl-number">0</span>..g.<span class="hl-title function_ invoke__">gen</span>(N)).<span class="hl-title function_ invoke__">map</span>(|_| g.<span class="hl-title function_ invoke__">gen</span>(M) <span class="hl-keyword">as</span> <span class="hl-type">i32</span>).collect::&lt;<span class="hl-type">Vec</span>&lt;_&gt;&gt;();</span>
+<span class="line">    xs.<span class="hl-title function_ invoke__">sort</span>();</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">x</span> = g.<span class="hl-title function_ invoke__">gen</span>(M) <span class="hl-keyword">as</span> <span class="hl-type">i32</span>;</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">i</span> = <span class="hl-title function_ invoke__">insertion_point</span>(&amp;xs, x);</span>
+<span class="line">    <span class="hl-keyword">if</span> i &gt; <span class="hl-number">0</span>        { <span class="hl-built_in">assert!</span>(xs[i - <span class="hl-number">1</span>] &lt; x) }</span>
+<span class="line">    <span class="hl-keyword">if</span> i &lt; xs.<span class="hl-title function_ invoke__">len</span>() { <span class="hl-built_in">assert!</span>(x &lt;= xs[i]) }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here</span>&rsquo;<span>s how I would naively write this function. First, I start with defining the boundaries for the</span>
+<span>binary search:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">insertion_point</span>(xs: &amp;[<span class="hl-type">i32</span>], x: <span class="hl-type">i32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lo</span> = <span class="hl-number">0</span>;</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">hi</span> = xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Then, repeatedly cut the interval in half until it vanishes</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">    <span class="hl-keyword">while</span> hi &gt; lo {</span>
+<span class="line">        <span class="hl-keyword">let</span> <span class="hl-variable">mid</span> = lo + (hi - lo) / <span class="hl-number">2</span>;</span>
+<span class="line">        ...</span>
+<span class="line">    }</span></code></pre>
+
+</figure>
+<p><span>and recur into the left or the right half accordingly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">        <span class="hl-keyword">if</span> x &lt; xs[mid] {</span>
+<span class="line">            lo = mid;</span>
+<span class="line">        } <span class="hl-keyword">else</span> {</span>
+<span class="line">            hi = mid;</span>
+<span class="line">        }</span></code></pre>
+
+</figure>
+<p><span>Altogether:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">insertion_point</span>(xs: &amp;[<span class="hl-type">i32</span>], x: <span class="hl-type">i32</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">usize</span> {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">lo</span> = <span class="hl-number">0</span>;</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-keyword">mut </span><span class="hl-variable">hi</span> = xs.<span class="hl-title function_ invoke__">len</span>();</span>
+<span class="line"></span>
+<span class="line">  <span class="hl-keyword">while</span> lo &lt; hi {</span>
+<span class="line">    <span class="hl-keyword">let</span> <span class="hl-variable">mid</span> = lo + (hi - lo) / <span class="hl-number">2</span>;</span>
+<span class="line">    <span class="hl-keyword">if</span> x &lt; xs[mid] {</span>
+<span class="line">      hi = mid;</span>
+<span class="line">    } <span class="hl-keyword">else</span> {</span>
+<span class="line">      lo = mid;</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line"></span>
+<span class="line">  lo</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>I love this code! It has so many details right!</span></p>
+<ul>
+<li>
+<span>The </span><code>insertion_point</code><span> interface compactly compresses usually messy result of a binary search to</span>
+<span>just one index.</span>
+</li>
+<li>
+<code>xs / x</code><span> pair of names for the sequence and its element crisply describes abstract algorithm on</span>
+<span>sequencies.</span>
+</li>
+<li>
+<span>Similarly, </span><code>lo / hi</code><span> name pair is symmetric, expressing the relation between the two indexes.</span>
+</li>
+<li>
+<span>Half-open intervals are used for indexing.</span>
+</li>
+<li>
+<span>There are no special casing anywhere, the natural </span><code>lo &lt; hi</code><span> condition handles empty slice.</span>
+</li>
+<li>
+<span>We even dodge Java</span>&rsquo;<span>s binary search bug by computing midpoint without overflow.</span>
+</li>
+</ul>
+<p><span>There</span>&rsquo;<span>s only one problem with this code </span>&mdash;<span> it doesn</span>&rsquo;<span>t work. Just blindly following rules-of-thumb</span>
+<span>gives you working code surprisingly often, but this particular algorithm is an exception.</span></p>
+<p><span>The question is, how do we fix this overwise great code? And here</span>&rsquo;<span>s where thinking invariants helps.</span>
+<span>Before I internalized invariants, my approach would be to find a failing example, and to fumble with</span>
+<span>some plus or minus ones here and there and other special casing to make it work. That is, find a</span>
+<span>concrete problem, solve it. This works, but is slow, and doesn</span>&rsquo;<span>t allow discovering the problem</span>
+<span>before running the code.</span></p>
+<p><span>The alternative is to actually make an effort and spell out, explicitly, what the code is supposed</span>
+<span>to do. In this case, we want </span><code>lo</code><span> and </span><code>hi</code><span> to bound the result. That is,</span>
+<code class="display">lo &lt;= insertion_point &lt;= hi</code>
+<span>should hold on every iteration. It clearly holds before we enter the loop. On each iteration, we</span>
+<span>would like to shorten this interval, cutting away the part that definitely does not contain</span>
+<span>insertion point.</span></p>
+<p><span>Elaborating the invariant, all elements to the left of </span><code>lo</code><span> should be less than the target.</span>
+<span>Conversely, all elements to the right of </span><code>hi</code><span> should be at least as large as the target.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> <span class="hl-number">0</span>..lo: xs[i] &lt; x</span>
+<span class="line"><span class="hl-keyword">for</span> <span class="hl-variable">i</span> <span class="hl-keyword">in</span> hi..:  x &lt;= xs[i]</span></code></pre>
+
+</figure>
+<p><span>Let</span>&rsquo;<span>s now take a second look at the branching condition:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">x &lt; xs[mid]</span></code></pre>
+
+</figure>
+<p><span>It matches neither invariant prong exactly: </span><code>x</code><span> is on the left, but inequality is strict. We can</span>
+<span>rearrange the code to follow the invariant more closely:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">if</span> xs[mid] &lt; x {</span>
+<span class="line">    lo = mid + <span class="hl-number">1</span>;</span>
+<span class="line">} <span class="hl-keyword">else</span> {</span>
+<span class="line">    hi = mid;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ul>
+<li>
+<span>we flip the condition and if-branches, so that </span><code>xs[mid] &lt; x</code><span> matches </span><code>xs[i] &lt; x</code><span> from the</span>
+<span>invariant for </span><code>lo</code>
+</li>
+<li>
+<span>to make the invariant tight, we add </span><code>mid + 1</code><span> (if </span><code>xs[mid]</code><span> is less than </span><code>x</code><span>, we know that the</span>
+<span>insertion point is at least </span><code>mid + 1</code><span>)</span>
+</li>
+</ul>
+<p><span>The code now works. So what went wrong with the original version with </span><code>x &lt; xs[mid]</code><span>? In the else</span>
+<span>case, when </span><code>x &gt;= xs[mid]</code><span> we set </span><code>lo = mid</code><span>, but that</span>&rsquo;<span>s wrong! It might be the case that </span><code>x ==
+xs[mid]</code><span> and </span><code>x == xs[mid - 1]</code><span>, which would break the invariant for </span><code>lo</code><span>.</span></p>
+<p><span>The point isn</span>&rsquo;<span>t in this </span><em><span>particular</span></em><span> invariant or this particular algorithm. It</span>&rsquo;<span>s the general</span>
+<span>pattern that  it</span>&rsquo;<span>s easy to write the code which implements the right algorithm, and sort-of works,</span>
+<span>but is wrong in details. To get the details right for the right reason, you need to understand</span>
+<em><span>precisely</span></em><span> what the result should be, and formulating this as a (loop or recursion) invariant</span>
+<span>helps.</span></p>
+<hr>
+<p><span>Perhaps it</span>&rsquo;<span>s time to answer the title question: invariant is some property which holds at all times</span>
+<span>during dynamic evolution of the system. In the above example, the evolution is the program</span>
+<span>progressing through subsequent loop iterations. The invariant, the condition binding </span><code>lo</code><span> and </span><code>hi</code><span>,</span>
+<span>holds on every iteration. Invariants are powerful, because they are </span><em><span>compressed</span></em><span> descriptions of</span>
+<span>the system, they collapse away the time dimension, which is a huge simplification. Reasoning about</span>
+<span>each particular path the program could take is hard, because there are so many different paths.</span>
+<span>Reasoning about invariants is easy, because they capture properties shared by </span><em><span>all</span></em><span> execution paths.</span></p>
+<p><span>The same idea applies when programming in the large. In the small, we looked at how the state of a</span>
+<span>running program evolves over time. In the large, we will look at how the source code of the program</span>
+<span>itself evolves, as it is being refactored and extended to support new features. Here are some</span>
+<span>systems invariants from the systems I</span>&rsquo;<span>ve worked with:</span></p>
+<p><strong><span>Cargo:</span></strong></p>
+<p><span>File system paths entered by users are preserved exactly. If the user types</span>
+<span class="display"><code>cargo frob ../some/dir</code><span>,</span></span>
+<span>Cargo doesn</span>&rsquo;<span>t attempt to resolve </span><code>../some/dir</code><span> to an absolute path and passes the path</span>
+<span>to the underlying OS as is. The reason for that is that file systems are very finicky. Although it</span>
+<span>might look as if two paths are equivalent, there are bound to be cases where they are not. If the</span>
+<span>user typed a particular form of a path, they believe that it</span>&rsquo;<span>ll work, and any changes can mess</span>
+<span>things up easily.</span></p>
+<p><span>This is a relatively compact invariant </span>&mdash;<span> basically, code is just forbidden from calling</span>
+<code>fs::canonicalize</code><span>.</span></p>
+<p><strong><span>rust-analyzer:</span></strong></p>
+<p><span>Syntax trees are identity-less value types. That is, if you take an object representing an </span><code>if</code>
+<span>expression, that object doesn</span>&rsquo;<span>t have any knowledge of where in the larger program the </span><code>if</code>
+<span>expression is. The thinking about this invariant was that it simplifies refactors </span>&mdash;<span> while in the</span>
+<span>static program it</span>&rsquo;<span>s natural to talk about </span>&ldquo;<code>if</code><span> on the line X in file Y</span>&rdquo;<span>, when you start modifying</span>
+<span>code, identity becomes much more fluid.</span></p>
+<p><span>This is an invariant with far reaching consequences </span>&mdash;<span> that means that literally everything in</span>
+<span>rust-analyzer needs to track identities of things explicitly. You don</span>&rsquo;<span>t just pass around syntax</span>
+<span>nodes, you pass nodes with extra breadcrumbs describing their origin. I think this might have been a</span>
+<span>mistake </span>&mdash;<span> while it does make refactoring APIs more principled, refactoring is not the common case!</span>
+<span>Most of the work of a language server consists of read-only analysis of existing code, and the</span>
+<span>actual refactor is just a cherry on top. So perhaps it</span>&rsquo;<span>s better to try to bind identity mode tightly</span>
+<span>into the core data structure, and just use fake identities for temporary trees that arise during</span>
+<span>refactors.</span></p>
+<p><span>A more successful invariant from rust-analyzer is that the IDE has a full, frozen view of a snapshot</span>
+<span>of the world. There</span>&rsquo;<span>s no API for inferring the types, rather, the API looks as if all the types are</span>
+<span>computed at all times. Similarly, there</span>&rsquo;<span>s no explicit API for changing the code or talking about</span>
+<span>different historical versions of the code </span>&mdash;<span> the IDE sees a single </span>&ldquo;<span>current</span>&rdquo;<span> snapshot with all</span>
+<span>derived data computed. Underneath, there</span>&rsquo;<span>s a smart system to secretly compute the information on</span>
+<span>demand and re-use previous results, but this is all hidden from the API.</span></p>
+<p><span>This is a great, simple mental model, and it provides for a nice boundary between the compiler</span>
+<span>proper and IDE fluff like refactors and code completion. Long term, I</span>&rsquo;<span>d love to see several</span>
+<span>implementations of the </span>&ldquo;<span>compiler parts</span>&rdquo;<span>.</span></p>
+<p><strong><span>TigerBeetle:</span></strong></p>
+<p><span>A </span><em><span>lot</span></em><span> of thoughtful invariants here! To touch only a few:</span></p>
+<p><span>TigerBeetle doesn</span>&rsquo;<span>t allocate memory after startup. This simple invariant affects every bit of code</span>
+&mdash;<span> whatever you do, you must manage with existing, pre-allocated data structures. You can</span>&rsquo;<span>t just</span>
+<code>memcpy</code><span> stuff around, there</span>&rsquo;<span>s no ambient available space to </span><code>memcpy</code><span> to! As a consequence (and,</span>
+<span>historically, as a motivation for the design)</span>
+<a href="https://github.com/tigerbeetle/tigerbeetle/blob/cfb46eff4e001bb6b33f5e48924a2de44db20e8f/src/constants.zig#L417-L418"><span>everything</span></a>
+<span>has a specific numeric limit.</span></p>
+<p><span>Another fun one is that transaction logic can</span>&rsquo;<span>t read from disk. Every object which could be touched</span>
+<span>by a transaction needs to be explicitly prefetched into memory before transaction begins. Because</span>
+<span>disk IO happens separately from the execution, it is possible to parallelize IO for a whole batch of</span>
+<span>transactions. The actual transaction execution is then a very tight serial CPU loop without any</span>
+<span>locks.</span></p>
+<p><span>Speaking of disk IO, in TigerBeetle </span>&ldquo;<span>reading from disk</span>&rdquo;<span> can</span>&rsquo;<span>t fail. The central API for reading</span>
+<span>takes a data block address, a checksum, and invokes the callback with data with a matching checksum.</span>
+<span>Everything built on top doesn</span>&rsquo;<span>t need to worry about error handling. The way this works internally is</span>
+<span>that reads that fail on a local disk are repaired through other replicas in the cluster. It</span>&rsquo;<span>s just</span>
+<span>that the repair happens transparently to the caller. If the block of data of interest isn</span>&rsquo;<span>t found on</span>
+<span>the set of reachable replicas, the cluster correctly gets stuck until it is found.</span></p>
+<hr>
+<p><span>Summing up: invariants are helpful for describing systems that evolve over time. There</span>&rsquo;<span>s a</span>
+<span>combinatorial explosion of trajectories that a system </span><em><span>could</span></em><span> take. Invariants compactly describe</span>
+<span>properties shared by an infinite amount of trajectories.</span></p>
+<p><span>In the small, formulating invariants about program state helps to wire correct code.</span></p>
+<p><span>In the large, formulating invariants about the code itself helps to go from a small, simple system</span>
+<span>that works to a large system which is used in production.</span></p>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">Comparative Analysis</title>
+<link href="https://matklad.github.io/2023/09/13/comparative-analysis.html" rel="alternate" type="text/html" title="Comparative Analysis" />
+<published>2023-09-13T00:00:00+00:00</published>
+<updated>2023-09-13T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/09/13/comparative-analysis</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[Most languages provide 6 comparison operators:]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/09/13/comparative-analysis.html"><![CDATA[
+    <h1>
+    <a href="#Comparative-Analysis"><span>Comparative Analysis</span> <time datetime="2023-09-13">Sep 13, 2023</time></a>
+    </h1>
+<p><span>Most languages provide 6 comparison operators:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">&lt;</span>
+<span class="line">&lt;=</span>
+<span class="line">&gt;</span>
+<span class="line">&gt;=</span>
+<span class="line">=</span>
+<span class="line">!=</span></code></pre>
+
+</figure>
+<p><span>That</span>&rsquo;<span>s too damn many of them! Some time ago I</span>&rsquo;<span>ve noticed that my code involving comparisons is often</span>
+<span>hard to understand, and hides bugs. I</span>&rsquo;<span>ve figured some rules of thumb to reduce complexity, which I</span>
+<span>want to share.</span></p>
+<p><span>The core idea is to canonicalize things. Both </span><code>x &lt; y</code><span> and </span><code>y &gt; x</code><span> mean the same, and, if you use</span>
+<span>them with roughly equal frequency, you need to spend extra mental capacity to fold the two versions</span>
+<span>into the single </span>&ldquo;<span>x tiny, y HUGE</span>&rdquo;<span> concept in your head.</span></p>
+<p><span>The </span><a href="https://en.wikipedia.org/wiki/Number_line"><span>number line</span></a><span> is a great intuition and visualization</span>
+<span>for comparisons. If you order things from small to big,</span>
+<span class="display"><code>A B C D</code><span>,</span></span>
+<span>you get intuitive concept of ordering without using comparison operators. You also plug into your</span>
+<span>existing intuition that the sort function arranges arrays in the ascending order.</span></p>
+<p><span>So, as a first order rule-of-thumb:</span>
+<span class="display"><strong><span>Strongly prefer </span><code>&lt;</code><span> and </span><code>&lt;=</code><span> over </span><code>&gt;</code><span> and </span><code>&gt;=</code></strong></span>
+<span>And, when using comparisons, use number line intuition.</span></p>
+<p><span>Some snippets:</span></p>
+<p><span>Checking if a point is inside the interval:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">lo &lt;= x <span class="hl-keyword">and</span> x &lt;= hi</span></code></pre>
+
+</figure>
+<p><span>Checking if a point is outside of the interval:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">x &lt; lo <span class="hl-keyword">or</span> hi &lt; x</span></code></pre>
+
+</figure>
+<p><span>Segment </span><code>a</code><span> is inside segment </span><code>b</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">b.start &lt;= a.start <span class="hl-keyword">and</span> a.end &lt;= b.end</span></code></pre>
+
+</figure>
+<p><span>Segments </span><code>a</code><span> and </span><code>b</code><span> are disjoint (either </span><code>a</code><span> is to the left of </span><code>b</code><span> or </span><code>a</code><span> is to the right of </span><code>b</code><span>):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">a.end &lt; b.start <span class="hl-keyword">or</span> b.end &lt; a.start</span></code></pre>
+
+</figure>
+<p><span>A particular common case for ordered comparisons is checking that an index is in bounds for an</span>
+<span>array. Here, the rule about number line works together with another important rule: </span><span class="display"><strong><span>State</span>
+<span>invariants positively</span></strong></span></p>
+<p><span>The indexing invariant is spelled as </span><span class="display"><code>index &lt; xs.len()</code><span>,</span></span></p>
+<p><span>and you should prefer to see it exactly that way in the source code. Concretely,</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">if</span> (index &gt;= xs.len) {</span>
+<span class="line"></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>is hard to get right, because is spells the converse of the invariant, and involves an extra mental</span>
+<span>negation (this is subtle </span>&mdash;<span> although there isn</span>&rsquo;<span>t a literal negation operator, you absolutely do</span>
+<span>think about this as a negation of the invariant). If possible, the code should be reshaped to</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">if</span> (index &lt; xs.len) {</span>
+<span class="line"></span>
+<span class="line">} <span class="hl-keyword">else</span> {</span>
+<span class="line"></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">TypeScript is Surprisingly OK for Compilers</title>
+<link href="https://matklad.github.io/2023/08/17/typescript-is-surprisingly-ok-for-compilers.html" rel="alternate" type="text/html" title="TypeScript is Surprisingly OK for Compilers" />
+<published>2023-08-17T00:00:00+00:00</published>
+<updated>2023-08-17T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/08/17/typescript-is-surprisingly-ok-for-compilers</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[There are two main historical trends when choosing an implementation language for something
+compiler-shaped.]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/08/17/typescript-is-surprisingly-ok-for-compilers.html"><![CDATA[
+    <h1>
+    <a href="#TypeScript-is-Surprisingly-OK-for-Compilers"><span>TypeScript is Surprisingly OK for Compilers</span> <time datetime="2023-08-17">Aug 17, 2023</time></a>
+    </h1>
+<p><span>There are two main historical trends when choosing an implementation language for something</span>
+<span>compiler-shaped.</span></p>
+<p><span>For more language-centric tasks, like a formal specification, or a toy hobby language, OCaml makes</span>
+<span>most sense. See, for example, </span><a href="https://plzoo.andrej.com"><span>plzoo</span></a><span> or </span><a href="https://github.com/WebAssembly/spec/tree/653938a88c6f40eb886d5980ca315136eb861d03/interpreter"><span>WebAssembly reference</span>
+<span>interpreter</span></a><span>.</span></p>
+<p><span>For something implementation-centric and production ready, C++ is often chosen: LLVM, clang, v8,</span>
+<span>HotSpot are all C++.</span></p>
+<p><span>These days, Rust is a great new addition to the landscape. It is influenced most directly by ML and</span>
+<span>C++, combines their strengths, and even brings something new of its own to the table, like seamless,</span>
+<span>safe multithreading. Still, Rust leans heavily towards production readiness side of the spectrum.</span>
+<span>While some aspects of it, like a </span>&ldquo;<span>just works</span>&rdquo;<span> build system, help with prototyping as well, there</span>&rsquo;<span>s</span>
+<span>still extra complexity tax due to the necessity to model physical layout of data. The usual advice,</span>
+<span>when you start building a compiler in Rust, is to avoid pointers and use indexes. Indexes are great!</span>
+<span>In large codebase, they allow greater decoupling (side tables can stay local to relevant modules),</span>
+<span>improved performance (an index is  </span><code>u32</code><span> and nudges you towards struct-of-arrays layouts), and more</span>
+<span>flexible computation strategies (indexes are easier to serialize or plug into incremental</span>
+<span>compilation framework). But they do make programming-in-the-small significantly more annoying, which</span>
+<span>is a deal-breaker for hobbyist tinkering.</span></p>
+<p><span>But OCaml is crufty! Is there something better? Today, I realized that TypeScript might actually be</span>
+<span>OK? It is not really surprising, given how the language works, but it never occured to me to think</span>
+<span>about TypeScript as an ML equivalent before.</span></p>
+<p><span>So, let</span>&rsquo;<span>s write a tiny-tiny typechecker in TS!</span></p>
+<p><span>Of course, we start with </span><a href="https://deno.land"><span>deno</span></a><span>. See </span><a href="https://matklad.github.io/2023/02/12/a-love-letter-to-deno.html"><em><span>A Love Letter to</span>
+<span>Deno</span></em></a><span> for more details, but the</span>
+<span>TL;DR is that deno provides out-of-the-box experience for TypeScript. This is a pain point for</span>
+<span>OCaml, and something that Rust does better than either OCaml or C++. But deno does this better than</span>
+<span>Rust! It</span>&rsquo;<span>s just a single binary, it comes with linting and formatting, there</span>&rsquo;<span>s no compilation step,</span>
+<span>and there are built-in task runner and watch mode. A dream setup for quick PLT hacks!</span></p>
+<p><span>And then there</span>&rsquo;<span>s TypeScript itself, with its sufficiently flexible, yet light-ceremony type system.</span></p>
+<p><span>Let</span>&rsquo;<span>s start with defining an AST. As we are hacking, we won</span>&rsquo;<span>t bother with making it an IDE-friendly</span>
+<span>concrete syntax tree, or incremental-friendly </span>&ldquo;<span>only store relative offsets</span>&rdquo;<span> tree, and will just tag</span>
+<span>AST nodes with locations in file:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">Location</span> {</span>
+<span class="line">  <span class="hl-attr">file</span>: <span class="hl-built_in">string</span>;</span>
+<span class="line">  <span class="hl-attr">line</span>: <span class="hl-built_in">number</span>;</span>
+<span class="line">  <span class="hl-attr">column</span>: <span class="hl-built_in">number</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Even here, we already see high-level nature of TypeScript </span>&mdash;<span> string is just a </span><code>string</code><span>, there</span>&rsquo;<span>s no</span>
+<span>thinking about </span><code>usize</code><span> vs </span><code>u32</code><span> as numbers are just </span><code>number</code><span>s.</span></p>
+<p><span>Usually, an expression is defined as a sum-type. As we want to tag each expression with a location,</span>
+<span>that representation would be slightly inconvenient for us, so we split things up a bit:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">Expr</span> {</span>
+<span class="line">    <span class="hl-attr">location</span>: <span class="hl-title class_">Location</span>;</span>
+<span class="line">    <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprKind</span>;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprKind</span> = <span class="hl-title class_">ExprBool</span> | <span class="hl-title class_">ExprInt</span> | ... ;</span></code></pre>
+
+</figure>
+<p><span>One more thing </span>&mdash;<span> as we are going for something quick, we</span>&rsquo;<span>ll be storing inferred types directly in</span>
+<span>the AST nodes. Still, we want to keep raw and type-checked AST separate, so what we are going to do</span>
+<span>here is to parametrize the </span><code>Expr</code><span> over associated data it stores. A freshly parsed expression would</span>
+<span>use </span><code>void</code><span> as data, and the type checker will set it to </span><code>Type</code><span>. Here</span>&rsquo;<span>s what we get:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">Expr</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-attr">location</span>: <span class="hl-title class_">Location</span>;</span>
+<span class="line">  <span class="hl-attr">data</span>: T;</span>
+<span class="line">  <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprKind</span>&lt;T&gt;;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprKind</span>&lt;T&gt; =</span>
+<span class="line">  | <span class="hl-title class_">ExprBool</span>&lt;T&gt;</span>
+<span class="line">  | <span class="hl-title class_">ExprInt</span>&lt;T&gt;</span>
+<span class="line">  | <span class="hl-title class_">ExprBinary</span>&lt;T&gt;</span>
+<span class="line">  | <span class="hl-title class_">ExprControl</span>&lt;T&gt;;</span></code></pre>
+
+</figure>
+<p><span>A definition of </span><code>ExprBinary</code><span> could look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprBinary</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-attr">op</span>: <span class="hl-title class_">BinaryOp</span>;</span>
+<span class="line">  <span class="hl-attr">lhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">rhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">enum</span> <span class="hl-title class_">BinaryOp</span> {</span>
+<span class="line">  <span class="hl-title class_">Add</span>, <span class="hl-title class_">Sub</span>, <span class="hl-title class_">Mul</span>, <span class="hl-title class_">Div</span>,</span>
+<span class="line">  <span class="hl-title class_">Eq</span>, <span class="hl-title class_">Neq</span>,</span>
+<span class="line">  <span class="hl-title class_">Lt</span>, <span class="hl-title class_">Gt</span>, <span class="hl-title class_">Le</span>, <span class="hl-title class_">Ge</span>,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Note how I don</span>&rsquo;<span>t introduce separate types for, e.g, </span><code>AddExpr</code><span> and </span><code>SubExpr</code><span> </span>&mdash;<span> all binary</span>
+<span>expressions have the same shape, so one type is enough!</span></p>
+<p><span>But we need a tiny adjustment here. Our </span><code>Expr</code><span> kind is defined as a union type. To match a value of</span>
+<span>a union type a bit of runtime type information is needed. However, it</span>&rsquo;<span>s one of the core properties</span>
+<span>of TypeScript that it doesn</span>&rsquo;<span>t add any runtime behaviors. So, if we want to match on expression kinds</span>
+<span>(and we for sure want!), we need to give a helping hand to the compiler and include a bit of RTTI</span>
+<span>manually. That would be the </span><code>tag</code><span> field:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprBinary</span>&lt;T&gt; {</span>
+<span class="line hl-line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;binary&quot;</span>;</span>
+<span class="line">  <span class="hl-attr">op</span>: <span class="hl-title class_">BinaryOp</span>;</span>
+<span class="line">  <span class="hl-attr">lhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">rhs</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><code>tag: "binary"</code><span> means that the only possible runtime value for </span><code>tag</code><span> is the string </span><code>"binary"</code><span>.</span></p>
+<p><span>Similarly to various binary expressions, boolean literal and int literal expressions have </span><em><span>almost</span></em>
+<span>identical shape.  Almost, because the payload (</span><code>boolean</code><span> or </span><code>number</code><span>) is different. TypeScript</span>
+<span>allows us to neatly abstract this over:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprBool</span>&lt;T&gt; = <span class="hl-title class_">ExprLiteral</span>&lt;T, <span class="hl-built_in">boolean</span>, <span class="hl-string">&quot;bool&quot;</span>&gt;;</span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprInt</span>&lt;T&gt; = <span class="hl-title class_">ExprLiteral</span>&lt;T, <span class="hl-built_in">number</span>, <span class="hl-string">&quot;int&quot;</span>&gt;;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprLiteral</span>&lt;T, V, <span class="hl-title class_">Tag</span>&gt; {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-title class_">Tag</span>;</span>
+<span class="line">  <span class="hl-attr">value</span>: V;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Finally, for control-flow expressions we only add </span><code>if</code><span> for now:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">ExprControl</span>&lt;T&gt; = <span class="hl-title class_">ExprIf</span>&lt;T&gt;;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">interface</span> <span class="hl-title class_">ExprIf</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;if&quot;</span>;</span>
+<span class="line">  <span class="hl-attr">cond</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">then_branch</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">  <span class="hl-attr">else_branch</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This concludes the definition of the ast! Let</span>&rsquo;<span>s move on to the type inference! Start with types:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">Type</span> = <span class="hl-title class_">TypeBool</span> | <span class="hl-title class_">TypeInt</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">TypeBool</span> {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Bool&quot;</span>;</span>
+<span class="line">}</span>
+<span class="line"><span class="hl-keyword">const</span> <span class="hl-title class_">TypeBool</span>: <span class="hl-title class_">TypeBool</span> = { <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Bool&quot;</span> };</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">TypeInt</span> {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Int&quot;</span>;</span>
+<span class="line">}</span>
+<span class="line"><span class="hl-keyword">const</span> <span class="hl-title class_">TypeInt</span>: <span class="hl-title class_">TypeInt</span> = { <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Int&quot;</span> };</span></code></pre>
+
+</figure>
+<p><span>Our types are really simple, we could have gone with </span><span class="display"><code>type Type = "Int" | "Bool"</code><span>,</span></span><span> but</span>
+<span>lets do this a bit more enterprisy! We define separate types for integer and boolean types. As these</span>
+<span>types are singletons, we also provide canonical definitions. And here is another TypeScript-ism.</span>
+<span>Because TypeScript fully erases types, everything related to types lives in a separate namespace. So</span>
+<span>you can have a type and a value sharing the same name. Which is exactly what we use to define the</span>
+<span>singletons!</span></p>
+<p><span>Finally, we can take advantage of our associated-data parametrized expression and write the</span>
+<span>signature of</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt;</span></code></pre>
+
+</figure>
+<p><span>As it says on the tin, </span><code>inter_types</code><span> fills in </span><code>Type</code><span> information into the void! Let</span>&rsquo;<span>s fill in the</span>
+<span>details!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    cas</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>If at this point we hit Enter, the editor completes:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;bool&quot;</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;int&quot;</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;binary&quot;</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;if&quot;</span>:</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s one problem though. What we really want to write here is something like</span>
+<span class="display"><code>const inferred_type = switch(..)</code><span>,</span></span>
+<span>but in TypeScript </span><code>switch</code><span> is a statement, not an expression.</span>
+<span>So let</span>&rsquo;<span>s define a generic visitor!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">type</span> <span class="hl-title class_">Visitor</span>&lt;T, R&gt; = {</span>
+<span class="line">  <span class="hl-title function_">bool</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprBool</span>&lt;T&gt;): R;</span>
+<span class="line">  <span class="hl-title function_">int</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprInt</span>&lt;T&gt;): R;</span>
+<span class="line">  <span class="hl-title function_">binary</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprBinary</span>&lt;T&gt;): R;</span>
+<span class="line">  <span class="hl-keyword">if</span>(<span class="hl-attr">kind</span>: <span class="hl-title class_">ExprIf</span>&lt;T&gt;): R;</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">function</span> visit&lt;T, R&gt;(</span>
+<span class="line">  <span class="hl-attr">expr</span>: <span class="hl-title class_">Expr</span>&lt;T&gt;,</span>
+<span class="line">  <span class="hl-attr">v</span>: <span class="hl-title class_">Visitor</span>&lt;T, R&gt;,</span>
+<span class="line">): R {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;bool&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">bool</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;int&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">int</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;binary&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">binary</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;if&quot;</span>: <span class="hl-keyword">return</span> v.<span class="hl-title function_">if</span>(expr.<span class="hl-property">kind</span>);</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Armed with the </span><code>visit</code><span>, we can ergonomically match over the expression:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">const</span> ty = <span class="hl-title function_">visit</span>(expr, {</span>
+<span class="line">    <span class="hl-attr">bool</span>: <span class="hl-function">() =&gt;</span> <span class="hl-title class_">TypeBool</span>,</span>
+<span class="line">    <span class="hl-attr">int</span>: <span class="hl-function">() =&gt;</span> <span class="hl-title class_">TypeInt</span>,</span>
+<span class="line">    <span class="hl-attr">binary</span>: <span class="hl-function">(<span class="hl-params">kind: ast.ExprBinary&lt;<span class="hl-built_in">void</span>&gt;</span>) =&gt;</span> <span class="hl-title function_">result_type</span>(kind.<span class="hl-property">op</span>),</span>
+<span class="line">    <span class="hl-attr">if</span>: (<span class="hl-attr">kind</span>: ast.<span class="hl-property">ExprIf</span>&lt;<span class="hl-built_in">void</span>&gt;) {</span>
+<span class="line">      ...</span>
+<span class="line">    },</span>
+<span class="line">  });</span>
+<span class="line">  ...</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">result_type</span>(<span class="hl-params">op: ast.BinaryOp</span>): <span class="hl-title class_">Type</span> {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (op) { <span class="hl-comment">// A tad verbose, but auto-completed!</span></span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Add</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Sub</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Mul</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Div</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title class_">TypeInt</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Eq</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Neq</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title class_">TypeBool</span></span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Lt</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Gt</span>:</span>
+<span class="line">    <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Le</span>: <span class="hl-keyword">case</span> ast.<span class="hl-property">BinaryOp</span>.<span class="hl-property">Ge</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title class_">TypeBool</span></span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Before we go further, let</span>&rsquo;<span>s generalize this visiting pattern a bit! Recall that our expressions are</span>
+<span>parametrized by the type of associated data, and type-checker-shaped transformations are essentially an</span>
+<code class="display">Expr&lt;U&gt; -&gt; Expr&lt;V&gt;</code>
+<span>transformation.</span></p>
+<p><span>Let</span>&rsquo;<span>s make this generic!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">function</span> transform&lt;U, V&gt;(<span class="hl-attr">expr</span>: <span class="hl-title class_">Expr</span>&lt;U&gt;, <span class="hl-attr">v</span>: <span class="hl-title class_">Visitor</span>&lt;V, V&gt;): <span class="hl-title class_">Expr</span>&lt;V&gt; {</span></code></pre>
+
+</figure>
+<p><span>Transform maps an expression carrying </span><code>T</code><span> into an expression carrying </span><code>V</code><span> by applying an </span><code>f</code>
+<span>visitor. Importantly, it</span>&rsquo;<span>s </span><code>Visitor&lt;V, V&gt;</code><span>, rather than a </span><code>Visitor&lt;U, V&gt;</code><span>. This is</span>
+<span>counter-intuitive, but correct </span>&mdash;<span> we run transformation bottom up, transforming the leaves first.</span>
+<span>So, when the time comes to visit an interior node, all subexpression will have been transformed!</span></p>
+<p><span>The body of </span><code>transform</code><span> is wordy, but regular, rectangular, and auto-completes itself:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">export</span> <span class="hl-keyword">function</span> transform&lt;U, V&gt;(<span class="hl-attr">expr</span>: <span class="hl-title class_">Expr</span>&lt;U&gt;, <span class="hl-attr">v</span>: <span class="hl-title class_">Visitor</span>&lt;V, V&gt;): <span class="hl-title class_">Expr</span>&lt;V&gt; {</span>
+<span class="line">  <span class="hl-keyword">switch</span> (expr.<span class="hl-property">kind</span>.<span class="hl-property">tag</span>) {</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;bool&quot;</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">bool</span>(expr.<span class="hl-property">kind</span>),</span>
+<span class="line">        <span class="hl-attr">kind</span>: expr.<span class="hl-property">kind</span>, <i class="callout" data-value="1"></i></span>
+<span class="line">      };</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;int&quot;</span>:</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">int</span>(expr.<span class="hl-property">kind</span>),</span>
+<span class="line">        <span class="hl-attr">kind</span>: expr.<span class="hl-property">kind</span>,</span>
+<span class="line">      };</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;binary&quot;</span>: {</span>
+<span class="line">      <span class="hl-keyword">const</span> <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprBinary</span>&lt;V&gt; = { <i class="callout" data-value="2"></i></span>
+<span class="line">        <span class="hl-attr">tag</span>: <span class="hl-string">&quot;binary&quot;</span>,</span>
+<span class="line">        <span class="hl-attr">op</span>: expr.<span class="hl-property">kind</span>.<span class="hl-property">op</span>,</span>
+<span class="line">        <span class="hl-attr">lhs</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">lhs</span>, v),</span>
+<span class="line">        <span class="hl-attr">rhs</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">rhs</span>, v),</span>
+<span class="line">      };</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">binary</span>(kind), <i class="callout" data-value="2"></i></span>
+<span class="line">        <span class="hl-attr">kind</span>: kind,</span>
+<span class="line">      };</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">case</span> <span class="hl-string">&quot;if&quot;</span>: {</span>
+<span class="line">      <span class="hl-keyword">const</span> <span class="hl-attr">kind</span>: <span class="hl-title class_">ExprIf</span>&lt;V&gt; = {</span>
+<span class="line">        <span class="hl-attr">tag</span>: <span class="hl-string">&quot;if&quot;</span>,</span>
+<span class="line">        <span class="hl-attr">cond</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">cond</span>, v),</span>
+<span class="line">        <span class="hl-attr">then_branch</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">then_branch</span>, v),</span>
+<span class="line">        <span class="hl-attr">else_branch</span>: <span class="hl-title function_">transform</span>(expr.<span class="hl-property">kind</span>.<span class="hl-property">else_branch</span>, v),</span>
+<span class="line">      };</span>
+<span class="line">      <span class="hl-keyword">return</span> {</span>
+<span class="line">        <span class="hl-attr">location</span>: expr.<span class="hl-property">location</span>,</span>
+<span class="line">        <span class="hl-attr">data</span>: v.<span class="hl-title function_">if</span>(kind),</span>
+<span class="line">        <span class="hl-attr">kind</span>: kind,</span>
+<span class="line">      };</span>
+<span class="line">    }</span>
+<span class="line">  }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<ol class="callout">
+<li>
+<p><span>Note how here </span><code>expr.kind</code><span> is both </span><code>Expr&lt;U&gt;</code><span> and  </span><code>Expr&lt;V&gt;</code><span> </span>&mdash;<span> literals don</span>&rsquo;<span>t depend on this type</span>
+<span>parameter, and TypeScript is smart enough to figure this out without us manually re-assembling</span>
+<span>the same value with a different type.</span></p>
+</li>
+<li>
+<p><span>This is where that magic with </span><code>Visitor&lt;V, V&gt;</code><span> happens.</span></p>
+</li>
+</ol>
+<p><span>The code is pretty regular here though! So at this point we might actually recall that TypeScript is</span>
+<span>a dynamically-typed language, and write a generic traversal using </span><code>Object.keys</code><span>, </span><em><span>while keeping the</span>
+<span>static function signature in-place</span></em><span>. I don</span>&rsquo;<span>t think we need to do it here, but there</span>&rsquo;<span>s comfort in</span>
+<span>knowing that it</span>&rsquo;<span>s possible!</span></p>
+<p><em><span>Now</span></em><span> implementing type inference should be a breeze! We need some way to emit type errors though.</span>
+<span>With TypeScript, it would be trivial to accumulate errors into an array as a side-effect, but let</span>&rsquo;<span>s</span>
+<span>actually represent type errors as instances of a specific type, </span><code>TypeError</code><span> (pun intended):</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">Type</span> = <span class="hl-title class_">TypeBool</span> | <span class="hl-title class_">TypeInt</span> | <span class="hl-title class_">TypeError</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">TypeError</span> {</span>
+<span class="line">  <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>;</span>
+<span class="line">  <span class="hl-attr">location</span>: ast.<span class="hl-property">Location</span>;</span>
+<span class="line">  <span class="hl-attr">message</span>: <span class="hl-built_in">string</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>To check ifs and binary expressions, we would also need a utility for comparing types:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">type_equal</span>(<span class="hl-params">lhs: Type, rhs: Type</span>): <span class="hl-built_in">boolean</span> {</span>
+<span class="line">  <span class="hl-keyword">if</span> (lhs.<span class="hl-property">tag</span> == <span class="hl-string">&quot;Error&quot;</span> || rhs.<span class="hl-property">tag</span> == <span class="hl-string">&quot;Error&quot;</span>) <span class="hl-keyword">return</span> <span class="hl-literal">true</span>;</span>
+<span class="line">  <span class="hl-keyword">return</span> lhs.<span class="hl-property">tag</span> == rhs.<span class="hl-property">tag</span>;</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>We make the </span><code>Error</code><span> type equal to any other type to prevent cascading failures. With all that</span>
+<span>machinery in place, our type checker is finally:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">infer_types</span>(<span class="hl-params">expr: ast.Expr&lt;<span class="hl-built_in">void</span>&gt;</span>): ast.<span class="hl-property">Expr</span>&lt;<span class="hl-title class_">Type</span>&gt; {</span>
+<span class="line">  <span class="hl-keyword">return</span> ast.<span class="hl-title function_">transform</span>(expr, {</span>
+<span class="line">    <span class="hl-attr">bool</span>: (): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> <span class="hl-title class_">TypeBool</span>,</span>
+<span class="line">    <span class="hl-attr">int</span>: (): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> <span class="hl-title class_">TypeInt</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-attr">binary</span>: (<span class="hl-attr">kind</span>: ast.<span class="hl-property">ExprBinary</span>&lt;<span class="hl-title class_">Type</span>&gt;, <span class="hl-attr">location</span>: ast.<span class="hl-property">Location</span>): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> {</span>
+<span class="line">      <span class="hl-keyword">if</span> (!<span class="hl-title function_">type_equal</span>(kind.<span class="hl-property">lhs</span>.<span class="hl-property">data</span>, kind.<span class="hl-property">rhs</span>.<span class="hl-property">data</span>)) {</span>
+<span class="line">        <span class="hl-keyword">return</span> {</span>
+<span class="line">          <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>,</span>
+<span class="line">          location,</span>
+<span class="line">          <span class="hl-attr">message</span>: <span class="hl-string">&quot;binary expression operands have different types&quot;</span>,</span>
+<span class="line">        };</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-keyword">return</span> <span class="hl-title function_">result_type</span>(kind.<span class="hl-property">op</span>);</span>
+<span class="line">    },</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-attr">if</span>: (<span class="hl-attr">kind</span>: ast.<span class="hl-property">ExprIf</span>&lt;<span class="hl-title class_">Type</span>&gt;, <span class="hl-attr">location</span>: ast.<span class="hl-property">Location</span>): <span class="hl-function"><span class="hl-params">Type</span> =&gt;</span> {</span>
+<span class="line">      <span class="hl-keyword">if</span> (!<span class="hl-title function_">type_equal</span>(kind.<span class="hl-property">cond</span>.<span class="hl-property">data</span>, <span class="hl-title class_">TypeBool</span>)) {</span>
+<span class="line">        <span class="hl-keyword">return</span> {</span>
+<span class="line">          <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>,</span>
+<span class="line">          location,</span>
+<span class="line">          <span class="hl-attr">message</span>: <span class="hl-string">&quot;if condition is not a boolean&quot;</span>,</span>
+<span class="line">        };</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-keyword">if</span> (!<span class="hl-title function_">type_equal</span>(kind.<span class="hl-property">then_branch</span>.<span class="hl-property">data</span>, kind.<span class="hl-property">else_branch</span>.<span class="hl-property">data</span>)) {</span>
+<span class="line">        <span class="hl-keyword">return</span> {</span>
+<span class="line">          <span class="hl-attr">tag</span>: <span class="hl-string">&quot;Error&quot;</span>,</span>
+<span class="line">          location,</span>
+<span class="line">          <span class="hl-attr">message</span>: <span class="hl-string">&quot;if branches have different types&quot;</span>,</span>
+<span class="line">        };</span>
+<span class="line">      }</span>
+<span class="line">      <span class="hl-keyword">return</span> kind.<span class="hl-property">then_branch</span>.<span class="hl-property">data</span>;</span>
+<span class="line">    },</span>
+<span class="line">  });</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">result_type</span>(<span class="hl-params">op: ast.BinaryOp</span>): <span class="hl-title class_">Type</span> {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Astute reader will notice that our visitor functions now take an extra </span><code>ast.Location</code><span> argument.</span>
+<span>TypeScript allows using this argument only in cases where it is needed, cutting down verbosity.</span></p>
+<p><span>And that</span>&rsquo;<span>s all for today! The end result is pretty neat and concise. It took some typing to get there,</span>
+<span>but TypeScript autocompletion really helps with that! What</span>&rsquo;<span>s more important, there was very little</span>
+<span>fighting with the language, and the result feels quite natural and directly corresponds to the shape</span>
+<span>of the problem.</span></p>
+<p><span>I am not entirely sure in the conclusion just yet, but I think I</span>&rsquo;<span>ll be using TypeScript as my tool</span>
+<span>of choice for various small language hacks. It is surprisingly productive due to the confluence of</span>
+<span>three aspects:</span></p>
+<ul>
+<li>
+<span>deno is a perfect scripting runtime! Small, hermetic, powerful, and optimized for effective</span>
+<span>development workflows.</span>
+</li>
+<li>
+<span>TypeScript tooling is great </span>&mdash;<span> the IDE is helpful and productive (and deno makes sure that it</span>
+<span>also requires zero configuration)</span>
+</li>
+<li>
+<span>The language is powerful both at runtime and at compile time. You can get pretty fancy with types,</span>
+<span>but you can also just escape to dynamic world if you need some very high-order code.</span>
+</li>
+</ul>
+<hr>
+<p><span>Just kidding, here</span>&rsquo;<span>s one more cute thing. Let</span>&rsquo;<span>s say that we want to have lots of syntactic sugar,</span>
+<span>and also want type-safe desugaring. We could tweak our setup a bit for that: instead of </span><code>Expr</code><span> and</span>
+<code>ExprKind</code><span> being parametrized over associated data, we circularly parametrize </span><code>Expr</code><span> by the whole</span>
+<code>ExprKind</code><span> and  vice verse:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">Expr</span>&lt;K&gt; {</span>
+<span class="line">  <span class="hl-attr">location</span>: <span class="hl-title class_">Location</span>,</span>
+<span class="line">  <span class="hl-attr">kind</span>: K,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">interface</span> <span class="hl-title class_">ExprBinary</span>&lt;E&gt; {</span>
+<span class="line">  <span class="hl-attr">op</span>: <span class="hl-title class_">BinaryOp</span>,</span>
+<span class="line">  <span class="hl-attr">lhs</span>: E,</span>
+<span class="line">  <span class="hl-attr">rhs</span>: E,</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>This allows expressing desugaring in a type-safe manner!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Fundamental, primitive expressions.</span></span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprKindCore</span>&lt;E&gt; =</span>
+<span class="line">    <span class="hl-title class_">ExprInt</span>&lt;E&gt; | <span class="hl-title class_">ExprBinary</span>&lt;E&gt; | <span class="hl-title class_">ExprIf</span>&lt;E&gt;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Expressions which are either themselves primitive,</span></span>
+<span class="line"><span class="hl-comment">// or can be desugared to primitives.</span></span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprKindSugar</span>&lt;E&gt; = <span class="hl-title class_">ExprKindCore</span>&lt;E&gt;</span>
+<span class="line">    | <span class="hl-title class_">ExprCond</span>&lt;E&gt; | <span class="hl-title class_">ExprUnless</span>&lt;E&gt;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprCore</span> = <span class="hl-title class_">Expr</span>&lt;<span class="hl-title class_">ExprKindCore</span>&lt;<span class="hl-title class_">ExprCore</span>&gt;&gt;;</span>
+<span class="line"><span class="hl-keyword">type</span> <span class="hl-title class_">ExprSugar</span> = <span class="hl-title class_">Expr</span>&lt;<span class="hl-title class_">ExprKindSugar</span>&lt;<span class="hl-title class_">ExprSugar</span>&gt;&gt;;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Desugaring works by reducing the set of expression kinds.</span></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">desugar</span>(<span class="hl-params">expr: ExprSugar</span>): <span class="hl-title class_">ExprCore</span></span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// A desugaring steps takes a (potentially sugar) expression,</span></span>
+<span class="line"><span class="hl-comment">// whose subexpression are already desugared,</span></span>
+<span class="line"><span class="hl-comment">// and produces an equivalent core expression.</span></span>
+<span class="line"><span class="hl-keyword">function</span> <span class="hl-title function_">desugar_one</span>(<span class="hl-params"></span></span>
+<span class="line"><span class="hl-params">    expr: ExprKindSugar&lt;ExprCore&gt;,</span></span>
+<span class="line"><span class="hl-params"></span>): <span class="hl-title class_">ExprKindCore</span>&lt;<span class="hl-title class_">ExprCore</span>&gt;</span></code></pre>
+
+</figure>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">Role Of Algorithms</title>
+<link href="https://matklad.github.io/2023/08/13/role-of-algorithms.html" rel="alternate" type="text/html" title="Role Of Algorithms" />
+<published>2023-08-13T00:00:00+00:00</published>
+<updated>2023-08-13T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/08/13/role-of-algorithms</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[This is lobste.rs comment as an article, so expect even more abysmal editing than usual.]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/08/13/role-of-algorithms.html"><![CDATA[
+    <h1>
+    <a href="#Role-Of-Algorithms"><span>Role Of Algorithms</span> <time datetime="2023-08-13">Aug 13, 2023</time></a>
+    </h1>
+<p><span>This is lobste.rs comment as an article, so expect even more abysmal editing than usual.</span></p>
+<p><span>Let me expand on something I mentioned in the</span>
+<a href="https://matklad.github.io/2023/08/06/fantastic-learning-resources.html" class="display url">https://matklad.github.io/2023/08/06/fantastic-learning-resources.html</a>
+<span>post:</span></p>
+<p>&ldquo;<span>Algorithms</span>&rdquo;<span> are a useful skill not because you use it at work every day, but because they train you</span>
+<span>to be better at particular aspects of software engineering.</span></p>
+<p><span>Specifically:</span></p>
+<p><em><span>First</span></em><span>, algorithms drill the skill of bug-free coding. Algorithms are hard and frustrating! Subtle</span>
+<span>off-by-one might not matter for simple tests, but breaks corner cases. But if you practice</span>
+<span>algorithms, you get better at this particular skill of writing correct small programs, and I think</span>
+<span>this probably generalizes.</span></p>
+<p><span>To give an array of analogies:</span></p>
+<ul>
+<li>
+<p><span>People do cardio or strength exercises not because they need to lift heavy weights in real life.</span>
+<span>Quite the opposite </span>&mdash;<span> there</span>&rsquo;<span>s </span><em><span>too little</span></em><span> physical exertion in our usual lives, so we need extra</span>
+<span>exercises for our bodies to gain generalized health (which </span><em><span>is</span></em><span> helpful in day-to-day life).</span></p>
+</li>
+<li>
+<p><span>You don</span>&rsquo;<span>t practice complex skill by mere repetition. You first break it down into atomic trainable</span>
+<span>sub skills, and drill each sub skill separately in unrealistic condition. Writing correct</span>
+<span>algorithmy code is a sub skill of software engineering.</span></p>
+</li>
+<li>
+<p><span>When you optimize system, you don</span>&rsquo;<span>t just repeatedly run end-to-end test until things go fast. You</span>
+<span>first identify the problematic area, then write a targeted micro benchmark to isolate this</span>
+<span>particular effect, and then you optimize that using much shorter event loop.</span></p>
+</li>
+</ul>
+<p><span>I still remember two specific lessons I learned when I started doing algorithms many years ago:</span></p>
+<dl>
+<dt><span>Debugging complex code is hard, </span><em><span>first</span></em><span> simplify, </span><em><span>then</span></em><span> debug</span></dt>
+<dd>
+<p><span>Originally, when I was getting a failed test, I sort of tried to add more code to my program to</span>
+<span>make it pass. At some point I realized that this is going nowhere, and then I changed my workflow</span>
+<span>to first try to </span><em><span>remove</span></em><span> as much code as I can, and only then investigate the problematic test</span>
+<span>case (which with time morphed into a skill of not writing more code then necessary in the first</span>
+<span>place).</span></p>
+</dd>
+<dt><span>Single source of truth is good</span></dt>
+<dd>
+<p><span>A lot of my early bugs was due to me duplicating the same piece of information in two places and</span>
+<span>then getting them out of sync. Internalizing that as a single source of truth fixed the issues.</span></p>
+</dd>
+</dl>
+<p><span>Meta note: if you already know this, my lessons are useless. If you don</span>&rsquo;<span>t yet know them, they are</span>
+<em><span>still</span></em><span> useless and most likely will bounce off you. This is tacit knowledge </span>&mdash;<span> it</span>&rsquo;<span>s very hard to</span>
+<span>convey it verbally, it is much more efficient to learn these things yourself by doing.</span></p>
+<p><span>Somewhat related, I noticed a surprising correlation between programming skills in the small, and</span>
+<span>programming skills in the large. You can solve a problem in five lines of code, or, if you try hard,</span>
+<span>in ten lines of code. If you consistently come up with concise solutions in the small, chances are</span>
+<span>large scale design will be simple as well.</span></p>
+<p><span>I don</span>&rsquo;<span>t know how true is that, as I never tried to look at a proper study, but it looks very</span>
+<span>plausible from what I</span>&rsquo;<span>ve seen. </span><em><span>If</span></em><span> this is true, the next interesting question is: </span>&ldquo;<span>if you train</span>
+<span>programming-in-the-small skills, do they transfer to programming in the large?</span>&rdquo;<span>. Again, I don</span>&rsquo;<span>t</span>
+<span>know, but I</span>&rsquo;<span>d take this Pascal</span>&rsquo;<span>s wager.</span></p>
+<p><em><span>Second</span></em><span>, algorithms teach about properties and invariants. Some lucky people get those skills from</span>
+<span>a hard math background, but algorithms are a much more accessible way to learn them, as everything</span>
+<span>is very visual, immediately testable, and has very short and clear feedback loop.</span></p>
+<p><span>And properties and invariants is what underlines most big and successful systems. Like 90% of the</span>
+<span>code is just fluff and glue, and if you have the skill to see the 10% that is architecturally</span>
+<span>salient properties, you could comprehend the system much faster.</span></p>
+<p><em><span>Third</span></em><span>, algorithms occasionally </span><em><span>are</span></em><span> useful at the job! Just last week on our design walk&amp;talk we</span>
+<span>were brainstorming one particular problem, and I was like</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>Wait, so the problem here is that our solution is O(1) amortized, but really that means O(N)</span>
+<span>occasionally and that creates problem. I wonder if we could shift amortized work to when we do the</span>
+<span>real work, sort of how there are helper threads in concurrent programming. Ohh, this actually sounds</span>
+<span>like range query problem! Yeah, I think that cryptic trick that is called </span>&ldquo;<span>дерево отрезков</span>&rdquo;<span> in</span>
+<span>Russian and doesn</span>&rsquo;<span>t have a meme name in English (</span>&ldquo;<span>monoid tree</span>&rdquo;<span> is a good, but unknown, name) could</span>
+<span>help here. Yup, that actually does solve amortization issue, this will be O(log N) non-amortized.</span></p>
+</blockquote>
+
+</figure>
+<p><span>We probably won</span>&rsquo;<span>t go with that solution as that</span>&rsquo;<span>s too complex algorithmically for what ultimately is</span>
+<span>a corner case, </span><em><span>but</span></em><span> it</span>&rsquo;<span>s important that we understand problem space in detail before we pick a</span>
+<span>solution.</span></p>
+<p><span>Note also how algorithms </span><em><span>vocabulary</span></em><span> helps me to think about the problem. In math (including</span>
+<span>algorithms), there</span>&rsquo;<span>s just like a handful of ideas which are applied again and again under different</span>
+<span>guises. You need some amount of insight of course, but, for most simple problems, what you actually</span>
+<span>need is just an ability to recognize the structure you</span>&rsquo;<span>ve seen somewhere already.</span></p>
+<p><em><span>Fourth</span></em><span>, connecting to the previous ones, the ideas really do form interconnected web which, on a</span>
+<span>deep level, underpins a whole lot of stuff. So, if you do have non-zero amount of pure curiosity</span>
+<span>when it comes to learning programming, algorithms cut pretty deep to the foundation. Let me repeat</span>
+<span>the list from the last post, but with explicit connections to other things:</span></p>
+<dl>
+<dt><span>linear search</span></dt>
+<dd>
+<p><span>assoc lists in most old functional languages work that way</span></p>
+</dd>
+<dt><span>binary search</span></dt>
+<dd>
+<p><span>It is literally everywhere. Also, binary search got a cute name, but actually it isn</span>&rsquo;<span>t the</span>
+<span>primitive operation. The primitive operation is </span><code>partition_point</code><span>, a predicate version of binary</span>
+<span>search. This is what you should add to your language</span>&rsquo;<span>s stdlib as a primitive, and base everything</span>
+<span>else in terms of it. Also, it is one of the few cases where we know lower bound of complexity. If</span>
+<span>an algorithm does k binary comparisons, it can give at most 2</span><sup><span>k</span></sup><span> distinct answers. So, to find</span>
+<span>insertion point among n items, you need at least k questions such that 2</span><sup><span>k</span></sup><span> &gt; n.</span></p>
+</dd>
+<dt><span>quadratic sorting</span></dt>
+<dd>
+<p><span>We use it at work! Some collections are statically bound by a small constant, and quadratically</span>
+<span>sorting them just needs less machine code. We are also a bit paranoid that production sort</span>
+<span>algorithms are very complex and </span><em><span>might</span></em><span> have subtle bugs, esp in newer languages.</span></p>
+</dd>
+<dt><span>merge sort</span></dt>
+<dd>
+<p><span>This is how you sort things on disk. This is also how LSM-trees, the most practically important</span>
+<span>data structure you haven</span>&rsquo;<span>t learned about in school, works! And k-way merge also is occasionally</span>
+<span>useful (this is from work from three weeks ago).</span></p>
+</dd>
+<dt><span>heap sort</span></dt>
+<dd>
+<p><span>Well, this one is only actually useful for the heap, </span><em><span>but</span></em><span> I think maybe the kernel uses it when</span>
+<span>it needs to sort something in place, without extra memory, and in guaranteed O(N log N)?</span></p>
+</dd>
+<dt><span>binary heap</span></dt>
+<dd>
+<p><span>Binary heaps are everywhere! Notably, simple timers are a binary heap of things in the order of</span>
+<span>expiration. This is also a part of Dijkstra and k-way-merge.</span></p>
+</dd>
+<dt><span>growable array</span></dt>
+<dd>
+<p><span>That</span>&rsquo;<span>s the mostly widely used collection of them all! Did you know that grow factor 2 has a</span>
+<span>problem that the size after </span><code>n</code><span> reallocations is larger then the sum total of all previous sizes,</span>
+<span>so the allocator can</span>&rsquo;<span>t re-use the space? Anecdotally, growth factors less than two are preferable</span>
+<span>for this reason.</span></p>
+</dd>
+<dt><span>doubly-linked list</span></dt>
+<dd>
+<p><span>At the heart of rust-analyzer is a </span><a href="https://github.com/rust-analyzer/rowan/blob/87909d03dfe78d07ae932151e105dfde7ae87536/src/sll.rs"><span>two-dimensional doubly-linked</span>
+<span>list</span></a><span>.</span></p>
+</dd>
+<dt><span>binary search tree</span></dt>
+<dd>
+<p><span>Again, rust-analyzer green tree are binary search trees using offset as an implicit key.</span>
+<span>Monoid trees are also binary search trees.</span></p>
+</dd>
+<dt><span>AVL tree</span></dt>
+<dd>
+<p><span>Ok, this one I actually don</span>&rsquo;<span>t know a direct application of! </span><em><span>But</span></em><span> I remember two</span>
+<span>programming-in-the-small lessons AVL could have taught me, but didn</span>&rsquo;<span>t. I struggled a lot</span>
+<span>implementing all of </span>&ldquo;<span>small left rotation</span>&rdquo;<span>, </span>&ldquo;<span>small right rotation</span>&rdquo;<span>, </span>&ldquo;<span>big left rotation</span>&rdquo;<span>, </span>&ldquo;<span>big right</span>
+<span>rotation</span>&rdquo;<span>. Some years later, I</span>&rsquo;<span>ve learned that you don</span>&rsquo;<span>t do</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">left: Tree,</span>
+<span class="line">right: Tree,</span></code></pre>
+
+</figure>
+<p><span>as that forces code duplication. Rather, you do </span><code class="display">children: [Tree; 2]</code><span> and then you could</span>
+<span>use </span><code>child_index</code><span> and </span><code>child_index ^ 1</code><span> to abstract over left-right.</span></p>
+<p><span>And then some years later still I read in wikipedia that big rotations are actually a composition</span>
+<span>of two small rotations.</span></p>
+<p><span>Actually, I</span>&rsquo;<span>ve lied that I don</span>&rsquo;<span>t know connections here. You use the same rotations for the splay</span>
+<span>tree.</span></p>
+</dd>
+<dt><span>Red Black Tree</span></dt>
+<dd>
+<p><span>red-black tree is a 2-3 tree is a B-tree. Also, you probably use jemalloc, and it has a red-black</span>
+<span>tree </span><a href="https://github.com/aerospike/jemalloc/blob/05108b5010a511226fb7586543f4162dd2d31d2b/include/jemalloc/internal/rb.h#L338"><span>implemented as a C</span>
+<span>macro</span></a><span>.</span>
+<span>Left-leaning red-black tree are an interesting variation, which is claimed to be simpler, but is</span>
+<span>also claimed to not actually be simpler, because it is not symmetric and neuters the </span><code>children</code>
+<span>trick.</span></p>
+</dd>
+<dt><span>B-tree</span></dt>
+<dd>
+<p><span>If you use Rust, you probably use B-tree. Also, if you use a database, it stores data either in</span>
+<span>LSM or in a B-tree. Both of these are because B-trees play nice with memory hierarchy.</span></p>
+</dd>
+<dt><span>Splay Tree</span></dt>
+<dd>
+<p><span>Worth knowing just to have a laugh at </span><a href="https://www.link.cs.cmu.edu/splay/tree5.jpg" class="url">https://www.link.cs.cmu.edu/splay/tree5.jpg</a><span>.</span></p>
+</dd>
+<dt><span>HashTable</span></dt>
+<dd>
+<p><span>Literally everywhere, both chaining and open-addressing versions are widely used.</span></p>
+</dd>
+<dt><span>Depth First Search</span></dt>
+<dd>
+<p><span>This is something I have to code, explicitly or implicitly, fairly often. Every time where you</span>
+<span>have a DAG, when things depend on other things, you</span>&rsquo;<span>d have a DFS somewhere. In rust-analyzer,</span>
+<span>there are at least a couple </span>&mdash;<span> one in borrow checker for something (have no idea what that does,</span>
+<span>just grepped for </span><code>fn dfs</code><span>) and one in crate graph to detect cycles.</span></p>
+</dd>
+<dt><span>Breadth First Search</span></dt>
+<dd>
+<p><span>Ditto, any kind of exploration problem is usually solved with bfs. Eg, rust-analyzer uses </span><code>bfs</code>
+<span>for directory traversal.</span></p>
+<p><span>Which is better, </span><code>bfs</code><span> or </span><code>dfs</code><span>? Why not both?! Take a look at bdfs from rust-analyzer:</span></p>
+<p><a href="https://github.com/rust-lang/rust-analyzer/blob/2fbe69d117ff8e3ffb9b21c4a564f835158eb67b/crates/hir-expand/src/ast_id_map.rs#L195-L222" class="url">https://github.com/rust-lang/rust-analyzer/blob/2fbe69d117ff8e3ffb9b21c4a564f835158eb67b/crates/hir-expand/src/ast_id_map.rs#L195-L222</a></p>
+</dd>
+<dt><span>Topological Sort</span></dt>
+<dd>
+<p><span>Again, comes up every time you deal with things which depend on each other. rust-analyzer has</span>
+<code>crates_in_topological_order</code></p>
+</dd>
+<dt><span>Strongly Connected Components</span></dt>
+<dd>
+<p><span>This is needed every time things depend on each other, but you also allow cyclic dependencies. I</span>
+<span>don</span>&rsquo;<span>t think I</span>&rsquo;<span>ve needed this one in real life. But, given that SCC is how you solve 2-SAT in</span>
+<span>polynomial time, seems important to know to understand the 3 in 3-SAT</span></p>
+</dd>
+<dt><span>Minimal Spanning Tree</span></dt>
+<dd>
+<p><span>Ok, really drawing a blank here! Connects to sorting, disjoint set union (which is needed for</span>
+<span>unification in type-checkers), and binary heap. Seems practically important algorithm though! Ah,</span>
+<span>MST also gives an approximation for planar traveling salseman I think, another border between hard</span>
+<span>&amp; easy problems.</span></p>
+</dd>
+<dt><span>Dijkstra</span></dt>
+<dd>
+<p><span>Dijkstra is what I think about when I imagine a Platonic </span><dfn><span>algorithm</span></dfn><span>, though</span>
+<span>I don</span>&rsquo;<span>t think I</span>&rsquo;<span>ve used it in practice? Connects to heap.</span></p>
+<p><span>Do you know why we use </span><code>i</code><span>, </span><code>j</code><span>, </span><code>k</code><span> for loop indices? Because </span><code>D ijk stra</code><span>!</span></p>
+</dd>
+<dt><span>Floyd-Warshall</span></dt>
+<dd>
+<p><span>This one is cool! Everybody knows why any regular expression can be complied to an equivalent</span>
+<span>finite state machine. Few people know the reverse, why each automaton has an equivalent regex</span>
+<span>(many people know this fact, but few understand why). Well, because Floyd-Warshall! To convert an</span>
+<span>automaton to regex use the same algorithm you use to find pairwise distances in a graph.</span></p>
+<p><span>Also, this is a final boss of dynamic programming. If you understand why this algorithm works, you</span>
+<span>understand dynamic programming. Despite being tricky to understand, it</span>&rsquo;<span>s very easy to implement! I</span>
+<span>randomly stumbled into Floyd-Warshall, when I tried to implement a different, wrong approach, and</span>
+<span>made a bug which turned my broken algo into a correct Floyd-Warshall.</span></p>
+</dd>
+<dt><span>Bellman-Ford</span></dt>
+<dd>
+<p><span>Again, not much practical applicaions here, but the theory is well connected. All shortest path</span>
+<span>algorithms are actually fixed-point iterations! But with Bellman-Ford and its explicit edge</span>
+<span>relaxation operator that</span>&rsquo;<span>s most obvious. Next time you open static analysis textbook and learn</span>
+<span>about fixed point iteration, map that onto the problem of finding shortest paths!</span></p>
+</dd>
+<dt><span>Quadratic Substring Search</span></dt>
+<dd>
+<p><span>This is what you language standard library does</span></p>
+</dd>
+<dt><span>Rabin-Karp</span></dt>
+<dd>
+<p><span>An excellent application of hashes. The same idea, </span><span class="display"><code>hash(composite) =
+compbine(hash(component)*)</code><span>,</span></span><span> is used in rust-analyzer to </span><a href="https://github.com/rust-analyzer/rowan/blob/87909d03dfe78d07ae932151e105dfde7ae87536/src/green/node_cache.rs#L86-L97"><span>intern syntax</span>
+<span>trees</span></a><span>.</span></p>
+</dd>
+<dt><span>Boyer-Moore</span></dt>
+<dd>
+<p><span>This is beautiful and practical algorithm which probably handles the bulk of real-world searches</span>
+<span>(that is, it</span>&rsquo;<span>s probably the hottest bit of </span><code>ripgrep</code><span> as used by an average person). Delightfully,</span>
+<span>this algorithm is faster than theoretically possible </span>&mdash;<span> it doesn</span>&rsquo;<span>t even look at every byte of</span>
+<span>input data!</span></p>
+</dd>
+<dt><span>Knuth-Morris-Pratt</span></dt>
+<dd>
+<p><span>Another </span>&ldquo;<span>this is how you do string search in the real world</span>&rdquo;<span> algorithm. It also is the platonic</span>
+<span>ideal of a finite state machine, and almost everything is an FSM. It also is Aho-Corasick.</span></p>
+</dd>
+<dt><span>Aho-Corasick</span></dt>
+<dd>
+<p><span>This is the same as Knuth-Morris-Pratt, but also teaches you about tries. Again, super-useful for</span>
+<span>string searches. As it is an FSM, and a regex is an FSM, and there</span>&rsquo;<span>s a general construct for</span>
+<span>building a product of two FSMs, you can use it to implement fuzzy search. </span>&ldquo;<span>Workspace symbol</span>&rdquo;
+<span>feature in rust-analyzer works like this. Here</span>&rsquo;<span>s </span><a href="https://github.com/BurntSushi/fst/pull/64"><span>a part</span>
+<span>of</span></a><span> implementation.</span></p>
+</dd>
+<dt><span>Edit Distance</span></dt>
+<dd>
+<p><span>Everywhere in Bioinformatics (not the actual edit distance, but this problem shape). The first</span>
+<span>post on this blog is about this problem:</span></p>
+<p><a href="https://matklad.github.io/2017/03/12/min-of-three.html" class="url">https://matklad.github.io/2017/03/12/min-of-three.html</a></p>
+<p><span>It</span>&rsquo;<span>s not about algorithms though, its about CPU-level parallelism.</span></p>
+</dd>
+</dl>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">Types and the Zig Programming Language</title>
+<link href="https://matklad.github.io/2023/08/09/types-and-zig.html" rel="alternate" type="text/html" title="Types and the Zig Programming Language" />
+<published>2023-08-09T00:00:00+00:00</published>
+<updated>2023-08-09T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/08/09/types-and-zig</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[Notes on less-than-obvious aspects of Zig's type system and things that surprised me after diving
+deeper into the language.]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/08/09/types-and-zig.html"><![CDATA[
+    <h1>
+    <a href="#Types-and-the-Zig-Programming-Language"><span>Types and the Zig Programming Language</span> <time datetime="2023-08-09">Aug 9, 2023</time></a>
+    </h1>
+<p><span>Notes on less-than-obvious aspects of Zig</span>&rsquo;<span>s type system and things that surprised me after diving</span>
+<span>deeper into the language.</span></p>
+<section id="Nominal-Types">
+
+    <h2>
+    <a href="#Nominal-Types"><span>Nominal Types</span> </a>
+    </h2>
+<p><span>Zig has a nominal type system despite the fact that types lack names. A struct type is declared by</span>
+<span class="display"><code>struct { field: T }</code><span>.</span></span>
+<span>It</span>&rsquo;<span>s anonymous; an explicit assignment is required to name the type:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> S = <span class="hl-keyword">struct</span> {</span>
+<span class="line">  field: T,</span>
+<span class="line">};</span></code></pre>
+
+</figure>
+<p><span>Still, the type system is nominal, not structural. The following does not compile:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> f</span>() <span class="hl-keyword">struct</span> { f: <span class="hl-type">i32</span> } {</span>
+<span class="line">  <span class="hl-keyword">return</span> .{ .f = <span class="hl-numbers">92</span> };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> g</span>(s: <span class="hl-keyword">struct</span> { f: <span class="hl-type">i32</span> }) <span class="hl-type">void</span> {</span>
+<span class="line">  _ = s;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  g(f()); <span class="hl-comment">// &lt;- type mismatch</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>The following does:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> S = <span class="hl-keyword">struct</span> { f: <span class="hl-type">i32</span> };</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> f</span>() S {</span>
+<span class="line">  <span class="hl-keyword">return</span> .{ .f = <span class="hl-numbers">92</span> };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> g</span>(s: S) <span class="hl-type">void</span> {</span>
+<span class="line">  _ = s;</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  g(f());</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>One place where Zig is structural are anonymous struct literals:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  <span class="hl-keyword">const</span> x                      = .{ .foo = <span class="hl-numbers">1</span> };</span>
+<span class="line">  <span class="hl-keyword">const</span> y: <span class="hl-keyword">struct</span> { foo: <span class="hl-type">i32</span> } = x;</span>
+<span class="line">  <span class="hl-keyword">comptime</span> assert(<span class="hl-built_in">@TypeOf</span>(x) <span class="hl-operator">!=</span> <span class="hl-built_in">@TypeOf</span>(y));</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Types of </span><code>x</code><span> and </span><code>y</code><span> are different, but </span><code>x</code><span> can be coerced to </span><code>y</code><span>.</span></p>
+<p><span>In other words, Zig structs are anonymous and nominal, but anonymous structs are structural!</span></p>
+</section>
+<section id="No-Unification">
+
+    <h2>
+    <a href="#No-Unification"><span>No Unification</span> </a>
+    </h2>
+<p><span>Simple type inference for an expression works by first recursively inferring the types of</span>
+<span>subexpressions, and then deriving the result type from that. So, to infer types in</span>
+<span class="display"><code>foo().bar()</code><span>,</span></span><span> we first derive the type of </span><code>foo()</code><span>, then lookup method </span><code>bar</code><span> on that</span>
+<span>type, and use the return type of the method.</span></p>
+<p><span>More complex type inference works through so called unification algorithm. It starts with a similar</span>
+<span>recursive walk over the expression tree, but this walk doesn</span>&rsquo;<span>t infer types directly, but rather</span>
+<span>assigns a type variable to each subexpression, and generates equations relating type variables. So the</span>
+<span>result of this first phase look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">x = y</span>
+<span class="line">Int = y</span></code></pre>
+
+</figure>
+<p><span>Then, in the second phase the equations are solved, yielding, in this case, </span><code>x = Int</code><span> and </span><code>y = Int</code><span>.</span></p>
+<p><span>Usually languages with powerful type systems have unification somewhere, though often unification</span>
+<span>is limited in scope (for example, Kotlin infers types statement-at-a-time).</span></p>
+<p><span>It is curious that Zig doesn</span>&rsquo;<span>t do unification, type inference is a simple single-pass recursion (or</span>
+<span>at least it should be, I haven</span>&rsquo;<span>t looked at how it is actually implemented). So, anytime there</span>&rsquo;<span>s a</span>
+<span>generic function like</span>
+<span class="display"><code>fn reverse(comptime T: type, xs: []T) void</code><span>,</span></span>
+<span>the call site has to pass the type in explicitly:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  <span class="hl-keyword">var</span> xs: [<span class="hl-numbers">3</span>]<span class="hl-type">i32</span> = .{<span class="hl-numbers">1</span>, <span class="hl-numbers">2</span>, <span class="hl-numbers">3</span>};</span>
+<span class="line">  reverse(<span class="hl-type">i32</span>, <span class="hl-operator">&amp;</span>xs);</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Does it mean that you have to pass the types all the time? Not really! In fact, the only place which</span>
+<span>feels like a burden are functions in </span><code>std.mem</code><span> module which operate on slices, but that</span>&rsquo;<span>s just</span>
+<span>because slices are builtin types (a kind of pointer really) without methods. The thing is, when you</span>
+<span>call a method on a </span>&ldquo;<span>generic type</span>&rdquo;<span>, its type parameters are implicitly in scope, and don</span>&rsquo;<span>t have to be</span>
+<span>specified. Study this example:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> std = <span class="hl-built_in">@import</span>(<span class="hl-string">&quot;std&quot;</span>);</span>
+<span class="line"><span class="hl-keyword">const</span> assert = std.debug.assert;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> Slice</span>(<span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>) <span class="hl-type">type</span> {</span>
+<span class="line">  <span class="hl-keyword">return</span> <span class="hl-keyword">struct</span> {</span>
+<span class="line">    ptr: [<span class="hl-operator">*</span>]T,</span>
+<span class="line">    len: <span class="hl-type">usize</span>,</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span><span class="hl-function"> init</span>(ptr: [<span class="hl-operator">*</span>]T, len: <span class="hl-type">usize</span>) <span class="hl-built_in">@This</span>() {</span>
+<span class="line">      <span class="hl-keyword">return</span> .{ .ptr = ptr, .len = len };</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">fn</span><span class="hl-function"> reverse</span>(slice: <span class="hl-built_in">@This</span>()) <span class="hl-type">void</span>{</span>
+<span class="line">      ...</span>
+<span class="line">    }</span>
+<span class="line">  };</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-type">void</span> {</span>
+<span class="line">  <span class="hl-keyword">var</span> xs: [<span class="hl-numbers">3</span>]<span class="hl-type">i32</span> = .{<span class="hl-numbers">1</span>, <span class="hl-numbers">2</span>, <span class="hl-numbers">3</span>};</span>
+<span class="line">  <span class="hl-keyword">var</span> slice = Slice(<span class="hl-type">i32</span>).init(<span class="hl-operator">&amp;</span>xs, xs.len);</span>
+<span class="line"></span>
+<span class="line">  slice.reverse(); <span class="hl-comment">// &lt;- look, no types!</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>There</span>&rsquo;<span>s a runtime parallel here. At runtime, there</span>&rsquo;<span>s a single dynamic dispatch, which prioritizes</span>
+<span>dynamic type of the first argument, and multiple dynamic dispatch, which can look at dynamic types</span>
+<span>of all arguments. Here, at compile time, the type of the first argument gets a preferential</span>
+<span>treatment. And, similarly to runtime, this covers 80% of use cases! Though, I</span>&rsquo;<span>d love for things like</span>
+<code>std.mem.eql</code><span> to be actual methods on slices</span>&hellip;</p>
+</section>
+<section id="Mandatory-Function-Signatures">
+
+    <h2>
+    <a href="#Mandatory-Function-Signatures"><span>Mandatory Function Signatures</span> </a>
+    </h2>
+<p><span>One of the best tricks a language server can pull off for as-you-type analysis is skipping bodies of</span>
+<span>the functions in dependencies. This works as long as the language requires complete signatures. In</span>
+<span>functional languages, its customary to make signatures optional, which precludes this crucial</span>
+<span>optimization. As per </span><a href="https://matklad.github.io/2023/08/01/on-modularity-of-lexical-analysis.html"><em><span>Modularity Of Lexical</span>
+<span>Analysis</span></em></a><span>, this has</span>
+<span>repercussions for all of:</span></p>
+<ul>
+<li>
+<span>incremental compilation,</span>
+</li>
+<li>
+<span>parallel compilation,</span>
+</li>
+<li>
+<span>robustness to errors.</span>
+</li>
+</ul>
+<p><span>I always assumed that Zig with its crazy </span><code>comptime</code><span> requires autopsy.</span>
+<span>But that</span>&rsquo;<span>s not actually the case! Zig doesn</span>&rsquo;<span>t have </span><code>decltype(auto)</code><span>, signatures are always explicit!</span></p>
+<p><span>Let</span>&rsquo;<span>s look at, e.g., </span><code>std.mem.bytesAsSlice</code><span>:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> bytesAsSlice</span>(</span>
+<span class="line">  <span class="hl-keyword">comptime</span> T: <span class="hl-type">type</span>,</span>
+<span class="line">  bytes: <span class="hl-type">anytype</span>,</span>
+<span class="line">) BytesAsSliceReturnType(T, <span class="hl-built_in">@TypeOf</span>(bytes)) {</span></code></pre>
+
+</figure>
+<p><span>Note how the return type is not </span><code>anytype</code><span>, but the actual, real thing. You could write complex</span>
+<span>computations there, but you can</span>&rsquo;<span>t look inside the body. Of course, it also is possible to write </span><span class="display"><code>fn
+foo() @TypeOf(bar()) {</code><span>,</span></span><span> but that feels like a fair game </span>&mdash;<span> </span><code>bar()</code><span> will be evaluated at</span>
+<span>compile time. In other words, only bodies of functions invoked at comptime needs to be looked at by</span>
+<span>a language server. This potentially improves performance for this use-case quite a bit!</span></p>
+<p><span>It</span>&rsquo;<span>s useful to contrast this with Rust. There, you could write</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">sneaky</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">impl</span> <span class="hl-title class_">Sized</span> {</span>
+<span class="line">  <span class="hl-number">0i32</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Although it feels like you are stating the interface, it</span>&rsquo;<span>s not really the case. Auto traits like</span>
+<code>Send</code><span> and </span><code>Sync</code><span> leak, and that can be detected by downstream code and lead to, e.g., different</span>
+<span>methods being called via </span><code>Deref</code><span>-based specialization depending on </span><code>: Send</code><span> being implemented:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">X</span>&lt;T&gt;(T);</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;T: <span class="hl-built_in">Send</span>&gt; X&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">i32</span> { todo!() }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">struct</span> <span class="hl-title class_">Y</span>;</span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Y</span> {</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">String</span> { todo!() }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span>&lt;T&gt; std::ops::Deref <span class="hl-keyword">for</span> <span class="hl-title class_">X</span>&lt;T&gt; {</span>
+<span class="line">  <span class="hl-keyword">type</span> <span class="hl-title class_">Target</span> = Y;</span>
+<span class="line">  <span class="hl-keyword">fn</span> <span class="hl-title function_">deref</span>(&amp;<span class="hl-keyword">self</span>) <span class="hl-punctuation">-&gt;</span> &amp;Y { todo!() }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">f</span>() <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">impl</span> <span class="hl-title class_">Sized</span> {</span>
+<span class="line">  ()</span>
+<span class="line"><span class="hl-comment">//  std::rc::Rc::new(())</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">main</span>() {</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">x</span> = <span class="hl-title function_ invoke__">X</span>(<span class="hl-title function_ invoke__">f</span>());</span>
+<span class="line">  <span class="hl-keyword">let</span> <span class="hl-variable">t</span> = x.<span class="hl-title function_ invoke__">foo</span>(); <span class="hl-comment">// &lt;- which `foo`?</span></span>
+<span class="line">  <span class="hl-comment">// The answer is inside f&#x27;s body!</span></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Zig is much more strict here, you have to fully name the return type (the name doesn</span>&rsquo;<span>t have to be</span>
+<span>pretty, take a second look at </span><code>bytesAsSlice</code><span>). But its not perfect, a genuine leakage happens  with</span>
+<span>inferred error types (</span><code>!T</code><span> syntax). A bad example would look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span><span class="hl-function"> f</span>() <span class="hl-operator">!</span><span class="hl-type">void</span> {</span>
+<span class="line">   <span class="hl-comment">// Mystery!</span></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> main</span>() <span class="hl-operator">!</span><span class="hl-type">void</span> {</span>
+<span class="line">  f() <span class="hl-keyword">catch</span> <span class="hl-operator">|</span>err<span class="hl-operator">|</span> {</span>
+<span class="line">    <span class="hl-keyword">comptime</span> assert(</span>
+<span class="line">      <span class="hl-built_in">@typeInfo</span>(<span class="hl-built_in">@TypeOf</span>(err)).ErrorSet.?.len <span class="hl-operator">==</span> <span class="hl-numbers">1</span>,</span>
+<span class="line">    );</span>
+<span class="line">  };</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Here, to check </span><code>main</code><span>, we actually do need to dissect </span><code>f</code>&rsquo;<span>s body, we can</span>&rsquo;<span>t treat the error union</span>
+<span>abstractly. When the compiler analyzes </span><code>main</code><span>, it needs to stop to process </span><code>f</code><span> signature (which is</span>
+<span>very fast, as it is very short) and then </span><code>f</code><span>’s body (this part could be quite slow, there might be a</span>
+<span>lot of code behind that </span><code>Mystery</code><span>! It</span>&rsquo;<span>s interesting to ponder alternative semantics, where, during</span>
+<span>type checking, inferred types are treated abstractly, and error exhastiveness is a separate late</span>
+<span>pass in the compiler. That way, complier only needs </span><code>f</code>&rsquo;<span>s signature to check </span><code>main</code><span>. And that means</span>
+<span>that bodies of </span><code>main</code><span> and </span><code>f</code><span> could be checked in parallel.</span></p>
+<p><span>That</span>&rsquo;<span>s all for today! The type system surprising I</span>&rsquo;<span>ve found so far are:</span></p>
+<ul>
+<li>
+<p><span>Nominal type system despite notable absence of names of types.</span></p>
+</li>
+<li>
+<p><span>Unification-less generics which don</span>&rsquo;<span>t incur unreasonable annotation burden due to methods </span>&ldquo;<span>closing</span>
+<span>over</span>&rdquo;<span> generic parameters.</span></p>
+</li>
+<li>
+<p><span>Explicit signatures with no </span><a href="https://wiki.dlang.org/Voldemort_types"><span>Voldemort types</span></a><span> with a</span>
+<span>notable exception of error unions.</span></p>
+</li>
+</ul>
+<p><span>Discussion on </span><a href="https://ziggit.dev/t/types-and-the-zig-programming-language/1430"><span>ziggit.dev</span></a><span>.</span></p>
+</section>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">Fantastic Learning Resources</title>
+<link href="https://matklad.github.io/2023/08/06/fantastic-learning-resources.html" rel="alternate" type="text/html" title="Fantastic Learning Resources" />
+<published>2023-08-06T00:00:00+00:00</published>
+<updated>2023-08-06T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/08/06/fantastic-learning-resources</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[People sometimes ask me: Alex, how do I learn X?. This article is a compilation of advice I
+usually give. This is things that worked for me rather than the most awesome things on earth. I
+do consider every item on the list to be fantastic though, and I am forever grateful to people
+putting these resources together.]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/08/06/fantastic-learning-resources.html"><![CDATA[
+    <h1>
+    <a href="#Fantastic-Learning-Resources"><span>Fantastic Learning Resources</span> <time datetime="2023-08-06">Aug 6, 2023</time></a>
+    </h1>
+<p><span>People sometimes ask me: </span>&ldquo;<span>Alex, how do I learn X?</span>&rdquo;<span>. This article is a compilation of advice I</span>
+<span>usually give. This is </span>&ldquo;<span>things that worked for me</span>&rdquo;<span> rather than </span>&ldquo;<span>the most awesome things on earth</span>&rdquo;<span>. I</span>
+<span>do consider every item on the list to be fantastic though, and I am forever grateful to people</span>
+<span>putting these resources together.</span></p>
+<section id="Learning-to-Code">
+
+    <h2>
+    <a href="#Learning-to-Code"><span>Learning to Code</span> </a>
+    </h2>
+<p><span>I don</span>&rsquo;<span>t think I have any useful advice on how to learn programming from zero. The rest of the post</span>
+<span>assumes that you at least can, given sufficient time, write simple programs. E.g., a program that</span>
+<span>reads a list of integers from an input textual file, sorts them using a quadratic algorithm, and</span>
+<span>writes the result to a different file.</span></p>
+</section>
+<section id="Project-Euler">
+
+    <h2>
+    <a href="#Project-Euler"><span>Project Euler</span> </a>
+    </h2>
+<p><a href="https://projecteuler.net/archives" class="url">https://projecteuler.net/archives</a><span> is fantastic. The first 50 problems or so are a perfect </span>&ldquo;<span>drill</span>&rdquo;
+<span>to build programming muscle, to go from </span>&ldquo;<span>I can write a program to sort a list of integers</span>&rdquo;<span> to </span>&ldquo;<span>I can</span>
+<em><span>easily</span></em><span> write a program to sort a list of integers</span>&rdquo;<span>.</span></p>
+<p><span>Later problems are very heavily math based. If you are mathematically inclined, this is perfect </span>&mdash;
+<span>you got to solve fun puzzles while also practicing coding. If advanced math isn</span>&rsquo;<span>t your cup of tea,</span>
+<span>feel free to stop doing problems as soon as it stops being fun.</span></p>
+</section>
+<section id="Modern-Operating-System">
+
+    <h2>
+    <a href="#Modern-Operating-System"><span>Modern Operating System</span> </a>
+    </h2>
+<p><a href="https://en.wikipedia.org/wiki/Modern_Operating_Systems" class="url">https://en.wikipedia.org/wiki/Modern_Operating_Systems</a><span> is fantastic. A </span><a href="https://en.wikipedia.org/wiki/Operating_Systems:_Design_and_Implementation"><span>version of the</span>
+<span>book</span></a><span> was the first</span>
+<span>thick programming related tome I devoured. It gives a big picture of the inner workings of software</span>
+<span>stack, and was a turning point for me personally. After reading this book I realized that I want to</span>
+<span>be a programmer.</span></p>
+</section>
+<section id="Nand-to-Tetris">
+
+    <h2>
+    <a href="#Nand-to-Tetris"><span>Nand to Tetris</span> </a>
+    </h2>
+<p><a href="https://www.nand2tetris.org" class="url">https://www.nand2tetris.org</a><span> is fantastic. It plays a similar </span>&ldquo;<span>big picture</span>&rdquo;<span> role as MOS,</span>
+<span>but this time you are the painter. In this course you build a whole computing system yourself,</span>
+<span>starting almost from nothing. It doesn</span>&rsquo;<span>t teach you how the real software/hardware stack works, but</span>
+<span>it thoroughly dispels any magic, and is extremely fun.</span></p>
+</section>
+<section id="CSES-Problem-Set">
+
+    <h2>
+    <a href="#CSES-Problem-Set"><span>CSES Problem Set</span> </a>
+    </h2>
+<p><a href="https://cses.fi/problemset/" class="url">https://cses.fi/problemset/</a><span> is fantastic. This is a list of algorithmic problems, which is</span>
+<span>meticulously crafted to cover all the standard topics to a reasonable depth. This is by far the best</span>
+<span>source for practicing algorithms.</span></p>
+</section>
+<section id="Programming-Languages">
+
+    <h2>
+    <a href="#Programming-Languages"><span>Programming Languages</span> </a>
+    </h2>
+<p><a href="https://www.coursera.org/learn/programming-languages" class="url">https://www.coursera.org/learn/programming-languages</a><span> is fantastic. This course is a whirlwind tour</span>
+<span>across several paradigms of programming, and makes you really </span><em><span>get</span></em><span> what programming languages are</span>
+<span>about (and variance).</span></p>
+</section>
+<section id="Compilers">
+
+    <h2>
+    <a href="#Compilers"><span>Compilers</span> </a>
+    </h2>
+<p><a href="http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=Compilers" class="url">http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=Compilers</a><span> is fantastic. In this</span>
+<span>course, you implement a working compiler for a simple, but real programming language. Note that you</span>
+<span>can implement your compiler in any language.</span></p>
+</section>
+<section id="Software-Architecture">
+
+    <h2>
+    <a href="#Software-Architecture"><span>Software Architecture</span> </a>
+    </h2>
+<p><a href="https://www.tedinski.com/archive/" class="url">https://www.tedinski.com/archive/</a><span> is fantastic. Work through the whole archive in chronological</span>
+<span>order. This is by far the best resource on </span>&ldquo;<span>programming in the large</span>&rdquo;<span>.</span></p>
+</section>
+<section id="Random-Bits-of-Advice">
+
+    <h2>
+    <a href="#Random-Bits-of-Advice"><span>Random Bits of Advice</span> </a>
+    </h2>
+<p><span>What follows are some things I</span>&rsquo;<span>ve learned for myself. Take with a pinch of salt!</span></p>
+<section id="On-Mentorship">
+
+    <h3>
+    <a href="#On-Mentorship"><span>On Mentorship</span> </a>
+    </h3>
+<p><span>Having a great mentor is fantastic, but mentors are not always available. Luckily, programming can</span>
+<span>be mastered without a mentor, if you got past the initial learning step. When you code, you get </span><em><span>a</span>
+<span>lot</span></em><span> of feedback, and, through trial and error, you can process the feedback to improve your skills.</span>
+<span>In fact, the hardest bit is actually finding the problems to solve (and this article suggests many).</span>
+<span>But if you have the problem, you can self-improve noticing the following:</span></p>
+<ul>
+<li>
+<span>How you verify that the solution works.</span>
+</li>
+<li>
+<span>Common bugs and techniques to avoid them in the future.</span>
+</li>
+<li>
+<span>Length of the solution: can you solve the problem using shorter, simpler code?</span>
+</li>
+<li>
+<span>Techniques </span>&mdash;<span> can you apply anything you</span>&rsquo;<span>ve read about this week? How would the problem be solved</span>
+<span>in Haskell? Could you apply pattern from language X in language Y?</span>
+</li>
+</ul>
+<p><span>In this context it is important to solve the same problem repeatedly. E.g., you could try solving</span>
+<span>the same model problem in all languages you know, with a month or two break between attempts.</span>
+<span>Repeatedly doing the same thing and noticing differences and similarities between tries is the</span>
+<span>essence of self-learning.</span></p>
+</section>
+<section id="On-Programming-Languages">
+
+    <h3>
+    <a href="#On-Programming-Languages"><span>On Programming Languages</span> </a>
+    </h3>
+<p><span>Learning your first programming language is a nightmare, because you are learning your editing</span>
+<span>environment (PyScripter, IntelliJ IDEA, VS Code) first, simple algorithms second, and the language</span>
+<span>itself third. It gets much easier afterwards!</span></p>
+<p><span>Learning different programming languages is one of the best way to improve your programming skills.</span>
+<span>By seeing what</span>&rsquo;<span>s similar, and what</span>&rsquo;<span>s different, you deeper learn how the things work under the hood.</span>
+<span>Different languages put different idioms to the forefront, and learning several expands your</span>
+<span>vocabulary considerably. As a bonus, after learning N languages, learning N+1st becomes a question</span>
+<span>of skimming through the official docs.</span></p>
+<p><span>In general, you want to cover big families of languages: Python, Java, Haskell, C, Rust, Clojure</span>
+<span>would be a good baseline. Erlang, Forth, and Prolog would be good additions afterwards.</span></p>
+</section>
+<section id="On-Algorithms">
+
+    <h3>
+    <a href="#On-Algorithms"><span>On Algorithms</span> </a>
+    </h3>
+<p><span>There are three levels of learning algorithms</span></p>
+<dl>
+<dt><span>Level 1</span></dt>
+<dd>
+<p><span>You are not actually learning algorithms, you are learning programming. At this stage, it doesn</span>&rsquo;<span>t</span>
+<span>matter how long your code is, how pretty it is, or how efficient it is. The only thing that</span>
+<span>matters is that it solves the problem. Generally, this level ends when you are fairly comfortable</span>
+<span>with recursion. Few first problems from Project Euler are a great resource here.</span></p>
+</dd>
+<dt><span>Level 2</span></dt>
+<dd>
+<p><span>Here you learn algorithms proper. The goal here is mostly encyclopedic knowledge of common</span>
+<span>techniques. There are quite a few, but not too many of those. At this stage, the most useful thing</span>
+<span>is understanding the math behind the algorithms </span>&mdash;<span> being able to explain algorithm using</span>
+<span>pencil&amp;paper, prove its correctness, and analyze Big-O runtime. Generally, you want to learn the</span>
+<span>name of algorithm or technique, read and grok the full explanation, and then implement it.</span></p>
+<p><span>I recommend doing an abstract implementation first (i.e., not </span>&ldquo;<span>HashMap to solve problem X</span>&rdquo;<span>, but</span>
+&ldquo;<span>just HashMap</span>&rdquo;<span>). Include tests in your implementation. Use randomized testing (e.g., when testing</span>
+<span>sorting algorithms, don</span>&rsquo;<span>t use a finite set of example, generate a million random ones).</span></p>
+<p><span>It</span>&rsquo;<span>s OK and even desirable to implement the same algorithm multiple times. When solving problems,</span>
+<span>like CSES, you </span><em><span>could</span></em><span> abstract your solutions and re-use them, but it</span>&rsquo;<span>s better to code everything</span>
+<span>from scratch every time, until you</span>&rsquo;<span>ve fully internalized the algorithm.</span></p>
+</dd>
+<dt><span>Level 3</span></dt>
+<dd>
+<p><span>One day, long after I</span>&rsquo;<span>ve finished my university, I was a TA for an algorithms course. The lecturer</span>
+<span>for the course was the person who originally taught me to program, through a similar algorithms</span>
+<span>course. And, during one coffee break, he said something like</span></p>
+
+<figure class="blockquote">
+<blockquote><p><span>We don</span>&rsquo;<span>t teach algorithms so that students can code Dijkstra with their eyes closed on the job.</span>
+<span>They probably won</span>&rsquo;<span>t have to code any fancy algorithms themselves.</span></p>
+<p><span>We teach algorithms so that students learn to think about invariants and properties when writing</span>
+<span>code. Real-life code is usually simple enough that it mostly works if you just throw spaghetti</span>
+<span>onto the wall. But it doesn</span>&rsquo;<span>t always work. To write correct, robust code at work, you need to</span>
+<span>think about invariants.</span></p>
+<p><span>The trick with algorithms is that coding them is hard. The only way to avoid bugs is to force</span>
+<span>yourself to think in terms of invariants.</span></p>
+</blockquote>
+
+</figure>
+<p><span>I was thunderstruck! I didn</span>&rsquo;<span>t realize that</span>&rsquo;<span>s the reason why I am learning (well, teaching at that</span>
+<span>point) algorithms! Before, I always muddled through my algorithms by randomly tweaking generally</span>
+<span>correct stuff until it works. E.g., with a binary search, just add </span><code>+1</code><span> somewhere until it doesn</span>&rsquo;<span>t</span>
+<span>loop on random arrays. After hearing this advice, I went home and wrote my millionth binary</span>
+<span>search, but this time I actually added comments with loop invariants, and it worked from the first</span>
+<span>try! I applied similar techniques for the rest of the course, and since then my subjective</span>
+<span>perception of bug rate (for normal work code) went down dramatically.</span></p>
+<p><span>So this is the third level of algorithms </span>&mdash;<span> you hone your coding skills to program without bugs.</span>
+<span>If you are already fairly comfortable with algorithms, try doing CSES again. But this time, spend</span>
+<span>however much you need double-checking the code </span><em><span>before</span></em><span> submission, but try to get everything</span>
+<span>correct on the first try.</span></p>
+</dd>
+</dl>
+</section>
+<section id="On-Algorithm-Names">
+
+    <h3>
+    <a href="#On-Algorithm-Names"><span>On Algorithm Names</span> </a>
+    </h3>
+<p><span>Here</span>&rsquo;<span>s the list of things you might want to be able to do, algorithmically. You don</span>&rsquo;<span>t need to be</span>
+<span>able to code everything on the spot. I think it would help if you know what each word is about, and</span>
+<span>have implemented the thing at least once in the past.</span></p>
+<p><span>Linear search, binary search, quadratic sorting, quick sort, merge sort, heap sort, binary heap,</span>
+<span>growable array (aka ArrayList, vector), doubly-linked list, binary search tree, avl tree, red-black</span>
+<span>tree, B-tree, splay tree, hash table (chaining and open addressing), depth first search, breadth first</span>
+<span>search, topological sort, strongly connected components, minimal spanning tree (Prim &amp; Kruskal),</span>
+<span>shortest paths (bfs, Dijkstra, Floyd–Warshall, Bellman–Ford), substring search (quadratic,</span>
+<span>Rabin-Karp, Boyer-Moore, Knuth-Morris-Pratt), trie, Aho-Corasick, dynamic programming (longest</span>
+<span>common subsequence, edit distance).</span></p>
+</section>
+<section id="On-Larger-Programs">
+
+    <h3>
+    <a href="#On-Larger-Programs"><span>On Larger Programs</span> </a>
+    </h3>
+<p><span>A very powerful exercise is coding a medium-sized project from scratch. Something that takes more</span>
+<span>than a day, but less than a week, and has a meaningful architecture which can be just right, or</span>
+<span>messed up. Here are some great projects to do:</span></p>
+<dl>
+<dt><span>Ray Tracer</span></dt>
+<dd>
+<p><span>Given an analytical description of a 3D scene, convert it to a colored 2D image, by simulating a</span>
+<span>path of a ray of light as it bounces off objects.</span></p>
+</dd>
+<dt><span>Software Rasterizer</span></dt>
+<dd>
+<p><span>Given a description of a 3D scene as a set of triangles, convert it to a colored 2D image by</span>
+<span>projecting triangles onto the viewing plane and drawing the projections in the correct order.</span></p>
+</dd>
+<dt><span>Dynamically Typed Programming Language</span></dt>
+<dd>
+<p><span>An </span><em><span>interpreter</span></em><span> which reads source code as text, parses it into an AST, and directly executes the</span>
+<span>AST (or maybe converts AST to the byte code for some speed up)</span></p>
+</dd>
+<dt><span>Statically Typed Programming Language</span></dt>
+<dd>
+<p><span>A </span><em><span>compiler</span></em><span> which reads source code as text, and spits out a binary (WASM would be a terrific</span>
+<span>target).</span></p>
+</dd>
+<dt><span>Relational Database</span></dt>
+<dd>
+<p><span>Several components:</span></p>
+<ul>
+<li>
+<span>Storage engine, which stores data durably on disk and implements on-disk ordered data structures</span>
+<span>(B-tree or LSM)</span>
+</li>
+<li>
+<span>Relational data model which is implemented on top of primitive ordered data structures.</span>
+</li>
+<li>
+<span>Relational language to express schema and queries.</span>
+</li>
+<li>
+<span>Either a TCP server to accept transactions as a database server, or an API for embedding for an</span>
+<span>in-processes </span>&ldquo;<span>embedded</span>&rdquo;<span> database.</span>
+</li>
+</ul>
+</dd>
+<dt><span>Chat Server</span></dt>
+<dd>
+<p><span>An exercise in networking and asynchronous programming. Multiple client programs connect to a</span>
+<span>server program. A client can send a message either to a specific different client, or to all other</span>
+<span>clients (broadcast). There are many variations on how to implement this: blocking read/write</span>
+<span>calls, </span><code>epoll</code><span>, </span><code>io_uring</code><span>, threads, callbacks, futures, manually-coded state machines.</span></p>
+</dd>
+</dl>
+<p><span>Again, it</span>&rsquo;<span>s more valuable to do the same exercise six times with variations, than to blast through</span>
+<span>everything once.</span></p>
+</section>
+</section>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">On Modularity of Lexical Analysis</title>
+<link href="https://matklad.github.io/2023/08/01/on-modularity-of-lexical-analysis.html" rel="alternate" type="text/html" title="On Modularity of Lexical Analysis" />
+<published>2023-08-01T00:00:00+00:00</published>
+<updated>2023-08-01T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/08/01/on-modularity-of-lexical-analysis</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[I was going to write a long post about designing an IDE-friendly language. I wrote an intro and
+figured that it would make a better, shorter post on its own. Enjoy!]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/08/01/on-modularity-of-lexical-analysis.html"><![CDATA[
+    <h1>
+    <a href="#On-Modularity-of-Lexical-Analysis"><span>On Modularity of Lexical Analysis</span> <time datetime="2023-08-01">Aug 1, 2023</time></a>
+    </h1>
+<p><span>I was going to write a long post about designing an IDE-friendly language. I wrote an intro and</span>
+<span>figured that it would make a better, shorter post on its own. Enjoy!</span></p>
+<p><span>The big idea of language server construction is that language servers are not magic </span>&mdash;<span> capabilities</span>
+<span>and performance of tooling are constrained by the syntax and semantics of the underlying language.</span>
+<span>If a language is not designed with toolability in mind, some capabilities (e.g, fully automated</span>
+<span>refactors) are impossible to implement correctly. What</span>&rsquo;<span>s more, an IDE-friendly language turns out to</span>
+<span>be a fast-to-compile language with easy-to-compose libraries!</span></p>
+<p><span>More abstractly, there</span>&rsquo;<span>s this cluster of unrelated at a first sight, but intimately intertwined and</span>
+<span>mutually supportive properties:</span></p>
+<ul>
+<li>
+<span>parallel, separate compilation,</span>
+</li>
+<li>
+<span>incremental compilation,</span>
+</li>
+<li>
+<span>resilience to errors.</span>
+</li>
+</ul>
+<p><span>Separate compilation measures how fast we can compile codebase from scratch if we have unlimited</span>
+<span>number of CPU cores. For a language server, it solves the cold start problem </span>&mdash;<span> time to</span>
+<span>code-completion when the user opens the project for the first time or switches branches. Incremental</span>
+<span>compilation is the steady state of the language server </span>&mdash;<span> user types code and expects to see</span>
+<span>immediate effects throughout the project. Resilience to errors is important for two different</span>
+<span>sub-reasons. First, when the user edits the code it is by definition incomplete and erroneous, but a</span>
+<span>language server still must analyze the surrounding context correctly. But the killer feature of</span>
+<span>resilience is that, if you are absolutely immune to some errors, you don</span>&rsquo;<span>t even have to look at the</span>
+<span>code. If a language server can ignore errors in function bodies, it doesn</span>&rsquo;<span>t have to look at the</span>
+<span>bodies of functions from dependencies.</span></p>
+<p><span>All three properties, parallelism, incrementality, and resilience, boil down to modularity </span>&mdash;
+<span>partitioning the code into disjoint components with well-defined interfaces, such that each</span>
+<span>particular component is aware only about the interfaces of other components.</span></p>
+<section id="Minimized-Example-Lexical-Analysis">
+
+    <h2>
+    <a href="#Minimized-Example-Lexical-Analysis"><span>Minimized Example: Lexical Analysis</span> </a>
+    </h2>
+<p><span>Lets do a short drill and observe how the three properties interact at a small scale. Let</span>&rsquo;<span>s</span>
+<span>minimize the problem of separate compilation to just </span>&hellip;<span> lexical analysis. How can we build a</span>
+<span>language that is easier to tokenize for an language server?</span></p>
+<p><span>An unclosed quote is a nasty little problem! Practically, it is rare enough that it doesn</span>&rsquo;<span>t really</span>
+<span>matter how you handle it, but qualitatively it is illuminating. In a language like Rust, where</span>
+<span>strings can span multiple lines, inserting a </span><code>"</code><span> in the middle of a file changes the lexical structure</span>
+<span>of the following text completely (</span><code>/*</code><span>, start of a block comment, has the same effect). When tokens</span>
+<span>change, so does the syntax tree and the set of symbols defined by the file. A tiny edit, just one</span>
+<span>symbol, unhinges semantic structure of the entire compilation unit.</span></p>
+<p><span>Zig solves this problem. In Zig, no token can span several lines. That is, it would be correct to</span>
+<span>first split Zig source file by </span><code>\n</code><span>, and then tokenize each line separately. This is achieved by</span>
+<span>solving underlying problems requiring multi-line tokens better. Specifically:</span></p>
+<ul>
+<li>
+<p><span>there</span>&rsquo;<span>s a single syntax for comments, </span><code>//</code><span>,</span></p>
+</li>
+<li>
+<p><span>double-quoted strings can</span>&rsquo;<span>t contain a </span><code>\n</code><span>,</span></p>
+</li>
+<li>
+<p><span>but there</span>&rsquo;<span>s a really nice syntax for multiline strings:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">const</span> greeting =</span>
+<span class="line">    <span class="hl-string">\\This is</span></span>
+<span class="line">    <span class="hl-string">\\a multiline string</span></span>
+<span class="line">    <span class="hl-string">\\   &lt;- with a leading whitespace here.</span></span>
+<span class="line">    <span class="hl-string">\\</span></span></code></pre>
+
+</figure>
+</li>
+</ul>
+<p><span>Do you see modules here? Disjoint-partitioning into interface-connected components? From the</span>
+<span>perspective of lexical analysis, each </span><em><span>line</span></em><span> is a module. And a line always has a trivial, empty</span>
+<span>interface </span>&mdash;<span> different lines are completely independent. As a result:</span></p>
+<p><em><span>First</span></em><span>, we can do lexical analysis in parallel. If you have N CPU cores, you can split file into N</span>
+<span>equal chunks, then in parallel locally adjust chunk boundaries such that they fall on newlines, and</span>
+<span>then tokenize each chunk separately.</span></p>
+<p><em><span>Second</span></em><span>, we have quick incremental tokenization </span>&mdash;<span> given a source edit, you determine the set of</span>
+<span>lines affected, and re-tokenize only those. The work is proportional to the size of the edit plus at</span>
+<span>most two boundary lines.</span></p>
+<p><em><span>Third</span></em><span>, any lexical error in a line is isolated just to this line. There</span>&rsquo;<span>s no unclosed quote</span>
+<span>problem, mistakes are contained.</span></p>
+<p><span>I am by no means saying that line-by-line lexing is a requirement for an IDE-friendly language</span>
+<span>(though it would be nice)! Rather, I want you to marvel how the same underlying structure of the</span>
+<span>problem can be exploited for quarantining errors, reacting to changes quickly, and parallelizing the</span>
+<span>processing.</span></p>
+<p><span>The three properties are just three different faces of modularity in the end!</span></p>
+<hr>
+<p><span>I do want to write that </span>&ldquo;<span>IDE-friendly language</span>&rdquo;<span> post at some point, but, as a hedge (after all, I</span>
+<span>still owe you </span>&ldquo;<a href="https://matklad.github.io/2022/04/25/why-lsp.html"><span>Why LSP</span></a><span> Sucks?</span>&rdquo;<span> one</span>&hellip;<span>), here are two comments where I explored the idea somewhat:</span>
+<a href="https://todo.sr.ht/~icefox/garnet/52#event-242650"><span>1</span></a><span>,</span>
+<a href="https://lobste.rs/s/u7y4lk/modules_matter_most_for_masses#c_i6a8n9"><span>2</span></a><span>.</span></p>
+<p><span>I also recommend these posts, which explore the same underlying phenomenon from the software</span>
+<span>architecture perspective:</span></p>
+<ul>
+<li>
+<a href="https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html" class="url">https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html</a>
+</li>
+<li>
+<a href="https://www.tedinski.com/2018/02/06/system-boundaries.html" class="url">https://www.tedinski.com/2018/02/06/system-boundaries.html</a>
+</li>
+<li>
+<a href="https://www.pathsensitive.com/2023/03/modules-matter-most-for-masses.html" class="url">https://www.pathsensitive.com/2023/03/modules-matter-most-for-masses.html</a>
+</li>
+</ul>
+</section>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">Three Different Cuts</title>
+<link href="https://matklad.github.io/2023/07/16/three-different-cuts.html" rel="alternate" type="text/html" title="Three Different Cuts" />
+<published>2023-07-16T00:00:00+00:00</published>
+<updated>2023-07-16T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/07/16/three-different-cuts</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[In this post, we'll look at how Rust, Go, and Zig express the signature of function cut --- the power tool of string manipulation.
+Cut takes a string and a pattern, and splits the string around the first occurrence of the pattern:
+cut("life", "if") = ("l", "e").]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/07/16/three-different-cuts.html"><![CDATA[
+    <h1>
+    <a href="#Three-Different-Cuts"><span>Three Different Cuts</span> <time datetime="2023-07-16">Jul 16, 2023</time></a>
+    </h1>
+<p><span>In this post, we</span>&rsquo;<span>ll look at how Rust, Go, and Zig express the signature of function </span><code>cut</code><span> </span>&mdash;<span> the power tool of string manipulation.</span>
+<span>Cut takes a string and a pattern, and splits the string around the first occurrence of the pattern:</span>
+<span class="display"><code>cut("life", "if") = ("l", "e")</code><span>.</span></span></p>
+<p><span>At a glance, it seems like a non-orthogonal jumbling together of searching and slicing.</span>
+<span>However, in practice a lot of ad-hoc string processing can be elegantly expressed via </span><code>cut</code><span>.</span></p>
+<p><span>A lot of things are </span><code>key=value</code><span> pairs, and cut fits perfectly there.</span>
+<span>What</span>&rsquo;<span>s more, many more complex sequencies, like</span>
+<span class="display"><code>--arg=key=value</code><span>,</span></span>
+<span>can be viewed as nested pairs.</span>
+<span>You can cut around </span><code>=</code><span> once to get </span><code>--arg</code><span> and </span><code>key=value</code><span>, and then cut the second time to separate </span><code>key</code><span> from </span><code>value</code><span>.</span></p>
+<p><span>In Rust, this function looks like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">fn</span> <span class="hl-title function_">split_once</span>&lt;<span class="hl-symbol">&#x27;a</span>, P&gt;(</span>
+<span class="line">  &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-keyword">self</span>,</span>
+<span class="line">  delimiter: P,</span>
+<span class="line">) <span class="hl-punctuation">-&gt;</span> <span class="hl-type">Option</span>&lt;(&amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>, &amp;<span class="hl-symbol">&#x27;a</span> <span class="hl-type">str</span>)&gt;</span>
+<span class="line"><span class="hl-keyword">where</span></span>
+<span class="line">  P: Pattern&lt;<span class="hl-symbol">&#x27;a</span>&gt;,</span>
+<span class="line">{</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Rust</span>&rsquo;<span>s </span><code>Option</code><span> is a good fit for the result type, it clearnly describes the behavior of the function when the pattern isn</span>&rsquo;<span>t found in the string at all.</span>
+<span>Lifetime </span><code>'a</code><span> expresses the relationship between the result and the input </span>&mdash;<span> both pieces of result are substrings of </span><code>&amp;'a self</code><span>, so, as long as they are used, the original string must be kept alive as well.</span>
+<span>Finally, the separator isn</span>&rsquo;<span>t another string, but a generic </span><code>P: Pattern</code><span>.</span>
+<span>This gives a somewhat crowded signature, but allows using strings, single characters, and even </span><code class="display">fn(c: char) -&gt; bool</code><span> functions as patterns.</span></p>
+<p><span>When using the function, there are is a multitude of ways to access the result:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Propagate `None` upwards:</span></span>
+<span class="line"><span class="hl-keyword">let</span> (prefix, suffix) = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>)?;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Handle `None` in an ad-hoc way:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((prefix, suffix)) = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>) <span class="hl-keyword">else</span> {</span>
+<span class="line">    <span class="hl-keyword">return</span></span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Ignore `None`:</span></span>
+<span class="line"><span class="hl-keyword">if</span> <span class="hl-keyword">let</span> <span class="hl-variable">Some</span>((prefix, suffix)) = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>) {</span>
+<span class="line">    ...</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Handle `Some` and `None` in a symmetric way:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">result</span> = <span class="hl-keyword">match</span> line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>) {</span>
+<span class="line">    <span class="hl-title function_ invoke__">Some</span>((prefix, suffix)) =&gt; { ... }</span>
+<span class="line">    <span class="hl-literal">None</span> =&gt; { ... }</span>
+<span class="line">};</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Access only one component of the result:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">suffix</span> = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>)?.<span class="hl-number">1</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Use high-order functions to extract key with a default:</span></span>
+<span class="line"><span class="hl-keyword">let</span> <span class="hl-variable">key</span> = line.<span class="hl-title function_ invoke__">split_once</span>(<span class="hl-string">&quot;=&quot;</span>)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">map</span>(|(key, _value)| key)</span>
+<span class="line">    .<span class="hl-title function_ invoke__">unwrap_or</span>(line);</span></code></pre>
+
+</figure>
+<p><span>Here</span>&rsquo;<span>s a Go equivalent:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-function"><span class="hl-keyword">func</span> <span class="hl-title">Cut</span><span class="hl-params">(s, sep <span class="hl-type">string</span>)</span></span> (before, after <span class="hl-type">string</span>, found <span class="hl-type">bool</span>) {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>It has a better name!</span>
+<span>It</span>&rsquo;<span>s important that frequently used building-block functions have short, memorable names, and </span>&ldquo;<span>cut</span>&rdquo;<span> is just perfect for what the function does.</span>
+<span>Go doesn</span>&rsquo;<span>t have an </span><code>Option</code><span>, but it allows multiple return values, and any type in Go has a zero value, so a boolean flag can be used to signal </span><code>None</code><span>.</span>
+<span>Curiously if the </span><code>sep</code><span> is not found in </span><code>s</code><span>, </span><code>after</code><span> is set to </span><code>""</code><span>, but </span><code>before</code><span> is set to </span><code>s</code><span> (that is, the whole string).</span>
+<span>This is occasionally useful, and corresponds to the last Rust example.</span>
+<span>But it also isn</span>&rsquo;<span>t something immediately obvious from the signature, it</span>&rsquo;<span>s an extra detail to keep in mind.</span>
+<span>Which might be fine for a foundational function!</span>
+<span>Similarly to Rust, the resulting strings point to the same memory as </span><code>s</code><span>.</span>
+<span>There are no lifetimes, but a potential performance gotcha </span>&mdash;<span> if one of the resulting strings is alive, then the entire </span><code>s</code><span> can</span>&rsquo;<span>t be garbage collected.</span></p>
+<p><span>There isn</span>&rsquo;<span>t much in way of using the function in Go:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">prefix, suffix, ok = strings.Cut(line, <span class="hl-string">&quot;=&quot;</span>)</span>
+<span class="line"><span class="hl-keyword">if</span> !ok {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Zig doesn</span>&rsquo;<span>t yet have an equivalent function in its standard library, but it probably will at some point, and the signature might look like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span><span class="hl-function"> cut</span>(</span>
+<span class="line">    s: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span>,</span>
+<span class="line">    sep: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span></span>
+<span class="line">) ?<span class="hl-keyword">struct</span> { prefix: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span>, suffix: []<span class="hl-keyword">const</span> <span class="hl-type">u8</span> } {</span>
+<span class="line">    ...</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><span>Similarly to Rust, Zig can express optional values.</span>
+<span>Unlike Rust, the option is a built-in, rather than a user-defined type (Zig </span><em><span>can</span></em><span> express a generic user-defined option, but chooses not to).</span>
+<span>All types in Zig are strictly prefix, so leading </span><code>?</code><span> concisely signals optionality.</span>
+<span>Zig doesn</span>&rsquo;<span>t have first-class tuple types, but uses very concise and flexible type declaration syntax, so we can return a named tuple.</span>
+<span>Curiously, this anonymous struct is still a nominal, rather than a structural, type!</span>
+<span>Similarly to Rust, </span><code>prefix</code><span> and </span><code>suffix</code><span> borrow the same memory that </span><code>s</code><span> does.</span>
+<span>Unlike Rust, this isn</span>&rsquo;<span>t expressed in the signature </span>&mdash;<span> while in this case it is obvious that the lifetime would be bound to </span><code>s</code><span>, rather than </span><code>sep</code><span>, there are no type system guardrails here.</span></p>
+<p><span>Because </span><code>?</code><span> is a built-in type, we need some amount of special syntax to handle the result, but it curiously feels less special-case and more versatile than the Rust version.</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Propagate `null` upwards / handle `null` in an ad-hoc way.</span></span>
+<span class="line"><span class="hl-keyword">const</span> cut = mem.cut(line, <span class="hl-string">&quot;=&quot;</span>) <span class="hl-keyword">orelse</span> <span class="hl-keyword">return</span> <span class="hl-literal">null</span>;</span>
+<span class="line"><span class="hl-keyword">const</span> cut = mem.cut(line, <span class="hl-string">&quot;=&quot;</span>) <span class="hl-keyword">orelse</span> <span class="hl-keyword">return</span>;</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Ignore or handle `null`.</span></span>
+<span class="line"><span class="hl-keyword">if</span> (mem.cut(line, <span class="hl-string">&quot;=&quot;</span>)) <span class="hl-operator">|</span>cut<span class="hl-operator">|</span> {</span>
+<span class="line"></span>
+<span class="line">} <span class="hl-keyword">else</span> {</span>
+<span class="line"></span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Go semantics: extract key with a default</span></span>
+<span class="line">let key = <span class="hl-keyword">if</span> (mem.cut(line, <span class="hl-string">&quot;=&quot;</span>)) <span class="hl-operator">|</span>cut<span class="hl-operator">|</span> cut.first <span class="hl-keyword">else</span> line;</span></code></pre>
+
+</figure>
+<p><span>Moral of the story?</span>
+<span>Work with the grain of the language </span>&mdash;<span> expressing the same concept in different languages usually requires a slightly different vocabulary.</span></p>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">GitHub Merge Queue</title>
+<link href="https://matklad.github.io/2023/06/18/GitHub-merge-queue.html" rel="alternate" type="text/html" title="GitHub Merge Queue" />
+<published>2023-06-18T00:00:00+00:00</published>
+<updated>2023-06-18T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/06/18/GitHub-merge-queue</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[Short, unedited note on GitHub merge queue.]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/06/18/GitHub-merge-queue.html"><![CDATA[
+    <h1>
+    <a href="#GitHub-Merge-Queue"><span>GitHub Merge Queue</span> <time datetime="2023-06-18">Jun 18, 2023</time></a>
+    </h1>
+<p><span>Short, unedited note on </span><a href="https://docs.github.com/en/repositories/configuring-branches-and-merges-in-your-repository/configuring-pull-request-merges/managing-a-merge-queue"><span>GitHub merge queue</span></a><span>.</span></p>
+<p><span>TL;DR, </span><a href="https://bors.tech" class="url">https://bors.tech</a><span> delivers a meaningfully better experience, although it suffers from being a third-party integration.</span></p>
+<p><span>Specific grievances:</span></p>
+<p><em><span>Complexity</span></em><span>. This is a vague feeling, but merge queue feels like it is built by complexity merchants </span>&mdash;<span> there are a lot of unclear settings and voluminous and byzantine docs.</span>
+<span>Good for allocating extra budget towards build engineering, bad for actual build engineering.</span></p>
+<p><em><span>GUI-only configuration</span></em><span>. Bors is setup using bors.toml in the repository, merge queue is setup by clicking through web GUI.</span>
+<span>To share config with other maintainers, I resorted to a zoomed-out screenshot of the page.</span></p>
+<p><em><span>Unclear set of checks</span></em><span>. The purpose of the merge queue is to enforce not rocket science rule of software engineering </span>&mdash;<span> making sure that the code in the main branch satisfies certain quality invariants (all tests are passing).</span>
+<span>It is impossible to tell what merge queue actually enforces.</span>
+<span>Typically, when you enable merge queue, you subsequently find out that it actually merges anything, without any checks whatsoever.</span></p>
+<p><em><span>Double latency</span></em><span>. One of the biggest benefits of a merge queue for a high velocity project is its </span><em><span>asynchrony</span></em><span>.</span>
+<span>After submitting a PR, you can do a review and schedule PR to be merged </span><em><span>without</span></em><span> waiting for CI to finish.</span>
+<span>This is massive: it is 2X reduction to human attention required.</span>
+<span>Without queue, you need to look at a PR twice: once to do a review, and once to click merge after the green checkmark is in.</span>
+<span>With the queue, you only need a review, and the green checkmark comes in asynchronously.</span>
+<span>Except that with GitHub merge queue, you can</span>&rsquo;<span>t actually add a PR to the queue until you get a green checkmark.</span>
+<span>In effect, that</span>&rsquo;<span>s still 2X attention, and then a PR runs through the same CI checks twice (yes, you can have separate checks for merge queue and PR. No, this is not a good idea, this is complexity and busywork).</span></p>
+<p><em><span>Lack of delegation</span></em><span>. With bors, you can use </span><code>bors delegate+</code><span> to delegate merging of a single, specific pull request to its author.</span>
+<span>This is helpful to drive contributor engagement, and to formalize </span>&ldquo;<span>LGTM with the nits fixed</span>&rdquo;<span> approval (which again reduces number of human round trips).</span></p>
+<p><span>You still should use GitHub merge queue, rather than bors-ng, as that</span>&rsquo;<span>s now a first-party feature.</span>
+<span>Still, its important to understand how things </span><em><span>should</span></em><span> work, to be able to improve state of the art some other time.</span></p>
+]]></content>
+</entry>
+
+<entry>
+<title type="text">The Worst Zig Version Manager</title>
+<link href="https://matklad.github.io/2023/06/02/the-worst-zig-version-manager.html" rel="alternate" type="text/html" title="The Worst Zig Version Manager" />
+<published>2023-06-02T00:00:00+00:00</published>
+<updated>2023-06-02T00:00:00+00:00</updated>
+<id>https://matklad.github.io/2023/06/02/the-worst-zig-version-manager</id>
+<author><name>Alex Kladov</name></author>
+<summary type="html"><![CDATA[https://github.com/matklad/hello-getzig]]></summary>
+<content type="html" xml:base="https://matklad.github.io/2023/06/02/the-worst-zig-version-manager.html"><![CDATA[
+    <h1>
+    <a href="#The-Worst-Zig-Version-Manager"><span>The Worst Zig Version Manager</span> <time datetime="2023-06-02">Jun 2, 2023</time></a>
+    </h1>
+
+<figure class="code-block">
+<figcaption class="title">./getzig.ps1</figcaption>
+
+
+<pre><code><span class="line">#!/bin/sh</span>
+<span class="line">echo `# &lt;#`</span>
+<span class="line"></span>
+<span class="line">mkdir -p ./zig</span>
+<span class="line"></span>
+<span class="line">wget https://ziglang.org/download/0.10.1/zig-linux-x86_64-0.10.1.tar.xz -O ./zig/zig-linux-x86_64-0.10.1.tar.xz</span>
+<span class="line">tar -xf ./zig/zig-linux-x86_64-0.10.1.tar.xz -C ./zig --strip-components=1</span>
+<span class="line">rm ./zig/zig-linux-x86_64-0.10.1.tar.xz</span>
+<span class="line"></span>
+<span class="line">echo "Zig installed."</span>
+<span class="line">./zig/zig version</span>
+<span class="line"></span>
+<span class="line">exit</span>
+<span class="line">#&gt; &gt; $null</span>
+<span class="line"></span>
+<span class="line">Invoke-WebRequest -Uri "https://ziglang.org/download/0.10.1/zig-windows-x86_64-0.10.1.zip" -OutFile ".\zig-windows-x86_64-0.10.1.zip"</span>
+<span class="line">Expand-Archive -Path ".\zig-windows-x86_64-0.10.1.zip" -DestinationPath ".\" -Force</span>
+<span class="line">Remove-Item -Path " .\zig-windows-x86_64-0.10.1.zip"</span>
+<span class="line">Rename-Item -Path ".\zig-windows-x86_64-0.10.1" -NewName ".\zig"</span>
+<span class="line"></span>
+<span class="line">Write-Host "Zig installed."</span>
+<span class="line">./zig/zig.exe version</span></code></pre>
+
+</figure>
+<p class="display"><a href="https://github.com/matklad/hello-getzig" class="url">https://github.com/matklad/hello-getzig</a></p>
+<p><span>Longer version:</span></p>
+<p><span>One of the values of Zig which resonates with me deeply is a mindful approach to dependencies.</span>
+<span>Zig tries hard not to ask too much from the environment, such that, if you get </span><code>zig version</code><span> running, you can be reasonably sure that everything else works.</span>
+<span>That</span>&rsquo;<span>s one of the main motivations for adding an HTTP client to the Zig distribution recently.</span>
+<span>Building software today involves downloading various components from the Internet, and, if Zig wants for software built with Zig to be hermetic and self-sufficient, it needs to provide ability to download files from HTTP servers.</span></p>
+<p><span>There</span>&rsquo;<span>s one hurdle for self-sufficiency: how do you get Zig in the first place?</span>
+<span>One answer to this question is </span>&ldquo;<span>from your distribution</span>&rsquo;<span>s package manager</span>&rdquo;<span>.</span>
+<span>This is not a very satisfying answer, at least until the language is both post 1.0 and semi-frozen in development.</span>
+<span>And even then, what if your distribution is Windows?</span>
+<span>How many distributions should be covered by </span>&ldquo;<span>Installing Zig</span>&rdquo;<span> section of your </span><code>CONTRIBUTING.md</code><span>?</span></p>
+<p><span>Another answer would be a version manager, a-la </span><code>rustup</code><span>, </span><code>nvm</code><span>, or </span><code>asdf</code><span>.</span>
+<span>These tools work well, but they are quite complex, and rely on various subtle properties of the environment, like </span><code>PATH</code><span>, shell activation scripts and busybox-style multipurpose executable.</span>
+<span>And, well, this also kicks the can down the road </span>&mdash;<span> you can use </span><code>zvm</code><span> to get Zig, but how do you get </span><code>zvm</code><span>?</span></p>
+<p><span>I like how we do this in </span><a href="https://github.com/tigerbeetledb/tigerbeetle/blob/56d14e82769deb6817809f866253220ae0f499d1/scripts/install_zig.sh"><span>TigerBeetle</span></a><span>.</span>
+<span>We don</span>&rsquo;<span>t use </span><code>zig</code><span> from </span><code>PATH</code><span>.</span>
+<span>Instead, we just put the correct version of Zig into </span><code>./zig</code><span> folder in the root of the repository, and run it like this:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> ./zig/zig build test</span></code></pre>
+
+</figure>
+<p><span>Suddenly, whole swaths of complexity go away.</span>
+<span>Quiz time: if you need to add a directory to </span><code>PATH</code><span>, which script should be edited so that both the graphical environment and the terminal are affected?</span></p>
+<p><span>Finally, another interesting case study is Gradle.</span>
+<span>Usually Gradle is a negative example, but they do have a good approach for installing Gradle itself.</span>
+<span>The standard pattern is to store two scripts, </span><code>gradlew.sh</code><span> and </span><code>gradlew.bat</code><span>, which bootstrap the right version of Gradle by downloading a jar file (java itself is not bootstrapped this way though).</span></p>
+<p><span>What all these approaches struggle to overcome is the problem of bootstrapping.</span>
+<span>Generally, if you need to automate anything, you can write a program to do that.</span>
+<span>But you need some pre-existing program runner!</span>
+<span>And there</span>&rsquo;<span>s just no good options out of the box </span>&mdash;<span> bash and powershell are passable, but barely, and they are different.</span>
+<span>And </span>&ldquo;<span>bash</span>&rdquo;<span> and the set of coreutils also differs depending on the Unix in question.</span>
+<span>But there</span>&rsquo;<span>s just no good solution here </span>&mdash;<span> if you want to bootstrap automatically, you must start with universally available tools.</span></p>
+<p><span>But is there perhaps some scripting language which is shared between Windows and Unix?</span>
+<a href="https://github.com/cspotcode"><span>@cspotcode</span></a><span> suggests </span><a href="https://cspotcode.com/posts/polyglot-powershell-and-bash-script"><span>a horrible workaround</span></a><span>.</span>
+<span>You can write a script which is </span><em><span>both</span></em><span> a bash script and a powershell script.</span>
+<span>And it even isn</span>&rsquo;<span>t too too ugly!</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">!/bin/bash</span>
+<span class="line">echo `# &lt;#`</span>
+<span class="line"></span>
+<span class="line">echo "Bash!"</span>
+<span class="line"></span>
+<span class="line">exit</span>
+<span class="line">#&gt; &gt; $null</span>
+<span class="line"></span>
+<span class="line">Write-Host "PowerShell!"</span></code></pre>
+
+</figure>
+<p><span>So, here</span>&rsquo;<span>s an idea for a hermetic Zig version management workflow.</span>
+<span>There</span>&rsquo;<span>s a canonical, short </span><code>getzig.ps1</code><span> PowerShell/sh script which is vendored verbatim by various projects.</span>
+<span>Running this script downloads an appropriate version of Zig, and puts it into </span><code>./zig/zig</code><span> inside the repository (</span><code>.gitignore</code><span> contains </span><code>/zig</code><span>).</span>
+<span>Building, testing, and other workflows use </span><code>./zig/zig</code><span> instead of relying on global system state (</span><code>$PATH</code><span>).</span></p>
+<p><span>A proof-of-concept </span><code>getzig.ps1</code><span> is at the start of this article.</span>
+<span>Note that I don</span>&rsquo;<span>t know bash, powershell, and how to download files from the Internet securely, so the above PoC was mostly written by Chat GPT.</span>
+<span>But it seems to work on my machine.</span>
+<span>I clone </span><a href="https://github.com/matklad/hello-getzig" class="url">https://github.com/matklad/hello-getzig</a><span> and run</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-title function_">$</span> ./getzig.ps1</span>
+<span class="line"><span class="hl-title function_">$</span> ./zig/zig run ./hello.zig</span></code></pre>
+
+</figure>
+<p><span>on both NixOS and Windows 10, and it prints hello.</span></p>
+<p><span>If anyone wants to make an actual thing out of this idea, here</span>&rsquo;<span>s possible desiderata:</span></p>
+<ul>
+<li>
+<p><span>A single polyglot </span><code>getzig.sh.ps1</code><span> is cute, but using a couple of different scripts wouldn</span>&rsquo;<span>t be a big problem.</span></p>
+</li>
+<li>
+<p><span>Size of the scripts </span><em><span>could</span></em><span> be a problem, as they are supposed to be vendored into each repository.</span>
+<span>I</span>&rsquo;<span>d say 512 lines for combined </span><code>getzig.sh.ps1</code><span> would be a reasonable complexity limit.</span></p>
+</li>
+<li>
+<p><span>The script must </span>&ldquo;<span>just work</span>&rdquo;<span> on all four major desktop operating systems: Linux, Mac, Windows, and WSL.</span></p>
+</li>
+<li>
+<p><span>The script should be polymorphic in </span><code>curl</code><span> / </span><code>wget</code><span> and </span><code>bash</code><span> / </span><code>sh</code><span>.</span></p>
+</li>
+<li>
+<p><span>It</span>&rsquo;<span>s ok if it doesn</span>&rsquo;<span>t work absolutely everywhere </span>&mdash;<span> downloading/building Zig manually for an odd platform is also an acceptable workflow.</span></p>
+</li>
+<li>
+<p><span>The script should auto-detect appropriate host platform and architecture.</span></p>
+</li>
+<li>
+<p><span>Zig version should be specified in a separate </span><code>zig-version.txt</code><span> file.</span></p>
+</li>
+<li>
+<p><span>After downloading the file, its integrity should be verified.</span>
+<span>For this reason, </span><code>zig-version.txt</code><span> should include a hash alongside the version.</span>
+<span>As downloads are different depending on the platform, I think we</span>&rsquo;<span>ll need some help from Zig upstream here.</span>
+<span>In particular, each published Zig version should include a cross-platform manifest file, which lists hashes and urls of per-platform binaries.</span>
+<span>The hash included into </span><code>zig-version.txt</code><span> should be the manifest</span>&rsquo;<span>s hash.</span></p>
+</li>
+</ul>
+]]></content>
+</entry>
+
+</feed>
diff --git a/index.html b/index.html
new file mode 100644
index 00000000..5efc9735
--- /dev/null
+++ b/index.html
@@ -0,0 +1,415 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>matklad</title>
+  <meta name="description" content="Yet another programming blog by Alex Kladov aka matklad.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <ul class="post-list">
+<li>
+  <h2><time datetime="2023-10-06">Oct 6, 2023</time> <a href="/2023/10/06/what-is-an-invariant.html">What is an Invariant?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-09-13">Sep 13, 2023</time> <a href="/2023/09/13/comparative-analysis.html">Comparative Analysis</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-08-17">Aug 17, 2023</time> <a href="/2023/08/17/typescript-is-surprisingly-ok-for-compilers.html">TypeScript is Surprisingly OK for Compilers</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-08-13">Aug 13, 2023</time> <a href="/2023/08/13/role-of-algorithms.html">Role Of Algorithms</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-08-09">Aug 9, 2023</time> <a href="/2023/08/09/types-and-zig.html">Types and the Zig Programming Language</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-08-06">Aug 6, 2023</time> <a href="/2023/08/06/fantastic-learning-resources.html">Fantastic Learning Resources</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-08-01">Aug 1, 2023</time> <a href="/2023/08/01/on-modularity-of-lexical-analysis.html">On Modularity of Lexical Analysis</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-07-16">Jul 16, 2023</time> <a href="/2023/07/16/three-different-cuts.html">Three Different Cuts</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-06-18">Jun 18, 2023</time> <a href="/2023/06/18/GitHub-merge-queue.html">GitHub Merge Queue</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-06-02">Jun 2, 2023</time> <a href="/2023/06/02/the-worst-zig-version-manager.html">The Worst Zig Version Manager</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-05-21">May 21, 2023</time> <a href="/2023/05/21/resilient-ll-parsing-tutorial.html">Resilient LL Parsing Tutorial</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-05-06">May 6, 2023</time> <a href="/2023/05/06/zig-language-server-and-cancellation.html">Zig Language Server And Cancellation</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-05-02">May 2, 2023</time> <a href="/2023/05/02/implicits-for-mvs.html">Value Oriented Programming Needs Implicits?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-04-23">Apr 23, 2023</time> <a href="/2023/04/23/data-oriented-parallel-value-interner.html">Data Oriented Parallel Value Interner</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-04-13">Apr 13, 2023</time> <a href="/2023/04/13/reasonable-bootstrap.html">Reasonable Bootstrap</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-04-09">Apr 9, 2023</time> <a href="/2023/04/09/can-you-trust-a-compiler-to-optimize-your-code.html">Can You Trust a Compiler to Optimize Your Code?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-04-02">Apr 2, 2023</time> <a href="/2023/04/02/ub-might-be-the-wrong-term-for-newer-languages.html">UB Might Be a Wrong Term for Newer Languages</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-03-28">Mar 28, 2023</time> <a href="/2023/03/28/rust-is-a-scalable-language.html">Rust Is a Scalable Language</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-03-26">Mar 26, 2023</time> <a href="/2023/03/26/zig-and-rust.html">Zig And Rust</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-03-08">Mar 8, 2023</time> <a href="/2023/03/08/an-engine-for-an-editor.html">An Engine For An Editor</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-02-21">Feb 21, 2023</time> <a href="/2023/02/21/why-SAT-is-hard.html">Why SAT Is Hard</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-02-16">Feb 16, 2023</time> <a href="/2023/02/16/three-state-stability.html">Three-State Stability</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-02-12">Feb 12, 2023</time> <a href="/2023/02/12/a-love-letter-to-deno.html">&lt;3 Deno</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-02-10">Feb 10, 2023</time> <a href="/2023/02/10/how-a-zig-ide-could-work.html">How a Zig IDE Could Work</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-01-26">Jan 26, 2023</time> <a href="/2023/01/26/rusts-ugly-syntax.html">Rust's Ugly Syntax</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-01-25">Jan 25, 2023</time> <a href="/2023/01/25/next-rust-compiler.html">Next Rust Compiler</a></h2>
+</li>
+<li>
+  <h2><time datetime="2023-01-04">Jan 4, 2023</time> <a href="/2023/01/04/on-random-numbers.html">On Random Numbers</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-12-31">Dec 31, 2022</time> <a href="/2022/12/31/raytracer-construction-kit.html">Ray Tracer Construction Kit</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-11-18">Nov 18, 2022</time> <a href="/2022/11/18/if-a-tree-falls-in-a-forest-does-it-overflow-the-stack.html">If a Tree Falls in a Forest, Does It Overflow the Stack?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-11-05">Nov 5, 2022</time> <a href="/2022/11/05/accessibility-px-or-rem.html">Accessibility: px or rem?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-10-28">Oct 28, 2022</time> <a href="/2022/10/28/elements-of-a-great-markup-language.html">Elements Of a Great Markup Language</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-10-24">Oct 24, 2022</time> <a href="/2022/10/24/actions-permissions.html">GitHub Actions Permissions</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-10-19">Oct 19, 2022</time> <a href="/2022/10/19/why-linux-troubleshooting-advice-sucks.html">Why Linux Troubleshooting Advice Sucks</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-10-06">Oct 6, 2022</time> <a href="/2022/10/06/hard-mode-rust.html">Hard Mode Rust</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-10-03">Oct 3, 2022</time> <a href="/2022/10/03/from-paxos-to-bft.html">From Paxos to BFT</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-07-10">Jul 10, 2022</time> <a href="/2022/07/10/almost-rules.html">Almost Rules</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-07-04">Jul 4, 2022</time> <a href="/2022/07/04/unit-and-integration-tests.html">Unit and Integration Tests</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-06-29">Jun 29, 2022</time> <a href="/2022/06/29/notes-on-gats.html">Notes on GATs</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-06-11">Jun 11, 2022</time> <a href="/2022/06/11/caches-in-rust.html">Caches In Rust</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-05-29">May 29, 2022</time> <a href="/2022/05/29/builder-lite.html">Builder Lite</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-05-29">May 29, 2022</time> <a href="/2022/05/29/binary-privacy.html">Binary Privacy</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-04-25">Apr 25, 2022</time> <a href="/2022/04/25/why-lsp.html">Why LSP?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-03-26">Mar 26, 2022</time> <a href="/2022/03/26/self-modifying-code.html">Self Modifying Code</a></h2>
+</li>
+<li>
+  <h2><time datetime="2022-03-14">Mar 14, 2022</time> <a href="/2022/03/14/rpath-or-why-lld-doesnt-work-on-nixos.html">RPATH, or why lld doesn't work on NixOS</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-11-27">Nov 27, 2021</time> <a href="/2021/11/27/notes-on-module-system.html">Notes On Module System</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-11-07">Nov 7, 2021</time> <a href="/2021/11/07/generate-all-the-things.html">Generate All the Things</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-09-05">Sep 5, 2021</time> <a href="/2021/09/05/Rust100k.html">One Hundred Thousand Lines of Rust</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-09-04">Sep 4, 2021</time> <a href="/2021/09/04/fast-rust-builds.html">Fast Rust Builds</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-08-22">Aug 22, 2021</time> <a href="/2021/08/22/large-rust-workspaces.html">Large Rust Workspaces</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-07-30">Jul 30, 2021</time> <a href="/2021/07/30/shell-injection.html">; echo Shell Injection</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-07-10">Jul 10, 2021</time> <a href="/2021/07/10/its-not-always-icache.html">It's Not Always ICache</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-07-09">Jul 9, 2021</time> <a href="/2021/07/09/inline-in-rust.html">Inline In Rust</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-05-31">May 31, 2021</time> <a href="/2021/05/31/how-to-test.html">How to Test</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-05-12">May 12, 2021</time> <a href="/2021/05/12/design-pattern-dumping-ground.html">Design Pattern: Kitchen Sink</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-04-26">Apr 26, 2021</time> <a href="/2021/04/26/concurrent-expression-problem.html">Concurrent Expression Problem</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-03-22">Mar 22, 2021</time> <a href="/2021/03/22/async-benchmarks-index.html">Async Benchmarks Index</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-03-12">Mar 12, 2021</time> <a href="/2021/03/12/goroutines-are-not-significantly-smaller-than-threads.html">Goroutines Are Not Significantly Smaller Than Threads</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-02-27">Feb 27, 2021</time> <a href="/2021/02/27/delete-cargo-integration-tests.html">Delete Cargo Integration Tests</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-02-24">Feb 24, 2021</time> <a href="/2021/02/24/another-generic-dilemma.html">Another Generic Dilemma</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-02-15">Feb 15, 2021</time> <a href="/2021/02/15/NEAR.html">matklad @ NEAR</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-02-14">Feb 14, 2021</time> <a href="/2021/02/14/for-the-love-of-macros.html">For the Love of Macros</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-02-10">Feb 10, 2021</time> <a href="/2021/02/10/a-better-profiler.html">A Better Rust Profiler</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-02-06">Feb 6, 2021</time> <a href="/2021/02/06/ARCHITECTURE.md.html">ARCHITECTURE.md</a></h2>
+</li>
+<li>
+  <h2><time datetime="2021-01-03">Jan 3, 2021</time> <a href="/2021/01/03/two-kinds-of-code-review.html">Two Kinds of Code Review</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-12-28">Dec 28, 2020</time> <a href="/2020/12/28/csdi.html">Call Site Dependency Injection</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-12-12">Dec 12, 2020</time> <a href="/2020/12/12/notes-on-lock-poisoning.html">Notes On Lock Poisoning</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-11-11">Nov 11, 2020</time> <a href="/2020/11/11/yde.html">Why an IDE?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-11-01">Nov 1, 2020</time> <a href="/2020/11/01/notes-on-paxos.html">Notes on Paxos</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-10-15">Oct 15, 2020</time> <a href="/2020/10/15/study-of-std-io-error.html">Study of std::io::Error</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-10-03">Oct 3, 2020</time> <a href="/2020/10/03/fast-thread-locals-in-rust.html">Fast Thread Locals In Rust</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-09-20">Sep 20, 2020</time> <a href="/2020/09/20/why-not-rust.html">Why Not Rust?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-09-13">Sep 13, 2020</time> <a href="/2020/09/13/your-language-sucks.html">Your Language Sucks, It Doesn't Matter</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-09-12">Sep 12, 2020</time> <a href="/2020/09/12/rust-in-2021.html">Rust in 2021</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-08-15">Aug 15, 2020</time> <a href="/2020/08/15/concrete-abstraction.html">Code Smell: Concrete Abstraction</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-08-12">Aug 12, 2020</time> <a href="/2020/08/12/who-builds-the-builder.html">Who Builds the Builder?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-08-11">Aug 11, 2020</time> <a href="/2020/08/11/things-I-have-learned-about-life.html">Things I Have Learned About Life</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-07-15">Jul 15, 2020</time> <a href="/2020/07/15/two-beautiful-programs.html">Two Beautiful Rust Programs</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-04-15">Apr 15, 2020</time> <a href="/2020/04/15/from-pratt-to-dijkstra.html">From Pratt to Dijkstra</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-04-13">Apr 13, 2020</time> <a href="/2020/04/13/simple-but-powerful-pratt-parsing.html">Simple but Powerful Pratt Parsing</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-03-22">Mar 22, 2020</time> <a href="/2020/03/22/fast-simple-rust-interner.html">Fast and Simple Rust Interner</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-02-14">Feb 14, 2020</time> <a href="/2020/02/14/why-rust-is-loved.html">Why is Rust the Most Loved Programming Language?</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-01-04">Jan 4, 2020</time> <a href="/2020/01/04/mutexes-are-faster-than-spinlocks.html">Mutexes Are Faster Than Spinlocks</a></h2>
+</li>
+<li>
+  <h2><time datetime="2020-01-02">Jan 2, 2020</time> <a href="/2020/01/02/spinlocks-considered-harmful.html">Spinlocks Considered Harmful</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-11-16">Nov 16, 2019</time> <a href="/2019/11/16/a-better-shell.html">A Better Shell</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-11-13">Nov 13, 2019</time> <a href="/2019/11/13/rust-analyzer-blog.html">rust-analyzer Blog</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-08-23">Aug 23, 2019</time> <a href="/2019/08/23/join-your-threads.html">Join Your Threads</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-07-25">Jul 25, 2019</time> <a href="/2019/07/25/unsafe-as-a-type-system.html">Unsafe as a Human-Assisted Type System</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-07-16">Jul 16, 2019</time> <a href="/2019/07/16/perils-of-constructors.html">Perils of Constructors</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-06-20">Jun 20, 2019</time> <a href="/2019/06/20/linux-desktop-tips.html">Linux Desktop Tips</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-05-19">May 19, 2019</time> <a href="/2019/05/19/rust-course-retrospective.html">Rust Course Retrospective</a></h2>
+</li>
+<li>
+  <h2><time datetime="2019-05-19">May 19, 2019</time> <a href="/2019/05/19/consider-using-asciidoctor-for-your-next-presentation.html">Consider Using Asciidoctor for Your Next Presentation</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-07-24">Jul 24, 2018</time> <a href="/2018/07/24/exceptions-in-structured-concurrency.html">Exceptions vs Structured Concurrency</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-06-18">Jun 18, 2018</time> <a href="/2018/06/18/a-trick-for-test-maintenance.html">A Trick For Test Maintenance</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-06-06">Jun 6, 2018</time> <a href="/2018/06/06/modern-parser-generator.html">Modern Parser Generator</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-06-04">Jun 4, 2018</time> <a href="/2018/06/04/newtype-index-pattern.html">Newtype Index Pattern</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-05-24">May 24, 2018</time> <a href="/2018/05/24/typed-key-pattern.html">Typed Key Pattern</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-05-04">May 4, 2018</time> <a href="/2018/05/04/encapsulating-lifetime-of-the-field.html">Encapsulating Lifetime of the Field</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-05-03">May 3, 2018</time> <a href="/2018/05/03/effective-pull-requests.html">Effective Pull Requests</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-03-03">Mar 3, 2018</time> <a href="/2018/03/03/stopping-a-rust-worker.html">Stopping a Rust Worker</a></h2>
+</li>
+<li>
+  <h2><time datetime="2018-01-03">Jan 3, 2018</time> <a href="/2018/01/03/make-your-own-make.html">Make your own make</a></h2>
+</li>
+<li>
+  <h2><time datetime="2017-10-21">Oct 21, 2017</time> <a href="/2017/10/21/lldb-dynamic-type.html">Dynamic types in LLDB</a></h2>
+</li>
+<li>
+  <h2><time datetime="2017-03-25">Mar 25, 2017</time> <a href="/2017/03/25/nixos-notes.html">NixOS Notes</a></h2>
+</li>
+<li>
+  <h2><time datetime="2017-03-18">Mar 18, 2017</time> <a href="/2017/03/18/min-of-three-part-2.html">Min of Three Part 2</a></h2>
+</li>
+<li>
+  <h2><time datetime="2017-03-12">Mar 12, 2017</time> <a href="/2017/03/12/min-of-three.html">Min of Three</a></h2>
+</li></ul>
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/templates.ts">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/links.html b/links.html
new file mode 100644
index 00000000..8c2639f7
--- /dev/null
+++ b/links.html
@@ -0,0 +1,288 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>matklad</title>
+  <meta name="description" content="Yet another programming blog by Alex Kladov aka matklad.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/links">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  <p><span>A bunch of things I find myself repeatedly referring to in various discussions!</span></p>
+<dl>
+<dt><span>Two meanings of systems programming</span></dt>
+<dd>
+<p><a href="http://willcrichton.net/notes/systems-programming/" class="url">http://willcrichton.net/notes/systems-programming/</a></p>
+</dd>
+<dt><span>Systems programmers can have nice things</span></dt>
+<dd>
+<p><a href="https://robert.ocallahan.org/2016/08/random-thoughts-on-rust-cratesio-and.html" class="url">https://robert.ocallahan.org/2016/08/random-thoughts-on-rust-cratesio-and.html</a></p>
+</dd>
+<dt><span>Goals and priorities for C++</span></dt>
+<dd>
+<p><a href="http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2020/p2137r0.html" class="url">http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2020/p2137r0.html</a></p>
+</dd>
+<dt><span>Boundaries</span></dt>
+<dd>
+<p><a href="https://www.destroyallsoftware.com/talks/boundaries" class="url">https://www.destroyallsoftware.com/talks/boundaries</a></p>
+</dd>
+<dt><span>Plugin diagram</span></dt>
+<dd>
+<p><a href="https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html" class="url">https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html</a></p>
+</dd>
+<dt><span>Data, ADT, Object</span></dt>
+<dd>
+<p><a href="https://www.tedinski.com/2018/02/27/the-expression-problem.html" class="url">https://www.tedinski.com/2018/02/27/the-expression-problem.html</a></p>
+</dd>
+<dt><span>John Carmack on inlined code</span></dt>
+<dd>
+<p><a href="http://number-none.com/blow/john_carmack_on_inlined_code.html" class="url">http://number-none.com/blow/john_carmack_on_inlined_code.html</a></p>
+</dd>
+<dt><span>A few billion lines of code later</span></dt>
+<dd>
+<p><a href="https://cacm.acm.org/magazines/2010/2/69354-a-few-billion-lines-of-code-later/abstract" class="url">https://cacm.acm.org/magazines/2010/2/69354-a-few-billion-lines-of-code-later/abstract</a></p>
+</dd>
+<dt><span>Simple testing can prevent most critical failures</span></dt>
+<dd>
+<p><a href="https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-yuan.pdf" class="url">https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-yuan.pdf</a></p>
+</dd>
+<dt><span>The Error Model</span></dt>
+<dd>
+<p><a href="http://joeduffyblog.com/2016/02/07/the-error-model/" class="url">http://joeduffyblog.com/2016/02/07/the-error-model/</a></p>
+</dd>
+<dt><span>Talks that changed the way I think about programming</span></dt>
+<dd>
+<p><a href="http://www.opowell.com/post/talks-that-changed-the-way-i-think-about-programming/" class="url">http://www.opowell.com/post/talks-that-changed-the-way-i-think-about-programming/</a></p>
+</dd>
+<dt><span>Why rewriting hotspots in a faster language doesn</span>&rsquo;<span>t work</span></dt>
+<dd>
+<p><a href="https://code.visualstudio.com/blogs/2018/03/23/text-buffer-reimplementation" class="url">https://code.visualstudio.com/blogs/2018/03/23/text-buffer-reimplementation</a></p>
+
+<figure class="blockquote">
+<blockquote><p><span>The right tool for the job is often the tool you are already using </span>&mdash;<span> adding new tools has a higher cost than many people appreciate</span></p>
+</blockquote>
+<figcaption><cite><a href="https://twitter.com/id_aa_carmack/status/989951283900514304"><span>John Carmack</span></a></cite></figcaption>
+</figure>
+</dd>
+<dt><span>Programming people</span></dt>
+<dd>
+<p><a href="https://leftoversalad.com/c/015_programmingpeople/" class="url">https://leftoversalad.com/c/015_programmingpeople/</a></p>
+</dd>
+<dt><span>Python spends almost all of its time in the C runtime</span></dt>
+<dd>
+<p><a href="http://blog.kevmod.com/2016/07/why-is-python-slow/" class="url">http://blog.kevmod.com/2016/07/why-is-python-slow/</a></p>
+</dd>
+<dt><span>Rider architecture</span></dt>
+<dd>
+<p><a href="https://www.codemag.com/Article/1811091/Building-a-.NET-IDE-with-JetBrains-Rider" class="url">https://www.codemag.com/Article/1811091/Building-a-.NET-IDE-with-JetBrains-Rider</a></p>
+</dd>
+<dt><span>System programming values</span></dt>
+<dd>
+<p><a href="https://www.youtube.com/watch?v=2wZ1pCpJUIM" class="url">https://www.youtube.com/watch?v=2wZ1pCpJUIM</a></p>
+</dd>
+<dt><span>Midlayer mistake</span></dt>
+<dd>
+<p><a href="https://lwn.net/Articles/336262/" class="url">https://lwn.net/Articles/336262/</a></p>
+</dd>
+<dt><span>Technology from the past come to save the future from itself</span></dt>
+<dd>
+<p><a href="http://venge.net/graydon/talks/" class="url">http://venge.net/graydon/talks/</a></p>
+</dd>
+<dt><span>Not Rocket Science Rule</span></dt>
+<dd>
+<p><a href="https://graydon2.dreamwidth.org/1597.html" class="url">https://graydon2.dreamwidth.org/1597.html</a></p>
+</dd>
+<dt><span>Why C++ Sails When the Vasa Sank</span></dt>
+<dd>
+<p><a href="https://www.youtube.com/watch?v=ltCgzYcpFUI" class="url">https://www.youtube.com/watch?v=ltCgzYcpFUI</a></p>
+</dd>
+<dt><span>Composition of Unsafe</span></dt>
+<dd>
+<p><a href="https://smallcultfollowing.com/babysteps/blog/2016/10/02/observational-equivalence-and-unsafe-code/" class="url">https://smallcultfollowing.com/babysteps/blog/2016/10/02/observational-equivalence-and-unsafe-code/</a></p>
+</dd>
+<dt><span>In Rust, Ordinary Vectors are Values</span></dt>
+<dd>
+<p><a href="http://smallcultfollowing.com/babysteps/blog/2018/02/01/in-rust-ordinary-vectors-are-values/" class="url">http://smallcultfollowing.com/babysteps/blog/2018/02/01/in-rust-ordinary-vectors-are-values/</a></p>
+</dd>
+<dt><span>Implementing Swift Generics</span></dt>
+<dd>
+<p><a href="https://www.youtube.com/watch?v=ctS8FzqcRug" class="url">https://www.youtube.com/watch?v=ctS8FzqcRug</a></p>
+</dd>
+<dt><span>Generics Dilemma</span></dt>
+<dd>
+<p><a href="https://research.swtch.com/generic" class="url">https://research.swtch.com/generic</a></p>
+</dd>
+<dt><span>A Catalogue of Optimizing Transformations</span></dt>
+<dd>
+<p><a href="https://www.clear.rice.edu/comp512/Lectures/Papers/1971-allen-catalog.pdf" class="url">https://www.clear.rice.edu/comp512/Lectures/Papers/1971-allen-catalog.pdf</a></p>
+</dd>
+<dt><span>Static Program Analysis</span></dt>
+<dd>
+<p><a href="https://cs.au.dk/~amoeller/spa/spa.pdf" class="url">https://cs.au.dk/~amoeller/spa/spa.pdf</a></p>
+</dd>
+<dt><span>Accurate mental model for Rust</span>&rsquo;<span>s reference types</span></dt>
+<dd>
+<p><a href="https://docs.rs/dtolnay/0.0.9/dtolnay/macro._02__reference_types.html" class="url">https://docs.rs/dtolnay/0.0.9/dtolnay/macro._02__reference_types.html</a></p>
+</dd>
+<dt><span>Don</span>&rsquo;<span>t write bugs</span></dt>
+<dd>
+<p><a href="https://www.teamten.com/lawrence/programming/dont-write-bugs.html" class="url">https://www.teamten.com/lawrence/programming/dont-write-bugs.html</a></p>
+</dd>
+<dt><span>Don</span>&rsquo;<span>t use </span><code>_</code><span> patterns</span></dt>
+<dd>
+<p><a href="https://youtu.be/-J8YyfrSwTk?t=1819" class="url">https://youtu.be/-J8YyfrSwTk?t=1819</a></p>
+</dd>
+<dt><span>What Is The Minimal Set Of Optimizations Needed For Zero-Cost Abstraction?</span></dt>
+<dd>
+<p><a href="https://robert.ocallahan.org/2020/08/what-is-minimal-set-of-optimizations.html" class="url">https://robert.ocallahan.org/2020/08/what-is-minimal-set-of-optimizations.html</a></p>
+</dd>
+<dt><span>Expect Tests</span></dt>
+<dd>
+<p><a href="https://blog.janestreet.com/using-ascii-waveforms-to-test-hardware-designs/" class="url">https://blog.janestreet.com/using-ascii-waveforms-to-test-hardware-designs/</a></p>
+</dd>
+<dt><span>What Every C Programmer Should Know About UB</span></dt>
+<dd>
+<p><a href="https://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html" class="url">https://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html</a></p>
+</dd>
+<dt><span>Build a Mountain</span></dt>
+<dd>
+<p><a href="https://www.youtube.com/watch?v=443UNeGrFoM&amp;t=6949s" class="url">https://www.youtube.com/watch?v=443UNeGrFoM&amp;t=6949s</a></p>
+</dd>
+<dt><span>Precise Profiling via rdpmc</span></dt>
+<dd>
+<p><a href="https://hackmd.io/sH315lO2RuicY-SEt7ynGA?view" class="url">https://hackmd.io/sH315lO2RuicY-SEt7ynGA?view</a></p>
+</dd>
+<dt><span>JSONMutexDB</span></dt>
+<dd>
+<p><a href="https://tailscale.com/blog/an-unlikely-database-migration/" class="url">https://tailscale.com/blog/an-unlikely-database-migration/</a></p>
+</dd>
+<dt><span>Swift Is Undecidable</span></dt>
+<dd>
+<p><a href="https://forums.swift.org/t/swift-type-checking-is-undecidable/39024" class="url">https://forums.swift.org/t/swift-type-checking-is-undecidable/39024</a></p>
+</dd>
+<dt><span>Don Syme of F# on typeclasses</span></dt>
+<dd>
+<p><a href="https://github.com/fsharp/fslang-suggestions/issues/243#issuecomment-916079347" class="url">https://github.com/fsharp/fslang-suggestions/issues/243#issuecomment-916079347</a></p>
+</dd>
+<dt><span>Outlined Containers</span></dt>
+<dd>
+<p><a href="https://github.com/rust-lang/rust/pull/60470#issuecomment-489136965" class="url">https://github.com/rust-lang/rust/pull/60470#issuecomment-489136965</a></p>
+</dd>
+<dt><span>Your ABI is Probably Wrong</span></dt>
+<dd>
+<p><a href="https://outerproduct.net/boring/2021-05-07_abi-wrong.html" class="url">https://outerproduct.net/boring/2021-05-07_abi-wrong.html</a></p>
+</dd>
+<dt><span>Limits to Growth</span></dt>
+<dd>
+<p><a href="https://graydon2.dreamwidth.org/263429.html" class="url">https://graydon2.dreamwidth.org/263429.html</a></p>
+</dd>
+<dt><span>Distributed Systems via remote syscalls</span></dt>
+<dd>
+<p><a href="http://www.catern.com/integration.html" class="url">http://www.catern.com/integration.html</a></p>
+</dd>
+<dt><span>Latency Numbers</span></dt>
+<dd>
+<p><a href="https://github.com/sirupsen/napkin-math" class="url">https://github.com/sirupsen/napkin-math</a></p>
+</dd>
+<dt><span>Moderation</span></dt>
+<dd>
+<p><a href="https://old.reddit.com/r/rust/comments/hnfnti/where_is_the_rust_community_allowed_to_talk_about/fxf65nf/" class="url">https://old.reddit.com/r/rust/comments/hnfnti/where_is_the_rust_community_allowed_to_talk_about/fxf65nf/</a></p>
+</dd>
+<dt><span>Interfaces Belong With Users</span></dt>
+<dd>
+<p><a href="https://neugierig.org/software/blog/2019/11/interface-pattern.html" class="url">https://neugierig.org/software/blog/2019/11/interface-pattern.html</a></p>
+</dd>
+</dl>
+
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/links.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/resume.html b/resume.html
new file mode 100644
index 00000000..7e63c00a
--- /dev/null
+++ b/resume.html
@@ -0,0 +1,306 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>matklad</title>
+  <meta name="description" content="Yet another programming blog by Alex Kladov aka matklad.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/resume">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  <link rel="stylesheet" href="/css/resume.css">
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  
+    <h1>
+    <a href="#Resume"><span>Resume</span> </a>
+    </h1>
+<p><span>Welcome to my resume!</span>
+<span>It consists of two parts.</span>
+<span>The first part is the free-form narrative of what I do work-wise.</span>
+<span>This is something </span><em><span>I</span></em><span> would be excited to read from a person I am going to work with.</span>
+<span>The second part is a more traditional bullet-list of companies, positions, and projects.</span>
+<span>The resume is available as </span><a href="https://matklad.github.io/resume.html"><span>.html</span></a><span> and </span><a href="https://matklad.github.io/resume.pdf"><span>.pdf</span></a><span>.</span></p>
+<section id="Narrative">
+
+    <h2>
+    <a href="#Narrative"><span>Narrative</span> </a>
+    </h2>
+<p><span>I used to do math.</span>
+<span>Although I no longer do mathematics daily, it is the basis I use to think about programming.</span>
+<span>I enjoy solving an occasional puzzle.</span>
+<span>See </span><a href="https://matklad.github.io/2021/11/07/generate-all-the-things.html">&ldquo;<span>Generate All the Things</span>&rdquo;</a><span> and </span><a href="https://matklad.github.io/2020/11/01/notes-on-paxos.html">&ldquo;<span>Notes on Paxos</span>&rdquo;</a><span> articles as examples of math I like.</span></p>
+<p><span>I am a programmer.</span>
+<span>I like writing code just for the sake of it.</span>
+<span>I like deleting code even more.</span>
+<span>I like short, simple, robust and beautiful code, which not only gets the job done, but does it in an obviously correct way.</span>
+<span>See, eg, </span><a href="https://github.com/rust-analyzer/ungrammar/tree/42810d770e4cddec2a5fff658489fa72f3b28a7c"><code>ungrammar</code></a><span> for an example of relatively short and self-contained piece of programming.</span></p>
+<p><span>I am a pragmatist.</span>
+<span>The above two points sound outright scary, but don</span>&rsquo;<span>t worry :)</span>
+<span>While I do enjoy encoding lambda calculus in types, that</span>&rsquo;<span>s not what I spend most of my time on.</span>
+<span>I see most code as something to be replaced and re-written later, and optimise for making changes over time, not for perfection right now.</span>
+<a href="https://github.com/rust-analyzer/rust-analyzer/blob/d9b2291f546abc77d24499339a72a89127464b95/docs/dev/style.md#scale-of-changes"><span>This section</span></a><span> from rust-analyzer style guide is a good example of this.</span></p>
+<p><span>I loathe accidental complexity.</span>
+<span>I think I spend most of my time trying to make things simpler, trying to remove parts, trying to make foundational APIs more crisp.</span>
+<span>I have a visceral reaction to the gaps between how the thing </span><em><span>should</span></em><span> be, and how they are.</span>
+<a href="https://github.com/matklad/cargo-xtask/tree/a49054989203a877f899d1285b5f3d642cf36d11"><code>cargo xtask</code></a><span> pattern shows to what lengths I am willing to go just to get rid of the mess the unix shell is.</span></p>
+<p><span>I build systems.</span>
+<span>Software engineering is programming integrated over time, and it</span>&rsquo;<span>s that time dimension that really matters.</span>
+<span>The shape of the software today is determined by accidental, runaway, viral successes of yesterday.</span>
+<span>There</span>&rsquo;<span>s a reason why VT100 interface is still programmed against today, and it is not its technical adequacy.</span>
+<a href="https://www.tedinski.com/2018/01/30/the-one-ring-problem-abstraction-and-power.html"><span>This is not my article</span></a><span>, but I like it so much that I</span>&rsquo;<span>ll advertise it even in my resume.</span>
+<span>System</span>&rsquo;<span>s thinking is why I am fascinated with Rust and not, eg, with Kotlin.</span>
+<span>Since Java with its reasonably fast managed runtime, Rust is the first PL revolution which meaningfully changes how we write software, and not just repacks known-good idioms with a better syntax (which is also important!, just not as exciting!).</span></p>
+<p><span>I build open source communities.</span>
+<span>My biggest successes so far I think are </span><a href="https://github.com/intellij-rust/intellij-rust"><code>IntelliJ Rust</code></a><span> and </span><a href="https://github.com/rust-analyzer/rust-analyzer"><code>rust-analyzer</code></a><span>.</span>
+<span>I didn</span>&rsquo;<span>t write the hardest, smartest bits of those.</span>
+<span>But I tried very hard to make sure that others can do that, by removing accidental complexity, by making contribution enjoyable, by trying to program the architecture which would be robust to time and systems effects.</span></p>
+<p><span>More generally, I help build moderately large projects, which are combinations of all of the above: people, systematic forces, beautiful mathematical abstractions at the core, and hundreds of thousands of lines of code as a physical manifestation.</span>
+<span>See </span><a href="https://matklad.github.io/2021/09/05/Rust100k.html">&ldquo;<span>One Hundred Thousand Lines of Rust</span>&rdquo;</a><span> series for a bunch of concrete, pragmatic lessons I</span>&rsquo;<span>ve learned so far.</span></p>
+<p><span>I love teaching!</span>
+<span>See, for example, my Russian Rust Course (</span><a href="https://www.youtube.com/watch?v=Oy_VYovfWyo&amp;list=PLlb7e2G7aSpTfhiECYNI2EZ1uAluUqE_e"><span>YouTube</span></a><span>), a series of videos about rust-analyzer (</span><a href="https://www.youtube.com/watch?v=I3RXottNwk0&amp;list=PLhb66M_x9UmrqXhQuIpWC5VgTdrGxMx3y"><span>YouTube</span></a><span>), or </span><a href="https://matklad.github.io/2020/04/13/simple-but-powerful-pratt-parsing.html"><span>the article about Pratt parsers</span></a><span>.</span></p>
+<p><span>Oh, and I love writing :-)</span></p>
+</section>
+<section id="Contacts">
+
+    <h2>
+    <a href="#Contacts"><span>Contacts</span> </a>
+    </h2>
+<dl>
+<dt><span>Name</span></dt>
+<dd>
+<p><span>Alex Kladov</span></p>
+</dd>
+<dt><span>GitHub</span></dt>
+<dd>
+<p><a href="https://github.com/matklad" class="url">https://github.com/matklad</a></p>
+</dd>
+<dt><span>Email</span></dt>
+<dd>
+<p><span>See GitHub profile</span></p>
+</dd>
+</dl>
+</section>
+<section id="Core-Competencies">
+
+    <h2>
+    <a href="#Core-Competencies"><span>Core Competencies</span> </a>
+    </h2>
+<ul>
+<li>
+<span>Building complex systems software from nothing</span>
+</li>
+<li>
+<span>Growing open source communities</span>
+</li>
+<li>
+<span>Implementing compilers, IDEs, build tools</span>
+</li>
+<li>
+<span>Rust mastery</span>
+</li>
+</ul>
+</section>
+<section id="Education">
+
+    <h2>
+    <a href="#Education"><span>Education</span> </a>
+    </h2>
+<ul>
+<li>
+<p><span>2014-2016, Masters of Software Engineering (unfinished)</span><br>
+<strong><span>St. Petersburg State University of RAS</span></strong><span>, Russia</span></p>
+</li>
+<li>
+<p><span>2009-2014, Specialist of Software Engineering</span><br>
+<strong><span>St. Petersburg State University</span></strong><span>, Russia</span></p>
+</li>
+</ul>
+</section>
+<section id="Professional-Experience">
+
+    <h2>
+    <a href="#Professional-Experience"><span>Professional Experience</span> </a>
+    </h2>
+<section id="Rust-Programming-Language">
+
+    <h3>
+    <a href="#Rust-Programming-Language"><span>Rust Programming Language</span> </a>
+    </h3>
+<p><code>From Sep 2015</code><br>
+<a href="https://www.rust-lang.org/governance/teams/dev-tools" class="url">https://www.rust-lang.org/governance/teams/dev-tools</a></p>
+<p><span>I am a member of the dev-tools team of the Rust programming language. I was the</span>
+<span>original author of both </span><a href="https://github.com/intellij-rust/intellij-rust"><span>IntelliJ</span>
+<span>Rust</span></a><span> and </span><a href="https://github.com/rust-lang/rust-analyzer"><span>rust-analyzer</span></a><span> </span>&mdash;<span> the two</span>
+<span>tools which today power IDE support for Rust in virtually every editor. My work</span>
+<span>included both the technical task of writing an advanced, incremental, resilient</span>
+<span>compilers </span><em><span>and</span></em><span> organizing a vibrant community of contributors and maintainers</span>
+<span>to ensure that my direct involvement is not a requirement.</span></p>
+<p><span>I made many smaller contributions across the Rust ecosystem. I was a</span>
+<span>co-maintainer of </span><a href="https://github.com/rust-lang/cargo"><span>Cargo</span></a><span> in 2016-2018,</span>
+<span>maintain </span><a href="https://github.com/matklad/once_cell/"><span>prominent libraries</span></a><span>, and</span>
+<span>document </span><a href="https://github.com/matklad/cargo-xtask"><span>emerging ecosystem patterns</span></a><span>.</span></p>
+</section>
+<section id="NEAR">
+
+    <h3>
+    <a href="#NEAR"><span>NEAR</span> </a>
+    </h3>
+<p><code>From Feb 2021</code><br>
+<a href="https://near.org" class="url">https://near.org</a></p>
+<p><span>At NEAR, I am a TLM/TL for the contract runtime team which is responsible for</span>
+<span>secure, reliable, and fast execution of smart contracts in WASM.</span></p>
+</section>
+<section id="Ferrous-Systems">
+
+    <h3>
+    <a href="#Ferrous-Systems"><span>Ferrous Systems</span> </a>
+    </h3>
+<p><code>Sep 2018 to Feb 2021</code><br>
+<a href="https://ferrous-systems.com" class="url">https://ferrous-systems.com</a></p>
+<p><span>With Ferrous Systems, we brought rust-analyzer project from an MVP to a de-facto</span>
+<span>standard for the ecosystem. I also helped with teaching people to use Rust</span>
+<span>efficiently.</span></p>
+</section>
+<section id="Computer-Science-Center">
+
+    <h3>
+    <a href="#Computer-Science-Center"><span>Computer Science Center</span> </a>
+    </h3>
+<p><code>Sep 2014 to Sep 2019</code><br>
+<a href="https://compscicenter.ru/teachers/934/" class="url">https://compscicenter.ru/teachers/934/</a></p>
+<p><span>At CSC I taught two major courses:</span></p>
+<dl>
+<dt><span>Programming In Rust, </span><code>Winter-Spring 2019</code><span>, </span><a href="https://www.youtube.com/watch?v=Oy_VYovfWyo"><span>video</span></a></dt>
+<dd>
+<p><span>A semester long introduction course, focused on contrasting unique Rust</span>
+<span>features with more mainstream languages like C++ or Java.</span></p>
+</dd>
+<dt><span>Programming In Python, </span><code>Autumn-Winter 2018</code><span>, </span><a href="https://www.youtube.com/watch?v=-py9GXvJk6A"><span>video</span></a></dt>
+<dd>
+<p><span>A semester long advanced course focusing on the language inner workings and</span>
+<span>programming idioms.</span></p>
+</dd>
+</dl>
+<p><span>I have also worked as a teaching assistant for </span>&ldquo;<span>Algorithms and Data structures</span>&rdquo;
+<span>and </span>&ldquo;<span>Python</span>&rdquo;<span> courses.</span></p>
+</section>
+<section id="JetBrains">
+
+    <h3>
+    <a href="#JetBrains"><span>JetBrains</span> </a>
+    </h3>
+<p><code>Sep 2015 to Jan 2018</code><br>
+<a href="https://intellij-rust.github.io" class="url">https://intellij-rust.github.io</a></p>
+<p><span>At JetBrains, I have led the development of</span>
+<a href="http://github.com/intellij-rust/intellij-rust"><span>IntelljJ-Rust</span></a><span> plugin for the Rust</span>
+<span>programming language. The plugin is a Rust </span>&ldquo;<span>compiler</span>&rdquo;<span> written in Kotlin, with</span>
+<span>full-blown parser, name resolution and type inference algorithms, and</span>
+<span>integrations with build tools and debuggers. Besides solving the technical</span>
+<span>problems, I</span>&rsquo;<span>ve created an open source community around the plugin by mentoring</span>
+<span>issues, writing developer documentation and supporting contributors.</span></p>
+</section>
+<section id="Stepik-org">
+
+    <h3>
+    <a href="#Stepik-org"><span>Stepik.org</span> </a>
+    </h3>
+<p><code>2012 to 2014</code><br>
+<a href="http://stepik.org/" class="url">http://stepik.org/</a></p>
+<p><span>Stepik is a e-learning platform, written in Python, focused on rich variety of</span>
+<span>practical exercises and ease of creating content. I was on the backend team of</span>
+<span>three from the start of the project. Among other things, I</span>&rsquo;<span>ve worked on</span>
+<span>exercises subsystem and student</span>&rsquo;<span>s code sandboxing, progress tracking and</span>
+<span>designed and implemented JSON API interface for the single-page frontend.</span></p>
+</section>
+</section>
+
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/resume.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>
diff --git a/resume.pdf b/resume.pdf
new file mode 100644
index 00000000..b88eec60
Binary files /dev/null and b/resume.pdf differ
diff --git a/style.html b/style.html
new file mode 100644
index 00000000..b2203ac2
--- /dev/null
+++ b/style.html
@@ -0,0 +1,320 @@
+
+<!DOCTYPE html>
+<html lang='en-US'>
+<head>
+  <meta charset='utf-8'>
+  <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>matklad</title>
+  <meta name="description" content="Yet another programming blog by Alex Kladov aka matklad.">
+  <link rel="icon" href="/favicon.png" type="image/png">
+  <link rel="icon" href="/favicon.svg" type="image/svg+xml">
+  <link rel="canonical" href="https://matklad.github.io/style">
+  <link rel="alternate" type="application/rss+xml" title="matklad" href="https://matklad.github.io/feed.xml">
+  <style>
+  @font-face {
+    font-family: 'Open Sans'; src: url('/css/OpenSans-300-Normal.woff2') format('woff2');
+    font-weight: 300; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'JetBrains Mono'; src: url('/css/JetBrainsMono-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Normal.woff2') format('woff2');
+    font-weight: 400; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-400-Italic.woff2') format('woff2');
+    font-weight: 400; font-style: italic;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Normal.woff2') format('woff2');
+    font-weight: 700; font-style: normal;
+  }
+  @font-face {
+    font-family: 'EB Garamond'; src: url('/css/EBGaramond-700-Italic.woff2') format('woff2');
+    font-weight: 700; font-style: italic;
+  }
+
+  * { box-sizing: border-box; margin: 0; padding: 0; margin-block-start: 0; margin-block-end: 0; }
+
+  body {
+    max-width: 80ch;
+    padding: 2ch;
+    margin-left: auto;
+    margin-right: auto;
+  }
+
+  header { margin-bottom: 2rem; }
+  header > nav { display: flex; column-gap: 2ch; align-items: baseline; flex-wrap: wrap; }
+  header a { font-style: normal; color: rgba(0, 0, 0, .8); text-decoration: none; }
+  header a:hover { color: rgba(0, 0, 0, .8); text-decoration: underline; }
+  header .title { font-size: 1.25em; flex-grow: 2; }
+
+  footer { margin-top: 2rem; }
+  footer > p { display: flex; column-gap: 2ch; justify-content: center; flex-wrap: wrap; }
+  footer a { color: rgba(0, 0, 0, .8); text-decoration: none; white-space: nowrap; }
+  footer i { vertical-align: middle; color: rgba(0, 0, 0, .8) }
+
+  </style>
+
+  <link rel="stylesheet" href="/css/main.css">
+  
+</head>
+
+<body>
+  <header>
+    <nav>
+      <a class="title" href="/">matklad</a>
+      <a href="/about.html">About</a>
+      <a href="/resume.html">Resume</a>
+      <a href="/links.html">Links</a>
+    </nav>
+  </header>
+
+  <main>
+  
+    <h1>
+    <a href="#Programming-Style"><span>Programming Style</span> </a>
+    </h1>
+<p><span>Congratulations, you</span>&rsquo;<span>ve found a secret level!</span></p>
+<p><span>This is a super work-in-progress page which collects various rules-of-thumb I use.</span>
+<span>The primary goal so far is to collect the rules for myself, that</span>&rsquo;<span>s why I don</span>&rsquo;<span>t link to this page from anywhere yet.</span></p>
+<section id="General">
+
+    <h2>
+    <a href="#General"><span>General</span> </a>
+    </h2>
+<section id="Naming">
+
+    <h3>
+    <a href="#Naming"><span>Naming</span> </a>
+    </h3>
+<p><span>Prefer full names except for extremely common cases (</span><code>ctx</code><span> for context), or equal-length pairs</span>
+<span>(</span><code>next/prev</code><span>). Use consistent names. Naming variables after types (</span><code>let thing: Thing</code><span>) is a way</span>
+<span>to achieve global consistency with little coordination.</span></p>
+<p><span>Build  a vocabulary of standard names and re-use it:</span></p>
+<dl>
+<dt><code>ctx</code></dt>
+<dd>
+<p>&ldquo;<span>context</span>&rdquo;<span> of an operation. Typically holds something mutable. Read-only</span>
+<span>context is named </span><code>params</code><span>.</span></p>
+</dd>
+<dt><code>params</code></dt>
+<dd>
+<p><span>A bag of named arguments. Unlike </span><code>config</code><span>, might hold not only pod types.</span></p>
+</dd>
+<dt><code>config</code></dt>
+<dd>
+<p><span>Generally user-specified POD parameters.</span></p>
+</dd>
+<dt><code>sink</code></dt>
+<dd>
+<p>&ldquo;<span>output</span>&rdquo;<span> of an internal iterator, typically </span><code>sink: &amp;mut FnMut(T)</code><span> or </span><code>sink: &amp;mut Vec&lt;T&gt;</code><span>.</span></p>
+</dd>
+<dt><code>lhs</code><span>, </span><code>rhs</code></dt>
+<dd>
+<p><span>operands of a binary operator.</span></p>
+</dd>
+<dt><code>fuel</code></dt>
+<dd>
+<p><span>Recursion and infinite loop guards</span></p>
+</dd>
+<dt><code>result</code></dt>
+<dd>
+<p><span>A </span>&ldquo;<span>return</span>&rdquo;<span> variable.</span></p>
+</dd>
+</dl>
+<p><span>Equisized pairs;</span></p>
+<ul>
+<li>
+<span>add/sub, mul/div</span>
+</li>
+<li>
+<span>lhs/rhs</span>
+</li>
+<li>
+<span>s/e</span>
+</li>
+<li>
+<span>next/prev</span>
+</li>
+<li>
+<span>source/target</span>
+</li>
+<li>
+<span>src/dst</span>
+</li>
+<li>
+<span>index/count</span>
+</li>
+<li>
+<span>insert/remove</span>
+</li>
+<li>
+<span>beg/end</span>
+</li>
+</ul>
+</section>
+<section id="Explicit-Data-Tables">
+
+    <h3>
+    <a href="#Explicit-Data-Tables"><span>Explicit Data Tables</span> </a>
+    </h3>
+<p><span>Remove code duplication by extracting commonalities into tabular data</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line">// GOOD</span>
+<span class="line">const cases = ["foo", "bar", "baz"];</span>
+<span class="line">for case in cases {</span>
+<span class="line">    if x == case {</span>
+<span class="line"></span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line">// BAD</span>
+<span class="line">if x == "foo" {</span>
+<span class="line"></span>
+<span class="line">} else if x == "bar" {</span>
+<span class="line"></span>
+<span class="line">} else if x == "baz" {</span>
+<span class="line"></span>
+<span class="line">}</span></code></pre>
+
+</figure>
+</section>
+<section id="Bulk-IO">
+
+    <h3>
+    <a href="#Bulk-IO"><span>Bulk IO</span> </a>
+    </h3>
+<p><span>Avoid opening file descriptors in favor of bulk operations. To write data to a file, you need to</span>
+<span>follow a lifecycle: open file descriptor, issue write syscalls, close file descriptor. Lifecycle</span>
+<span>handling requires complicated type-system machinery and is bettre avoided. Usually, standard library</span>
+<span>provides something like </span><code>std::fs::read_to_string</code><span> which encapsulates lifecycle management.</span></p>
+</section>
+</section>
+<section id="Rust">
+
+    <h2>
+    <a href="#Rust"><span>Rust</span> </a>
+    </h2>
+<section id="No-Self-Types">
+
+    <h3>
+    <a href="#No-Self-Types"><span>No Self Types</span> </a>
+    </h3>
+<p><span>Write types out explicitly, avoid </span><code>Self</code><span> alias if possible:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Good</span></span>
+<span class="line"><span class="hl-keyword">pub</span> <span class="hl-keyword">struct</span> <span class="hl-title class_">Diagnostic</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> code: DiagnosticCode,</span>
+<span class="line">    <span class="hl-keyword">pub</span> text: <span class="hl-type">String</span>,</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Diagnostic</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(code: DiagnosticCode, text: <span class="hl-type">String</span>) <span class="hl-punctuation">-&gt;</span> Diagnostic {</span>
+<span class="line">        Diagnostic { code, text }</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Bad</span></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Diagnostic</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">new</span>(code: DiagnosticCode, text: <span class="hl-type">String</span>) <span class="hl-punctuation">-&gt;</span> <span class="hl-keyword">Self</span> {</span>
+<span class="line">        <span class="hl-keyword">Self</span> { code, text }</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><strong><span>Rationale:</span></strong><span> reducing cognitive load, optimizing for the reader.</span>
+<span>Resolving </span><code>Self</code><span> is a small mental effort, it can be avoided.</span></p>
+</section>
+<section id="Prefer-new-Over-default">
+
+    <h3>
+    <a href="#Prefer-new-Over-default"><span>Prefer new Over default</span> </a>
+    </h3>
+<p><span>Use </span><code>new</code><span> over </span><code>default</code><span> to construct instances.</span></p>
+<p><strong><span>Rationale:</span></strong><span> new is too ingrained.</span></p>
+</section>
+<section id="Blank-Line-Between-Declarations">
+
+    <h3>
+    <a href="#Blank-Line-Between-Declarations"><span>Blank Line Between Declarations</span> </a>
+    </h3>
+<p><span>Leave blank line between top-level declarations:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-comment">// Good</span></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Foo</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>() {</span>
+<span class="line">    }</span>
+<span class="line"></span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">bar</span>() {</span>
+<span class="line">    }</span>
+<span class="line">}</span>
+<span class="line"></span>
+<span class="line"><span class="hl-comment">// Bad</span></span>
+<span class="line"><span class="hl-keyword">impl</span> <span class="hl-title class_">Foo</span> {</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">foo</span>() {</span>
+<span class="line">    }</span>
+<span class="line">    <span class="hl-keyword">pub</span> <span class="hl-keyword">fn</span> <span class="hl-title function_">bar</span>() {</span>
+<span class="line">    }</span>
+<span class="line">}</span></code></pre>
+
+</figure>
+<p><strong><span>Rationale:</span></strong><span> consistency.</span>
+<span>Omitting blank line leads to somewhat terser code, but is very hard to do consistently.</span></p>
+</section>
+<section id="Derive-Order">
+
+    <h3>
+    <a href="#Derive-Order"><span>Derive Order</span> </a>
+    </h3>
+<p><span>Use the following order of derives:</span></p>
+
+<figure class="code-block">
+
+
+<pre><code><span class="line"><span class="hl-meta">#[derive(Clone, Copy, PartialEq, Eq, PartialOrd, Ord, Hash, Debug)]</span></span></code></pre>
+
+</figure>
+<p><strong><span>Rationale:</span></strong><span> consistency.</span>
+<span>Debug comes last because it is the most often added item.o</span></p>
+</section>
+</section>
+
+  </main>
+
+  <footer class="site-footer">
+    <p>
+      <a href="https://github.com/matklad/matklad.github.io/edit/master/src/style.dj">
+        <svg class="icon"><use href="/assets/icons.svg#edit"/></svg>
+        fix typo
+      </a>
+
+      <a href="/feed.xml">
+        <svg class="icon"><use href="/assets/icons.svg#rss"/></svg>
+        rss
+      </a>
+
+      <a href="https://github.com/matklad">
+        <svg class="icon"><use href="/assets/icons.svg#github"/></svg>
+        matklad
+      </a>
+    </p>
+  </footer>
+</body>
+
+</html>