Linear stack blowup with multiple Vec::push of big struct with destructor #40883

arielb1 · 2017-03-28T22:40:27Z

STR

#![crate_type="rlib"]

pub struct Big {
    drop_me: [Option<Box<u8>>; 64],
}

pub fn supersize_me(meal: fn() -> Big, out: &mut Vec<Big>) {
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal());
    out.push(meal()); // 16 calls to `push`
}

Expected Result

Function should use a small amount of stack space, definitely less than 2 kilobytes (Big is 512 bytes per copy); 1.12.0 with -Z orbit=off uses 1088 bytes of stack.

Actual Result

When compiled, the function uses more than 16384 = 8*64*16*2 bytes of stack space, as is evident from subq $16384, %rsp in the assembly - 2 copies of Big for every call to push.

Notes

This is the root cause for #40573. It is not new, however - it was probably always present in MIR.

The text was updated successfully, but these errors were encountered:

arielb1 · 2017-03-29T14:20:52Z

Another example:

#![crate_type="rlib"]
#![feature(rustc_private)]
extern crate rustc;

use rustc::mir::*;

pub fn biggie2(basic_blocks: &mut Vec<BasicBlockData<'static>>,
               mk: fn() -> BasicBlockData<'static>,
               may_panic: fn())
{
    {
        let value = mk();
        may_panic();
        basic_blocks.push(value);
    }

    {
        let value = mk();
        may_panic();
        basic_blocks.push(value);
    }

    {
        let value = mk();
        may_panic();
        basic_blocks.push(value);
    }
}

arielb1 · 2017-03-29T20:07:07Z

And then there's this case, which is probably the big granddaddy of them all, and which fixing probably requires help on the LLVM side:

#![crate_type="rlib"]

pub fn foo(get: fn() -> [u64; 128], sink: fn(u32),
           may_panic: fn([u64; 128]) -> u32,
           something_random_with_a_dtor: Box<u32>) {
    sink(may_panic(get()));
    sink(may_panic(get()));
}

The LLVM IR ends up like this:

define void @_ZN8rust_out3foo17hb77808e3fc28588aE(void ([128 x i64]*)* nocapture, void (i32)* nocapture, i32 ([128 x i64]*)* nocapture, i32* noalias dereferenceable(4)) unnamed_addr #0 personality i32 (i32, i32, i64, %"unwind::libunwind::_Unwind_Exception"*, %"unwind::libunwind::_Unwind_Context"*)* @rust_eh_personality {
entry-block:
  %_19 = alloca [128 x i64], align 8
  %_13 = alloca [128 x i64], align 8
  %4 = bitcast [128 x i64]* %_13 to i8*
  call void @llvm.lifetime.start(i64 1024, i8* %4)
  invoke void %0([128 x i64]* noalias nocapture nonnull sret dereferenceable(1024) %_13)
          to label %bb3 unwind label %bb1

bb1:                                              ; preds = %entry-block, %bb3, %bb4, %bb5, %bb6, %bb7
  %5 = landingpad { i8*, i32 }
          cleanup
  %6 = bitcast i32* %3 to i8*
  tail call void @__rust_deallocate(i8* %6, i64 4, i64 4) #1
  resume { i8*, i32 } %5

bb3:                                              ; preds = %entry-block
  %7 = invoke i32 %2([128 x i64]* noalias nocapture nonnull dereferenceable(1024) %_13)
          to label %bb4 unwind label %bb1

bb4:                                              ; preds = %bb3
  call void @llvm.lifetime.end(i64 1024, i8* %4)
  invoke void %1(i32 %7)
          to label %bb5 unwind label %bb1

bb5:                                              ; preds = %bb4
  %8 = bitcast [128 x i64]* %_19 to i8*
  call void @llvm.lifetime.start(i64 1024, i8* %8)
  invoke void %0([128 x i64]* noalias nocapture nonnull sret dereferenceable(1024) %_19)
          to label %bb6 unwind label %bb1

bb6:                                              ; preds = %bb5
  %9 = invoke i32 %2([128 x i64]* noalias nocapture nonnull dereferenceable(1024) %_19)
          to label %bb7 unwind label %bb1

bb7:                                              ; preds = %bb6
  call void @llvm.lifetime.end(i64 1024, i8* %8)
  invoke void %1(i32 %9)
          to label %bb9 unwind label %bb1

bb9:                                              ; preds = %bb7
  %10 = bitcast i32* %3 to i8*
  tail call void @__rust_deallocate(i8* %10, i64 4, i64 4) #1
  ret void
}

And the problem is that either alloca can be alive at bb1, so LLVM thinks they can be simultaneously alive. And we can't solve this without splitting bb1 - which creates lots of landing pads that also degrade performance/stack usage.

ranma42 · 2017-03-30T09:48:11Z

@arielb1 Since the lifetimes of the allocas are disjoint, wouldn't it be possible to just use one alloca?

arielb1 · 2017-03-30T10:41:45Z

@ranma42

As a MIR optimization? Certainly. However, that won't help with LLVM inlining (LLVM has the alloca-merging-on-inlining thing, which sometimes improves matters, but not always).

michaelwoerister · 2017-03-30T20:51:32Z

According to @arielb1, this is another example of #35408. Will need LLVM changes to really be fixed. He'll look into it some more.

michaelwoerister · 2017-03-30T20:53:01Z

Possibly related LLVM bug: https://bugs.llvm.org//show_bug.cgi?id=25776

arielb1 · 2017-04-02T16:42:10Z

https://bugs.llvm.org//show_bug.cgi?id=32488 & https://reviews.llvm.org/D31583

brson · 2017-04-04T17:04:36Z

Thanks @arielb1.

nikomatsakis · 2017-04-11T15:53:32Z

@arielb1 can you say a bit more about this "granddaddy example". The problem seems to be (iiuc) that we unwind to bb1 so we can drop the something_random_with_a_dtor variable, and at that point LLVM considers the allocas alive. Would it be possible for us to issue calls to llvm.lifetime.end intrinsics on the unwind path? Is that not allowed in LLVM for some reason? (It seems like it'd be valid for us to do so, no?)

dotdash · 2017-06-13T11:34:07Z

Patch has landed upstream! Thanks for keeping it on track @arielb1 🥇

arielb1 · 2017-06-13T13:22:39Z

Yay!

brson · 2017-06-15T16:13:31Z

Does the pending LLVM upgrade fix?

Fixes rust-lang#40883.

@eddyb

Update LLVM to pick StackColoring improvement Fixes #40883. r? @eddyb

arielb1 mentioned this issue Mar 28, 2017

Current beta breaks my build that works on stable #40573

Closed

michaelwoerister added the P-medium Medium priority label Mar 30, 2017

pnkfelix mentioned this issue Apr 6, 2017

rustc built by MIR overflows its stack for crates with very deep ASTs. #35408

Closed

3 tasks

arielb1 mentioned this issue Jun 19, 2017

Update LLVM to pick StackColoring improvement #42750

Merged

arielb1 added a commit to arielb1/rust that referenced this issue Jun 19, 2017

Update LLVM to pick StackColoring improvement

4f1da87

Fixes rust-lang#40883.

bors added a commit that referenced this issue Jun 21, 2017

Auto merge of #42750 - arielb1:unwind-stack, r=eddyb

03198da

Update LLVM to pick StackColoring improvement Fixes #40883. r? @eddyb

bors closed this as completed in #42750 Jun 21, 2017

nikomatsakis mentioned this issue Sep 19, 2019

record fewer adjustment types in generator witnesses, avoid spurious drops in MIR construction #64584

Merged

RalfJung mentioned this issue Feb 17, 2020

debug_assert a few more raw pointer methods #69208

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear stack blowup with multiple Vec::push of big struct with destructor #40883

Linear stack blowup with multiple Vec::push of big struct with destructor #40883

arielb1 commented Mar 28, 2017

arielb1 commented Mar 29, 2017 •

edited

Loading

arielb1 commented Mar 29, 2017

ranma42 commented Mar 30, 2017

arielb1 commented Mar 30, 2017 •

edited

Loading

michaelwoerister commented Mar 30, 2017

michaelwoerister commented Mar 30, 2017

arielb1 commented Apr 2, 2017

brson commented Apr 4, 2017

nikomatsakis commented Apr 11, 2017

dotdash commented Jun 13, 2017

arielb1 commented Jun 13, 2017

brson commented Jun 15, 2017

Linear stack blowup with multiple Vec::push of big struct with destructor #40883

Linear stack blowup with multiple Vec::push of big struct with destructor #40883

Comments

arielb1 commented Mar 28, 2017

Meta

STR

Expected Result

Actual Result

Notes

arielb1 commented Mar 29, 2017 • edited Loading

arielb1 commented Mar 29, 2017

ranma42 commented Mar 30, 2017

arielb1 commented Mar 30, 2017 • edited Loading

michaelwoerister commented Mar 30, 2017

michaelwoerister commented Mar 30, 2017

arielb1 commented Apr 2, 2017

brson commented Apr 4, 2017

nikomatsakis commented Apr 11, 2017

dotdash commented Jun 13, 2017

arielb1 commented Jun 13, 2017

brson commented Jun 15, 2017

arielb1 commented Mar 29, 2017 •

edited

Loading

arielb1 commented Mar 30, 2017 •

edited

Loading