Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[VPlan] Hook IR blocks into VPlan during skeleton creation (NFC) #114292

Merged
merged 42 commits into from
Dec 12, 2024
Merged
Show file tree
Hide file tree
Changes from 26 commits
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
1b89761
[VPlan] Hook IR blocks into VPlan during skeleton creation (NFC)
fhahn Oct 11, 2024
4e5c743
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Nov 7, 2024
b87cf14
!fixup address comments, thanks!
fhahn Nov 7, 2024
4ddc87a
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks-tmp
fhahn Nov 7, 2024
599e690
!fixup address latest comments, thanks!
fhahn Nov 7, 2024
1a77b55
[VPlan] Add PredIdx and SuccIdx arguments to connectBlocks (NFC).
fhahn Nov 9, 2024
a2137b4
[VPlan] Add insertOnEdge (NFC).
fhahn Nov 9, 2024
b4d0eac
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Nov 9, 2024
2b4c71f
Merge branch 'main' into vplan-runtime-checks
fhahn Nov 9, 2024
be2c3a6
!fixup use insertOnEdge.
fhahn Nov 9, 2024
466b393
[VPlan] Add insertOnEdge to VPBlockUtils (NFC).
fhahn Nov 9, 2024
10a675e
Merge branch 'main' into vplan-runtime-checks
fhahn Nov 9, 2024
7996451
!fixup cleanup after merge.
fhahn Nov 9, 2024
5ea6f7b
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Nov 9, 2024
64909aa
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Nov 10, 2024
382380a
!fixup formatting
fhahn Nov 10, 2024
333536a
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Nov 25, 2024
e5b8af3
!fixup address latest comments, thanks!
fhahn Nov 25, 2024
df6894a
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Nov 25, 2024
927a66d
!fixup update verifier
fhahn Nov 25, 2024
5468f61
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 4, 2024
3eef601
!fixup address comments, thanks!
fhahn Dec 4, 2024
1d6db4a
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 5, 2024
22eeebe
!fixup address latest comments, thanks!
fhahn Dec 5, 2024
886bcc2
!fixup restore part of assert and fix formatting
fhahn Dec 5, 2024
65ac2d7
!fixup more formatting fixes
fhahn Dec 5, 2024
2f7d530
[VPlan] Use RPOT for VPlan codegen and printing.
fhahn Dec 6, 2024
ba08c2e
Merge branch 'main' into vplan-runtime-checks
fhahn Dec 6, 2024
4c76bec
!fixup update test
fhahn Dec 6, 2024
43c9186
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 6, 2024
7f08758
!fixup update test
fhahn Dec 6, 2024
3709f17
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 6, 2024
a72df24
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 7, 2024
16a6246
!Fixup update after merge
fhahn Dec 7, 2024
87f2815
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 7, 2024
b7b43a8
!fixup fix formatting
fhahn Dec 7, 2024
906603f
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 11, 2024
1e7cac7
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 11, 2024
4a51d29
!fixup use first successor for middle block.
fhahn Dec 12, 2024
f53cf1b
Merge remote-tracking branch 'origin/main' into vplan-runtime-checks
fhahn Dec 12, 2024
a0af583
!fixup update after merging #112138
fhahn Dec 12, 2024
968598b
!fixup use front instead of [0]
fhahn Dec 12, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
64 changes: 39 additions & 25 deletions llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -2428,6 +2428,25 @@ InnerLoopVectorizer::getOrCreateVectorTripCount(BasicBlock *InsertBlock) {
return VectorTripCount;
}

/// Introduces a new VPIRBasicBlock for \p CheckIRBB to \p Plan between the
/// vector preheader and its predecessor, also connecting the new block to the
/// scalar preheader.
static void introduceCheckBlockInVPlan(VPlan &Plan, BasicBlock *CheckIRBB) {
VPBlockBase *ScalarPH = Plan.getScalarPreheader();
VPBlockBase *VectorPH = Plan.getVectorPreheader();
VPBlockBase *PreVectorPH = VectorPH->getSinglePredecessor();
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ScalarPH is expected to be the other successor of PreVectorPH, so could be asserted (or another way to retrieve ScalarPH), although more general w/o this assert?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added an assert for now, than

if (PreVectorPH->getNumSuccessors() != 1) {
assert(PreVectorPH->getNumSuccessors() == 2 && "Expected 2 successors");
assert(PreVectorPH->getSuccessors()[0] == ScalarPH &&
"Unexpected successor");
VPIRBasicBlock *CheckVPIRBB = VPIRBasicBlock::fromBasicBlock(CheckIRBB);
VPBlockUtils::insertOnEdge(PreVectorPH, VectorPH, CheckVPIRBB);
PreVectorPH = CheckVPIRBB;
}
VPBlockUtils::connectBlocks(PreVectorPH, ScalarPH);
PreVectorPH->swapSuccessors();
}

void InnerLoopVectorizer::emitIterationCountCheck(BasicBlock *Bypass) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(Independent): LoopScalarPreHeader is passed as parameter Bypass, while LoopVectorPreHeader is retrieved directly to set TCCheckBlock (before being reset). Would be better to be consistent.

Value *Count = getTripCount();
// Reuse existing vector loop preheader for TC checks.
Expand Down Expand Up @@ -2502,14 +2521,15 @@ void InnerLoopVectorizer::emitIterationCountCheck(BasicBlock *Bypass) {
DT->getNode(Bypass)->getIDom()) &&
"TC check is expected to dominate Bypass");
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This assert was probably added along with the lines below, but should remain, right?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is to check the DT updates from SplitBlock I think


// Update dominator for Bypass & LoopExit (if needed).
DT->changeImmediateDominator(Bypass, TCCheckBlock);
BranchInst &BI =
*BranchInst::Create(Bypass, LoopVectorPreHeader, CheckMinIters);
if (hasBranchWeightMD(*OrigLoop->getLoopLatch()->getTerminator()))
setBranchWeights(BI, MinItersBypassWeights, /*IsExpected=*/false);
ReplaceInstWithInst(TCCheckBlock->getTerminator(), &BI);
LoopBypassBlocks.push_back(TCCheckBlock);

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Scalar preheader can be connected in VPlan from the outset, rather than here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are a few places that assume the scalar PH has a single predecessor, which would need to be updated. Could pull this in here or adjust as follow-up?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The single predecessor of Scalar PH in VPlan being middle_block, right?
I.e., scalar loop is initially connected as leftover/remainder loop only, and here connected also as alternative/bypass loop - to handle trip counts too small for the vector loop, and potentially other unvectorized cases.
Perhaps connectScalarAsBypassLoopInVPlan() is more accurate than connectScalarPreheaderInVPlan(), given that scalar PH is already connected in VPlan?

Applying this additional connection earlier, is fine as a later follow-up, perhaps w/ a TODO.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, renamed and added TODO to connectScalarAsBypassLoopInVPlan

// TODO: Wrap LoopVectorPreHeader in VPIRBasicBlock here.
introduceCheckBlockInVPlan(Plan, TCCheckBlock);
}

BasicBlock *InnerLoopVectorizer::emitSCEVChecks(BasicBlock *Bypass) {
Expand All @@ -2526,6 +2546,8 @@ BasicBlock *InnerLoopVectorizer::emitSCEVChecks(BasicBlock *Bypass) {
"Should already be a bypass block due to iteration count check");
LoopBypassBlocks.push_back(SCEVCheckBlock);
AddedSafetyChecks = true;

introduceCheckBlockInVPlan(Plan, SCEVCheckBlock);
return SCEVCheckBlock;
}

Expand Down Expand Up @@ -2562,6 +2584,7 @@ BasicBlock *InnerLoopVectorizer::emitMemRuntimeChecks(BasicBlock *Bypass) {

AddedSafetyChecks = true;

introduceCheckBlockInVPlan(Plan, MemCheckBlock);
return MemCheckBlock;
}

Expand Down Expand Up @@ -7657,11 +7680,6 @@ DenseMap<const SCEV *, Value *> LoopVectorizationPlanner::executePlan(
OrigLoop->getHeader()->getContext());
VPlanTransforms::optimizeForVFAndUF(BestVPlan, BestVF, BestUF, PSE);

LLVM_DEBUG(dbgs() << "Executing best plan with VF=" << BestVF
<< ", UF=" << BestUF << '\n');
BestVPlan.setName("Final VPlan");
LLVM_DEBUG(BestVPlan.dump());

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Moved later to VPlan::execute(), after is replaces VPBBtoVPIRBB.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep, can also split off once we are happy

// Perform the actual loop transformation.
VPTransformState State(&TTI, BestVF, BestUF, LI, DT, ILV.Builder, &ILV,
&BestVPlan);
Expand Down Expand Up @@ -7880,8 +7898,6 @@ EpilogueVectorizerMainLoop::emitIterationCountCheck(BasicBlock *Bypass,
DT->getNode(Bypass)->getIDom()) &&
"TC check is expected to dominate Bypass");

// Update dominator for Bypass.
DT->changeImmediateDominator(Bypass, TCCheckBlock);
LoopBypassBlocks.push_back(TCCheckBlock);

// Save the trip count so we don't have to regenerate it in the
Expand All @@ -7896,6 +7912,7 @@ EpilogueVectorizerMainLoop::emitIterationCountCheck(BasicBlock *Bypass,
setBranchWeights(BI, MinItersBypassWeights, /*IsExpected=*/false);
ReplaceInstWithInst(TCCheckBlock->getTerminator(), &BI);

introduceCheckBlockInVPlan(Plan, TCCheckBlock);
return TCCheckBlock;
}

Expand Down Expand Up @@ -7926,9 +7943,6 @@ EpilogueVectorizerEpilogueLoop::createEpilogueVectorizedLoopSkeleton(
EPI.MainLoopIterationCountCheck->getTerminator()->replaceUsesOfWith(
VecEpilogueIterationCountCheck, LoopVectorPreHeader);

DT->changeImmediateDominator(LoopVectorPreHeader,
EPI.MainLoopIterationCountCheck);

EPI.EpilogueIterationCountCheck->getTerminator()->replaceUsesOfWith(
VecEpilogueIterationCountCheck, LoopScalarPreHeader);

Expand All @@ -7939,19 +7953,8 @@ EpilogueVectorizerEpilogueLoop::createEpilogueVectorizedLoopSkeleton(
EPI.MemSafetyCheck->getTerminator()->replaceUsesOfWith(
VecEpilogueIterationCountCheck, LoopScalarPreHeader);

DT->changeImmediateDominator(
VecEpilogueIterationCountCheck,
VecEpilogueIterationCountCheck->getSinglePredecessor());

DT->changeImmediateDominator(LoopScalarPreHeader,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This one still needed?

(Review to be continued from here)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep this one is still needed for now.

EPI.EpilogueIterationCountCheck);
if (!Cost->requiresScalarEpilogue(EPI.EpilogueVF.isVector()))
// If there is an epilogue which must run, there's no edge from the
// middle block to exit blocks and thus no need to update the immediate
// dominator of the exit blocks.
DT->changeImmediateDominator(OrigLoop->getUniqueLatchExitBlock(),
EPI.EpilogueIterationCountCheck);

// Keep track of bypass blocks, as they feed start values to the induction and
// reduction phis in the scalar loop preheader.
if (EPI.SCEVSafetyCheck)
Expand Down Expand Up @@ -8054,6 +8057,16 @@ EpilogueVectorizerEpilogueLoop::emitMinimumVectorEpilogueIterCountCheck(
}
ReplaceInstWithInst(Insert->getTerminator(), &BI);
LoopBypassBlocks.push_back(Insert);

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Entry is conceptually an immutable VPIRBB that wraps the original scalar preheader, where SCEVs can be safely expanded, and all runtime checks are added as successors starting with minimal trip count check that is added as its terminal. Does this change when executing the VPlan of an epilog loop, whose Entry is replicated and transformed from old to new?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think it changes conceptually, but here we need to wrap the new entry to be the new node created here, after executing the plan. (There is existing logic to re-use expanded SCEVs from the first execution of the main plan, to avoid duplicated expansions)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All VPlans are initialized with the original scalar preheader as their Entry, but this works only when vectorizing the main loop (w/ or w/o vectorizing the epilog). When vectorizing the epilog loop, Entry no longer serves to host SCEV-expand recipes but should instead refer to the block between Middle and "Vector Epilog PreHeader" (called "Vector Epilogue Trip Count Check", but that's also how the first block is named), as depicted in https://llvm.org/docs/Vectorizers.html#epilogue-vectorization. The documentation of Entry in createInitialVPlan() deserves update?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a comment, thanks!

// A new entry block has been created for the epilogue VPlan. Hook it in, as
// otherwise we would try to modify the entry to the main vector loop.
VPIRBasicBlock *NewEntry = VPIRBasicBlock::fromBasicBlock(Insert);
VPBasicBlock *OldEntry = Plan.getEntry();
VPBlockUtils::reassociateBlocks(OldEntry, NewEntry);
Plan.setEntry(NewEntry);
delete OldEntry;

introduceCheckBlockInVPlan(Plan, Insert);
return Insert;
}

Expand Down Expand Up @@ -10332,8 +10345,6 @@ bool LoopVectorizePass::processLoop(Loop *L) {
cast<VPHeaderPHIRecipe>(&R)->setStartValue(StartVal);
}

assert(DT->verify(DominatorTree::VerificationLevel::Fast) &&
"DT not preserved correctly");
LVP.executePlan(EPI.EpilogueVF, EPI.EpilogueUF, BestEpiPlan, EpilogILV,
DT, true, &ExpandedSCEVs);
++LoopsEpilogueVectorized;
Expand Down Expand Up @@ -10361,6 +10372,9 @@ bool LoopVectorizePass::processLoop(Loop *L) {
checkMixedPrecision(L, ORE);
}

assert(DT->verify(DominatorTree::VerificationLevel::Fast) &&
"DT not preserved correctly");

std::optional<MDNode *> RemainderLoopID =
makeFollowupLoopID(OrigLoopID, {LLVMLoopVectorizeFollowupAll,
LLVMLoopVectorizeFollowupEpilogue});
Expand Down
97 changes: 67 additions & 30 deletions llvm/lib/Transforms/Vectorize/VPlan.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -170,9 +170,7 @@ VPBasicBlock *VPBlockBase::getEntryBasicBlock() {
}

void VPBlockBase::setPlan(VPlan *ParentPlan) {
assert(
(ParentPlan->getEntry() == this || ParentPlan->getPreheader() == this) &&
"Can only set plan on its entry or preheader block.");
assert(ParentPlan->getEntry() == this && "Can only set plan on its entry.");
Plan = ParentPlan;
}

Expand Down Expand Up @@ -823,16 +821,25 @@ void VPRegionBlock::print(raw_ostream &O, const Twine &Indent,
}
#endif

VPlan::VPlan(VPBasicBlock *OriginalPreheader, VPValue *TC,
VPBasicBlock *EntryVectorPreHeader, VPIRBasicBlock *ScalarHeader)
: VPlan(OriginalPreheader, TC, ScalarHeader) {
VPBlockUtils::connectBlocks(OriginalPreheader, EntryVectorPreHeader);
}

VPlan::VPlan(VPBasicBlock *OriginalPreheader,
VPBasicBlock *EntryVectorPreHeader, VPIRBasicBlock *ScalarHeader)
: VPlan(OriginalPreheader, ScalarHeader) {
VPBlockUtils::connectBlocks(OriginalPreheader, EntryVectorPreHeader);
}

VPlan::~VPlan() {
if (Entry) {
VPValue DummyValue;
for (VPBlockBase *Block : vp_depth_first_shallow(Entry))
Block->dropAllReferences(&DummyValue);

VPBlockBase::deleteCFG(Entry);

Preheader->dropAllReferences(&DummyValue);
delete Preheader;
}
for (VPValue *VPV : VPLiveInsToFree)
delete VPV;
Expand All @@ -855,9 +862,16 @@ VPlanPtr VPlan::createInitialVPlan(Type *InductionTy,
VPIRBasicBlock *Entry =
VPIRBasicBlock::fromBasicBlock(TheLoop->getLoopPreheader());
VPBasicBlock *VecPreheader = new VPBasicBlock("vector.ph");
// Connect entry only to vector preheader initially. Entry will also be
// connected to the scalar preheader later, during skeleton creation when
// runtime guards are added as needed. Note that when executing the VPlan for
// an epilogue vector loop, the original entry block here will be replaced by
// a new VPIRBasicBlock wrapping the entry to the epilogue vector loop after
// generating code for the main vector loop.
VPBlockUtils::connectBlocks(Entry, VecPreheader);
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Worth noting that this connection is subject to insertion of runtime checks.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a comment, thanks!

VPIRBasicBlock *ScalarHeader =
VPIRBasicBlock::fromBasicBlock(TheLoop->getHeader());
auto Plan = std::make_unique<VPlan>(Entry, VecPreheader, ScalarHeader);
auto Plan = std::make_unique<VPlan>(Entry, ScalarHeader);

// Create SCEV and VPValue for the trip count.

Expand Down Expand Up @@ -1005,15 +1019,21 @@ void VPlan::execute(VPTransformState *State) {
State->CFG.DTU.applyUpdates(
{{DominatorTree::Delete, VectorPreHeader, State->CFG.ExitBB}});

// Replace regular VPBB's for the middle and scalar preheader blocks with
// VPIRBasicBlocks wrapping their IR blocks. The IR blocks are created during
// skeleton creation, so we can only create the VPIRBasicBlocks now during
// VPlan execution rather than earlier during VPlan construction.
// Replace regular VPBB's for the vector preheader, middle and scalar
// preheader blocks with VPIRBasicBlocks wrapping their IR blocks. The IR
// blocks are created during skeleton creation, so we can only create the
// VPIRBasicBlocks now during VPlan execution rather than earlier during VPlan
// construction.
BasicBlock *MiddleBB = State->CFG.ExitBB;
VPBasicBlock *MiddleVPBB = getMiddleBlock();
BasicBlock *ScalarPh = MiddleBB->getSingleSuccessor();
replaceVPBBWithIRVPBB(getVectorPreheader(), VectorPreHeader);
replaceVPBBWithIRVPBB(getMiddleBlock(), MiddleBB);
replaceVPBBWithIRVPBB(getScalarPreheader(), ScalarPh);
replaceVPBBWithIRVPBB(MiddleVPBB, MiddleBB);

LLVM_DEBUG(dbgs() << "Executing best plan with VF=" << State->VF
<< ", UF=" << getUF() << '\n');
setName("Final VPlan");
LLVM_DEBUG(dump());

// Disconnect the middle block from its single successor (the scalar loop
// header) in both the CFG and DT. The branch will be recreated during VPlan
Expand All @@ -1028,8 +1048,11 @@ void VPlan::execute(VPTransformState *State) {
State->CFG.DTU.applyUpdates(
{{DominatorTree::Delete, ScalarPh, ScalarPh->getSingleSuccessor()}});

// Generate code in the loop pre-header and body.
for (VPBlockBase *Block : vp_depth_first_shallow(Entry))
ReversePostOrderTraversal<VPBlockShallowTraversalWrapper<VPBlockBase *>> RPOT(
Entry);
// Generate code for the VPlan, in parts of the vector skeleton, loop body and
// successor blocks including the middle, exit and scalar preheader blocks.
for (VPBlockBase *Block : RPOT)
Block->execute(State);

VPBasicBlock *LatchVPBB = getVectorLoopRegion()->getExitingBasicBlock();
Expand Down Expand Up @@ -1079,9 +1102,6 @@ void VPlan::execute(VPTransformState *State) {
}

State->CFG.DTU.flush();
assert(State->CFG.DTU.getDomTree().verify(
DominatorTree::VerificationLevel::Fast) &&
"DT not preserved correctly");
}

InstructionCost VPlan::cost(ElementCount VF, VPCostContext &Ctx) {
Expand Down Expand Up @@ -1134,12 +1154,10 @@ void VPlan::print(raw_ostream &O) const {

printLiveIns(O);

if (!getPreheader()->empty()) {
O << "\n";
getPreheader()->print(O, "", SlotTracker);
}
ReversePostOrderTraversal<VPBlockShallowTraversalWrapper<const VPBlockBase *>>
RPOT(getEntry());

for (const VPBlockBase *Block : vp_depth_first_shallow(getEntry())) {
for (const VPBlockBase *Block : RPOT) {
O << '\n';
Block->print(O, "", SlotTracker);
}
Expand Down Expand Up @@ -1170,6 +1188,31 @@ std::string VPlan::getName() const {
return Out;
}

VPRegionBlock *VPlan::getVectorLoopRegion() {
// TODO: Cache if possible.
for (VPBlockBase *B : vp_depth_first_shallow(getEntry()))
if (auto *R = dyn_cast<VPRegionBlock>(B))
return R;
return nullptr;
Comment on lines +1172 to +1175
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Better cache the result? Can be done as follow-up.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good, mostly worried about invalidation (also when thinking about #117506 / #108378)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Leave a TODO?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added thanks!

}

const VPRegionBlock *VPlan::getVectorLoopRegion() const {
for (const VPBlockBase *B : vp_depth_first_shallow(getEntry()))
if (auto *R = dyn_cast<VPRegionBlock>(B))
return R;
return nullptr;
}

VPBasicBlock *VPlan::getScalarPreheader() const {
auto *MiddleVPBB =
cast<VPBasicBlock>(getVectorLoopRegion()->getSingleSuccessor());
auto *LastSucc = MiddleVPBB->getSuccessors().back();
// If scalar preheader is connected to VPlan, it is the last successor of
// MiddleVPBB. If this last successor is a VPIRBasicBlock, it is the Exit
// block rather than the scalar preheader.
return isa<VPIRBasicBlock>(LastSucc) ? nullptr : cast<VPBasicBlock>(LastSucc);
}

LLVM_DUMP_METHOD
void VPlan::printDOT(raw_ostream &O) const {
VPlanPrinter Printer(O, *this);
Expand Down Expand Up @@ -1220,7 +1263,6 @@ static void remapOperands(VPBlockBase *Entry, VPBlockBase *NewEntry,

VPlan *VPlan::duplicate() {
// Clone blocks.
VPBasicBlock *NewPreheader = Preheader->clone();
const auto &[NewEntry, __] = cloneFrom(Entry);

BasicBlock *ScalarHeaderIRBB = getScalarHeader()->getIRBasicBlock();
Expand All @@ -1230,8 +1272,7 @@ VPlan *VPlan::duplicate() {
return VPIRBB && VPIRBB->getIRBasicBlock() == ScalarHeaderIRBB;
}));
// Create VPlan, clone live-ins and remap operands in the cloned blocks.
auto *NewPlan =
new VPlan(NewPreheader, cast<VPBasicBlock>(NewEntry), NewScalarHeader);
auto *NewPlan = new VPlan(cast<VPBasicBlock>(NewEntry), NewScalarHeader);
DenseMap<VPValue *, VPValue *> Old2NewVPValues;
for (VPValue *OldLiveIn : VPLiveInsToFree) {
Old2NewVPValues[OldLiveIn] =
Expand All @@ -1251,7 +1292,6 @@ VPlan *VPlan::duplicate() {
// else NewTripCount will be created and inserted into Old2NewVPValues when
// TripCount is cloned. In any case NewPlan->TripCount is updated below.

remapOperands(Preheader, NewPreheader, Old2NewVPValues);
remapOperands(Entry, NewEntry, Old2NewVPValues);

// Initialize remaining fields of cloned VPlan.
Expand Down Expand Up @@ -1303,8 +1343,6 @@ void VPlanPrinter::dump() {
OS << "edge [fontname=Courier, fontsize=30]\n";
OS << "compound=true\n";

dumpBlock(Plan.getPreheader());

for (const VPBlockBase *Block : vp_depth_first_shallow(Plan.getEntry()))
dumpBlock(Block);

Expand Down Expand Up @@ -1565,7 +1603,6 @@ void VPSlotTracker::assignNames(const VPlan &Plan) {
assignName(Plan.BackedgeTakenCount);
for (VPValue *LI : Plan.VPLiveInsToFree)
assignName(LI);
assignNames(Plan.getPreheader());

ReversePostOrderTraversal<VPBlockDeepTraversalWrapper<const VPBlockBase *>>
RPOT(VPBlockDeepTraversalWrapper<const VPBlockBase *>(Plan.getEntry()));
Expand Down
Loading
Loading