[InstSimplify] (a ^ b) ? (~a) ^ b : a ^ (~b) #63104

k-arrows · 2023-06-04T06:48:27Z

Test code:

int foo(int a, int b)
{
    return (a ^ b) ? (~a) ^ b : a ^ (~b) ;
}

Clang 15.0.0 (and Gcc trunk)

foo(int, int):                               # @foo(int, int)
        mov     eax, edi
        xor     eax, esi
        not     eax
        ret

Clang 16.0.0 (and Clang trunk)

foo(int, int):                               # @foo(int, int)
        mov     ecx, edi
        xor     ecx, esi
        cmp     edi, esi
        not     ecx
        mov     eax, -1
        cmovne  eax, ecx
        ret

https://godbolt.org/z/qdhKnj3af

dc03 · 2023-06-09T11:44:33Z

I'd like to fix this, but I'm not able to find the exact transform in InstCombine causing this. It appears that the newer clang is converting a ^ (~b) into 0 ^ (-1) (which then gets folded into -1), possibly because it is able to prove that a ^ b being false means that a and b are equal. The older clang appears to be converting a ^ (~b) as well as (~a) ^ b into ~(a ^ b), then coalescing them leading to the codegen you see.

junaire · 2023-06-09T12:30:16Z

This fold doesn't introduce new instructions so what you really should take a look is InstructionSimplify, maybe

llvm-project/llvm/lib/Analysis/InstructionSimplify.cpp

Line 4482 in 0662167

static Value *simplifySelectWithICmpCond(Value *CondVal, Value *TrueVal,

Alive proof: https://alive2.llvm.org/ce/z/KEt9R5

nikic · 2023-06-09T13:05:58Z

This probably needs a generalization of simplifyWithOpReplaced to look at more than one instruction.

dc03 · 2023-06-21T06:12:15Z

This probably needs a generalization of simplifyWithOpReplaced to look at more than one instruction.

@nikic: Not entirely sure where this plays in. Stepping through the code, the transformation comes from here: a ^ ~b gets turned into a ^ (b ^ -1) which then gets reassociated by SimplifyAssociativeOrCommutative here to be (a ^ b) ^ -1, where a ^ b then gets transformed by simplifyByDomEq into 0. I don't see simplifyWithOpReplaced in the call stack at any point.

nikic · 2023-06-21T08:30:56Z

@dc03 The optimization regression comes from making use of dominating conditions in icmp simplification. However, we don't want to avoid that and instead make sure that the new IR can still be simplified. simplifyWithOpReplaced() is the transform that handles this kind of pattern.

vfdff · 2023-06-21T12:20:02Z

The regression between llvm 15 and llvm 16 , https://gcc.godbolt.org/z/Goq9ETh67

vfdff · 2023-06-24T09:58:38Z

maybe this is the simplify case https://alive2.llvm.org/ce/z/TGgJTq , show that the simplifyWithOpReplaced need look at more than one instruction.

A similar assumption as for the x^x case also existed for the absorber case, which lead to a stage2 miscompile. That assumption is not fixed. ----- Support replacement of operands not only in the immediate instruction, but also instructions it uses. To the most part, this extension is straightforward, but there are two bits worth highlighting: First, we can now no longer assume that if the Op is a vector, the instruction also returns a vector. If Op is a vector and the instruction returns a scalar, we should consider it as a cross-lane operation. Second, for the x ^ x special case and the absorber special case, we can no longer assume that one of the operands is RepOp, as we might have a replacement higher up the instruction chain. There is one optimization regression, but it is in a fuzzer-generated test case. Fixes #63104.

A similar assumption as for the x^x case also existed for the absorber case, which lead to a stage2 miscompile. That assumption is not fixed. ----- Support replacement of operands not only in the immediate instruction, but also instructions it uses. To the most part, this extension is straightforward, but there are two bits worth highlighting: First, we can now no longer assume that if the Op is a vector, the instruction also returns a vector. If Op is a vector and the instruction returns a scalar, we should consider it as a cross-lane operation. Second, for the x ^ x special case and the absorber special case, we can no longer assume that one of the operands is RepOp, as we might have a replacement higher up the instruction chain. There is one optimization regression, but it is in a fuzzer-generated test case. Fixes llvm#63104.

Support replacement of operands not only in the immediate instruction, but also instructions it uses. To the most part, this extension is straightforward, but there are two bits worth highlighting: First, we can now no longer assume that if the Op is a vector, the instruction also returns a vector. If Op is a vector and the instruction returns a scalar, we should consider it as a cross-lane operation. Second, for the x ^ x special case, we can no longer assume that the operand is RepOp, as we might have a replacement higher up the instruction chain. There is one optimization regression, but it is in a fuzzer-generated test case. Fixes llvm/llvm-project#63104.

A similar assumption as for the x^x case also existed for the absorber case, which lead to a stage2 miscompile. That assumption is not fixed. ----- Support replacement of operands not only in the immediate instruction, but also instructions it uses. To the most part, this extension is straightforward, but there are two bits worth highlighting: First, we can now no longer assume that if the Op is a vector, the instruction also returns a vector. If Op is a vector and the instruction returns a scalar, we should consider it as a cross-lane operation. Second, for the x ^ x special case and the absorber special case, we can no longer assume that one of the operands is RepOp, as we might have a replacement higher up the instruction chain. There is one optimization regression, but it is in a fuzzer-generated test case. Fixes llvm/llvm-project#63104.

github-actions bot added the new issue label Jun 4, 2023

nikic added llvm:instcombine missed-optimization and removed new issue labels Jun 4, 2023

nikic closed this as completed in 3d199d0 Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[InstSimplify] (a ^ b) ? (~a) ^ b : a ^ (~b) #63104

[InstSimplify] (a ^ b) ? (~a) ^ b : a ^ (~b) #63104

k-arrows commented Jun 4, 2023

dc03 commented Jun 9, 2023 •

edited

Loading

junaire commented Jun 9, 2023

nikic commented Jun 9, 2023

dc03 commented Jun 21, 2023

nikic commented Jun 21, 2023

vfdff commented Jun 21, 2023

vfdff commented Jun 24, 2023

[InstSimplify] (a ^ b) ? (~a) ^ b : a ^ (~b) #63104

[InstSimplify] (a ^ b) ? (~a) ^ b : a ^ (~b) #63104

Comments

k-arrows commented Jun 4, 2023

dc03 commented Jun 9, 2023 • edited Loading

junaire commented Jun 9, 2023

nikic commented Jun 9, 2023

dc03 commented Jun 21, 2023

nikic commented Jun 21, 2023

vfdff commented Jun 21, 2023

vfdff commented Jun 24, 2023

dc03 commented Jun 9, 2023 •

edited

Loading