Optionally encode floating point operations as uninterpreted functions #1057

can-leh-emmtrix · 2024-06-18T13:04:19Z

In our ongoing work with alive2, we often encountered cases where floating point operations remain largely unchanged while other parts of the function differ. Since, Z3's floating point theory is quite slow I am looking at using uninterpreted functions to (over) approximate these floating point operations.

This PR adds a new command line option --uf-float that encodes all floating point operations using uninterpreted functions. The encoding is a conservative approximation, meaning that if a counter-example for the approximation is found, it is not necessarily a valid counter example.

Commutativity for FPBinOp and FCmp is encoded using the encoding proposed by Seongwon Bang, Seunghyeon Nam, Inwhan Chun, Ho Young Jhoo, and Juneyoung Lee in "SMT-based Translation Validation for Machine Learning Compiler.". It does not use the other parts of the encoding. It might be interesting to encode properties other than commutativity as well though.

I decided to open this PR as a draft, to see if there is interest in upstreaming this feature and get some feedback on the direction.

Related: #916

regehr · 2024-06-18T16:14:13Z

so my (now former) student Zhengyang has been using floats-as-UF in his Alive-based superoptimizer. @zhengyang92 do you have feedback on this?

can-leh-emmtrix · 2024-07-02T09:43:35Z

I'd consider this ready for review now.

As the diff is quite large, I'd recommend looking at the first three commits individually and then looking at the diff between the third and last commit.

The main reason for the large diff is that implementing the --uf-float option required indenting the existing code:

if (is_uf_float()) {
  <create UF>
} else {
  <existing code>
}

can-leh-emmtrix · 2024-07-10T11:52:55Z

@nunoplopes Do you have any feedback on this?

nunoplopes · 2024-07-14T15:00:40Z

@nunoplopes Do you have any feedback on this?

Sorry for the delay. I still have a few bugs to fix on my queue. I'll get back to this afterwards.

llvm_util/cmd_args_def.h

ir/instr.cpp

nunoplopes · 2024-08-05T09:22:38Z

ir/instr.cpp

+    }
+  };
+
+  fast_math_flag(FastMathFlags::NNaN, "nnan");


I don't understand this part. Also, where are the other flags?

It handles flags which may result in the operation returning poison. As far as I can tell the only flags which can result in poison are nnan and ninf. They are handled by creating additional uninterpreted functions op.np_nnan(x, y): bool and op.np_ninf(x, y): bool (in the binary case) whose results are added to the non_poison and expression.

The nsz flag is not relevant as in uf-float mode we do not really have a zero value.

I don't think I entirely understand how the remaining fast math flags are handled in the existing code, though it looks like the result of the operation is wrapped in a function call to another UF.

nunoplopes · 2024-08-05T09:24:10Z

ir/instr.cpp

+  auto value = expr::mkUF(name, arg_values, res);
+  if (is_commutative) {
+    assert(args.size() == 2);
+    value = value & expr::mkUF(name, {arg_values[1], arg_values[0]}, res);


I don't think this is a sound over-approximation. Do you need commutativity in your examples?
Would an axiom stating that forall x, y . fadd(x, y) = fadd(y, x) suffice?

The encoding comes from "SMT-based Translation Validation for Machine Learning Compiler" by Seongwon Bang, Seunghyeon Nam, Inwhan Chun, Ho Young Jhoo, and Juneyoung Lee. The idea is to encode commutativity as

$\mathrm{op}(x, y) = \mathrm{op}'(x, y) \mspace{0.3em} \mathrm{and} \mspace{0.3em} \mathrm{op}'(y, x)$

where $\mathrm{and}$ is the bitwise and operator and $\mathrm{op}'$ is an uninterpreted function. There is a proof of correctness in the supplementary material for the paper.

I added a comment referencing the paper.

The proof is very vague and doesn't show equisatisfiability clearly.
Why not use x <= y ? f(x,y) : f(y, x)?

I would prefer to keep the current encoding, as I know that it works well on our test cases. While I could try other encodings, that might of course take some time. So here is my attempt at constructing a proof. It is of course inspired by the proof given in the supplementary material, but uses a simpler construction for $op'$ / $f'$.

Please correct me if I am wrong, but since we are already over approximating, we only need to prove that for all commutative functions $\mathrm{op}: \mathrm{BV}^n \times \mathrm{BV}^n \rightarrow \mathrm{BV}^n$ there is a function $\mathrm{op}': \mathrm{BV}^n \times \mathrm{BV}^n \rightarrow \mathrm{BV}^n$ such that

$$\mathrm{op}(x, y) = \mathrm{op}'(x, y) \mspace{0.3em} \mathrm{and} \mspace{0.3em} \mathrm{op}'(y, x)$$

(in the context of SMT solving this ensures that no model/counterexample can get lost.)

Proof: Let $\mathrm{BV}^n$ be the set of all bit vectors of size $n$ and let $\mathrm{op}: \mathrm{BV}^n \times \mathrm{BV}^n \rightarrow \mathrm{BV}^n$ be a commutative function.

Let $\mathrm{op}'(x, y) := \mathrm{op}(x, y)$. Then

$$ \mathrm{op}'(x, y) \mspace{0.3em} \mathrm{and} \mspace{0.3em} \mathrm{op}'(y, x) = \mathrm{op}(x, y) \mspace{0.3em} \mathrm{and} \mspace{0.3em} \mathrm{op}(y, x) $$

By commutativity of $\mathrm{op}$

$$ = \mathrm{op}(x, y) \mspace{0.3em} \mathrm{and} \mspace{0.3em} \mathrm{op}(x, y) $$

By $\forall x \in \mathrm{BV}^n: x \mspace{0.3em} \mathrm{and} \mspace{0.3em} x = x$

$$ = \mathrm{op}(x, y) $$

Thus we have shown that $\mathrm{op}'$ fullfills the requirement.

ir/instr.cpp

nunoplopes · 2024-08-07T09:48:07Z

I was just thinking about this again and I think there's a better approach, which is much less invasive.
We could implement this in smt/expr.cpp instead and have a global mode to switch to UF encoding. That way, instr.cpp wouldn't be changed, all fmath flags would work, and we have a single place to change things, instead of duplicating semantics in instr.cpp.

can-leh-emmtrix · 2024-08-07T13:07:09Z

I played a bit with the idea, but I was unable to achieve the same performance as when implementing the feature in instr.cpp. I used the uf-float/select testcase as my benchmark.

I tried a few different things, but here are the most successful approaches I could come up with.

No Approximation is just using Z3's floating point theory
expr.cpp / Floating Point is implemented in expr.cpp and uses uninterpreted functions that take floating point values as arguments
expr.cpp / Bit Vector is implemented in expr.cpp and uses uninterpreted functions that take bit vector values as arguments
instr.cpp is this PR

Here are the results. I measured the total runtime of alive-tv on uf-float/select over 8 runs per approach.

Here is a chart that shows only the approximations.

So at least on this benchmark the approach taken by this PR has a clear advantage over inserting the uninterpreted functions in expr.cpp. This also matches my prior observations in our internal examples.

I think that the reason for this difference are the other operations applied fm_poison. It uses FloatType::fromFloat, any_fp_zero, and handle_subnormal, all of which are not neccessary for the approach taken by this PR. Additionally, I conjecture that in examples using instructions which are implemented as a more complex series of Z3 operations (e.g. FMin), we will observe the same problem. This is something that can not be fixed in expr.cpp.

Regarding the remaining fast math flags: Implementing the same logic as in fm_poison to over approximate them should work for uf_float too, shouldn't it?

can-leh-emmtrix · 2024-08-07T13:11:44Z

Here is the relevant part of the diff for expr.cpp / Floating Point. This is of course very much a prototype and only implements the operations I needed for benchmarking.

diff --git a/smt/expr.cpp b/smt/expr.cpp
index 70353e8a..0af924a2 100644
--- a/smt/expr.cpp
+++ b/smt/expr.cpp
@@ -1205,7 +1205,8 @@ expr expr::fadd(const expr &rhs, const expr &rm) const {
 
 expr expr::fsub(const expr &rhs, const expr &rm) const {
   C(rhs, rm);
-  return simplify_const(Z3_mk_fpa_sub(ctx(), rm(), ast(), rhs()), *this, rhs);
+  //return simplify_const(Z3_mk_fpa_sub(ctx(), rm(), ast(), rhs()), *this, rhs);
+  return expr::mkUF("fsub", {*this, rhs}, *this);
 }
 
 expr expr::fmul(const expr &rhs, const expr &rm) const {
@@ -1282,7 +1283,8 @@ expr expr::foge(const expr &rhs) const {
 }
 
 expr expr::folt(const expr &rhs) const {
-  return binop_fold(rhs, Z3_mk_fpa_lt);
+  //return binop_fold(rhs, Z3_mk_fpa_lt);
+  return expr::mkUF("flt", {*this, rhs}, true);
 }
 
 expr expr::fole(const expr &rhs) const {
@@ -1310,7 +1312,8 @@ expr expr::fuge(const expr &rhs) const {
 }
 
 expr expr::fult(const expr &rhs) const {
-  return funo(rhs) || binop_fold(rhs, Z3_mk_fpa_lt);
+  //return funo(rhs) || binop_fold(rhs, Z3_mk_fpa_lt);
+  return funo(rhs) || expr::mkUF("flt", {*this, rhs}, true);
 }
 
 expr expr::fule(const expr &rhs) const {

Here is the relevant part of the diff for expr.cpp / Bit Vector:

diff --git a/smt/expr.cpp b/smt/expr.cpp
index 70353e8a..729c8ac3 100644
--- a/smt/expr.cpp
+++ b/smt/expr.cpp
@@ -1205,7 +1205,8 @@ expr expr::fadd(const expr &rhs, const expr &rm) const {
 
 expr expr::fsub(const expr &rhs, const expr &rm) const {
   C(rhs, rm);
-  return simplify_const(Z3_mk_fpa_sub(ctx(), rm(), ast(), rhs()), *this, rhs);
+  //return simplify_const(Z3_mk_fpa_sub(ctx(), rm(), ast(), rhs()), *this, rhs);
+  return expr::mkUF("fsub", {float2BV(), rhs.float2BV()}, float2BV()).BV2float(rhs);
 }
 
 expr expr::fmul(const expr &rhs, const expr &rm) const {
@@ -1282,7 +1283,8 @@ expr expr::foge(const expr &rhs) const {
 }
 
 expr expr::folt(const expr &rhs) const {
-  return binop_fold(rhs, Z3_mk_fpa_lt);
+  //return binop_fold(rhs, Z3_mk_fpa_lt);
+  return expr::mkUF("flt", {float2BV(), rhs.float2BV()}, true);
 }
 
 expr expr::fole(const expr &rhs) const {
@@ -1310,7 +1312,8 @@ expr expr::fuge(const expr &rhs) const {
 }
 
 expr expr::fult(const expr &rhs) const {
-  return funo(rhs) || binop_fold(rhs, Z3_mk_fpa_lt);
+  //return funo(rhs) || binop_fold(rhs, Z3_mk_fpa_lt);
+  return funo(rhs) || expr::mkUF("flt", {float2BV(), rhs.float2BV()}, true);
 }
 
 expr expr::fule(const expr &rhs) const {

nunoplopes · 2024-08-08T07:51:54Z

I think those results are very encouraging and really suggest we should go the expr.cpp way.
Note that I don't think we should be using float2BV() at all. Floats should not exist; only BVs. We can also have the bit-width be user-specified.

This solution is superior in terms of maintainability. And getting rid of floats altogether should close the performance gap.

can-leh-emmtrix · 2024-08-12T10:46:01Z

It would still require some changes in instr.cpp or type.cpp to disable checking for NaN and inserting nondeteriministic values though. Do you have a solution for instructions implemented in terms of multipe floating point and non floating point Z3 operations? Ideally they would be reduced to a single UF as well.

nunoplopes · 2024-08-14T21:07:33Z

It would still require some changes in instr.cpp or type.cpp to disable checking for NaN and inserting nondeteriministic values though. Do you have a solution for instructions implemented in terms of multipe floating point and non floating point Z3 operations? Ideally they would be reduced to a single UF as well.

I don't remember of cases that are not 1-to-1 mapping with Z3. Do you have an example in mind?

(btw, I'm taking off for vacations tmr; I'm back in September)

can-leh-emmtrix · 2024-08-19T11:58:04Z

I don't remember of cases that are not 1-to-1 mapping with Z3. Do you have an example in mind?

Looking through the source code I found the following cases:

FMin, FMax: https://github.com/AliveToolkit/alive2/blob/master/ir/instr.cpp#L884-L890
FMinimum, FMaximum: https://github.com/AliveToolkit/alive2/blob/master/ir/instr.cpp#L897-L905
MulAdd: https://github.com/AliveToolkit/alive2/blob/master/ir/instr.cpp#L1441-L1442
LRInt, LRound: https://github.com/AliveToolkit/alive2/blob/master/ir/instr.cpp#L1788-L1823

For the FpConversionOp instructions LRInt, LRound, FPToSInt, FPToUInt the non_poison value is also computed using multiple floating point operations: https://github.com/AliveToolkit/alive2/blob/master/ir/instr.cpp#L1788-L1832

(btw, I'm taking off for vacations tmr; I'm back in September)

Have a nice vacation!

Edit: I will be on vacation until September 23.

nunoplopes · 2024-09-29T09:06:04Z

Closing this as we don't want to have to duplicate semantics. A solution in smt/expr.cpp would avoid that.
Plus it's a better solution longer term once we start supporting abstraction refinement.

can-leh-emmtrix added 9 commits June 17, 2024 17:31

Add getOpName and getCondName

e39513b

Add --uf-float command line option

263f62c

Map floating point operations to uninterpreted functions

d67f7aa

Handle poison values in FpBinOp

e5661c2

Encode commutativity

4dd1cab

Handle poison

14d14d4

Fix return type for FCmp

d2fb021

Use qualified call to std::move

233f6b5

Add tests

4149c54

can-leh-emmtrix added 6 commits June 19, 2024 10:41

Use scalar type as suffix for UF

aaf6b36

Encode all fcmp conditions using oeq, ueq, olt, ult, ord

9c2384d

Fix poison for fast math

35247ea

Add more tests

0ccb8f4

Rename FpMappingMode to FpEncodingMode

736ff33

Refactor

0a4c47c

can-leh-emmtrix marked this pull request as ready for review July 2, 2024 09:35

can-leh-emmtrix added 3 commits July 15, 2024 14:48

Fix FpConversionOp for vectors

69178f8

Fix code style

6c80504

Merge branch 'master' into uf-float

aba1f4a

nunoplopes reviewed Aug 5, 2024

View reviewed changes

can-leh-emmtrix added 6 commits August 5, 2024 12:07

Merge branch 'master' into uf-float

e724a88

Split long lines

9c21f56

Unindent existing code

055172e

Smaller diff

4b5ffe4

Reference paper in comment

fa3cf88

Split long lines

8eef61c

nunoplopes reviewed Aug 6, 2024

View reviewed changes

ir/instr.cpp Outdated Show resolved Hide resolved

nunoplopes reviewed Aug 6, 2024

View reviewed changes

ir/instr.cpp Outdated Show resolved Hide resolved

ir/instr.cpp Outdated Show resolved Hide resolved

ir/instr.cpp Outdated Show resolved Hide resolved

ir/instr.cpp Outdated Show resolved Hide resolved

can-leh-emmtrix added 5 commits August 6, 2024 17:13

Fix formatting

35f65ad

Use doesApproximation correctly

123ee99

Remove unused FCmp::getCondName

0685b96

Refactor FpConversionOp

51e4cc5

Move variable declarations to same line as definitions

c397911

nunoplopes reviewed Aug 7, 2024

View reviewed changes

Fix formatting

b8d2f83

Merge branch 'master' into uf-float

4584dc8

nunoplopes closed this Sep 29, 2024

can-leh-emmtrix mentioned this pull request Oct 9, 2024

Encode floating point operations as uninterpreted functions #1095

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optionally encode floating point operations as uninterpreted functions #1057

Optionally encode floating point operations as uninterpreted functions #1057

can-leh-emmtrix commented Jun 18, 2024 •

edited

Loading

regehr commented Jun 18, 2024

can-leh-emmtrix commented Jul 2, 2024

can-leh-emmtrix commented Jul 10, 2024

nunoplopes commented Jul 14, 2024

nunoplopes Aug 5, 2024

can-leh-emmtrix Aug 5, 2024

nunoplopes Aug 5, 2024

can-leh-emmtrix Aug 5, 2024

can-leh-emmtrix Aug 5, 2024

nunoplopes Aug 6, 2024

can-leh-emmtrix Aug 6, 2024

nunoplopes commented Aug 7, 2024

can-leh-emmtrix commented Aug 7, 2024 •

edited

Loading

can-leh-emmtrix commented Aug 7, 2024

nunoplopes commented Aug 8, 2024

can-leh-emmtrix commented Aug 12, 2024

nunoplopes commented Aug 14, 2024

can-leh-emmtrix commented Aug 19, 2024 •

edited

Loading

nunoplopes commented Sep 29, 2024

Optionally encode floating point operations as uninterpreted functions #1057

Optionally encode floating point operations as uninterpreted functions #1057

Conversation

can-leh-emmtrix commented Jun 18, 2024 • edited Loading

regehr commented Jun 18, 2024

can-leh-emmtrix commented Jul 2, 2024

can-leh-emmtrix commented Jul 10, 2024

nunoplopes commented Jul 14, 2024

nunoplopes Aug 5, 2024

Choose a reason for hiding this comment

can-leh-emmtrix Aug 5, 2024

Choose a reason for hiding this comment

nunoplopes Aug 5, 2024

Choose a reason for hiding this comment

can-leh-emmtrix Aug 5, 2024

Choose a reason for hiding this comment

can-leh-emmtrix Aug 5, 2024

Choose a reason for hiding this comment

nunoplopes Aug 6, 2024

Choose a reason for hiding this comment

can-leh-emmtrix Aug 6, 2024

Choose a reason for hiding this comment

nunoplopes commented Aug 7, 2024

can-leh-emmtrix commented Aug 7, 2024 • edited Loading

can-leh-emmtrix commented Aug 7, 2024

nunoplopes commented Aug 8, 2024

can-leh-emmtrix commented Aug 12, 2024

nunoplopes commented Aug 14, 2024

can-leh-emmtrix commented Aug 19, 2024 • edited Loading

nunoplopes commented Sep 29, 2024

can-leh-emmtrix commented Jun 18, 2024 •

edited

Loading

can-leh-emmtrix commented Aug 7, 2024 •

edited

Loading

can-leh-emmtrix commented Aug 19, 2024 •

edited

Loading