-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(continuously updating) List of benchmarks on which z3str3 return an incorrect result #2758
Comments
Taking a look at these cases. Some are now resolved (z3str3 returns sat), others are actually segfaults and a few are timing out. Is there a way we would prefer to track progress on these (maybe editing the list in the comment)? |
I can rearrange them. |
20 seconds. I'll post the full list tomorrow (I will need to update the segfaults list as well as I tested those on the latest build). |
Actually, what timeout did you use? I could re-test the "timeout" cases if you used a longer interval. |
The timeout I used was 120 seconds |
I re-ran the timeout cases for 120 seconds. Here are the latest results for commit Cases which are still UNSAT:
Cases which timeout:
Cases which segfault:
And all 42 other cases return SAT. |
@mtrberzi Here are some more examples : All the formulas at each level in the assertion stack are satisfiable. But z3str3 gives unsat at some level. For example for the file 1.smt2, z3str3 gives: while z3-seq gives: OS: $ git log -1 |
OK, thanks for the new cases. Is that commit the one you tested on? I'm actually in the middle of replacing the length and value testing algorithm with an improved procedure that should deal with many of these correctness issues. I tracked down several wrong-answer cases to bad behaviour in the legacy algorithm and the best way is to implement this new technique (which should also be more efficient in many cases). I'll let you know when it's ready to be tested, I would really appreciate the cross-check from your tool. |
That sounds good. Just let me know when it's ready :) |
OK, all of these cases now return SAT, or time out. I'll close this issue for now, and look at addressing the slow cases after some of the bigger old issues are resolved. Please do keep reporting any wrong answer cases or crashes you find, I appreciate it. |
Hi,
For the following list of benchmarks, z3str3 return incorrect result:
All the files listed below are sat.
z3str3 returns unsat for all of them.
CVC4 returns sat for all of them.
Z3 (without z3str3) retrurns sat for all of them. For some it hangs. Those files are marked with (z3 hangs).
I am using the z3 nightly build
Z3 version 4.8.8 - 64 bit
The text was updated successfully, but these errors were encountered: