Quantifier instantiation via simplistic E-matching by tautschnig · Pull Request #8224 · diffblue/cbmc

tautschnig · 2024-02-27T20:56:15Z

Use E-matching on index expressions to instantiate quantifiers when eager instantiation (aka enumerative instantiation) cannot be applied.

Each commit message has a non-empty body, explaining why the change was made.
Methods or procedures I have added are documented, following the guidelines provided in CODING_STANDARD.md.
The feature or user visible behaviour I have added or modified has been documented in the User Guide in doc/cprover-manual/
Regression or unit tests are included, or existing tests cover the modified code (in this case I have detailed which ones those are in the commit message).
n/a My commit message includes data points confirming performance improvements (if claimed).
My PR is restricted to a single feature or bugfix.
n/a White-space or formatting changes outside the feature-related changed lines are in commits of their own.

tautschnig · 2024-02-27T21:22:08Z

Despite this being a "draft" PR I would like to ask for feedback on the following aspects:

@remi-delmas-3000 on the overall PR and specifically on the Quantifiers-unbounded-array-overflow test case: we'll have to clean this up to make clear what assertions we really expect to fail.
@kroening and @martin-cs : the expression matching implementation is really crude. Any and all suggestions would be most appreciated.

codecov · 2024-03-05T14:15:11Z

Codecov Report

❌ Patch coverage is 70.40359% with 66 lines in your changes missing coverage. Please review.
✅ Project coverage is 80.00%. Comparing base (15eb10a) to head (0ae286e).

Files with missing lines	Patch %	Lines
src/solvers/flattening/boolbv_quantifier.cpp	70.40%	66 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8224      +/-   ##
===========================================
- Coverage    80.00%   80.00%   -0.01%     
===========================================
  Files         1700     1700              
  Lines       188252   188470     +218     
  Branches        73       73              
===========================================
+ Hits        150613   150783     +170     
- Misses       37639    37687      +48

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

tautschnig · 2024-03-22T10:57:00Z

@remi-delmas-3000 on the overall PR and specifically on the Quantifiers-unbounded-array-overflow test case: we'll have to clean this up to make clear what assertions we really expect to fail.

This is now done, comment explaining the (expected) verification failure is now included.

kroening · 2024-05-01T16:58:40Z

This needs serious consideration: failing quantifier instantiation is the leading source of proof brittleness, and we are opening the door to that here.

kroening · 2024-05-01T16:59:06Z

I would much prefer a more deterministic, predictable and robust solution to this.

martin-cs · 2024-05-05T21:14:34Z

Hard maybe?

So, it would be good but there is going to be a limit on how much we can do like this. Unless we put a refinement loop around the solver and do something like MBQI. Getting that stable and reliable is tricky. Given the amount of work that has gone into E-matching in Z3 and Andy's decade+ of work on quantifiers in cvc5, I wonder whether it would be more efficient (in terms of dev-hours / things proved) to improve our SMT integration.

To actually answer the question you asked, things that might help:

Make sure both the terms in the quantifiers and the expressions you are using as context are rewritten.
Use a union-find so that "up to equality" is handled. The modern approach might be to use an E-Graph data structure for both this and previous.
You can do a number of optimisations if the quantified variable(s) are only used once in the body or are only used in one atom. That should handle the simple cases but when you have \forall x, y, z . p(x,y) && q(y,z) && r(x,z) I think you need to get into term indexing and ordering the matching.

HTH

Copilot

Pull request overview

This PR implements quantifier instantiation via E-matching for cases where eager (enumerative) instantiation cannot be applied. The implementation uses pattern matching on array index expressions to find suitable instantiation values from the solver's cache.

Key changes:

Added E-matching based quantifier instantiation as a fallback when eager instantiation fails
Removed the lb > ub check in eager instantiation (correctly handles empty ranges with vacuous truth/false)
Updated test expectations to reflect that previously unsupported quantifier cases now work

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/solvers/flattening/boolbv_quantifier.cpp	Implements E-matching algorithm via `instantiate_one_quantifier` function and modifies `finish_eager_conversion_quantifiers` to apply it
regression/cbmc/Quantifiers-unbounded-array/test.desc	Test descriptor for new unbounded array quantifier test case
regression/cbmc/Quantifiers-unbounded-array/main.c	Test case demonstrating quantifier instantiation with unbounded arrays and loop invariants
regression/cbmc/Quantifiers-unbounded-array-overflow/test.desc	Test descriptor for overflow detection with quantifiers
regression/cbmc/Quantifiers-unbounded-array-overflow/main.c	Test case that validates overflow detection in loop invariants with quantifiers
regression/cbmc/Quantifiers-two-dimension-array/no-upper-bound.desc	Test descriptor for 2D array quantifier handling
regression/cbmc/Quantifiers-two-dimension-array/no-upper-bound.c	Test case for nested quantifiers over 2D arrays
regression/cbmc/Quantifiers-type/test.desc	Updated test expectations to reflect working quantifier support
regression/cbmc/Quantifiers-invalid-var-range/test.desc	Updated test expectations from KNOWNBUG to CORE status
doc/cprover-manual/cbmc-assertions.md	Updated documentation from "future releases will support" to "CPROVER supports" for quantifiers

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-19T15:13:45Z

+  else
+    return conjunction(instantiations);


Missing defensive check for quantifier type. Add an assertion or UNREACHABLE statement after the if-else chain to handle the case where q_expr.id() is neither ID_exists nor ID_forall, similar to the pattern used in eager_quantifier_instantiation at line 265.

Suggested change

else

return conjunction(instantiations);

else if(q_expr.id() == ID_forall)

return conjunction(instantiations);

UNREACHABLE;

return {};

conjunction(...) and disjunction(...) helper functions produce the appropriate result for empty operand sequences.

Use E-matching on index expressions to instantiate quantifiers when eager instantiation (aka enumerative instantiation) cannot be applied.

Replace the simplistic E-matching in instantiate_one_quantifier with a complete instantiation approach based on: Yeting Ge and Leonardo de Moura, "Complete instantiation for quantified formulas in Satisfiability Modulo Theories", CAV 2009. The paper shows that for essentially uninterpreted formulas (where quantified variables only appear as arguments of uninterpreted functions), the quantifier can be eliminated by instantiating with a finite set of ground terms computed as the least fixed point of a system of set constraints (ΔF). In CBMC's context, array accesses (index_exprt) play the role of uninterpreted functions. Key changes: - find_index_contexts: Properly decomposes array index expressions into bound_var + offset form using pre-order traversal. Handles patterns var, cast(var), var+r, r+var, var-r. This implements the offset extension from Section 4 of the paper, where for f(x_i + r) we generate constraints S_{k,i} + r ⊆ A_{f,j} and A_{f,j} - r ⊆ S_{k,i}. - collect_ground_indices: Collects ground index terms from both index_exprt (array reads) and with_exprt (array writes) in the bitvector cache, seeding the sets A_{f,j} from the paper. - compute_instantiation_set: Computes the least fixed point of the set constraint system ΔF. Uses integer arithmetic when all offsets and ground indices are numeric constants (avoiding expression growth), with a symbolic fallback using simplify_expr. Bounded by max_iterations and max_terms to handle non-stratified (non-FEU) cases. - arrays_match: SSA-aware array comparison using L1 object identifiers to match across SSA versions. For nested array accesses (e.g., multi-dimensional arrays like a[i][j]), recursively matches the base array and compares simplified indices, since the pattern may contain unsimplified arithmetic (e.g., cast(0 % 2)) that is semantically equal to a constant in the cache. Instantiation terms are sorted (via irept::operator<) before generating the conjunction/disjunction to ensure deterministic SAT clause ordering. Without this, MiniSat performance varies from sub-second to minutes on the same UNSAT instance depending on hash-dependent iteration order, which changes with file paths embedded in source locations. This enables verification of properties that depend on relationships between adjacent array elements, such as: forall j: arr[j+1] == arr[j] + 1 which the previous E-matching could not handle because it did not generate the ground terms needed for the offset positions.

Add two regression tests exercising the complete instantiation approach: - Quantifiers-offset-instantiation: Tests the offset pattern arr[j+1] from Section 4 of Ge & de Moura (2009). The quantifier forall j < n-1: arr[j+1] == arr[j] + 1 requires the set constraint fixed-point to generate ground terms for both arr[j] and arr[j+1] positions. Without the offset handling, the instantiation misses the terms needed to chain the equalities. - Quantifiers-fixpoint-instantiation: Tests that the fixed-point computation correctly propagates through multiple iterations. The property arr[2] > 0 requires two steps of reasoning: j=0: arr[0]>0 => arr[1]>0 j=1: arr[1]>0 => arr[2]>0 This exercises the iterative nature of the ΔF fixed-point from Section 3 of the paper, where instantiating with one set of terms generates new ground terms that feed back into the next iteration.

Add TIMEOUT 1200 to the set_tests_properties call in both copies of the add_test_pl_profile macro (regression/CMakeLists.txt and regression/libcprover-cpp/CMakeLists.txt). This ensures CTest will kill and report as failed any regression test that exceeds 20 minutes, preventing CI jobs from hanging indefinitely on tests that time out.

Add a -t <secs> option (and TESTPL_TIMEOUT environment variable) to test.pl for enforcing a wall-clock timeout on each individual test. The implementation replaces the system() call in run() with fork/exec, placing the child in its own process group (setpgrp). When the timeout fires via Perl's alarm(), the entire process group is killed with SIGKILL, ensuring all child processes (bash, cbmc, solvers) are cleaned up. On timeout the test is marked as failed and a synthetic output file is written so that the pattern-matching logic does not crash. The timeout defaults to 0 (disabled), preserving existing behaviour unless -t is explicitly passed or TESTPL_TIMEOUT is set.

tautschnig force-pushed the features/quantifiers-elimination branch from e08d98e to 24806af Compare February 27, 2024 21:23

tautschnig assigned tautschnig, kroening, remi-delmas-3000 and martin-cs Feb 27, 2024

tautschnig added the Solvers label Feb 27, 2024

tautschnig force-pushed the features/quantifiers-elimination branch from 24806af to 69f8720 Compare March 5, 2024 13:43

tautschnig removed their assignment Mar 5, 2024

tautschnig marked this pull request as ready for review March 5, 2024 14:06

tautschnig requested review from TGWDB, jimgrundy, kroening, martin-cs and peterschrammel as code owners March 5, 2024 14:06

tautschnig force-pushed the features/quantifiers-elimination branch from 6e48e44 to 49b3909 Compare March 5, 2024 15:15

tautschnig self-assigned this Mar 5, 2024

tautschnig force-pushed the features/quantifiers-elimination branch 2 times, most recently from 7bb221f to 64252a0 Compare March 6, 2024 08:59

tautschnig force-pushed the features/quantifiers-elimination branch 2 times, most recently from d821caa to 090c84a Compare March 22, 2024 10:55

tautschnig unassigned remi-delmas-3000 and tautschnig Mar 22, 2024

tautschnig force-pushed the features/quantifiers-elimination branch from 090c84a to 72550cf Compare March 22, 2024 11:12

tautschnig force-pushed the features/quantifiers-elimination branch from 72550cf to 5e379d6 Compare June 11, 2024 12:23

tautschnig mentioned this pull request Jun 11, 2024

CBMC wavefront does not terminate on simple decreases clause in loop #8301

Open

tautschnig force-pushed the features/quantifiers-elimination branch from 5e379d6 to e076dab Compare July 12, 2024 11:38

tautschnig assigned tautschnig and unassigned kroening and martin-cs Jul 12, 2024

tautschnig force-pushed the features/quantifiers-elimination branch 2 times, most recently from 2a85427 to 9be48ee Compare March 25, 2025 10:28

Copilot AI review requested due to automatic review settings December 19, 2025 15:09

tautschnig force-pushed the features/quantifiers-elimination branch from 9be48ee to 9e896a5 Compare December 19, 2025 15:09

Copilot started reviewing on behalf of tautschnig December 19, 2025 15:09 View session

Copilot AI reviewed Dec 19, 2025

View reviewed changes

tautschnig force-pushed the features/quantifiers-elimination branch from 9e896a5 to d7eba68 Compare December 19, 2025 15:28

tautschnig added 4 commits February 23, 2026 10:08

Eager quantifier elimination: support empty ranges

55af2a4

conjunction(...) and disjunction(...) helper functions produce the appropriate result for empty operand sequences.

Quantifier instantiation via simplistic E-matching

3f746db

Use E-matching on index expressions to instantiate quantifiers when eager instantiation (aka enumerative instantiation) cannot be applied.

tautschnig force-pushed the features/quantifiers-elimination branch from d7eba68 to a8283e2 Compare February 23, 2026 16:28

tautschnig added 2 commits February 24, 2026 11:39

tautschnig force-pushed the features/quantifiers-elimination branch from eac08f6 to 0ae286e Compare February 24, 2026 11:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantifier instantiation via simplistic E-matching#8224

Quantifier instantiation via simplistic E-matching#8224
tautschnig wants to merge 6 commits into
diffblue:developfrom
tautschnig:features/quantifiers-elimination

tautschnig commented Feb 27, 2024 •

edited

Loading

Uh oh!

tautschnig commented Feb 27, 2024

Uh oh!

codecov Bot commented Mar 5, 2024 •

edited

Loading

Uh oh!

tautschnig commented Mar 22, 2024

Uh oh!

kroening commented May 1, 2024

Uh oh!

kroening commented May 1, 2024

Uh oh!

martin-cs commented May 5, 2024

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

-  else
-    return conjunction(instantiations);
+  else if(q_expr.id() == ID_forall)
+    return conjunction(instantiations);
+  UNREACHABLE;
+  return {};

Conversation

tautschnig commented Feb 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tautschnig commented Feb 27, 2024

Uh oh!

codecov Bot commented Mar 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

tautschnig commented Mar 22, 2024

Uh oh!

kroening commented May 1, 2024

Uh oh!

kroening commented May 1, 2024

Uh oh!

martin-cs commented May 5, 2024

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tautschnig commented Feb 27, 2024 •

edited

Loading

codecov Bot commented Mar 5, 2024 •

edited

Loading