Persistent structure-based term identifiers #1830

samcowger · 2023-03-02T20:16:03Z

This adds a field to STApp-constructed Terms to store the hash of the TermF they encompass, giving shared terms a readily-available, (relatively) unique identifier based solely on the shape/structure of their subterms. The goal of this is to provide terms with a persistent, concise reference that does not change between SAW invocations.

In doing this, I changed the Hashable and Eq instances to leverage this field. Before, Hashable was implemented solely in terms of stAppIndex, which was basically guaranteed unique for any single SAW invocation. Now, Hashable is implemented solely in terms of this stored hash. This slightly increases the possibility of term hash collisions, but I believe the likelihood of collisions is no worse than that of automatically-derived instances. RFC: should we now handwrite a Hashable instance for TermF to mitigate this risk somewhat?

One use case for this (which this PR does not implement, but which a future PR will) is to allow users to print terms with memoization based on these references, rather than based on mercurially-unique integer tags. This would help users to maintain context in larger, heavily-memoized terms (e.g. a proof goal) as users enter and exit SAW REPLs and proof subshells in the process of proof development.

…ashes

I originally made these changes to avoid computing `TermF` hashes twice, but in doing so assumed a total absence of term hash collisions. The cost of that assumption far outweighs the cost of hash recomputation which, at least for terms with cached subterm hashes (which I expect to encompass most terms), is actually quite cheap.

samcowger · 2023-03-03T18:37:04Z

One alternative to the approach implemented here, if there is substantial concern about changing the Hashable instance, would be to define a parallel class, say Hashable', whose only purpose would be to provide a structure-based hash in tandem with the current index-based hash, and implement it on the Term and its subordinate types. We would still need to modify the representation of STApp Terms to cache this hash, but could otherwise leave the Hashable and Eq instances unchanged.

eddywestbrook

While you are changing the Term datatype, please add a haddock description to it to explain the design concepts associated with it, including: that the STApp constructor implements hash-consing (maybe put a pointer to Wikipedia or something to explain the concept?) using the stAppIndex field; that the stAppIndex field is always meant to be unique; that the stAppTermF field "ties the knot" for the TermF data-type (it would be great if you could dig up a reference for this style of datatype functor, but it's fine if you can't); and that the stAppHash and stAppFreeVars fields cache the free variables and hash values of the term.

saw-core/src/Verifier/SAW/Term/Functor.hs

samcowger · 2023-03-04T02:33:37Z

While you are changing the Term datatype, please add a haddock description to it to explain the design concepts associated with it, including: that the STApp constructor implements hash-consing (maybe put a pointer to Wikipedia or something to explain the concept?) using the stAppIndex field; that the stAppIndex field is always meant to be unique; that the stAppTermF field "ties the knot" for the TermF data-type (it would be great if you could dig up a reference for this style of datatype functor, but it's fine if you can't); and that the stAppHash and stAppFreeVars fields cache the free variables and hash values of the term.

Can do!

saw-core/src/Verifier/SAW/Term/Functor.hs

eddywestbrook

Excellent, thanks for adding those haddocks!

samcowger added 9 commits February 13, 2023 13:49

Add stAppHash field to Term to allow term hash memoization

09e6927

Change Hashable Term instance to leverage stored hash

83b0fcd

Modify TermFMap representation to leverage precomputation of term h…

27a4dd5

…ashes

Switch term hashing behavior

729ec84

Silence warning

3bee817

Update Hashable Term instance to mitigate some collisions

648471b

Whitespace

b5b4a2d

Update Eq Term instance to satisfy hashWithSalt contract

e0f90f2

samcowger requested review from eddywestbrook and andreistefanescu March 2, 2023 20:16

Merge branch 'master' into persistent-term-hashing

5d5fa94

samcowger mentioned this pull request Mar 3, 2023

Maintaining grounding in large terms during proof development is difficult #1831

Open

eddywestbrook suggested changes Mar 4, 2023

View reviewed changes

eddywestbrook reviewed Mar 4, 2023

View reviewed changes

saw-core/src/Verifier/SAW/Term/Functor.hs Outdated Show resolved Hide resolved

samcowger added 5 commits March 6, 2023 09:45

Haddock

b64d96b

Better Haddock

f2b5bd3

Update PrimName hashing

9913e5d

Comments

2e61cb2

Merge branch 'master' into persistent-term-hashing

2805fdb

samcowger mentioned this pull request Mar 8, 2023

Expose term hashes in memoization #1837

Open

Merge branch 'master' into persistent-term-hashing

286cc56

andreistefanescu approved these changes Mar 14, 2023

View reviewed changes

saw-core/src/Verifier/SAW/Term/Functor.hs Outdated Show resolved Hide resolved

samcowger added 2 commits March 29, 2023 10:40

Merge branch 'master' into persistent-term-hashing

c9cebb8

Revert equality to its original index-based semantics

3745791

eddywestbrook approved these changes Mar 30, 2023

View reviewed changes

samcowger added the PR: ready to merge Magic flag for pull requests to ask Mergify to merge given an approval and a successful CI run label Mar 30, 2023

mergify bot merged commit ab51c9b into master Mar 30, 2023

m-yac mentioned this pull request May 16, 2023

Update Hashable Term instance so alphaEquiv t1 t2 implies hash t1 == hash t2 #1869

Open

RyanGlScott deleted the persistent-term-hashing branch March 22, 2024 14:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Persistent structure-based term identifiers #1830

Persistent structure-based term identifiers #1830

samcowger commented Mar 2, 2023

samcowger commented Mar 3, 2023

eddywestbrook left a comment

samcowger commented Mar 4, 2023

eddywestbrook left a comment

Persistent structure-based term identifiers #1830

Persistent structure-based term identifiers #1830

Conversation

samcowger commented Mar 2, 2023

samcowger commented Mar 3, 2023

eddywestbrook left a comment

Choose a reason for hiding this comment

samcowger commented Mar 4, 2023

eddywestbrook left a comment

Choose a reason for hiding this comment