Merkle Path with Predecessor/Successor + Universal Query Gadget #388

nicholas-mainardi · 2024-10-10T20:31:51Z

First PR for batching query circuits. It provides:

The gadget for Merkle-path verification that also locates predecessor and successor nodes in the BST
A refactoring of universal query circuit logic in 2 components, universal_query_hash_gadget and universal_query_value_gadget, which will then be employed distinctly in the new batching circuits

feat: allow for either aggregation-only or tabular-only queries

…ueries

nikkolasg

Logic looks fine (the merkle tree verification with predecessor/successor extraction) - Mostly left comments about API and naming which is a bit confusing to me. Thanks !

nikkolasg · 2024-10-21T09:30:38Z

verifiable-db/src/query/aggregation/mod.rs

+        }
+    }
+
+    /// Build an instance of `Self` without range-check the `UInt256Target`s


Can you write in which conditions this is safe to use ? (i'm reading the PR top to bottom so maybe you explain it below)

It's up to the caller to determine whether it's safe to use this method or not, depending on how the values are employed. For instance, if the node info is employed to compute the hash of the node, it should be safe to allocate without range checks since the hash will be different in case the prover provides out-of-range limbs.

nikkolasg · 2024-10-21T10:40:14Z

verifiable-db/src/query/merkle_path.rs

+/// Input wires related to the data of the end node whose membership is proven with `MerklePathWithNeighborsGadget`
+pub struct EndNodeInputs {
+    node_min: UInt256Target,
+    node_max: UInt256Target,
+    left_child_exists: BoolTarget,
+    left_child_info: NodeInfoTarget,
+    right_child_exists: BoolTarget,
+    right_child_info: NodeInfoTarget,
+}


Naming: This comment is saying it's being used to verify the merkle path with neirghbors yet this struct contains the children, not the neighbors (predecessor / successor). It's confusing, since right after, naming seems more consistent:

/// Target containing data about a neighbor of a node (neighbor can be /// either the predecessor or the successor of a node) pub struct NeighborInfoTarget

Could you specify what is the data and why is it required / why is there ?

Comments added in commit 600a6e9.

nikkolasg · 2024-10-21T10:42:52Z

verifiable-db/src/query/merkle_path.rs

+pub struct MerklePathWithNeighborsTargetInputs<const MAX_DEPTH: usize>
+where
+    [(); MAX_DEPTH - 1]:,
+{
+    pub(crate) path_inputs: MerklePathTargetInputs<MAX_DEPTH>,
+    pub(crate) end_node_inputs: EndNodeInputs,
+}


Naming: Here the name claims with Neighbors, and yet there is no neighbors information (predecessor/successor) inside. Could you clarify what is the input and why it's needed there ? And why do you put neighbors if it doesn't contain neighbors information ? or maybe im just missing it, where is it ?

Like right after

pub struct MerklePathWithNeighborsTarget<const MAX_DEPTH: usize>

and it uses

pub(crate) predecessor_info: NeighborInfoTarget,

which makes sense there then.

Because MerklePathWithNeighborsTargetInputs is the data structure that contains only the input wires (i.e., the ones that need to be assigned). The information about the neighbors is an output of the gadget, that's why they appear only in the latter data structure. Might be clearer if I rename the data structure to MerklePathWithNeighborsInputTargets maybe?

lol ok maybe just MPathInputs or stg ? It's ok to not put the full name in relation to what it does, personally i find MerklePathWithNeighborsTargetInputs more confusing.

I added the WithNeighbors sufffix to distinguish this from the simpler gadget MerklePathTargetInputs which only checks membership in the merkle path, without computing data about the predecessor/successor nodes

nikkolasg · 2024-10-21T10:45:23Z

verifiable-db/src/query/merkle_path.rs

+pub struct MerklePathWithNeighborsGadget<const MAX_DEPTH: usize>
+where
+    [(); MAX_DEPTH - 1]:,
+{
+    path_gadget: MerklePathGadget<MAX_DEPTH>,
+    end_node_min: U256,
+    end_node_max: U256,
+    end_node_children: [Option<NodeInfo>; 2],
+}


Naming: same here, neighbor mentionned but no neighbors structs used, only NodeInfo... Could you clarify what are the fields and what's the purpose of the struct, wrt to the "neighbor" especially ?

Same as above, there are no neighbors since this a gadget is expected to contain only the input values. I will add comments to clarify the purpose of the additional fields. They re basically needed because in case the predecessor/successor is not in the path, then we can get its value from the min and max of the children of the node.

Comments added in commit 600a6e9.

nikkolasg · 2024-10-21T10:53:01Z

verifiable-db/src/query/merkle_path.rs

+fn build_common<const MAX_DEPTH: usize>(
+    b: &mut CircuitBuilder<F, D>,
+    end_node: HashOutTarget,
+    index_id: Target,
+    with_neighbors: bool,
+    end_node_info: Option<EndNodeInputs>,
+) -> MerklePathWires<MAX_DEPTH>


The API is a bit convoluted imo with_neighbors must always be correct wrt end_node_info AND when looking at the actual function logic, most lines are only regarding the successor/predecessor function.
Also, maybe I'm missing something, but it looks like all the logic only need the node_hash computed from the "normal / without neighbor" logic, is that correct ?

Wdyt about we maybe extract out the only small portion about verifying the hashes, return all hashes and do the successor/predecessor check in its own function ?
That would separate the concern cleanly imo and also able to avoid the convoluted arguments, since now we would not mix at all the two logic ("verifying mpath" + "extracting successor").

I initially thought also about what you suggested but I was unsure whether the build_path method would have been well-defined or not. But probably it is since you suggested this structure too :) Done in commit 600a6e9.

nikkolasg · 2024-10-21T11:14:12Z

verifiable-db/src/query/merkle_path.rs

+        let rest = node_min[i]
+            .to_targets()
+            .into_iter()
+            .chain(node_max[i].to_targets())
+            .chain(once(index_id))
+            .chain(node_value[i].to_targets())
+            .chain(embedded_tree_hash[i].to_targets())
+            .collect_vec();


Could we use the node input target struct which already defines the sequence of elements to hash ?

I am not sure I fully understand what you mean: you are suggesting to add a method to NodeInfoTarget that will compute rest basically? If so, I am not sure how to name this method, computing rest inside of the whole hash is an optimization but it's not a "well-defined" operation. Wdyt?

…ueries

…th-with-successor-gadget

This PR introduces a circuit to expose the results of simple `SELECT` queries without aggregation functions, avoiding the need to build a results tree.

…sor-gadget

delehef and others added 30 commits September 10, 2024 14:48

feat: allow for either aggregation-only or tabular-only queries

cd191cd

Add Merkle-path verification gadget

719a0c0

merge MPT extraction

3794779

is_multiplier

3793aa1

multiplier for rows

ebdd33a

and api

071a2f0

Add revelation circuit unproven offset

ac3443e

refactoring of cell logic

3b735cd

wip

2b3c289

row leaf passing

d3ae335

Add APIs for simple select

ff8039c

Update general query APIs

5079597

partial fixed

36673ef

full node fixed

6713f66

row tree test passing

90f8d1c

Fix build integration test

3fe4b55

testing merge circuit

8dcdc05

API for merge

0ea3f41

adding one more circuit set size

2d7de6f

Fix build groth-16 crate

f8a79d1

wip

e9c3bcb

Refactor query test cases code

dbb8e7a

Merge with main

799a60f

fix: ORDER BY is not supported

b26ea94

Merge pull request #355 from Lagrange-Labs/feat/drop-scalar-check

a36fd13

feat: allow for either aggregation-only or tabular-only queries

WiP: keep both queries types in a single test

dfaa7d7

Merge branch 'main' into feat/tabular-queries

8fc299f

wip

76a4bdd

compiling

4f7aec5

Merge branch 'feat/tabular-queries' into feat/unproven-limit-offset-q…

c0bb9a6

…ueries

nicholas-mainardi and others added 5 commits October 15, 2024 13:07

Add LIMIT by default only in simple SELECT queries

c87856f

Avoid running wildcard queries on merged tables

c7bd811

Fix parsil test

a5a6962

fix: MemoryStorage::all_keys_at should not return dead keys

94dca85

feat: add random_key_at

e58faee

nikkolasg approved these changes Oct 21, 2024

View reviewed changes

nicholas-mainardi added 5 commits October 25, 2024 09:47

Address comments

600a6e9

Merge branch 'feat/merge_table' into feat/tabular-queries

85cdb54

Merge branch 'feat/tabular-queries' into feat/unproven-limit-offset-q…

791ba8a

…ueries

Merge branch 'feat/unproven-limit-offset-queries' into feat/merkle-pa…

e17ff40

…th-with-successor-gadget

Simple Select Queries with Unchecked Offset (#365)

5e2df85

This PR introduces a circuit to expose the results of simple `SELECT` queries without aggregation functions, avoiding the need to build a results tree.

Base automatically changed from feat/unproven-limit-offset-queries to feat/tabular-queries October 29, 2024 21:06

Merge branch 'feat/tabular-queries' into feat/merkle-path-with-succes…

55e0421

…sor-gadget

nicholas-mainardi marked this pull request as ready for review October 29, 2024 21:29

nicholas-mainardi and others added 13 commits October 30, 2024 16:24

Fix build

1b7a41c

chore: clippy

5483d50

[parsil] correctly handle LIMIT/OFFSET

60d3f34

[mp2] use more explicit names

35d9623

fix: imports

5bb0490

Merge branch 'main' into feat/tabular-queries

2e32687

Fix integration test + fmt

3644f30

use u32's for offset & limit

6a27c73

Fix test failing from time to time

ec32679

merged with main

a65bf3b

add reset-db to devenv

d23c0b7

Update result check + re-enable test with nonexisting secondary index

d008cf8

Merge branch 'feat/tabular-queries' into feat/merkle-path-with-succes…

c8f4bff

…sor-gadget

Base automatically changed from feat/tabular-queries to main November 13, 2024 16:08

nicholas-mainardi added 2 commits November 30, 2024 01:33

Merge branch 'main' into feat/merkle-path-with-successor-gadget

71513df

fmt

3bf074c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merkle Path with Predecessor/Successor + Universal Query Gadget #388

Merkle Path with Predecessor/Successor + Universal Query Gadget #388

nicholas-mainardi commented Oct 10, 2024

nikkolasg left a comment

nikkolasg Oct 21, 2024

nicholas-mainardi Oct 24, 2024

nikkolasg Oct 21, 2024

nikkolasg Oct 21, 2024

nicholas-mainardi Oct 25, 2024

nikkolasg Oct 21, 2024

nikkolasg Oct 21, 2024

nicholas-mainardi Oct 24, 2024

nikkolasg Oct 30, 2024

nicholas-mainardi Nov 13, 2024 •

edited

Loading

nikkolasg Oct 21, 2024

nicholas-mainardi Oct 24, 2024

nicholas-mainardi Oct 25, 2024 •

edited

Loading

nikkolasg Oct 21, 2024

nicholas-mainardi Oct 25, 2024

nikkolasg Oct 21, 2024

nicholas-mainardi Oct 25, 2024

Merkle Path with Predecessor/Successor + Universal Query Gadget #388

Are you sure you want to change the base?

Merkle Path with Predecessor/Successor + Universal Query Gadget #388

Conversation

nicholas-mainardi commented Oct 10, 2024

nikkolasg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nicholas-mainardi Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nicholas-mainardi Oct 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nicholas-mainardi Nov 13, 2024 •

edited

Loading

nicholas-mainardi Oct 25, 2024 •

edited

Loading