EPIC: Storage #12986

tac0turtle · 2022-08-22T06:36:14Z

alexanderbez · 2022-08-22T14:53:28Z

Thanks for writing up this up @marbar3778! Seems like a pretty good summary to me. Just to highlight and reiterate, things that I think are paramount:

Single logical DB utilization for atomic commits (giving optionality for separate logical DBs when desired by a module).
Refactor store APIs and how stores are registered

One thing I'm still hazy on is IAVL vs some future better accumulator tree structure (e.g. JMT). How much time do we want to spend on IAVL and say refactoring the key layout vs just switching to something better like the JMT?

tac0turtle · 2022-08-25T14:29:31Z

One thing I'm still hazy on is IAVL vs some future better accumulator tree structure (e.g. JMT). How much time do we want to spend on IAVL and say refactoring the key layout vs just switching to something better like the JMT?

Talking with @ValarDragon he believes the change would take 1 week or less of work after a repo cleanup is upstreamed to our version. The JMT work would take much longer. This could be an easy win until we land JMT or something similar.

Would you want to dive into the key format change?

alexanderbez · 2022-08-25T20:01:03Z

Awesome!

Would you want to dive into the key format change?

Sure! From what I've read in the JMT paper, we can adopt a similar tactic, i.e. keys take the form of version || nibble path (e.g. 45 || 0010101) where || denotes concatenation , I'm not sure IAVL uses bit nibble paths for keys, but it is binary and thus we can a similar/identical format. Correct me if I'm wrong @ValarDragon.

adu-crypto · 2022-08-26T08:25:40Z

are we planning on switching to the JMT and refactoring the whole storage?

adu-crypto · 2022-08-26T08:28:04Z

For current store v2 work, besides the SMT itself compared with IAVL tree, I personally think the new store v2 comes with two major differences from store v1:

store v2 uses backing db transaction to commit and control version, which means every time store is committed, db tx is discarded and root store needs to be reloaded next time for read/write. While store v1 manages the version by the tree itself and don’t need to discard tx and reload store after commitment.
store v1 iavlStore caches store nodes in memory, I don’t think store v2 caches smt/databucket/indexbucket in memory

alexanderbez · 2022-08-26T17:37:12Z

are we planning on switching to the JMT and refactoring the whole storage?

The strategy will be as follows (in order):

Refactor store package (API cleaning and improvement -- I'd like to use signal logical DB)
Relatively low-lift/small refactor & improvement of IAVL
Eventual implementation of JMT (~q1 2023)

alexanderbez · 2022-08-26T17:37:48Z

As for the current v2, I think there might be a chance that is actually discarded -- I'll let @marbar3778 chime in on that

tac0turtle · 2022-08-29T12:19:43Z

V2 is still being evaluated. Much of the team has been in and out the past couple weeks so we have been unable to review in a timely manner. There are some concerns taken in the current design and the performance of v2 as it currently stands is not drastically more than v1 with iavl.

With a week or two of work on iavl and another week or two in the store package there is a high probability that v1+iavl will become more performant than v2+smt. For now we are opting to move forward with v1+iavl until we fully evaluate the design decisions of v2+smt.

tac0turtle · 2022-08-29T12:27:54Z

The strategy will be as follows (in order):

Refactor store package (API cleaning and improvement -- I'd like to use signal logical DB)

Relatively low-lift/small refactor & improvement of IAVL

Eventual implementation of JMT (~q1 2023)

lets start with the first two and add in specs/docs for the store package. Do you want to lead it @alexanderbez?

alexanderbez · 2022-08-30T05:08:50Z

The strategy will be as follows (in order):

Refactor store package (API cleaning and improvement -- I'd like to use signal logical DB)

Relatively low-lift/small refactor & improvement of IAVL

Eventual implementation of JMT (~q1 2023)

lets start with the first two and add in specs/docs for the store package. Do you want to lead it @alexanderbez?

yes ser

yihuang · 2022-09-14T01:50:28Z

FYI, v2 store looks not bad in terms of db size reduction: evmos/ethermint#1304 (comment)
But this benchmark didn't count in the cost of snapshot versioning.

yihuang · 2022-09-14T03:21:42Z

The low level db snapshot/checkpoints hard-links the db files, it seems pretty costly in terms of db size intuitively (need some tests to confirm).

https://hackmd.io/@2O2cXDfdQpijemGc_vlHCA/SkXxuTAli
I wrote a proposal to store historical versions explicitly in a similar way to erigon, to not rely on low level db checkpoints.

adu-crypto · 2022-09-14T06:17:59Z

FYI, v2 store looks not bad in terms of db size reduction: evmos/ethermint#1304 (comment) But this benchmark didn't count in the cost of snapshot versioning.

I agree that if we really want to make the blockchain network more usable, we have to take the db size of archive/full node into consideration.

tac0turtle · 2022-09-14T08:42:23Z

Seems like the size comparison is of smt vs iavl. Smt having a smaller footprint is known, this is not part of the worry, the worry is that the semantics of iavl and store v1 are not known and a new implementation was done without this understanding.

Did the benchmark account for the historical versions on disk or only what is in the tree? The data in smt is not historical but only current so it makes sense that its smaller

JayT106 · 2022-09-15T21:32:48Z

The low level db snapshot/checkpoints hard-links the db files, it seems pretty costly in terms of db size intuitively (need some tests to confirm).

not sure why the checkpoints consumes a lot of disk space but definitely need to be checked out
#12251 (comment)

yihuang · 2022-10-06T03:32:02Z

An alternative db design for historical states, moving the discussion here.

reduce archive node db size dramatically (10x)
not consensus breaking

tac0turtle · 2023-06-28T09:37:10Z

closing this epic for now, @alexanderbez when you have a chance could you create a new one for the upcoming storage work?

alexanderbez self-assigned this Aug 22, 2022

alexanderbez added T:Epic Epics C:Store labels Aug 22, 2022

alexanderbez mentioned this issue Aug 22, 2022

Clarify/Change merkelized DB expected delete API #12989

Closed

kocubinski mentioned this issue Aug 29, 2022

EPIC: determinism with maps #13039

Closed

tac0turtle added this to Cosmos-SDK Aug 31, 2022

tac0turtle moved this to 📝 Todo in Cosmos-SDK Aug 31, 2022

tac0turtle assigned facundomedica Sep 5, 2022

alexanderbez mentioned this issue Sep 23, 2022

Store plain historical states to reduce the db size of archive node #13317

Closed

alexanderbez mentioned this issue Oct 10, 2022

Problem: iavl stores end up with inconsistent versions after upgrade #13477

Closed

dangush mentioned this issue Nov 22, 2022

docs: Spec on current cachekv implementation #13977

Merged

13 tasks

angbrav mentioned this issue Nov 25, 2022

docs: store spec template/guideline #14020

Merged

14 tasks

angbrav mentioned this issue Dec 20, 2022

docs: inter-block cache specification #14370

Merged

14 tasks

tac0turtle added the Q2:2023 label Apr 3, 2023

tac0turtle closed this as completed Jun 28, 2023

github-project-automation bot moved this from 📝 Todo to 👏 Done in Cosmos-SDK Jun 28, 2023

tac0turtle removed this from Cosmos-SDK Jul 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EPIC: Storage #12986

EPIC: Storage #12986

tac0turtle commented Aug 22, 2022 •

edited

Loading

alexanderbez commented Aug 22, 2022

tac0turtle commented Aug 25, 2022

alexanderbez commented Aug 25, 2022 •

edited

Loading

adu-crypto commented Aug 26, 2022

adu-crypto commented Aug 26, 2022

alexanderbez commented Aug 26, 2022

alexanderbez commented Aug 26, 2022

tac0turtle commented Aug 29, 2022

tac0turtle commented Aug 29, 2022 •

edited

Loading

alexanderbez commented Aug 30, 2022

yihuang commented Sep 14, 2022 •

edited

Loading

yihuang commented Sep 14, 2022

adu-crypto commented Sep 14, 2022

tac0turtle commented Sep 14, 2022

JayT106 commented Sep 15, 2022 •

edited

Loading

yihuang commented Oct 6, 2022 •

edited

Loading

tac0turtle commented Jun 28, 2023

EPIC: Storage #12986

EPIC: Storage #12986

Comments

tac0turtle commented Aug 22, 2022 • edited Loading

Summary

Problem Definition

Proposal

Work Breakdown

Phase 1

Phase 2

Phase 3

alexanderbez commented Aug 22, 2022

tac0turtle commented Aug 25, 2022

alexanderbez commented Aug 25, 2022 • edited Loading

adu-crypto commented Aug 26, 2022

adu-crypto commented Aug 26, 2022

alexanderbez commented Aug 26, 2022

alexanderbez commented Aug 26, 2022

tac0turtle commented Aug 29, 2022

tac0turtle commented Aug 29, 2022 • edited Loading

alexanderbez commented Aug 30, 2022

yihuang commented Sep 14, 2022 • edited Loading

yihuang commented Sep 14, 2022

adu-crypto commented Sep 14, 2022

tac0turtle commented Sep 14, 2022

JayT106 commented Sep 15, 2022 • edited Loading

yihuang commented Oct 6, 2022 • edited Loading

tac0turtle commented Jun 28, 2023

tac0turtle commented Aug 22, 2022 •

edited

Loading

alexanderbez commented Aug 25, 2022 •

edited

Loading

tac0turtle commented Aug 29, 2022 •

edited

Loading

yihuang commented Sep 14, 2022 •

edited

Loading

JayT106 commented Sep 15, 2022 •

edited

Loading

yihuang commented Oct 6, 2022 •

edited

Loading