new transactions #178

brennanjl · 2023-08-11T06:48:25Z

This branch contains a basic restructuring of transactions, signatures, serialization, and payloads. I'd love feedback on this.

This contains quite a bit of duplication with existing code. I did not want to break the build, but I tried to segment it so it would make sense.

Transactions

The new transactions structure takes into account the weird situation we ran into with signed message vs transactions (the old transactions structure was not super flexible). This new structure allows us to contain really any sort of message. It also is designed to not be coupled to the message signer (moved to Signature).

Furthermore, it also removes the needs for a distinctly defined PayloadType (as found in pkg/tx), in favor of type switching. I believe this is better, but am not sure.

This can be found in pkg/transactions.

Signatures

The new signatures are similar to the old ones, except they should be more flexible for supporting a lot of different signatures from different keys. Essentially, the sender, message, signature, and signature type are all contained within the Signature. The message would actually just be the hash of the transaction payload. When it comes time to verify it, the verifier would hash the payload to make sure it matches the data contained in Signature, and if successful, simply verifies the signature. There is an example of this in the new transactions package.

I have stubbed out the structure for private keys, public keys, and addresses for different key pairs. I think we could actually just use one address type (instead of secp256k1 and ed25519 having their own), but kept it separate for now.

As a team, it seems we are sort of stuck in between a rock and a hard place:

We need to support multiple key types, with various signature algorithms for the different ecosystems we need to support. For example, despite both supporting secp256k1, Ethereum and CometBFT use different signature algorithms. While we don't yet need to support the CometBFT version, I think we are likely in this place with ed25519 signatures between CometBFT and NEAR.
None of us are exactly experts in this type of stuff, so it's tough to quickly make an educated decision.
We don't have a ton of time to learn it.

Because of this, I have left it very general. It would be quite easy to add new signature algorithms in the future while maintaining backwards compatibility using this structure.

This can be found in pkg/crypto.

Serialization

As previously discussed, we have some major issues with json marshaling. Even forgetting the inefficiencies, the nondeterminism could really be a massive footgun that we don't run into until this thing is running. Furthermore, it's quite hard to provide backwards compatibility with that.

As a workaround, I have used Ethereum's RLP for serialization. There is an extra type appended to the front of all serialized messages, so that if we want to switch in the future / support more types of serialization (such as our own), we can do that with backwards compatibility. Backwards compatibility is my biggest concern here, hence this decision.

There are certainly some tradeoffs with RLP, but there were some strong things going for it:

It is battle tested, so it is very reliable, and has a much lower chance of providing us with bugs.
It is widely supported, with implementations in virtually every language. We don't need to make our own implementations in TypeScript/Python.
It is deterministic, as well as pretty compact (efficiency is not a main concern, but its a bonus).

The drawbacks:

It can't contain maps, (they are very hard to make deterministic) which are currently used in almost every payload. That being said, we have talked about wanting to remove them, but it's just forcing our hand.
It does not support any. The canonical way of handling this is using strings. Because of SQLite's typing system, this actually is not an issue, but still is not ideal.

I have already addressed these drawbacks for our specific case, as addressed in the next section.

This can be found in pkg/serialize/rlp, however if we choose to adopt it, it will move into pkg/serialize.

Payloads

The payloads issue is another recurring one. They were in entity (we discussed why this was bad), and until now have lived in serialize (which is absolutely not where they belong). The following was my knee-jerk solution, and the potential downsides are pretty obvious.

I have moved the payloads into a types package. @jchappelow, I'd love to get your thoughts on types packages; I see them all the time, but most software gurus say they are a bad.

This package only holds our payloads (maybe we can rename it to payloads), and keeps them separated there as a means of attempting to define a stable public API in one place (they can't be contained in protobuf because they get passed serialized). If we choose to adopt this, we should strictly limit this package to only our public API types, as the drawbacks of types packages with regards to coupling are easy to see.

This can be found in pkg/types

Conclusion

All of the above is implemented in this branch, with the exception of actual signature + verification algorithms. Since signatures are not Kwil specific (which is exactly the problem), I have sent up the bat signal to a few different devs who may be able to help us get it sorted out given the short time frame.

I definitely want to have everyone on board with this; I've made a ton of major suggestions recently, so if anyone thinks something in here is a bad idea, please feel empowered to speak up.

I tried to tie up the loose ends from the rest of the problems in these suggestions, so that hopefully this is the last massive shift we have to do. In addition to the "signatures" problem that we discussed earlier, this also tries to address:

the payloads issue
the serialization issue
the CometBFT validator keys being entirely separate citizens issue
being able to provide backwards compatibility (hopefully)

I really tried to thread the needle of these concerns, our timeline, and downstream effects this may have on tooling.

Let me know what you all think!

Yaiba · 2023-08-11T14:04:09Z

"Ethereum and CometBFT use different signature algorithms."

There is a hacky way use same public key to generate both Cosmos and Ethereum address, not sure about cometbft

Yaiba · 2023-08-11T14:10:15Z

pkg/crypto/ed25519.go

+package crypto
+
+import (
+	oasisEd25519 "github.com/oasisprotocol/curve25519-voi/primitives/ed25519"


Why not 'crypto/ed25519'?

I was trying to implement the signing algorithm used by CometBFT (they use this under the hood), in order to use validator keys.

We can definitely change the implementation of these, was just what I was toying around with.

jchappelow · 2023-08-11T14:21:44Z

pkg/crypto/signature.go

 type Signature struct {
-	Signature []byte        `json:"signature_bytes"`
+	Message   []byte `json:"message"`
+	Signature []byte `json:"signature_bytes"`
+	Sender    PublicKey


I find this struct a bit awkward, namely the inclusion of both the message and the pubkey itself.
pkg/transactions.SignedMessage contains this structure and at a glance appears to contain a copy of the message, although in interface form.
How is this Signature type contsructed and passed in practice? Same for SignedMessage. Will open this branch in an IDE to see if it's clearer.

I guess if you were to write out a doc string for this exported type, how would it read in terms of "this combines" and "so that"/"which facilitates" and "should be used by" etc.?

I see how the naming and semantics of this are confusing.

The "Message" contained here is actually a hash of the message contained within the transaction. Re-looking at this after stepping away, I agree that the "Message" field could either be renamed hash, or just dropped entirely (since it really is not needed).

The "Signature.Signature" also seems a bit weird too.

I can create a few basic constructors; the creation of them should be pretty easy.

Looping back on this; I agree that the message should not be contained in the Signature. What about the Sender?

Probably shouldn't have it here either IMO. For a signature, the norm is to define it as just the signature data by itself (either a byte slice or an r/s bigint pair). Then it has a verify method that accepts a hash/message and a pubkey (otherwise there's a pure function taking all three).

We also need the SignatureType field (or we define separate concrete types with their own implementations of a Signature interface), but the other stuff is not strictly necessary and more likely to confuse.

You have:

type PublicKey interface { Bytes() []byte Type() KeyType Verify(sign *Signature) error Address() Address }

so we can do

func (s *Signature) Verify() error { return s.Sender.Verify(s) }

I think you're mocking cometbft here with the PubKey interface, but I'm just not sure it's a great example to follow. Here's what cometbft's does in it's secp256k1.PubKey.VerifyMessage: https://github.com/cometbft/cometbft/blob/68c10d2ad0bb1015a7fb3e1283c50fb2e933fb9a/crypto/secp256k1/secp256k1.go#L190-L216. They literally re-parse the signature and do sig.Verify(...). It all feels circular.

It would be great to only have one Verify method we need to use. That is (*Signature).Verify should just work with the pubkey bytes and the signed data.

The other thing is KeyType vs SignatureType. It feels really weird and error prone that these aren't the same. We have two separate secp256k1 signature types, but that's really referring to variations in the DER encoding we're forced to deal with, not the bytes of the r/s components of the successfully-parsed signature, the interpretation of which isn't subject to any variation across implementations. By the time we construct our new Signature struct instance, we should be shed of the encoding variations, in a canonical format where there's no distinction between signature type and pubkey type.

jchappelow · 2023-08-11T14:26:32Z

pkg/types/schema.go

+
+type DropSchema struct {
+	Owner string
+	Name  string
+}
+
+func (s *DropSchema) Bytes() ([]byte, error) {
+	return rlp.Encode(s)
+}
+
+func (s *DropSchema) FromBytes(b []byte) error {


This is a reasonable pattern for defining a "payload". In my local workspace for the validator module, I have this:

// ExecuteActionPayload is a struct that represents the action execution type ExecuteActionPayload struct { Action string `json:"action"` DBID string `json:"dbid"` Params []map[string]any `json:"params"` } func (e *ExecuteActionPayload) Bytes() ([]byte, error) { return json.Marshal(e) } var _ encoding.BinaryUnmarshaler = (*ExecuteActionPayload)(nil) var _ encoding.BinaryMarshaler = (*ExecuteActionPayload)(nil) func (e *ExecuteActionPayload) UnmarshalBinary(b []byte) error { return json.Unmarshal(b, e) } func (e *ExecuteActionPayload) MarshalBinary() ([]byte, error) { return e.Bytes() }

In this case I've just chosen the encoding.BinaryUnmarshaler/encoding.BinaryMarshaler because it is defined and recognized around the standard library, but both Bytes and FromBytes are common. BTW, the semantics of io.Reader and io.Writer interfaces are totally confusing so I'm delighted you didn't go that way.

EDIT: updated the example code above, pasted wrongish thing

So in that case I've obviously continued to use JSON under the hood, but obviously the point is its a good idea to pick a generic interface and run with it.

This may be a stupid question, but are these lines just meant to ensure that it fulfills the interface?:

var _ encoding.BinaryUnmarshaler = (*ExecuteActionPayload)(nil) var _ encoding.BinaryMarshaler = (*ExecuteActionPayload)(nil)

Yah, it's mostly declarative. Expresses to the reader what you're trying to do, and makes a compile error if you mess with the impls.

jchappelow · 2023-08-11T14:35:53Z

pkg/crypto/secp256k1.go

+type Secp256k1Address struct {
+	address common.Address


This is what we do for our account addresses, but the address format is somewhat arbitrary. This might encourage us to forget about how users of other chains might interact with Kwil. I think we've talked about this, but I wanted to make a note.

Ideally it would be all pubkey, with a Kwil-defined address format for other uses like queries and accounts store, but I sense we cannot do that easily, or it completely breaks assumptions about how the users and sdks expect it to work. (They do not identify purely by pubkey, mainly for practical reasons surrounding wallet interactions, if I understand right.).

Yeah, not only do they not identify by pubkey, but they actually expect to be identified by their specific address.

So if some user from Ethereum uses a key, then they will expect that the caller on Kwil (which is exposed to the user in databases, via @caller) is the same as what they have in their Ethereum wallet.

brennanjl · 2023-08-11T16:37:08Z

I made the changes we discussed this morning:

Cleaned Up Signature

I'll blame the weird signature structure on the lack of sleep. It's actually exactly what it was before (minus the actual signature algorithms).

Separated Transactions and Signed Messages

Gavin brought up that we should separate these, and I agree. The one thing to note here is that, within Transaction, everything within the body is checked in the signature. The public key is kept out of this, since in clients like Metamask, this would require revealing the public key before signing, as opposed to using ecrecover afterwards. This does bring up the issue though of hash collisions; it is VERY possible that two users could create the exact same transaction hash.

No More Types

The types got moved into the transactions package. This was suggested by @jchappelow.

Yaiba · 2023-08-11T18:22:33Z

pkg/transactions/transaction.go

+// TransactionBody is the body of a transaction that gets included in the signature
+type TransactionBody struct {
+	// Payload are the raw bytes of the payload data
+	Payload rlp.SerializedData


Should we put dbid out of the payload?
Reasons are:

easy to identify which db this TX for (easy for explorer?) you don't need to look into the payload to know From and To

db engine only executes payload in different DB

I don't think so, because DBID is Dataset Module specific. The Validator Module should not need to know or care about it.

I think the confusion with "From" and "To" (when comparing with Ethereum) is that in Ethereum, a smart contract is an account, just like a wallet. We don't have that same functionality.

brennanjl · 2023-08-14T05:14:30Z

I made the changes discussed above. There are still two outstanding issues:

The CLI is not building. This is due to changes in the private keys.
Crypto is not working, and is simply stubbed out (however I know @jchappelow had some concerns with that, which I am fine with).

jchappelow · 2023-08-14T13:50:21Z

pkg/transactions/transaction.go

+	// hash of the transaction that is signed.  it is kept here as a cache
+	hash []byte


Had same thought. Just have to be cognizant of concurrent access. Slapping on a mutex might be tempting but it's usually best to first understand the access patterns and document the restrictions on the method (not thread safe), unless absolutely necessary.

jchappelow · 2023-08-14T13:52:55Z

In the following comment on the Payload field,

kwil-db/pkg/tx/message.go

Lines 12 to 15 in 2f99564

    
           // This was made after Transaction, and is made to be more general. 
        
           // Unlike Transaction, SignedMessage contains a deserialized payload 
        
           type SignedMessage[T Serializable] struct { 
        
           	Payload   T // we use generic here to give access to the underlying struct fields

to what is it referring? I can do some sleuthing, but where's access to the underlying struct fields happening? Trying to see why Payload Serializable isn't the preferable definition here.

brennanjl · 2023-08-14T14:12:36Z

I think you may be seeing an old version. pkg/tx got renamed to pkg/_tx, to remove it from being compiled. A few of the packages that have changed a lot I have left there temporarily, to allow people to see the differences.

The package that contains the transactions named in this commit is pkg/transactions. I renamed it because I actually didn't really like the tx package name, since it is a common variable name and screws with a lot of linters.

…ypes

sonarcloud · 2023-08-14T15:18:40Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
6 Code Smells

No Coverage information
0.0% Duplication

Yaiba

I think we can start to work on those changes to slowly integrate our code

* added a new theoretical structure for transactions, signatures, and types * added drop schema payload * made changes discussed this morning * modified repo to reflect new transaction structure * updated CLI to use stubbed out methods. Should now be building * removed old packages

Yaiba reviewed Aug 11, 2023

View reviewed changes

jchappelow reviewed Aug 11, 2023

View reviewed changes

Yaiba reviewed Aug 11, 2023

View reviewed changes

brennanjl force-pushed the new-transactions branch from 9ade0b3 to 3091e50 Compare August 14, 2023 05:12

jchappelow mentioned this pull request Aug 14, 2023

add validator store and module for ABCI app #182

Merged

jchappelow reviewed Aug 14, 2023

View reviewed changes

brennanjl marked this pull request as ready for review August 14, 2023 15:17

brennanjl added 8 commits August 14, 2023 10:17

added a new theoretical structure for transactions, signatures, and t…

1b764e5

…ypes

added drop schema payload

e275709

made changes discussed this morning

a747d9c

minor changes

d1f50d2

modified repo to reflect new transaction structure

2d20043

updated CLI to use stubbed out methods. Should now be building

d324858

removed old packages

47e0f86

removed old packages

6612784

brennanjl force-pushed the new-transactions branch from 83b16a4 to 6612784 Compare August 14, 2023 15:17

Yaiba approved these changes Aug 14, 2023

View reviewed changes

jchappelow approved these changes Aug 14, 2023

View reviewed changes

Yaiba merged commit c78aa8d into main Aug 14, 2023
1 of 3 checks passed

Yaiba deleted the new-transactions branch August 14, 2023 16:25

Yaiba mentioned this pull request Aug 17, 2023

Replace action interface from maps to tuples #156

Closed

Yaiba mentioned this pull request Aug 29, 2023

potential replay attack on view mustsign action #138

Closed

jchappelow added this to the v0.6.0 milestone Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new transactions #178

new transactions #178

brennanjl commented Aug 11, 2023 •

edited

Loading

Yaiba commented Aug 11, 2023

Yaiba Aug 11, 2023

brennanjl Aug 11, 2023

jchappelow Aug 11, 2023

jchappelow Aug 11, 2023

brennanjl Aug 11, 2023

brennanjl Aug 11, 2023

jchappelow Aug 11, 2023 •

edited

Loading

jchappelow Aug 11, 2023 •

edited

Loading

jchappelow Aug 11, 2023

brennanjl Aug 11, 2023

Yaiba Aug 11, 2023

jchappelow Aug 11, 2023

jchappelow Aug 11, 2023

brennanjl Aug 11, 2023

brennanjl commented Aug 11, 2023 •

edited

Loading

Yaiba Aug 11, 2023 •

edited

Loading

brennanjl Aug 11, 2023

brennanjl Aug 11, 2023

brennanjl commented Aug 14, 2023

jchappelow Aug 14, 2023

jchappelow commented Aug 14, 2023

brennanjl commented Aug 14, 2023

sonarcloud bot commented Aug 14, 2023

Yaiba left a comment

		// hash of the transaction that is signed. it is kept here as a cache
		hash []byte

new transactions #178

new transactions #178

Conversation

brennanjl commented Aug 11, 2023 • edited Loading

Transactions

Signatures

Serialization

Payloads

Conclusion

Yaiba commented Aug 11, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jchappelow Aug 11, 2023 • edited Loading

Choose a reason for hiding this comment

jchappelow Aug 11, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brennanjl commented Aug 11, 2023 • edited Loading

Cleaned Up Signature

Separated Transactions and Signed Messages

No More Types

Yaiba Aug 11, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brennanjl commented Aug 14, 2023

Choose a reason for hiding this comment

jchappelow commented Aug 14, 2023

brennanjl commented Aug 14, 2023

sonarcloud bot commented Aug 14, 2023

Yaiba left a comment

Choose a reason for hiding this comment

brennanjl commented Aug 11, 2023 •

edited

Loading

jchappelow Aug 11, 2023 •

edited

Loading

jchappelow Aug 11, 2023 •

edited

Loading

brennanjl commented Aug 11, 2023 •

edited

Loading

Yaiba Aug 11, 2023 •

edited

Loading