eth/filters, interfaces.go: EIP-234 Add blockHash option to eth_getLogs #16734

reductionista · 2018-05-12T21:38:56Z

Adds a blockHash option to params field of eth_getLogs JSON-RPC call.

Returns logs matching filter criteria only for all transactions in the block with hash blockHash. This works the same as if you specify fromBlock= toBlock = (# of block whose hash is blockHash), but allows the call to work if the RPC client knows only the block hash but not the block number.

Returns invalid param error if both blockHash and either fromBlock or toBlock are specified.

This implements EIP-234, which includes details on the motivation for it.

I've tested this and it works both for a full node and for a light node, as long as they have finished initial syncing. See some examples here.

holiman

LGTM!

reductionista · 2018-06-01T03:04:32Z

@karalabe Just a reminder, that this is all ready to merge.

Because it's only a wrapper around current functionality that adds a new way of calling the same thing, very low risk that it could negatively affect existing functionality.

tinybike · 2018-06-13T20:43:09Z

@karalabe this is something that is pretty important for augur, any idea when this will be merged?

reductionista · 2018-06-13T22:15:07Z

I've added documentation for this option to the JSON-RPC wiki . I've marked it as "future" in bold; will remove that once this is merged.

karalabe · 2018-06-14T10:33:30Z

I'm not sure this PR fully implements the desired functionality. It assumes that blockHash will be from the canonical chain, because it just converts the hash into a number and start filtering that way. However if the hash is from a side chain, it will filter the canonical block corresponding to the same height, not the side block.

reductionista · 2018-06-14T21:52:45Z

@karalabe Oh, you're right! A big oversight on my part.

I think all I need to add here is a check to make sure the block requested is part of the canonical chain. If it's not, it should just return an error instead of the logs.

@MicahZoltu I hope this behavior is acceptable to you. If you need side-chain logs to be returned, that would be a much larger change, and I'm not even sure it would be possible (as I don't think there is any guarantee that logs from side-chain blocks are stored permanently). Please confirm that returning an error when the requested block hash is not part of the canonical block chain would solve the underlying client problem this is aimed at solving.

MicahZoltu · 2018-06-15T04:32:42Z

It should return an error if the block for the given hash is not found. It would be very desirable if it could pull logs that are still available and return them, even if the block isn't part of the current canonical chain, e.g., if the block just got reorgs out and hasn't yet been deleted/pruned.

I think in the majority of cases where this matters, the user is asking for logs of a block they just got told about and even if a reorg happened in the meantime the node very likely still has the block hanging around (though, certainly I understand that there are no guarantees here).

However, if that isn't possible then a "not-found" error response seems appropriate.

MicahZoltu · 2018-06-15T04:37:59Z

Side note: I have actually solved my specific problem client side, but if I hadn't then returning an error for blocks that just got reorged out would not have solved the problem. The problem we were facing is that reorgs (which are common) resulted in sometimes racing and asking for logs of a block that just got reorged out.

This is further complicated by the fact that the block could get reorged back in before we fetch the next block, so we can't just drop the logs for that block, we would have to drop the whole block. If we could get the logs for the block, then either it will stay reorged out and we'll replace the whole block soon enough or it will get reorged back in and we'll have gotten the logs for it already.

reductionista · 2018-06-15T21:14:05Z

@MicahZoltu thanks for explaining.

Today I looked through the source code more thoroughly, and apparently geth does keep all block receipts permanently, even if they were removed from the chain. So I withdraw my comment about it potentially being impossible (although there still may be no guarantees about this in the protocol itself, in future versions of geth, or in other clients). However, I'm trying to get EIP758 finished before I start a new job in July, and unfortunately I can't afford to take a break right now for a week to add this additional functionality to EIP234.

"This is further complicated by the fact that the block could get reorged back in before we fetch the next block, so we can't just drop the logs for that block, we would have to drop the whole block."

So, would I be correct in saying that this case of a block being removed from the chain temporarily and then added back in again would be the only known instance so far where receiving the logs from non-canonical blocks would be useful? It seems like the same thing could be accomplished by keeping the block, but re-requesting the logs if and when it gets reorg'd back in, right? Is that difficult or cumbersome for a client to implement? Just trying to get a sense of how important this additional functionality would be compared to the rest. Also curious to hear @karalabe's thoughts on it. One possibility is to merge in what we have now, and then if and when I get a chance later I will add the rest.

MicahZoltu · 2018-06-16T06:40:04Z

It seems like the same thing could be accomplished by keeping the block, but re-requesting the logs if and when it gets reorg'd back in, right?

It is possible to never witness the block getting reorged out or back in. Take the following example:

Block 123 hash 0xabc is announced to client

2. Client asks for logs of block 123 3. Server responds with logs from block 0xdef 4. Client asks for block 123 5. Server responds with block 123 hash 0xabc

In this example, the only hint of a reorg the client has is that the logs it got back don't match the logs it expected. Otherwise, it always witnesses the 0xabc chain when talking to the node. The problem here is that if it gets back no logs then it won't witness the reorg and it won't know to throw out the block!

There are two ways for the client to deal with this:

Throw away any block that the client cannot get logs for, this requires the client have a way to identify whether they actually got logs from the expected block or not.
Talk to a server that will give it logs for blocks outside the canonical chain.

MicahZoltu · 2018-06-16T06:41:00Z

I do think that what you have now is significantly better than nothing. Even with my recent client side fix, I just realized that there are still failure scenarios where the client will miss some logs unless what you have is merged.

reductionista · 2018-06-16T16:34:58Z

@MicahZoltu Ok, if I understand correctly, the double-reorg scenario you've described is currently a problem, but would be solved if my present implementation of EIP-234 is merged:

Block 123 hash 0xabc is announced to client
Client requests logs for blockHash 0xabc
Server responds with error: "Block with hash 0xabc was removed from canonical chain."
Client waits until new announcements come, since it's now aware this block was removed.
Block 123 hash 0xdef is announced to client
Client requests logs for blockHash 0xdef
Server responds with error: "Block with hash 0xdef was removed from canonical chain."
Client waits again, since 0xdef was also removed.
Block 123 hash 0xabc is announced to client
Client requests logs for blockHash 0xabc
Server returns logs for blockHash 0xabc
...
The server letting the client know when it's trying to request logs for a removed block allows the two to synchronize, whereas before there was a race condition where if the 2nd two announcements get delayed, invalid logs could be returned and confuse the client.

The main question I'm wondering is whether there is an example of a case where my current implementation would not fix the problem. Based on your most recent comment, it sounds like the answer is no, even though adding the ability to receive logs for removed blocks might potentially be useful in some other cases (that Augur currently doesn't care about). Is that right?

MicahZoltu · 2018-06-17T03:55:50Z

@reductionista You are correct. With the changes I recently made to ethereumjs-blockstream I am now able to solve all known problems except the one described, which would be solved by the node returning an error instead of the logs for the wrong block. Thus, I am satisfied with the solution you have. 😄

reductionista · 2018-06-28T05:01:12Z

@karalabe Please advise if anything else needs to be done in order for this to be merged.

karalabe · 2018-07-12T14:58:12Z

The PR didn't solve the issue it set out to solve. If I understand correctly, the original issue was that we wanted to filter for some logs, but the returned logs were from a different block. The PR in the current implementation verified whether the block hash is part of the canonical chain, and if so, did a filter based on block number. But a reorg can happen after verifying that it's on the canonical chain, but before filtering it. That would lead to the exact same erroneous behavior.

The only way to properly implement this EIP is to extend the filtering mechanism itself so that it can operate either on block numbers or on block hashes. Without explicit support for hash based filtering, this PR will forever have a racey behavior.

I've pushed a commit on top which extends the low level filtering itself to be able to do both range queries as well as block hash queries. Could please all of you verify that a) the new mechanism works as expected and b) the old one still works (i.e. that I didn't bork anything)?

holiman

Have not tested the PR, but the changes LGTM

karalabe · 2018-07-24T09:47:37Z

@reductionista @tinybike @MicahZoltu Btw, I didn't know there was a bounty put on this PR, nor do I want to have a claim in it. The reason I figured I'll fix up the code is to get this merged in faster since we went back and forth for a long time already. The original contribution IMHO did a good job, alas didn't have the full picture. Nonetheless it's because of the original contribution that this PR gets merged, so feel free to award the full bounty to @reductionista.

reductionista · 2018-07-25T05:43:18Z

@karalabe, you are extremely generous!!

Thank you. I will avoid more back-and-forth and accept your generosity. If you ever change your mind and decide you want your half of it, just let me know and I'll send it your way! Assuming I haven't foolishly cashed it in before it goes to the moon, of course. ;-)

ethereum#16734 introduced BlockHash to the FilterQuery struct. However, the ethclient.go was not updated to include BlockHash in the actual rpc request.

#16734 introduced BlockHash to the FilterQuery struct. However, ethclient was not updated to include BlockHash in the actual RPC request.

ethereum/go-ethereum#16734 introduced BlockHash to the FilterQuery struct. However, ethclient was not updated to include BlockHash in the actual RPC request.

…reum#16734)

reductionista requested a review from karalabe as a code owner May 12, 2018 21:38

reductionista force-pushed the eip234 branch from 2c0db1d to 2722420 Compare May 15, 2018 03:07

holiman approved these changes May 25, 2018

View reviewed changes

MicahZoltu mentioned this pull request Jun 13, 2018

Error while using zeroEx.exchange.subscribe: received log with same block number but index newer than previous index 0xProject/0x-monorepo#693

Closed

reductionista force-pushed the eip234 branch from 2722420 to 3611ae3 Compare June 15, 2018 02:00

karalabe force-pushed the eip234 branch from d985573 to b63f877 Compare July 12, 2018 14:56

reductionista requested a review from zsfelfoldi as a code owner July 12, 2018 14:56

reductionista and others added 2 commits July 12, 2018 18:16

eth/filters, ethereum: EIP-234 add blockHash param for eth_getLogs

96339da

accounts, eth, les: blockhash based filtering on all code paths

e1f1d30

karalabe force-pushed the eip234 branch from b63f877 to e1f1d30 Compare July 12, 2018 15:17

karalabe added this to the 1.8.13 milestone Jul 12, 2018

holiman approved these changes Jul 14, 2018

View reviewed changes

karalabe merged commit 21c059b into ethereum:master Jul 24, 2018

fabioberger mentioned this pull request Jul 30, 2018

Add blockHash option to eth_getLogs (EIP 234) openethereum/parity-ethereum#9251

Closed

This was referenced Aug 1, 2018

Race condition ethereumjs/ethereumjs-blockstream#10

Closed

Implement EIP-234 trufflesuite/ganache#136

Closed

Implement EIP-234's get logs by blockHash XLNT/gnarly#38

Open

fabioberger mentioned this pull request Sep 21, 2018

Fix dropped events issue in Order-watcher and Contract-wrappers subscriptions 0xProject/0x-monorepo#1080

Merged

5 tasks

tamirms mentioned this pull request Oct 28, 2018

ethclient: include block hash from FilterQuery #17996

Merged

fjl pushed a commit that referenced this pull request Nov 8, 2018

ethclient: include block hash from FilterQuery (#17996)

b16cc50

#16734 introduced BlockHash to the FilterQuery struct. However, ethclient was not updated to include BlockHash in the actual RPC request.

yoomee1313 mentioned this pull request Dec 23, 2020

abi: import files from latest geth release klaytn/klaytn#815

Merged

9 tasks

kjeom mentioned this pull request Nov 1, 2022

Add block hash feature for getLogs api klaytn/klaytn#1653

Merged

9 tasks

gzliudan mentioned this pull request Mar 14, 2024

improve package eth/filter XinFinOrg/XDPoSChain#491

Merged

19 tasks

gzliudan added a commit to gzliudan/XDPoSChain that referenced this pull request Mar 18, 2024

accounts, eth, les: blockhash based filtering on all code paths (ethe…

a4b557b

…reum#16734)

ibrahimkhled mentioned this pull request Oct 13, 2024

Have not tested the PR, but the changes LGTM Tenderly/tenderly-cli#192

Open

c98tristan mentioned this pull request Nov 22, 2024

Support eth_getLogs query with blockHash params. BuildOnViction/victionchain#480

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eth/filters, interfaces.go: EIP-234 Add blockHash option to eth_getLogs #16734

eth/filters, interfaces.go: EIP-234 Add blockHash option to eth_getLogs #16734

reductionista commented May 12, 2018 •

edited

Loading

holiman left a comment

reductionista commented Jun 1, 2018

tinybike commented Jun 13, 2018

reductionista commented Jun 13, 2018

karalabe commented Jun 14, 2018

reductionista commented Jun 14, 2018

MicahZoltu commented Jun 15, 2018

MicahZoltu commented Jun 15, 2018

reductionista commented Jun 15, 2018

MicahZoltu commented Jun 16, 2018

MicahZoltu commented Jun 16, 2018 •

edited

Loading

reductionista commented Jun 16, 2018

MicahZoltu commented Jun 17, 2018

reductionista commented Jun 28, 2018

karalabe commented Jul 12, 2018

holiman left a comment

karalabe commented Jul 24, 2018 •

edited

Loading

reductionista commented Jul 25, 2018

eth/filters, interfaces.go: EIP-234 Add blockHash option to eth_getLogs #16734

eth/filters, interfaces.go: EIP-234 Add blockHash option to eth_getLogs #16734

Conversation

reductionista commented May 12, 2018 • edited Loading

holiman left a comment

Choose a reason for hiding this comment

reductionista commented Jun 1, 2018

tinybike commented Jun 13, 2018

reductionista commented Jun 13, 2018

karalabe commented Jun 14, 2018

reductionista commented Jun 14, 2018

MicahZoltu commented Jun 15, 2018

MicahZoltu commented Jun 15, 2018

reductionista commented Jun 15, 2018

MicahZoltu commented Jun 16, 2018

MicahZoltu commented Jun 16, 2018 • edited Loading

reductionista commented Jun 16, 2018

MicahZoltu commented Jun 17, 2018

reductionista commented Jun 28, 2018

karalabe commented Jul 12, 2018

holiman left a comment

Choose a reason for hiding this comment

karalabe commented Jul 24, 2018 • edited Loading

reductionista commented Jul 25, 2018

reductionista commented May 12, 2018 •

edited

Loading

MicahZoltu commented Jun 16, 2018 •

edited

Loading

karalabe commented Jul 24, 2018 •

edited

Loading