WIP: Queriers for staking to increase Gaia-lite performance #2139

fedekunze · 2018-08-24T15:26:55Z

Ref: #2078
Fixes #2009

Targeted PR against correct branch (see CONTRIBUTING.md)
Linked to github-issue with discussion and accepted design OR link to spec that describes this work.
Wrote tests
Updated relevant documentation (docs/)
Added entries in PENDING.md with issue #
reviewed Files changed in the github PR explorer

For Admin Use:

Added appropriate labels to PR (ex. wip, ready-for-review, docs)
Reviewers Assigned
Squashed all commits, uses message "Merge pull request #XYZ: [title]" (coding standards)

codecov · 2018-08-24T15:34:29Z

Codecov Report

Merging #2139 into develop will decrease coverage by 0.8%.
The diff coverage is 17.84%.

@@            Coverage Diff             @@
##           develop   #2139      +/-   ##
==========================================
- Coverage    63.91%   63.1%   -0.81%     
==========================================
  Files          134     135       +1     
  Lines         8194    8348     +154     
==========================================
+ Hits          5237    5268      +31     
- Misses        2604    2727     +123     
  Partials       353     353

x/stake/queryable.go

codecov · 2018-08-27T18:25:46Z

Codecov Report

❗ No coverage uploaded for pull request base (develop@d214952). Click here to learn what that means.
The diff coverage is 44.69%.

@@            Coverage Diff            @@
##             develop   #2139   +/-   ##
=========================================
  Coverage           ?   64.1%           
=========================================
  Files              ?     142           
  Lines              ?    8808           
  Branches           ?       0           
=========================================
  Hits               ?    5646           
  Misses             ?    2758           
  Partials           ?     404

alexanderbez

Left a few comments @fedekunze -- I think it'll be good after this.

x/stake/keeper/delegation.go

x/stake/keeper/validator.go

ValarDragon · 2018-09-01T20:32:29Z

In the future, lets please not combine refactors with actual functionality changes. I'm having a very hard time distilling the actual functionality change. (Aside from the delegation stuff, It seems its some bech32 stuff, but there aren't comments anywhere motivating this, nor discussion in the issue / statement at the top of the PR? It probably makes sense, just not indicating it increases time to grok) Reviewing by diff breaks down as we grow PR size. To quote Anton, quality of PR review goes down as PR size grows.

I.E. The bech32validator stuff is in the same PR, as something thats change the capitalization as some log messages, which is in the same PR as stuff that changes the name of a ton function parameters names. These all sound reasonable, but they don't need to be evaluated together with the delegation speedups.

I saw I was requested to review this. Is it important that I do so? If so I think this should be split up into multiple PR's. Optimizing for quality of PR review is crucial, and something I think we need to start enforcing. I totally understand that slowness of merging things / annoyances of branching make including more refactors into the same PR something that naturally happens. I think as a team we just need to get better at pointing this out and avoiding it. (I'm guilty of it as well!)

I'm not sure this is something thats important for me to review though, as I don't do anything on this part of the codebase, and rige, sunny and bez are reviewing this.

fedekunze · 2018-09-02T11:20:07Z

@ValarDragon
First of all, this IS a refactor. It's about changing the actual implementation of the staking Lite endpoints by introducing Queriers.

Second, I'm fine with splitting PRs into several issues and in general I'm completely in favour of it, that's why I opened #2182 and #2202. Nevertheless, when the PR has been sitting for more than a week waiting to be merged and with more than 3 reviews already, it's definitely a waste of time for the author of the PR. If you want to propose splitting the PR into several ones please do so in the first days that the PR has been labeled as R4R.

I've already discussed with @faboweb that this slowness in merging PRs because of things like this is blocking our work in Voyager. This in particular fixes #2009 which is currently slowing GET requests on the validators page in Voyager. I'm literally working during weekends because we need to move forward with this PR. @alexanderbez and @cwgoes one of the only ones who're actually continuously reviewing PRs and speeding things up. Thank you so much @alexanderbez for reviewing even when sometimes you haven't been requested for review !

Also, you don't need to review the PR if you don't want. I requested a review from you because I think you are bright and smart and because your input can definitely be valuable here, specially in what's related to this comment #2139 (comment).

Cheers

ValarDragon · 2018-09-02T16:30:14Z

Theres a lot of "clearly correct" parts of this PR, that if were in a separate PR would get merged extremely quickly. I'll link some examples.

All of these are clearly correct, and are just refactoring / minor changes that are independent from the Querier feature. If a PR was made of just those, it would likely get merged within 48 hours. But more importantly, it would reduce the diff here by like 200 - 400 lines, which makes it substantially easier to review. (And therefore get this merged faster) Normally including of the minor things isn't a big deal. It only starts being a problem when the PR size grows to be very large. You can even base your second PR on the refactoring PR.

Something that would have made this immensely easier to review would have just been doing a find/replace for delAddr -> delegatorAddr and valAddr -> validatorAddr in a separate PR, as its that change in particular that changed the diff size so much.

I totally understand you've put tons of work went into this. I also see why requests to split it up are frustrating. There isn't any need to do it this time, as people have already reviewed it, but note that when that comment was made, the PR was only R4R for 3 days. However further splitting up PR's similar to this in the future would make them get reviewed / merged much more quickly. (hence alleviating I've already discussed with @faboweb that this slowness in merging PRs because of things like this is blocking our work in Voyager.) Thanks for already starting to scope the PR's by splitting into more issues.

As a separate point of discussion, could we unblock voyager by making a "voyager branch" of the SDK. Then have voyager work of the voyager branch, and you could merge any PR you needed into that branch immediately. (This can be done via command line, no github PR required) When relevant aspects get merged to develop, delete the voyager branch and remake it off of develop again?

(e.g. proposed flow is: Create voyager branch, create feature branch off develop. PR feature branch into develop, just merge it into voyager branch immediately. Once feature is in develop, reset voyager branch to develop.)

ValarDragon · 2018-09-02T16:25:29Z

x/stake/keeper/validator.go

 	}
 	iterator.Close()
-	return validators
+	return validators[:i] // trim


This only adds i items to the validators array, is there a reason to trim it?

x/stake/client/rest/query.go

rigelrozanski

Alllllllright! - I went through this PR in reasonable detail for design (I didn't go through the new tests in detail however - but they look clean which is good)

There are a number of structural changes which I'd requested in my review. My biggest quarrel is that there is a lot of code bloat introduced into the keeper which I am windly against - a lot of bloat can be moved into functional attributes of the the objects (and kept in x/stake/types/ maybe we will want to keep some things in the stake keeper but we should add that to a query_utils.go.

Additionally I request that the all changes to x/gov/ be moved to a new PR just to keep this one clean.

Lastly, there is a bit of refactor stuff which I request is removed from this PR (I've made specific comments throughout)

rigelrozanski · 2018-09-02T20:16:24Z

x/stake/keeper/validator.go

-		if !iterator.Valid() {
-			break
-		}
+	for ; iterator.Valid() && (!retrieve || (retrieve && i < int(maxRetrieve[0]))); iterator.Next() {


I'm quite sure we don't want to combine GetValidators and GetAllValidators - in reality GetAllValidators should rarely be getting called and is kind of an edge case for exporting the state of the entire blockchain, everything else should retrieve predetermined maximum number of validators

rigelrozanski · 2018-09-02T20:20:39Z

x/stake/keeper/validator.go

-// Get the set of all validators, retrieve a maxRetrieve number of records
-func (k Keeper) GetValidators(ctx sdk.Context, maxRetrieve int16) (validators []types.Validator) {
+// Get the set of all validators.  If maxRetrieve is supplied, the respective amount will be returned.
+func (k Keeper) GetBechValidators(ctx sdk.Context, maxRetrieve ...int16) (validators []types.BechValidator) {


As per my previous comment we should rarely be using GetAllValidators so it should be its own edge case. But also, let's remove use of this function altogether, it bloats the keeper - what we should really be doing is just converting the validators to bech once we've retrieved them from GetValidators.

With that implementation we'd have to iterate the validator array twice, one to retrieve the validators and another to convert each of them into BechValidator ...

x/stake/keeper/validator.go

x/stake/keeper/keeper.go

rigelrozanski · 2018-09-02T21:15:25Z

PENDING.md

-      * A new bech32 prefix has been introduced for Tendermint signing keys and
-        addresses, `cosmosconspub` and `cosmoscons` respectively.
-
+          * A new bech32 prefix has been introduced for Tendermint signing keys and addresses, `cosmosconspub` and `cosmoscons` respectively.


unnecessary/ undesirable

https://github.com/cosmos/cosmos-sdk/pull/2103/files if we do introduce new bech32's, it would be cosmosval and cosmosvalpub, not cosmoscons* just fyi.

I think maybe there's something wrong with the sdk impl right now... i don't think the validator operator address should be anything different than a normal address.

@jaekwon @rigelrozanski @ValarDragon @alexanderbez Currently BechValidator is not using bech32 prefix for sdk.ValAddress, just hex. Also its PubKey value uses cosmosconspub prefix. Is that ok ? Because I thought it was supposed to be using cosmosvalpub instead

PENDING.md

rigelrozanski · 2018-09-02T21:22:01Z

x/gov/queryable.go

-	err2 := keeper.cdc.UnmarshalJSON(req.Data, proposalID)
-	if err2 != nil {
-		return []byte{}, sdk.ErrUnknownRequest(fmt.Sprintf("incorrectly formatted request data - %s", err2.Error()))
+	errRes := keeper.cdc.UnmarshalJSON(req.Data, proposalID)


Please undo all these refactor changes (for now) to use errRes and res it's bloating this PR - maybe make a refactor PR after this? - Although, I also don't really agree with this modification either - but we can discuss in the next PR

x/stake/client/rest/query.go

x/stake/client/rest/utils.go

rigelrozanski · 2018-09-02T21:38:02Z

p.s. I just assigned myself to this issue, I'll gladly merge once comments addressed

rigelrozanski · 2018-09-02T21:48:12Z

Also note @fedekunze @faboweb - Voyager should not be blocked on things like merging a PR - The voyager team should work off this branch / their own updated branch which contains any new features they want to test out. Rushing merging PR's like this is a big nono - there are it a LOT going on here and requires thorough review. Previously there were staking rest bugs which were introduced with an LCD refactor PR (#1880) which I'm pretty sure was "rushed" to merge - in this previous PR I had not actually finished reviewing it yet but it got merged. Subsequently I had to fix dem bugs, which will happen from time to time and IS OKAY and expected, but you know, I would have rather caught them in review.

fedekunze · 2018-09-02T22:21:44Z

@rigelrozanski what do you think of my comment ?

validator.BechValidator() is returning the ValAddress in hex for the operator and I'm pretty sure the prefix for the pubkey is also wrong. It's using cosmosconspub were it should be cosmosvalpub.

fedekunze · 2018-09-03T08:48:27Z

@rigelrozanski @ValarDragon

The voyager team should work off this branch / their own updated branch which contains any new features they want to test out

I don't think that's currently an option. We are behind schedule with staking in Voyager and we don't have capacity to maintain another branch. Besides, if we want those changes to be on develop we'd still have to edit the code to address the comments from PR review, making the branch redundant.

fedekunze · 2018-09-03T08:49:54Z

@rigelrozanski @ValarDragon @alexanderbez FYI I'll update the PR here and split it once I've addressed Rigel's comments. Thanks for your reviews

jaekwon · 2018-09-03T09:19:47Z

This PR I think is poking at some structural issues we have with the SDK/LCD system.

Before, the LCD handlers were querying the store (a cryptographically verifiable operation using merkle proofs...) so it would have been possible for the LCD to verify everything along the way... but this refactor is trying to push all the logic into keepers.

I'm not sure why this PR makes things faster -- as the comments say, there are a lot of concerns that are mixed in here and it makes for a big PR that is hard to reason about. Why does this PR increase performance?

Anyways, I'm starting to think that the LCD should maybe be replaced with a full node w/ queriers, so that Voyager etc can query any full node directly. There's no point to the LCD if it's just going to proxy the result of a full node's Querier. If we need this for performance reasons, then we might as well bypass the LCD for now until we figure out how to efficiently call the Querier from an LCD (which I think we can do by moving logic into the Keeper as this PR does, and then calling the keeper with a cliCtx which has Stores that are RemoteStores which handles Merkle proofs).

So I'm in favor of moving logic into keepers, and I think we want to have Voyager just query the full node to avoid unnecessary proxying logic that defeats the point of an LCD in the first place.

Also, @fedekunze why is this PR an improvement in speed? Is it because we're avoiding multiple REST query calls inside of a loop or something? Could Voyager just query the full node instead of proxying through LCD?

We have an SDK standup at 10am PDT M/W/F. Can you make that, or someone familiar with Voyager's requirements of the LCD?

fedekunze · 2018-09-07T10:59:21Z

Splitting and moving to #2249 and #2259

Cherry picked commits from prev branch

2ca7e61

fedekunze added C:x/staking wip labels Aug 24, 2018

fedekunze requested review from cwgoes, ebuchman and rigelrozanski as code owners August 24, 2018 15:26

fedekunze mentioned this pull request Aug 24, 2018

WIP: Fix slow fetch delegations #2078

Closed

9 tasks

Federico Kunze added 2 commits August 24, 2018 20:47

Added new keepers for querier functionalities

5e285e1

Renaming

ec13fbe

fedekunze self-assigned this Aug 24, 2018

Federico Kunze added 4 commits August 24, 2018 21:15

Fixed gov errors and messages

3eacdfa

Added Querier to stake and app

682008f

Update delegation keepers

ceae374

REST Queriers not working

9ba98ab

rigelrozanski reviewed Aug 27, 2018

View reviewed changes

x/stake/queryable.go Outdated Show resolved Hide resolved

Fix marshalling error

8bf6b2c

faboweb mentioned this pull request Aug 28, 2018

Slow GET /stake/delegators/{addr} #2009

Closed

4 tasks

Federico Kunze added 8 commits August 28, 2018 15:54

Querier tests working

cd0ca86

Pool and params working

01304c9

sdk.NewCoin for test handler

7889374

Refactor and renaming

d3b6c3e

Update LCD queries and added more tests for queriers

5fb1547

use sdk.NewCoin

8cfc507

Delegator summary query and tests

044b851

Added more tests for keeper

685f05a

fedekunze changed the title ~~WIP: Queriers for staking to increase Gaia-lite performance~~ R4R: Queriers for staking to increase Gaia-lite performance Aug 29, 2018

fedekunze requested a review from faboweb August 29, 2018 14:23

alexanderbez approved these changes Aug 31, 2018

View reviewed changes

x/stake/keeper/delegation.go Outdated Show resolved Hide resolved

x/stake/keeper/validator.go Outdated Show resolved Hide resolved

Federico Kunze added 2 commits September 1, 2018 20:03

Changed Bech validator

99acd57

Updated remaining tests and bech32 validator

8abb44e

fedekunze requested a review from ValarDragon September 1, 2018 20:18

Merge branch 'develop' into fedekunze/2009-queriers-staking

55ff4b7

ValarDragon reviewed Sep 2, 2018

View reviewed changes

rigelrozanski suggested changes Sep 2, 2018

View reviewed changes

rigelrozanski self-assigned this Sep 2, 2018

Addressed most of Rigel's comments

450539b

cwgoes mentioned this pull request Sep 3, 2018

R4R: Governance CLI uses Querier #2141

Merged

5 tasks

fedekunze changed the title ~~R4R: Queriers for staking to increase Gaia-lite performance~~ WIP: Queriers for staking to increase Gaia-lite performance Sep 4, 2018

fedekunze added wip and removed ready-for-review labels Sep 4, 2018

Federico Kunze added 3 commits September 6, 2018 11:20

Updated tests and types

61efdf6

Make codec to be unexported from keeper

0bd0417

Moved logic to query_utils and updated tests

8a2d144

This was referenced Sep 6, 2018

R4R: Staking Querier pt1 #2249

Merged

R4R: Minor changes on slashing logs and gov Querier #2259

Merged

fedekunze closed this Sep 7, 2018

fedekunze deleted the fedekunze/2009-queriers-staking branch September 7, 2018 10:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Queriers for staking to increase Gaia-lite performance #2139

WIP: Queriers for staking to increase Gaia-lite performance #2139

fedekunze commented Aug 24, 2018 •

edited

Loading

codecov bot commented Aug 24, 2018 •

edited

Loading

codecov bot commented Aug 27, 2018 •

edited

Loading

alexanderbez left a comment

ValarDragon commented Sep 1, 2018 •

edited

Loading

fedekunze commented Sep 2, 2018

ValarDragon commented Sep 2, 2018 •

edited

Loading

ValarDragon Sep 2, 2018

rigelrozanski left a comment

rigelrozanski Sep 2, 2018 •

edited

Loading

rigelrozanski Sep 2, 2018

fedekunze Sep 3, 2018

rigelrozanski Sep 2, 2018

jaekwon Sep 3, 2018

jaekwon Sep 3, 2018

fedekunze Sep 3, 2018 •

edited

Loading

rigelrozanski Sep 2, 2018

rigelrozanski commented Sep 2, 2018

rigelrozanski commented Sep 2, 2018

fedekunze commented Sep 2, 2018

fedekunze commented Sep 3, 2018

fedekunze commented Sep 3, 2018

jaekwon commented Sep 3, 2018

fedekunze commented Sep 7, 2018

WIP: Queriers for staking to increase Gaia-lite performance #2139

WIP: Queriers for staking to increase Gaia-lite performance #2139

Conversation

fedekunze commented Aug 24, 2018 • edited Loading

codecov bot commented Aug 24, 2018 • edited Loading

Codecov Report

codecov bot commented Aug 27, 2018 • edited Loading

Codecov Report

alexanderbez left a comment

Choose a reason for hiding this comment

ValarDragon commented Sep 1, 2018 • edited Loading

fedekunze commented Sep 2, 2018

ValarDragon commented Sep 2, 2018 • edited Loading

ValarDragon Sep 2, 2018

Choose a reason for hiding this comment

rigelrozanski left a comment

Choose a reason for hiding this comment

rigelrozanski Sep 2, 2018 • edited Loading

Choose a reason for hiding this comment

rigelrozanski Sep 2, 2018

Choose a reason for hiding this comment

fedekunze Sep 3, 2018

Choose a reason for hiding this comment

rigelrozanski Sep 2, 2018

Choose a reason for hiding this comment

jaekwon Sep 3, 2018

Choose a reason for hiding this comment

jaekwon Sep 3, 2018

Choose a reason for hiding this comment

fedekunze Sep 3, 2018 • edited Loading

Choose a reason for hiding this comment

rigelrozanski Sep 2, 2018

Choose a reason for hiding this comment

rigelrozanski commented Sep 2, 2018

rigelrozanski commented Sep 2, 2018

fedekunze commented Sep 2, 2018

fedekunze commented Sep 3, 2018

fedekunze commented Sep 3, 2018

jaekwon commented Sep 3, 2018

fedekunze commented Sep 7, 2018

fedekunze commented Aug 24, 2018 •

edited

Loading

codecov bot commented Aug 24, 2018 •

edited

Loading

codecov bot commented Aug 27, 2018 •

edited

Loading

ValarDragon commented Sep 1, 2018 •

edited

Loading

ValarDragon commented Sep 2, 2018 •

edited

Loading

rigelrozanski Sep 2, 2018 •

edited

Loading

fedekunze Sep 3, 2018 •

edited

Loading