Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

internal/grpcsync: support two ways to schedule a callback with the serializer #7408

Merged
merged 3 commits into from
Jul 12, 2024

Conversation

easwars
Copy link
Contributor

@easwars easwars commented Jul 12, 2024

The earlier API to add callbacks to the serializer, Schedule, was returning a bool to indicate whether or not the callback was added to the serializer. This had the following problems:

  • No way to mandate that callers actually look at the return value
  • Downstream errors because of the failure to schedule a callback (and the failure to look at the return value) can be very hard to debug
  • Very few callers need to take specific action when the call to Schedule fails, while for most of the others, it is totally OK for the call to Schedule to fail.

This PR splits the API into two:

  • MaybeSchedule: A best effort API that fails silently if the serializer is closed
  • ScheduleOr: An API which runs an onFailure func if the callback cannot be scheduled

RELEASE NOTES: none

@easwars easwars added the Type: Internal Cleanup Refactors, etc label Jul 12, 2024
@easwars easwars added this to the 1.66 Release milestone Jul 12, 2024
@easwars easwars requested a review from dfawley July 12, 2024 17:33
@easwars easwars force-pushed the serializer_api_change branch from 556f171 to 290191e Compare July 12, 2024 17:49
Copy link

codecov bot commented Jul 12, 2024

Codecov Report

Attention: Patch coverage is 91.66667% with 4 lines in your changes missing coverage. Please review.

Project coverage is 81.38%. Comparing base (e7d8822) to head (343432d).
Report is 2 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #7408      +/-   ##
==========================================
- Coverage   81.45%   81.38%   -0.08%     
==========================================
  Files         348      350       +2     
  Lines       26752    26845      +93     
==========================================
+ Hits        21792    21848      +56     
- Misses       3773     3799      +26     
- Partials     1187     1198      +11     
Files Coverage Δ
balancer_wrapper.go 88.74% <100.00%> (+1.98%) ⬆️
internal/grpcsync/callback_serializer.go 100.00% <100.00%> (ø)
internal/grpcsync/pubsub.go 76.47% <100.00%> (ø)
resolver_wrapper.go 85.10% <100.00%> (ø)
...s/internal/balancer/cdsbalancer/cluster_watcher.go 100.00% <100.00%> (ø)
...rnal/balancer/clusterresolver/resource_resolver.go 93.33% <100.00%> (ø)
xds/internal/resolver/serviceconfig.go 88.14% <100.00%> (ø)
xds/internal/xdsclient/authority.go 86.09% <100.00%> (ø)
xds/internal/resolver/watch_service.go 89.47% <83.33%> (ø)
xds/internal/xdsclient/clientimpl_watchers.go 76.74% <50.00%> (ø)
... and 1 more

... and 26 files with indirect coverage changes

Copy link
Member

@dfawley dfawley left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This LGTM but one minor simplification suggestion. Naming question: Maybe is OK and what I originally proposed, but how do you feel about TrySchedule?

@@ -94,8 +94,8 @@ func newCCBalancerWrapper(cc *ClientConn) *ccBalancerWrapper {
// the underlying balancer. This is always executed from the serializer, so
// it is safe to call into the balancer here.
func (ccb *ccBalancerWrapper) updateClientConnState(ccs *balancer.ClientConnState) error {
errCh := make(chan error)
ok := ccb.serializer.Schedule(func(ctx context.Context) {
errCh := make(chan error, 1)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can stay unbounded, and maybe improved a little:

func uCCS() {
	errCh := make(chan error)
	defer close(errCh)
	uccs := ...
	onFailure := func() {}
...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need to block on reading from this channel when the uccs callback is successfully scheduled on the serializer (because we want to handle state updates inline). But if we don't send something on the channel from the onFailure func, the last line return <-errCh will block forever, right? This is equivalent, isn't it? https://go.dev/play/p/GHPD4MLhis6

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Um, yes, I tried to simplify too much, oops. You can still make the channel unbuffered (sorry for saying "unbounded"), and not defer the close but call close in the onFailure:

https://go.dev/play/p/oqDNkOs_hj7

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

@@ -309,8 +309,8 @@ func (b *cdsBalancer) UpdateClientConnState(state balancer.ClientConnState) erro
b.lbCfg = lbCfg

// Handle the update in a blocking fashion.
done := make(chan struct{})
ok = b.serializer.Schedule(func(context.Context) {
errCh := make(chan error, 1)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar here as above.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can't make it unbuffered here because if ScheduleOr fails, it will call onFailure inline and this means that the write to the channel on line 327 errCh <- errBalancerClosed will happen before the read on line 330. Therefore it will just hang.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah yes, that makes sense.

@dfawley dfawley assigned easwars and unassigned dfawley Jul 12, 2024
@easwars easwars assigned dfawley and unassigned easwars Jul 12, 2024
@easwars easwars merged commit d27ddb5 into grpc:master Jul 12, 2024
13 checks passed
@easwars easwars deleted the serializer_api_change branch July 12, 2024 21:47
SaiTejaSuvvari02 added a commit to SaiTejaSuvvari02/grpc-go that referenced this pull request Jul 14, 2024
printchard pushed a commit to printchard/grpc-go that referenced this pull request Jul 30, 2024
printchard pushed a commit to printchard/grpc-go that referenced this pull request Jul 30, 2024
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Jan 9, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants