Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GODRIVER-2533 Fix data race from NumberSessionsInProgress. #1085

Merged
merged 4 commits into from
Oct 4, 2022
Merged

GODRIVER-2533 Fix data race from NumberSessionsInProgress. #1085

merged 4 commits into from
Oct 4, 2022

Conversation

benjirewis
Copy link
Contributor

GODRIVER-2533

Fixes data race in session.Pool by switching checkedOut from an int to an atomically-accessed int64. Adds a regression test for the data race.

Copy link
Collaborator

@matthewdale matthewdale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Atomic logic looks good!

select {
case <-ctx.Done():
return
}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This select statement seems like it will block until the ctx.Done() channel is closed and will never reach the call below. To get around that, either add a default case or check if the context is done in the for loop.

E.g.

for ctx.Err() == nil {
	_ = mt.Client.NumberSessionsInProgress()
}

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a little concerned about the busy waiting here as it occupies the CPU while waiting. Will this be an issue?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TIL about the anti-pattern of busy-waiting, thanks 🧑‍🔧 . I modified the test to only run each operation 5 times and used a channel to sync the goroutines' execution as opposed to a context. Note that the test fails now only about 75% of the time when checkedOut is not atomically accessed (as opposed to closer to 95% of the time with the previous design). I think that's fine, though.

Copy link
Collaborator

@matthewdale matthewdale Oct 4, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think short busy-wait loops in tests that are trying to exercise a specific non-deterministic behavior are probably OK, but I do think the "run N times" loops are a good way to achieve that also. I left some suggestions about possible ways to further improve the reliability of the tests in my subsequent review.

mongo/integration/sessions_test.go Outdated Show resolved Hide resolved
@benjirewis
Copy link
Contributor Author

@matthewdale thank you for catching that the test was not actually running the functions 😮‍💨 . Should be fixed now; I also used a WaitGroup in addition to the context to better synchronize the goroutines and avoid more data races between the goroutines and test cleanup.

@benjirewis benjirewis requested a review from matthewdale October 4, 2022 04:19
Copy link
Collaborator

@matthewdale matthewdale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good 👍

mongo/integration/sessions_test.go Outdated Show resolved Hide resolved
mongo/integration/sessions_test.go Show resolved Hide resolved
mongo/integration/sessions_test.go Show resolved Hide resolved
@benjirewis
Copy link
Contributor Author

@matthewdale thanks. Removed channel, increased iteration count to 100, and added a 100µs sleep.

Copy link
Collaborator

@qingyang-hu qingyang-hu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@benjirewis benjirewis merged commit 1205fe4 into mongodb:master Oct 4, 2022
@benjirewis benjirewis deleted the numberSessionsDataRace.2533 branch October 4, 2022 19:46
Julien-Beezeelinx pushed a commit to Julien-Beezeelinx/mongo-go-driver that referenced this pull request Oct 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants