chore: fix data races #1080

nktks · 2024-12-02T08:33:39Z

PR Checklist

Read the Contributing documentation.
Read the Code of conduct documentation.
Name your Pull Request title clearly, concisely, and prefixed with the name of the primarily affected package you changed according to Good commit messages (such as memory: add interfaces for X, Y or util: add whizzbang helpers).
Check that there isn't already a PR that solves the problem the same way to avoid creating a duplicate.
Provide a description in this PR that addresses what the PR is solving, or reference the issue that it solves (e.g. Fixes #123).
~~Describes the source of new concepts.~~ <- I don't add new concept
References existing implementations as appropriate.
~~Contains test coverage for new functions.~~ <- I don't add new function
Passes all golangci-lint checks.

What

Hello.
Thank you for maintaining this great module!
I would like to:

fix OpenAI Chat LLM thread safety #281 (comment)
add -race flag in go test CI to avoid data race.
fix data race in test case https://github.com/tmc/langchaingo/actions/runs/12115516980/job/33774019172?pr=1080

leventov · 2024-12-04T10:17:45Z

There is at least one more race, ernieclient.Client.accessToken field is written in autoRefresh() through a race. This is pretty much a syntactically-catchable thing, I wonder why go vet or other tools don't catch this.

nktks · 2024-12-05T02:08:17Z

@leventov
Hello.
You mentioned about https://github.com/nktks/langchaingo/blob/fix/data-race/llms/ernie/internal/ernieclient/ernieclient.go#L178 ?

I think, the normal use case is to call ernieclient.New() only once for one *ernieclient.Client instance.
In this case, autoRefresh() is called only in ernieclient.New(), and the accessToken is written in only one goroutine in autoRefresh(), so rata races won't occur.

leventov · 2024-12-05T07:49:37Z

@nktks yes, only one goroutine. But isn't writing a plain field c.accessToken from one goroutine and then reading from others (any goroutines that use the Client for making LLM calls) a data race? Shouldn't such value hand over happen either via atomic.Value, or a channel, but not just plain field write/read?

leventov · 2024-12-05T07:52:10Z

Here's from https://go.dev/ref/mem:

Reads of memory locations larger than a single machine word are encouraged but not required to meet the same semantics as word-sized memory locations, observing a single allowed write w. For performance reasons, implementations may instead treat larger operations as a set of individual machine-word-sized operations in an unspecified order. This means that races on multiword data structures can lead to inconsistent values not corresponding to a single write. When the values depend on the consistency of internal (pointer, length) or (pointer, type) pairs, as can be the case for interface values, maps, slices, and strings in most Go implementations, such races can in turn lead to arbitrary memory corruption.

nktks · 2024-12-05T09:21:38Z

@leventov Thank you!
Indeed reading/writting accessToken happens data race.

I added test case to reproduce.
https://github.com/tmc/langchaingo/actions/runs/12176399051/job/33962066313

Then I fixed it by using sync.RWMutex.

nktks added 3 commits December 2, 2024 17:32

ci: add -race flat go test in CI

a551e3d

openaiclient: fix data race at createChat

7052f6f

chains_test: use mutex to lock

f7fac22

nktks marked this pull request as ready for review December 2, 2024 09:10

nktks changed the title ~~fix data race in openaiclient~~ fix data races Dec 2, 2024

nktks changed the title ~~fix data races~~ chore: fix data races Dec 2, 2024

nktks added 3 commits December 5, 2024 17:56

add fail test case to r/w accessToken from some goroutines

b4bb8d1

fix data race for accessToken

5000ec8

fix lint

1aec997

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: fix data races #1080

chore: fix data races #1080

nktks commented Dec 2, 2024 •

edited

Loading

leventov commented Dec 4, 2024

nktks commented Dec 5, 2024 •

edited

Loading

leventov commented Dec 5, 2024

leventov commented Dec 5, 2024

nktks commented Dec 5, 2024

chore: fix data races #1080

Are you sure you want to change the base?

chore: fix data races #1080

Conversation

nktks commented Dec 2, 2024 • edited Loading

PR Checklist

What

leventov commented Dec 4, 2024

nktks commented Dec 5, 2024 • edited Loading

leventov commented Dec 5, 2024

leventov commented Dec 5, 2024

nktks commented Dec 5, 2024

nktks commented Dec 2, 2024 •

edited

Loading

nktks commented Dec 5, 2024 •

edited

Loading