-
Notifications
You must be signed in to change notification settings - Fork 17.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
spec: define initialization order more precisely #57411
Comments
CC @griesemer |
Given the widespread use of More generally, what if a package does:
while another does
or even different files in the same package? |
Depending on the implementation of this,
That is a great question... couldn't this raise a syntax error ? Like a "pick a lane bud, you're drunk !" 😄 |
packaged does the first one, packagee does the second are you just never allowed to import these two packages together? |
Without having thought about it too much (therefore don't take my word for it), I think I would simply raise a syntax error on both import lines (for packaged and packagee...) There is hardly any reason why one would want to have 2 different import orders, and if there was a reason, that reason doesn't exist as of today, because at the moment, package initialization order isn't guaranteed (and so we can eliminate that problem by raising a syntax error.) |
Actually, I see a potential problem with applications with huge dependency trees with open source packages. One would want to take this into consideration if something is to be done... |
You should not connect to a server from Instead, packagea should have a proper constructor that gets invoked. Each unit test is then responsible for calling it. This establishes proper isolation between each test case, which is a very useful property. Meanwhile, unit tests for other packages that depend on packagea should likely be using some interface instead of direct function calls, which allows for mocking. |
See also #31636. You can force the init order by ordering the imports accordingly and into "blocks" so that they don't get re-ordered (example). Note that this is not prescribed by the language, it's an implementation behavior. That said, it is much more robust to be explicit and call initialization functions explicitly if they cause side effects. |
I guess you are right, there are probably other ways to get succint code without connecting to a server in the init() function. However the fundamental question regarding side effects is still of interest, for other use cases. |
it would be useful to have real usecases motivating these discussions. |
This proposal is only useful if package A's initialization is dependent on some other package B, despite the fact that A does not import B directly or indirectly. Writing code this way seems extremely confusing and error prone, especially because it will silently break if B imports A directly or indirectly, since then A would have to be initialized first no matter what. |
I'm glad that in the example you provided, the same assumptions are made; however while testing this I experienced some irregularities.
|
VSCode's goimport doesn't rearrange imports in different groups if they are separated by a blank line: import (
_ "b"
// blank line
_ "a"
) (at least for me.) That said, let me emphasize again: it's much better to explicitly call initialization functions that have important side effects if you want robust code. |
Thanks for chiming in; as I've mentioned, this doesn't seem to affect import order consistently for unit tests. It initializes packages in the proper order during normal execution, but when running unit tests, this order is not followed; Let me share an example: -- go.mod --
module somepackage
go 1.19
-- somepackage.go --
package somepackage
import (
"os"
)
var ENV_VAR = os.Getenv("ENV_VAR")
-- mock/mock.go --
package mock
import "os"
const ENV_VAR_VAL = "localhost"
var _ = os.Setenv("ENV_VAR", ENV_VAR_VAL)
-- somepackage_test.go --
package somepackage_test
import (
"testing"
"somepackage/mock"
_ "somepackage/mock"
"somepackage"
)
func TestSomePackage(t *testing.T) {
if somepackage.ENV_VAR != mock.ENV_VAR_VAL {
t.Fail()
}
} If I run this code on my machine (i.e. run the unit test |
I also just confirmed that in the example above, running the tests in a separate test package causes the behavior to change: // test/test_test.go
package test_test
import (
"testing"
"somepackage/mock"
_ "somepackage/mock"
"somepackage"
)
func TestTests(t *testing.T) {
if somepackage.ENV_VAR != mock.ENV_VAR_VAL {
t.Fail()
}
} In this case, the desired import order is the same, but the result is not the same, and the test passes. |
What matters here is also in which order the files are presented to the compiler by the But I cannot but emphasize one more time: depending on these orders leads to extremely fragile code. If there are important side-effects that your code depends on, they should be encoded explicitly by calling respective initialization functions in the desired order. |
That is true, and I agree; the practice should be avoided in most instances. However, the problem of reproducibility and consistency still remains. The commands I'm using to run unit tests is
I'm just trying to understand why it was, that with all my careful research and attentive reading of the Go spec, I ended up believing that I can have some side effects propagated during the import process; and then ended up in the end realizing that the import order is inconsistent, and that the import for side effects functionality doesn't guarantee any import order and so should be avoided altogether. If I am to import some package for its side effects, but can't have anything from the import/initialization process depend on it (implicitly), well then isn't the import for side effects functionality rendered useless, since we can't derive any assumption from it? I mean that, if the only code that can rely on side effects from imports is the one that sits in the package that is importing for side effects, couldn't the imported package (the one with the side effects) simply expose a method that is called in the init process of the importing package ? I'm just thinking that if calling the initialization manually in my package is a better approach, then importing a package for its side effects isn't really all that useful and should probably be discouraged; am I missing something ? Should a mention about this possible inconsistency be included in the spec, in the section where import for side effects is brought up? |
This proposal has been added to the active column of the proposals project |
If we do anything here I think we should do the algorithm discussed in #31636. That issue was closed with a note in the CL that we should consider implementing the more deterministic algorithm later. CC @randall77 |
Specifically, the algorithm proposed in #31636 is "find the lexically earliest package [as sorted by import path] that is not initialized yet, but has had all its dependencies initialized, Initialize that package, and repeat." |
@the-zucc The |
My sense of this so far has been that A significant (but acceptable) consequence of that concession is that because (I will concede that there are some packages in stdlib that seem to be intentionally designed around the existence of initialization side-effects, such as how It seems to me that the existence of However, I'm very skeptical about the import order of dependencies of a particular package being significant. The author of a package fully controls what it directly depends on, and so can avoid using combinations of packages that don't make sense together, can directly import the packages whose side-effects they depend on, and avoid directly importing packages that contain known bugs (init-order-related or otherwise). I share the same reaction to others in this discussion that having an However, if we put aside that particular part of the problem then we have an example where the It feels like a bug to me that I feel skeptical that the specification needs to guarantee anything stronger than that. |
@apparentlymart I am definitely sympathetic to that point of view. Still, for a counter-argument, see #31636 (comment) . |
Change https://go.dev/cl/478916 mentions this issue: |
Retry in CL 478916. Google internal ran into two issues with this change. I will explain one; the other is very similar. The problem is a undefined ordering between Package
Package
Typically package Critically for the issue at hand, Without even an indirect dependence, the initialization order is unspecified (before this CL, that is). If the initialization of The fundamental problem here is that we're trying to establish dependence by name (a string) and not by an actual language construct reference. It would be better if the only way to set a flag was to get the address of it somehow (which would require exporting it, at least in test mode). In any case, that ship has sailed with how We're working on fixing some of these implicit ordering dependence things inside Google so we can retry the CL. The question here is, how much do we expect external code to run into this? Anything we can do to mitigate it? One option to consider is having |
A possibility: if |
Change https://go.dev/cl/480215 mentions this issue: |
As part of developing #57411, we ran into cases where a flag was defined in one package init and Set in another package init, and there was no init ordering implied by the spec between those two packages. Changes in initialization ordering as part of #57411 caused a Set to happen before the definition, which makes the Set silently fail. This CL makes the Set fail loudly in that situation. Currently Set *does* fail kinda quietly in that situation, in that it returns an error. (It seems that no one checks the error from Set, at least for string flags.) Ian suggsted that instead we panic at the definition site if there was previously a Set called on that (at the time undefined) flag. So Set on an undefined flag is ok and returns an error (as before), but defining a flag which has already been Set causes a panic. (The API for flag definition has no way to return an error, and does already panic in some situations like a duplicate definition.) Update #57411 Change-Id: I39b5a49006f9469de0b7f3fe092afe3a352e4fcb Reviewed-on: https://go-review.googlesource.com/c/go/+/480215 Run-TryBot: Keith Randall <[email protected]> Reviewed-by: Ian Lance Taylor <[email protected]> TryBot-Result: Gopher Robot <[email protected]> Reviewed-by: Keith Randall <[email protected]>
Change https://go.dev/cl/498755 mentions this issue: |
Also move all the language changes to the same part of the release notes. For #56351 For #57411 Change-Id: Id1c51b5eb8f7d85e61a2ae44ee7d73bb13036631 Reviewed-on: https://go-review.googlesource.com/c/go/+/498755 TryBot-Bypass: Ian Lance Taylor <[email protected]> Reviewed-by: Ian Lance Taylor <[email protected]> Reviewed-by: Keith Randall <[email protected]> Auto-Submit: Ian Lance Taylor <[email protected]> Reviewed-by: Dmitri Shuralyov <[email protected]> Auto-Submit: Ian Lance Taylor <[email protected]>
Change https://go.dev/cl/499278 mentions this issue: |
For #57411 Change-Id: I56c112bb03dde24c2e2643c9b72ce06158a8e717 Reviewed-on: https://go-review.googlesource.com/c/go/+/499278 TryBot-Bypass: Keith Randall <[email protected]> Reviewed-by: Ian Lance Taylor <[email protected]> Reviewed-by: Keith Randall <[email protected]>
Change https://go.dev/cl/501696 mentions this issue: |
For #57411. Change-Id: I94982d939d16ad17174f801cc167cc10ddc8da30 Reviewed-on: https://go-review.googlesource.com/c/go/+/501696 Reviewed-by: Keith Randall <[email protected]> Reviewed-by: Robert Griesemer <[email protected]> TryBot-Bypass: Robert Griesemer <[email protected]> Reviewed-by: Ian Lance Taylor <[email protected]> Reviewed-by: Keith Randall <[email protected]>
So now there is no way to force initialization order? It seems sad. |
@hunshcn Initialization order is still under user control. If you have a concrete concern, please file an issue explaining your concern with a reproducible case or clear example. Thanks. |
Can you guide me? I tried package main
import (
_ "issue/packageB"
)
import (
"fmt"
_ "issue/packageA"
)
func main() {
fmt.Println("ok")
} and
packageA/B content is simple.It has one file named a.go package packageB
import "fmt"
func init() {
fmt.Println("packageB/a.go init")
} In the above two import methods, PackageA is executed before PackageB. |
If you want packageB to be initialized before packageA, then do, in packageA
|
This is the simplest case. In a normal scenario, packageA/B may be a third-party package. |
Can you be specific about the two third-party packages you're having trouble with? |
I don't think the specific package is critical. In go1.20, adjusting the import order in main can affect initialization order. go1.20 cannot But for your question, I will also make an example to describe the scene. Please give me some time. |
@randall77 Thank You for Your Time in my project (a k8s controller) there are two workqueue metric registration, "sigs.k8s.io/controller-runtime/pkg/metrics" and "k8s.io/component-base/metrics/prometheus/workqueue". The simultaneous import of the two packages is not explicitly declared by me, I just want to use "sigs.k8s.io/controller-runtime/pkg/metrics". "k8s.io/component-base/metrics/prometheus/workqueue" from the deep dependency They both registry workqueue metric, but workqueue provider can only registry once. So the import order becomes very important. I have no idea to deal with this problem on go 1.21 at present. In go 1.20, I only need to import "sigs.k8s.io/controller-runtime/pkg/metrics" at the top of package main to solve it. |
I'm a bit confused by your answer. You quote two packages that you want to initialize in a specific order, but I don't understand why you need them initialized in a specific order. My best guess: are they both calling into |
@randall77 You are very wise. Just an hour ago,I located the citation exception through dependency analysis. Two years ago, our team members accidentally imported However, this is also an example, and defining initialization order may be a potential requirement. |
I think I still need more details to understand.
Does that mean I'm right about
I don't understand what this means.
By "this change", you mean the one you're currently working on? So there was a latent bug for 2 years that you're now tripping up on? Shouldn't that latent bug be fixed instead of papering back over it by reordering initialization? |
No, after discovering what I said, I fixed it. I mean, similar things may happen in other scenes. Of course, I provided a bad example. I'm sorry. |
Ok, thanks for the discussion. |
Context
Package Initialization
When importing a package, it is initialized according to the sequence described in the Go specification. If that package has dependencies, it makes sense that these are initialized first, and that is what is described in the spec. In many instances, this is very useful and helps to write very succinct code, eliminating much boilerplate code.
Importing for Side Effects
In the Go specification, there is mention that it is possible to import packages for their side-effects:
Small Issue
Currently, the import order is predictable for packages with import relations. This is very useful and works flawlessly. However, as I've come to realize it, within a series of import statements in a single file, a package that is imported (for its side effects) early in the list of imports has no guarantee of early initialization. Consequently, the initializations are not ran in order, which makes results unpredictable at times.
Example Use Case - Mocking An Execution Environment Before Running Unit Tests
The package names here are not necessarily representative of what has caused this issue, they are just to explain the use case; let's say I have a package,
packagea
, that initializes a connection to a server, either in itsinit()
or via variable initializers.In this case, I've observed 2 things:
packageb
will be imported and initialized beforepackagea
-- in my case, the package imported the earliest was initialized at the end, effectively not propagating its side effects early enough in the program execution for them to apply to other package initializations, as the environment variable was not set which causedpackagea
to default to the "production code" server URL, and not the testing URL;Proposed Solution
Enable some way of specifying that a package should be imported before any other package for its side effects. As an example, initialization behavior for grouped imports could remain the same, but for separate import statements, statement order could define the initialization order.
EDIT: as mentioned in the comments, the proposed solution wouldn't work, unfortunately; however it still gives an idea of what a candidate implementation could look like.
The text was updated successfully, but these errors were encountered: