ensure extension triggers are only run by the package that satified them #48513

vtjnash · 2023-02-03T19:54:05Z

Oscar had noticed a problem where loading an extension might trigger loading another package, which might re-trigger attempting to load the same extension. And then that causes a deadlock from waiting for the extension to finish loading (it appears to be a recursive import triggered from multiple places). Instead alter the representation, to be similar to a semaphore, so that it will be loaded only exactly by the final package that satisfied all dependencies for it.

This approach could still encounter an issue if the user imports a package (C) which it does not explicitly list as a dependency for extension. But it is unclear to me that we actually want to solve that, since it weakens and delays the premise that Bext is available shortly after A and B are both loaded.

# module C; using A, B; end;; module A; end;; module B; end;; module Bext; using C; end
# using C, Bext / A / B
starts C -> requires A, B to load
loads A -> defines Bext (extends B, but tries to also require C) loads B -> loads Bext (which waits for C -> deadlock!)
finish C -> now safe to load Bext

While using this order would have been fine.
# using A, B, Bext / C
loads A -> defines Bext (extends B, but tries to also require C) loads B -> starts Bext
loads C
finish Bext

Oscar had noticed a problem where loading an extension might trigger loading another package, which might re-trigger attempting to load the same extension. And then that causes a deadlock from waiting for the extension to finish loading (it appears to be a recursive import triggered from multiple places). Instead alter the representation, to be similar to a semaphore, so that it will be loaded only exactly by the final package that satisfied all dependencies for it. This approach could still encounter an issue if the user imports a package (C) which it does not explicitly list as a dependency for extension. But it is unclear to me that we actually want to solve that, since it weakens and delays the premise that Bext is available shortly after A and B are both loaded. \# module C; using A, B; end;; module A; end;; module B; end;; module Bext; using C; end \# using C, Bext / A / B starts C -> requires A, B to load loads A -> defines Bext (extends B, but tries to also require C) loads B -> loads Bext (which waits for C -> deadlock!) finish C -> now safe to load Bext While using this order would have been fine. \# using A, B, Bext / C loads A -> defines Bext (extends B, but tries to also require C) loads B -> starts Bext loads C finish Bext

vtjnash · 2023-02-03T21:58:41Z

Oscar helped discuss offline that this deadlock case (mentioned in the comments) is actually potentially a severe flaw here right now in EXT_DORMATORY. We have unintentionally created a situation where user might easily accidentally create a loading graph that deadlocks unpredictably when loading it. In practice, that means, in addition to this PR being sensible, that we may want to forbid extension packages from loading any new packages (unless we can prove they do not depend on any of the triggers?). It should thus potentiall be forbidden from accessing any package that was not explicitly listed as one of its triggers (any others it wants transitively it could access via using Parent.Other if desired).

KristofferC · 2023-02-04T07:10:58Z

It should thus potentiall be forbidden from accessing any package that was not explicitly listed as one of its triggers (any others it wants transitively it could access via using Parent.Other if desired).

Originally, I had that as the implementation and you went via Parent.Other to get dependencies. That however excluded the case where you wanted to load a package in an extension that was not a trigger nor was it loaded into Parent. This seemed to be a quite common desire so I allowed that.

The code that allows that is this:

julia/base/loading.jl

Lines 824 to 825 in c4fd8a4

    
           # `name` is not an ext, do standard lookup as if this was the parent 
        
           return identify_package(PkgId(UUID(uuid), dep_name), name)

To understand better, the problem in the previous code was the use of Base.loaded_modules which doesn't actually say if the module has been loaded but only if it has been started to be loaded?

KristofferC · 2023-02-04T07:25:15Z

This also seems subsume #48509 and fix #48498?

vtjnash · 2023-02-04T14:56:44Z

There are a couple problems we observed, and this only really addresses one of them.

This addresses the case where multiple extensions need to be loaded. Previously each extension that loaded would trigger a new walk through the list, and then load the next extension in sequence. This is not strictly a problem, but is a bit awkward as it means the stack keeps getting longer and complicates the locking order of the loaded packages in some cases.

Originally, I had that as the implementation and you went via Parent.Other to get dependencies. That however excluded the case where you wanted to load a package in an extension that was not a trigger nor was it loaded into Parent. This seemed to be a quite common desire so I allowed that.

Secondly, we observed that this should never have been allowed. It would be very useful, if it was valid. But it creates a cycle in the loading graph, which leads to unpredictable deadlocks, and causing the extension to sometimes hang (v1.9) or error (master today). It turns out that we must forbid this (for now), to prevent such unreliable behavior from catching users unawares and making PkgEval unreliable. We can re-evaluate later if we want to design a solution for it to reallow it later. Of note, it also must be forbidden (aka strongly discouraged) from loading those extra, unexpected packages via other mechanisms (e.g. during __init__ or Requires.jl too) for the same reason that they will sometimes cause it to deadlock or error unpredictably.

vtjnash · 2023-02-04T14:59:26Z

problem in the previous code was the use of Base.loaded_modules

FWIW, this was not a problem. I still check both of those lists, I just moved where it checked them, and implemented the TODO comment about making this more efficient than a repeated scan of the list.

KristofferC · 2023-02-04T15:08:12Z

But it creates a cycle in the loading graph, which leads to unpredictable deadlocks, and causing the extension to sometimes hang (v1.9) or error (master today).

Would moving the extension into the parent package create the same cycle?

vtjnash · 2023-02-05T04:07:07Z

Commonly, yes. The same extension being loaded explicitly by the parent does suffer the same problem (the parent depends on weakdep depends on external which depends on the parent), but not if the same content is moved into another context downstream of the external dependency.

KristofferC · 2023-02-05T10:36:24Z

FWIW, this was not a problem.

But the previous code only loaded the extension when all the triggers were in Base.loaded_modules (so only the last one should trigger the loading). So what was the issue then?

vtjnash · 2023-02-05T14:18:57Z

Yeah, we realized later this is not really the source of the problem with deadlock that we had encountered (though it was a possible cause), but I didn't go back to update the PR text. This is only a change to make it so that only exactly that package will run all of its triggers, and not any other package (including weakdeps) that happen to finish loading around the same time.

KristofferC · 2023-02-06T11:12:16Z

Something seems strange here, when an extension fails to load, it tries to load it again. I introduced a syntax error in https://github.com/KristofferC/PGFPlotsX.jl/blob/master/ext/ColorsExt.jl and I get:

julia> using PGFPlotsX

julia> using Colors
[ Info: Precompiling ColorsExt [283d1826-985b-5544-82b8-7fd9aa83b823]
ERROR: LoadError: UndefVarError: `eeh` not defined
Stacktrace:
 [1] top-level scope
   @ ~/JuliaPkgs/PGFPlotsX.jl/ext/ColorsExt.jl:7
 [2] include
   @ ./Base.jl:456 [inlined]
 [3] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::Nothing)
   @ Base ./loading.jl:2031
 [4] top-level scope
   @ stdin:2
in expression starting at /Users/kristoffercarlsson/JuliaPkgs/PGFPlotsX.jl/ext/ColorsExt.jl:1
in expression starting at stdin:2
┌ Error: Error during loading of extension ColorsExt of PGFPlotsX
└ @ Base loading.jl:1187
[ Info: Precompiling ColorsExt [283d1826-985b-5544-82b8-7fd9aa83b823]
ERROR: LoadError: UndefVarError: `eeh` not defined
Stacktrace:
 [1] top-level scope
   @ ~/JuliaPkgs/PGFPlotsX.jl/ext/ColorsExt.jl:7
 [2] include
   @ ./Base.jl:456 [inlined]
 [3] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::Nothing)
   @ Base ./loading.jl:2031
 [4] top-level scope
   @ stdin:2
in expression starting at /Users/kristoffercarlsson/JuliaPkgs/PGFPlotsX.jl/ext/ColorsExt.jl:1
in expression starting at stdin:2
┌ Error: Error during loading of extension ColorsExt of PGFPlotsX
└ @ Base loading.jl:1187

with this PR (and using #48550 to make stuff a bit shorter).

On master it only prints once.

base/loading.jl

vtjnash · 2023-02-06T14:18:16Z

Oh, I think that is actually correct behavior, but only because of a different bug. The original PR for weakdeps usually puts 2 copies of the weakdep into the array. But the try-catch was in the wrong place, so it would initially skip the second one and then come back to all of them the next time you loaded any package to try again. I meant to file an issue, but forgot.

KristofferC · 2023-02-06T14:31:54Z

The original PR for weakdeps usually puts 2 copies of the weakdep into the array.

I actually think #48352 was what made this happen.

vtjnash · 2023-02-06T14:44:07Z

I guess that might sometimes, but I think that PR is correct. I was referring to the presence of a call to that insert_extension_triggers function from both tryrequire and require however seeming that it may potentially insert duplicates.

(cherry picked from commit b2adb8548d7f2a38dc73ea2de1be271e688a545c)

ensure extension triggers are only run by the package that satified them

Krastanov · 2023-02-13T21:26:30Z

This pull request seems to have caused a drop in coverage of 5 percentage points.
Or maybe I am simply misunderstanding the codecov report?

https://app.codecov.io/gh/JuliaLang/julia/commits

https://coveralls.io/builds/56686154

vtjnash · 2023-02-13T22:09:39Z

This just happened to be the commit where we fixed the coverage computation (JuliaCI/julia-buildkite#188)

vtjnash requested review from KristofferC and oscardssmith February 3, 2023 19:54

vtjnash force-pushed the jn/extend-once branch from 251dbef to d7b2062 Compare February 3, 2023 19:56

KristofferC added the backport 1.9 Change should be backported to release-1.9 label Feb 4, 2023

This comment was marked as off-topic.

Sign in to view

vtjnash mentioned this pull request Feb 5, 2023

cycle creation issue with new weakdeps code #48533

Closed

KristofferC mentioned this pull request Feb 6, 2023

improve error message when an extension fails to load #48550

Merged

KristofferC reviewed Feb 6, 2023

View reviewed changes

base/loading.jl Show resolved Hide resolved

KristofferC mentioned this pull request Feb 6, 2023

only add extensions from one environment #48558

Merged

KristofferC added 2 commits February 6, 2023 20:14

improve error message when an extension fails to load (#48550)

ed1ecea

only add extensions from one environment (#48558)

628b15c

(cherry picked from commit b2adb8548d7f2a38dc73ea2de1be271e688a545c)

KristofferC merged commit b99cce5 into master Feb 7, 2023

KristofferC deleted the jn/extend-once branch February 7, 2023 07:59

KristofferC added a commit that referenced this pull request Feb 7, 2023

Merge pull request #48513 from JuliaLang/jn/extend-once

d4e0f36

ensure extension triggers are only run by the package that satified them

KristofferC mentioned this pull request Feb 7, 2023

run_extension_callbacks introduced a data race regression in loading #48498

Closed

KristofferC removed the backport 1.9 Change should be backported to release-1.9 label Feb 7, 2023

KristofferC mentioned this pull request Feb 8, 2023

@time_imports giving ERROR: type Pair has no field id on nightly #48590

Closed

KristofferC mentioned this pull request May 8, 2023

prevent order dependency if an extension is loaded or not #49681

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ensure extension triggers are only run by the package that satified them #48513

ensure extension triggers are only run by the package that satified them #48513

vtjnash commented Feb 3, 2023 •

edited

Loading

vtjnash commented Feb 3, 2023

KristofferC commented Feb 4, 2023 •

edited

Loading

KristofferC commented Feb 4, 2023

This comment was marked as off-topic.

vtjnash commented Feb 4, 2023

vtjnash commented Feb 4, 2023

KristofferC commented Feb 4, 2023

vtjnash commented Feb 5, 2023

KristofferC commented Feb 5, 2023

vtjnash commented Feb 5, 2023

KristofferC commented Feb 6, 2023 •

edited

Loading

vtjnash commented Feb 6, 2023

KristofferC commented Feb 6, 2023

vtjnash commented Feb 6, 2023

Krastanov commented Feb 13, 2023

vtjnash commented Feb 13, 2023

ensure extension triggers are only run by the package that satified them #48513

ensure extension triggers are only run by the package that satified them #48513

Conversation

vtjnash commented Feb 3, 2023 • edited Loading

vtjnash commented Feb 3, 2023

KristofferC commented Feb 4, 2023 • edited Loading

KristofferC commented Feb 4, 2023

This comment was marked as off-topic.

vtjnash commented Feb 4, 2023

vtjnash commented Feb 4, 2023

KristofferC commented Feb 4, 2023

vtjnash commented Feb 5, 2023

KristofferC commented Feb 5, 2023

vtjnash commented Feb 5, 2023

KristofferC commented Feb 6, 2023 • edited Loading

vtjnash commented Feb 6, 2023

KristofferC commented Feb 6, 2023

vtjnash commented Feb 6, 2023

Krastanov commented Feb 13, 2023

vtjnash commented Feb 13, 2023

vtjnash commented Feb 3, 2023 •

edited

Loading

KristofferC commented Feb 4, 2023 •

edited

Loading

KristofferC commented Feb 6, 2023 •

edited

Loading