Make the "second Base copy" trick actually work #26111

Keno · 2018-02-19T03:50:06Z

As was mentioned in #25988, there is a handy trick where you can load
a second copy of Base on top of an existing copy. This is useful for
at least two reasons:

Base printing is available, so things like MethodErrors print nicely
Even if the load fails, the resulting (broken) copy of base is inspectable
by standard introspection tools from the REPL, as long as you're a bit
careful not to mix types from the two copies of Base.

However, as I mentioned in #26079, this only actually works until about version.jl,
at which point things crash. This is because at that point it tries to use PCRE
which uses Ref(0), which is actually an abstract type in Core, even though the
type of the constructed object (RefValue) is in Base. As a result, the new Base
gets the wrong kind of RefValue (the one from the original Base) and things break.

Luckily this is easily fixed by using an explicit RefValue call in the relevant places.

A second problem we run into is that modules nested under our new Base, get a default
import of the old Base (unless we declare the new Base to be the global top module, but
that would break the REPL subsequent to loading the new Base, which breaks reason 2 above).
I suggest (and implement in this PR) to have the default import be the next topmodule along
the parent link chain (as we already do for syntax defined in Base), which makes this work.
A small related detail is that in all such modules import Base: x, needs to be replaced by
import .Base: x, to make sure we resolve the identifier Base (as imported from our
new top module) rather than the global name Base (which still refers to the old module).

I changed sysimg.jl to avoid loading stdlibs in second Base mode, to avoid having to implement
the same changes there. Since the stdlibs are already decoupled from Base, they can already
be developed separately fairly easily, so there's not much reason to include them in this trick.

For completeness, there's a couple of ways to use this trick, but perhaps the simplest is:

cd("base")
baremodule NotBase
    Core.include(NotBase, "sysimg.jl")
end

from the REPL.

StefanKarpinski · 2018-02-19T22:11:54Z

These all seem like good changes to me which make the meaning of the code more precise.

JeffBezanson · 2018-02-20T15:49:45Z

base/refpointer.jl

-            ptrs[i] = unsafe_convert(P, root)::P
-            roots[i] = root
+###
+if parentmodule(@__MODULE__) === Main


It would be better not to rely on Main for this. It's not clear if Main should be the parent module of Base and Core; in fact that seems like a holdover from the old code loading model. For now this check should be factored into a variable like is_primary_base_module or something.

JeffBezanson · 2018-02-20T15:50:38Z

I suggest (and implement in this PR) to have the default import be the next topmodule along
the parent link chain

How does this work? It looks to me like the second loaded copy of Base does not set the topmodule flag.

Keno · 2018-02-20T16:19:18Z

How does this work? It looks to me like the second loaded copy of Base does not set the topmodule flag.

It sets the topmodule flag, it just doesn't set the primary flag.

As was mentioned in #25988, there is a handy trick where you can load a second copy of Base on top of an existing copy. This is useful for at least two reasons: 1. Base printing is available, so things like MethodErrors print nicely 2. Even if the load fails, the resulting (broken) copy of base is inspectable by standard introspection tools from the REPL, as long as you're a bit careful not to mix types from the two copies of Base. However, as I mentioned in #26079, this only actually works until about version.jl, at which point things crash. This is because at that point it tries to use PCRE which uses `Ref(0)`, which is actually an abstract type in Core, even though the type of the constructed object (`RefValue`) is in Base. As a result, the new Base gets the wrong kind of `RefValue` (the one from the original `Base`) and things break. Luckily this is easily fixed by using an explicit `RefValue` call in the relevant places. A second problem we run into is that `module`s nested under our new `Base`, get a default import of the old `Base` (unless we declare the new Base to be the global top module, but that would break the REPL subsequent to loading the new Base, which breaks reason 2 above). I suggest (and implement in this PR) to have the default import be the next topmodule along the parent link chain (as we already do for syntax defined in Base), which makes this work. A small related detail is that in all such modules `import Base: x`, needs to be replaced by `import .Base: x`, to make sure we resolve the identifier `Base` (as imported from our new top module) rather than the global name `Base` (which still refers to the old module). I changed sysimg.jl to avoid loading stdlibs in second Base mode, to avoid having to implement the same changes there. Since the stdlibs are already decoupled from Base, they can already be developed separately fairly easily, so there's not much reason to include them in this trick. For completeness, there's a couple of ways to use this trick, but perhaps the simplest is: ``` cd("base") baremodule NotBase Core.include(NotBase, "sysimg.jl") end ``` from the REPL.

Keno requested a review from JeffBezanson February 19, 2018 03:50

JeffBezanson reviewed Feb 20, 2018

View reviewed changes

Keno force-pushed the kf/secondbase branch from a49a34d to 3316acc Compare February 20, 2018 21:38

Keno force-pushed the kf/secondbase branch from 3316acc to 84e68f3 Compare February 20, 2018 22:40

JeffBezanson approved these changes Feb 20, 2018

View reviewed changes

Keno merged commit b1189ce into master Feb 21, 2018

StefanKarpinski deleted the kf/secondbase branch February 21, 2018 05:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the "second Base copy" trick actually work #26111

Make the "second Base copy" trick actually work #26111

Keno commented Feb 19, 2018

StefanKarpinski commented Feb 19, 2018

JeffBezanson Feb 20, 2018

Keno Feb 20, 2018

JeffBezanson commented Feb 20, 2018

Keno commented Feb 20, 2018

Make the "second Base copy" trick actually work #26111

Make the "second Base copy" trick actually work #26111

Conversation

Keno commented Feb 19, 2018

StefanKarpinski commented Feb 19, 2018

JeffBezanson Feb 20, 2018

Choose a reason for hiding this comment

Keno Feb 20, 2018

Choose a reason for hiding this comment

JeffBezanson commented Feb 20, 2018

Keno commented Feb 20, 2018