Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow unresolved symlinks inside a tree artifact #15454

Open
GWLeal opened this issue May 10, 2022 · 21 comments
Open

Allow unresolved symlinks inside a tree artifact #15454

GWLeal opened this issue May 10, 2022 · 21 comments
Labels
P3 We're not considering working on this, but happy to review a PR. (No assignee) team-Rules-API API for writing rules/aspects: providers, runfiles, actions, artifacts type: feature request

Comments

@GWLeal
Copy link

GWLeal commented May 10, 2022

Hello,

I've noticed some strange behavior regarding how bazel evaluates invalid symlinks.

UC1:
When I declare an invalid symlink, in a custom rule, and I build a bazel target of that kind bazel prints out the message:
Error in declare_symlink: actions.declare_symlink() is not allowed; use the --experimental_allow_unresolved_symlinks command line option
As expected when the suggested option is added the target builds successfully.

UC2:
However when I declare a folder and add a invalid symlink in that folder I get:
ERROR: (...): Failed to resolve relative path invalid_symlink inside TreeArtifact (...) The associated file is either missing or is an invalid symlink.
Adding the --experimental_allow_unresolved_symlinks does not work in this case.

This seems like inconsistent behavior. Also it is pretty common to pack build artifacts that contain invalid symlinks that will only become valid when the package is installed on the intended target.
I looked in the documentation and did not find a way to circumvent the issue.

I prepared a minimal working example to replicate the problem: https://github.com/GWLeal/bazel_invalid_symlinks_ex

UC1:
bazel build //test:symlink_test

UC2:
bazel build //test:dir_symlink_test

I'm currently using bazel version 5.1.1

Thank you in advance for taking the time to look into this :)

@lberki
Copy link
Contributor

lberki commented May 10, 2022

I'm not too surprised. --experimental_allow_unresolved_symlinks only works for "regular" artifacts not those that are part of TreeArtifacts. This is one of the reasons why it's still experimental...

/cc @coeuvre @alexjski

@coeuvre maybe this is something that could conceivably be fixed as a drive-by fix while you are poking at metadata handling for remote execution? (I dorealize this is a long shot)

@alexjski
Copy link
Contributor

--experimental_allow_unresolved_symlinks

What is the use case for having unresolved symlinks?

@GWLeal
Copy link
Author

GWLeal commented May 10, 2022

--experimental_allow_unresolved_symlinks

What is the use case for having unresolved symlinks?

@alexjski in this case we generate an image with the application filesystem that is mounted in one of the target's partitions as read-only. One of the symlinks points to a file in the OS partition in the target. But I would say that having unresolved symlinks in the context of packaging is common.

(thank you for the replying so quickly)

@lberki
Copy link
Contributor

lberki commented Jun 30, 2022

After some discussion with @lfpino we came to the conclusion that there are two missing pieces before we can stabilize this:

  1. Support on Bazel's RE interface (this is known to not work)
  2. Support in various sandboxes (code suggests that this works, but I don't know off the top of my head if it does or it is tested)

I specifically don't want to add any requirement concerning Windows because symlinks on Windows are a shaky edifice anyway that is not used widely so I don't think it makes sense for Bazel to handle symlinks on Windows better than Windows itself.

So if @lfpino fixes the above two issues, I see no obstacle to stabilizing this functionality (the flag will stay since we need to disable it at Google, but it can default to true and the --experimental_* prefix can be removed)

Since I won't be able to attend to this in the near future, @coeuvre graciously agreed to oversee the process and review the code.

@fmeum
Copy link
Collaborator

fmeum commented Jun 30, 2022

I have RBE support implemented on a branch, which I can submit for review after #15700 and #15773. Specifically, the tests I added in #15700 seems to indicate that everything works pretty well on Windows (except possibly the Windows sandbox, which I haven't tested with yet).

@lfpino Let me know if you want to sync on this.

@lfpino
Copy link
Contributor

lfpino commented Jul 1, 2022

@fmeum I've only started looking at this recently, I'd definitely love to meet and chat about this.

@ismell
Copy link

ismell commented Oct 6, 2022

I have RBE support implemented on a branch, which I can submit for review after #15700 and #15773. Specifically, the tests I added in #15700 seems to indicate that everything works pretty well on Windows (except possibly the Windows sandbox, which I haven't tested with yet).

@lfpino Let me know if you want to sync on this.

Looks like the two issues stated have been fixed. What's the status of the RBE branch?

@fmeum
Copy link
Collaborator

fmeum commented Oct 6, 2022

@ismell --experimental_allow_unresolved_symlinks should just work with RBE with Bazel HEAD. The flag is planned to be flipped for Bazel 6. I don't think that support covers unresolved symlinks in tree artifacts though.

@ismell
Copy link

ismell commented Oct 6, 2022

Ah, I was looking for the unresolved symlinks in tree artifacts support. Is that still being worked on or has it been shelved?

@fmeum
Copy link
Collaborator

fmeum commented Oct 6, 2022

I don't know, but I think @larsrc-google does.

@larsrc-google
Copy link
Contributor

Unresolved symlinks in tree artifacts are not currently being worked on, to the best of my knowledge.

@lberki
Copy link
Contributor

lberki commented Oct 7, 2022

@larsrc-google correct; we had a discussion with @tjgq and the outcome was that it's highly non-trivial how to handle them, because technically, each file in tree artifacts could require different treatment (Bazel could either resolve any symlink or not) and there isn't an obvious way to tell Bazel about each file what to do.

The simplest way to resolve that to make the decision at the level of tree artifact (i.e. either every symlink is resolved or every symlink is unresolved), but making that behavior consistent requires some thought about how it works with remote execution and maybe changes to the protocol which we didn't have time for for Bazel 6.0 so we decided to shelve the problem for now.

@larsrc-google
Copy link
Contributor

I'm adding this to the docs in my CL removing experimental_.

@brandjon brandjon added team-Rules-API API for writing rules/aspects: providers, runfiles, actions, artifacts and removed team-Build-Language labels Nov 5, 2022
@comius comius added team-Local-Exec Issues and PRs for the Execution (Local) team and removed team-Rules-API API for writing rules/aspects: providers, runfiles, actions, artifacts labels Nov 18, 2022
@comius
Copy link
Contributor

comius commented Nov 18, 2022

I changed the team, because the issue is more concerned with the execution phase than it is with Rules API. Local exec seemed to be the closest pick, although probably not precise one.

@tjgq
Copy link
Contributor

tjgq commented Nov 22, 2022

@comius The feature request here is to support unresolved symlinks inside a tree artifact; the current semantics are to transparently resolve symlinks into the files they point to.

I don't think this is purely an "execution phase" change, because it would affect the way artifacts are tracked by Skyframe, and require some sort of API to opt into it: either a flag to change the semantics for every tree artifact, or some way to toggle it on a per-tree or per-file-inside-tree basis. This seems within the purview of Build API (and perhaps even Core, given the Skyframe implications).

@tjgq tjgq added team-Build-Language team-Rules-API API for writing rules/aspects: providers, runfiles, actions, artifacts and removed team-Build-Language team-Local-Exec Issues and PRs for the Execution (Local) team labels Nov 29, 2022
@lberki
Copy link
Contributor

lberki commented Nov 30, 2022

I think it's all of the above: one would need to

  • Come up with an API to say that symlinks (or some symlinks) in a tree artifact are unresolved
  • Implement that on the execution side
  • Keep track of the metadata in Skyframe and the local action cache

@comius comius added P3 We're not considering working on this, but happy to review a PR. (No assignee) and removed untriaged labels Feb 14, 2023
@tjgq tjgq changed the title --experimental_allow_unresolved_symlinks does not work when declaring a folder Allow unresolved symlinks inside a tree artifact Oct 16, 2023
@tjgq tjgq removed their assignment Oct 16, 2023
copybara-service bot pushed a commit that referenced this issue Dec 4, 2023
… matching a locally produced one.

This CL harmonizes the criteria for permitting a symlink for locally and remotely produced tree artifacts, namely:

1. A symlink to an absolute path is allowed.
2. A symlink to a relative path is allowed, as long as the target is inside the tree artifact.
3. The target path must exist (a consequence of tree artifacts transparently dereferencing symlinks; see also #15454).

as enforced by TreeArtifactValue#visitTree (see https://cs.opensource.google/bazel/bazel/+/master:src/main/java/com/google/devtools/build/lib/skyframe/TreeArtifactValue.java;l=585;drc=de9d1f59915a36229978d46b78a22c9e5389db92).

The admonition about symlinks potentially compromising hermeticity is still valid, but there's little gain in making the behavior divergent between local and remote execution. I don't believe we can go in the other direction and extend the restriction to cover local execution, as it would break existing projects.

The --incompatible_remote_disallow_symlink_in_tree_artifact flag is deleted. It was added at a time when symlink resolution was extremely unreliable when building without the bytes; the state of symlink handling has improved a lot since then. I'm deleting the flag entirely (as opposed to making it a no-op) because it was only introduced in the Bazel 7 tree and hasn't made it into a stable release yet.

Makes #18284 obsolete.

PiperOrigin-RevId: 587691220
Change-Id: I470a8fc523bdbb7577ad5a564b6b2c0acab17d7d
tjgq added a commit to tjgq/bazel that referenced this issue Dec 4, 2023
…rtifact, matching a locally produced one.

This CL harmonizes the criteria for permitting a symlink for locally and remotely produced tree artifacts, namely:

1. A symlink to an absolute path is allowed.
2. A symlink to a relative path is allowed, as long as the target is inside the tree artifact.
3. The target path must exist (a consequence of tree artifacts transparently dereferencing symlinks; see also bazelbuild#15454).

as enforced by TreeArtifactValue#visitTree (see https://cs.opensource.google/bazel/bazel/+/master:src/main/java/com/google/devtools/build/lib/skyframe/TreeArtifactValue.java;l=585;drc=de9d1f59915a36229978d46b78a22c9e5389db92).

The admonition about symlinks potentially compromising hermeticity is still valid, but there's little gain in making the behavior divergent between local and remote execution. I don't believe we can go in the other direction and extend the restriction to cover local execution, as it would break existing projects.

The --incompatible_remote_disallow_symlink_in_tree_artifact flag is deleted. It was added at a time when symlink resolution was extremely unreliable when building without the bytes; the state of symlink handling has improved a lot since then. I'm deleting the flag entirely (as opposed to making it a no-op) because it was only introduced in the Bazel 7 tree and hasn't made it into a stable release yet.

Makes bazelbuild#18284 obsolete.

PiperOrigin-RevId: 587691220
Change-Id: I470a8fc523bdbb7577ad5a564b6b2c0acab17d7d
keertk pushed a commit that referenced this issue Dec 4, 2023
…rtifact, matching a locally produced one. (#20426)

This CL harmonizes the criteria for permitting a symlink for locally and
remotely produced tree artifacts, namely:

1. A symlink to an absolute path is allowed.
2. A symlink to a relative path is allowed, as long as the target is
inside the tree artifact.
3. The target path must exist (a consequence of tree artifacts
transparently dereferencing symlinks; see also #15454).

as enforced by TreeArtifactValue#visitTree (see
https://cs.opensource.google/bazel/bazel/+/master:src/main/java/com/google/devtools/build/lib/skyframe/TreeArtifactValue.java;l=585;drc=de9d1f59915a36229978d46b78a22c9e5389db92).

The admonition about symlinks potentially compromising hermeticity is
still valid, but there's little gain in making the behavior divergent
between local and remote execution. I don't believe we can go in the
other direction and extend the restriction to cover local execution, as
it would break existing projects.

The --incompatible_remote_disallow_symlink_in_tree_artifact flag is
deleted. It was added at a time when symlink resolution was extremely
unreliable when building without the bytes; the state of symlink
handling has improved a lot since then. I'm deleting the flag entirely
(as opposed to making it a no-op) because it was only introduced in the
Bazel 7 tree and hasn't made it into a stable release yet.

Makes #18284 obsolete.

PiperOrigin-RevId: 587691220
Change-Id: I470a8fc523bdbb7577ad5a564b6b2c0acab17d7d
@aherrmann
Copy link
Contributor

I ran into this issue while updating rules_zig to support the upcoming Zig 0.12.0 release (as of 0.12.0-dev.3366+8e7d9afda).
The Zig compiler intentionally generates a dangling symlink to store metadata in its global cache directory.
rules_zig currently treats Zig's cache directories as an output, see aherrmann/rules_zig#87.

With Bazel 7.0.2 this behavior raises errors of the form

ERROR: /home/aj/src/rules_zig/main/e2e/workspace/include-dependencies/zig-std-include/BUILD.bazel:5:11: Error while validating output TreeArtifact File:[[<execution_root>]bazel-out/k8-fastbuild/bin]include-dependencies/zig-std-include/.zig-cache/global/binary : Failed to resolve relative path o/8c1a455b9d0322528743f00b10ab8b93/lld.id inside TreeArtifact /home/aj/.cache/bazel/_bazel_aj/0deefcb713c28fbc7261691ef6069b08/execroot/_main/bazel-out/k8-fastbuild/bin/include-dependencies/zig-std-include/.zig-cache/global/binary. The associated file is either missing or is an invalid symlink.

With Bazel 7.1.0 this behavior raises a very confusing error of the form

ERROR: /home/aj/src/rules_zig/main/e2e/workspace/include-dependencies/zig-std-include/BUILD.bazel:5:11: TreeArtifact include-dependencies/zig-std-include/.zig-cache/global/binary was not created
ERROR: /home/aj/src/rules_zig/main/e2e/workspace/include-dependencies/zig-std-include/BUILD.bazel:5:11: Building include-dependencies/zig-std-include/main.zig as Zig binary bazel-out/k8-fastbuild/bin/include-dependencies/zig-std-include/binary failed: not all outputs were created or valid

@tjgq
Copy link
Contributor

tjgq commented Mar 21, 2024

This changed in 5506a0f which was cherry-picked as 1d46604. Let me send a CL to restore the clearer error message.

[EDIT: above is incorrect; the commit that caused the error message to change was actually 4247c20]

copybara-service bot pushed a commit that referenced this issue Mar 21, 2024
…cannot be canonicalized (because one of the components is a non-directory or a dangling symlink).

This improves the error message for a tree artifact containing a dangling symlink, which regressed in 4247c20 (see #15454 (comment)).

PiperOrigin-RevId: 617870632
Change-Id: I6847084a52b1e4bb7d8a9384ad6cd5d015dddf1b
@aherrmann
Copy link
Contributor

@tjgq Thank you! I tested Bazel as of commit b78d73f and can confirm that the error message is improved:

ERROR: /home/aj/src/rules_zig/main/e2e/workspace/include-dependencies/zig-std-include/BUILD.bazel:5:11: error while validating output tree artifact include-dependencies/zig-std-include/.zig-cache/global/binary: child o/8c1a455b9d0322528743f00b10ab8b93/lld.id is a dangling symbolic link
ERROR: /home/aj/src/rules_zig/main/e2e/workspace/include-dependencies/zig-std-include/BUILD.bazel:5:11: Building include-dependencies/zig-std-include/main.zig as Zig binary include-dependencies/zig-std-include/binary failed: not all outputs were created or valid
Target //include-dependencies/zig-std-include:binary failed to build

bazel-io pushed a commit to bazel-io/bazel that referenced this issue Mar 25, 2024
…cannot be canonicalized (because one of the components is a non-directory or a dangling symlink).

This improves the error message for a tree artifact containing a dangling symlink, which regressed in bazelbuild@4247c20 (see bazelbuild#15454 (comment)).

PiperOrigin-RevId: 617870632
Change-Id: I6847084a52b1e4bb7d8a9384ad6cd5d015dddf1b
iancha1992 pushed a commit that referenced this issue Mar 27, 2024
…he path cannot be canonicalized (because one of the components is a non-directory or a dangling symlink). (#21800)

This improves the error message for a tree artifact containing a
dangling symlink, which regressed in
4247c20
(see
#15454 (comment)).

PiperOrigin-RevId: 617870632
Change-Id: I6847084a52b1e4bb7d8a9384ad6cd5d015dddf1b

Commit
b78d73f

Co-authored-by: Googler <[email protected]>
bazel-io pushed a commit to bazel-io/bazel that referenced this issue Apr 3, 2024
…cannot be canonicalized (because one of the components is a non-directory or a dangling symlink).

This improves the error message for a tree artifact containing a dangling symlink, which regressed in bazelbuild@4247c20 (see bazelbuild#15454 (comment)).

PiperOrigin-RevId: 617870632
Change-Id: I6847084a52b1e4bb7d8a9384ad6cd5d015dddf1b
keertk pushed a commit that referenced this issue Apr 3, 2024
…he path cannot be canonicalized (#21889)

This improves the error message for a tree artifact containing a
dangling symlink, which regressed in
4247c20
(see
#15454 (comment)).

PiperOrigin-RevId: 617870632
Change-Id: I6847084a52b1e4bb7d8a9384ad6cd5d015dddf1b

Commit
b78d73f

Co-authored-by: Googler <[email protected]>
@Wyverald
Copy link
Member

Wyverald commented May 2, 2024

@coeuvre @tjgq What's the current status of this issue? Is it fixed/mitigated? Are we still targeting 7.2.0 for a fix?

@tjgq
Copy link
Contributor

tjgq commented May 6, 2024

@Wyverald This issue is a feature request that we haven't yet decided to implement. The sub-discussion started by #15454 (comment) is about a confusing error message, which has been fixed at head (b78d73f) and cherry-picked into 7.2.0 (86ff365).

anguslees added a commit to anguslees/rules_proto_grpc that referenced this issue May 10, 2024
`cp -r` copies the _symlinks_, not the destination of symlinks.  This results in
a TreeArtifact with symlinks to content _outside_ the TreeArtifact, which is not
well defined in bazel[^1].  At least bazel 7.1.1's linux-sandbox (bazel
--spawn_strategy=sandboxed on Linux) does not follow the symlinks and destroys
the temporary directory containing the actual files after the action, leaving a
TreeArtifact of dangling symlinks.

`cp -rL` chases the symlinks and creates a TreeArtifact of regular files.  This
has the desired behaviour in all bazel versions/sandboxes.

[^1]: See discussion in bazelbuild/bazel#15454.  It is
unclear whether we _want_ to resolve these symlinks or leave them dangling, and
behaviour varies across bazel versions and execution environment.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P3 We're not considering working on this, but happy to review a PR. (No assignee) team-Rules-API API for writing rules/aspects: providers, runfiles, actions, artifacts type: feature request
Projects
None yet
Development

No branches or pull requests