Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BPF] detach cali programs from devices no longer in regex #7008

Merged
merged 5 commits into from
Nov 25, 2022

Conversation

tomastigera
Copy link
Contributor

@tomastigera tomastigera commented Nov 18, 2022

Description

[BPF] fv test for prog cleanups

[BPF] detach cali programs from devices no longer in regex

It can happen that the device regexp changes and calico programs then
remain attached to devices that we do not observe anymore. For instance,
user has the default ethX in the regexp, but those devices are bonded
and the user does not have bondX in the regexp. The user fixes the
regexp to include bondX only, but then the programs stay attached to
ethX. User has to remove the programs manually or restart the node.

We figure out that the programs on devices not in regexp are ours when
those programs have a jump map pinned inthe calico bpffs hierarchy.

This is not really a problem for workload devices as they usually
disapear and programs get detached then.

[BPF] add ListCalicoAttached() to list calico programs

We need that to list programs only attached by calico and not attached
to other ifaces by someone else. We may want to clean up someof these if
we are not interested in a device anymore.

[BPF] bpfEndpointManager.ensureStarted should  exec once in tests too.

[BPF] cleaning up maps is for both TC and XDP

Move it from tc package to bpf package with some code reorganization for
simplicity.

Also lock the clean up lock whenupdating XDP. It wasn't synced till now.

Related issues/PRs

Todos

  • Tests
  • Documentation
  • Release note

Release Note

ebpf: cleanup previously attached programs when BPFDataIfacePattern changes.

Reminder for the reviewer

Make sure that this PR has the correct labels and milestone set.

Every PR needs one docs-* label.

  • docs-pr-required: This change requires a change to the documentation that has not been completed yet.
  • docs-completed: This change has all necessary documentation completed.
  • docs-not-required: This change has no user-facing impact and requires no docs.

Every PR needs one release-note-* label.

  • release-note-required: This PR has user-facing changes. Most PRs should have this label.
  • release-note-not-required: This PR has no user-facing changes.

Other optional labels:

  • cherry-pick-candidate: This PR should be cherry-picked to an earlier release. For bug fixes only.
  • needs-operator-pr: This PR is related to install and requires a corresponding change to the operator.

Move it from tc package to bpf package with some code reorganization for
simplicity.

Also lock the clean up lock whenupdating XDP. It wasn't synced till now.
We need that to list programs only attached by calico and not attached
to other ifaces by someone else. We may want to clean up someof these if
we are not interested in a device anymore.
@tomastigera tomastigera requested a review from a team as a code owner November 18, 2022 19:57
@marvin-tigera marvin-tigera added this to the Calico v3.25.0 milestone Nov 18, 2022
@marvin-tigera marvin-tigera added docs-pr-required Change is not yet documented release-note-required Change has user-facing impact (no matter how small) labels Nov 18, 2022
@tomastigera tomastigera added docs-not-required Docs not required for this change and removed docs-pr-required Change is not yet documented labels Nov 18, 2022
Copy link
Member

@mazdakn mazdakn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, but I am not confident enough with dataplane/bpf_ep_manager so maybe someone else should review as well.
I left few comments fixes.

felix/bpf/attach.go Outdated Show resolved Hide resolved
felix/bpf/attach.go Outdated Show resolved Hide resolved
felix/dataplane/linux/bpf_ep_mgr.go Show resolved Hide resolved
It can happen that the device regexp changes and calico programs then
remain attached to devices that we do not observe anymore. For instance,
user has the default ethX in the regexp, but those devices are bonded
and the user does not have bondX in the regexp. The user fixes the
regexp to include bondX only, but then the programs stay attached to
ethX. User has to remove the programs manually or restart the node.

We figure out that the programs on devices not in regexp are ours when
those programs have a jump map pinned inthe calico bpffs hierarchy.

This is not really a problem for workload devices as they usually
disapear and programs get detached then.
Copy link
Member

@mazdakn mazdakn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tomastigera tomastigera merged commit 8ae1e82 into projectcalico:master Nov 25, 2022
@tomastigera tomastigera deleted the tomas-bpf-hep-unload branch November 25, 2022 01:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs-not-required Docs not required for this change release-note-required Change has user-facing impact (no matter how small)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants