-
Notifications
You must be signed in to change notification settings - Fork 120
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Restore kind-1.19-sriov provider files
In order to safely test the new kind-1.22-sriov provider periodic and presubmits jobs on CI without interrupting the current jobs, it is necessary to restore the old kind-1.19-sriov files. Signed-off-by: Or Mergi <[email protected]>
- Loading branch information
Showing
24 changed files
with
2,148 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
filters: | ||
".*": | ||
reviewers: | ||
- qinqon | ||
- oshoval | ||
- phoracek | ||
- ormergi | ||
approvers: | ||
- qinqon | ||
- phoracek |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,101 @@ | ||
# K8S 1.19.11 with SR-IOV in a Kind cluster | ||
|
||
Provides a pre-deployed containerized k8s cluster with version 1.19.11 that runs | ||
using [KinD](https://github.com/kubernetes-sigs/kind) | ||
The cluster is completely ephemeral and is recreated on every cluster restart. The KubeVirt containers are built on the | ||
local machine and are then pushed to a registry which is exposed at | ||
`localhost:5000`. | ||
|
||
This version also expects to have SR-IOV enabled nics (SR-IOV Physical Function) on the current host, and will move | ||
physical interfaces into the `KinD`'s cluster worker node(s) so that they can be used through multus and SR-IOV | ||
components. | ||
|
||
This providers also deploys [multus](https://github.com/k8snetworkplumbingwg/multus-cni) | ||
, [sriov-cni](https://github.com/k8snetworkplumbingwg/sriov-cni) | ||
and [sriov-device-plugin](https://github.com/k8snetworkplumbingwg/sriov-network-device-plugin). | ||
|
||
## Bringing the cluster up | ||
|
||
```bash | ||
export KUBEVIRT_PROVIDER=kind-1.19-sriov | ||
export KUBEVIRT_NUM_NODES=3 | ||
make cluster-up | ||
|
||
$ cluster-up/kubectl.sh get nodes | ||
NAME STATUS ROLES AGE VERSION | ||
sriov-control-plane Ready control-plane,master 20h v1.19.11 | ||
sriov-worker Ready worker 20h v1.19.11 | ||
sriov-worker2 Ready worker 20h v1.19.11 | ||
|
||
$ cluster-up/kubectl.sh get pods -n kube-system -l app=multus | ||
NAME READY STATUS RESTARTS AGE | ||
kube-multus-ds-amd64-d45n4 1/1 Running 0 20h | ||
kube-multus-ds-amd64-g26xh 1/1 Running 0 20h | ||
kube-multus-ds-amd64-mfh7c 1/1 Running 0 20h | ||
|
||
$ cluster-up/kubectl.sh get pods -n sriov -l app=sriov-cni | ||
NAME READY STATUS RESTARTS AGE | ||
kube-sriov-cni-ds-amd64-fv5cr 1/1 Running 0 20h | ||
kube-sriov-cni-ds-amd64-q95q9 1/1 Running 0 20h | ||
|
||
$ cluster-up/kubectl.sh get pods -n sriov -l app=sriovdp | ||
NAME READY STATUS RESTARTS AGE | ||
kube-sriov-device-plugin-amd64-h7h84 1/1 Running 0 20h | ||
kube-sriov-device-plugin-amd64-xrr5z 1/1 Running 0 20h | ||
``` | ||
|
||
## Bringing the cluster down | ||
|
||
```bash | ||
export KUBEVIRT_PROVIDER=kind-1.19-sriov | ||
make cluster-down | ||
``` | ||
|
||
This destroys the whole cluster, and moves the SR-IOV nics to the root network namespace. | ||
|
||
## Setting a custom kind version | ||
|
||
In order to use a custom kind image / kind version, export `KIND_NODE_IMAGE`, `KIND_VERSION`, `KUBECTL_PATH` before | ||
running cluster-up. For example in order to use kind 0.9.0 (which is based on k8s-1.19.1) use: | ||
|
||
```bash | ||
export KIND_NODE_IMAGE="kindest/node:v1.19.1@sha256:98cf5288864662e37115e362b23e4369c8c4a408f99cbc06e58ac30ddc721600" | ||
export KIND_VERSION="0.9.0" | ||
export KUBECTL_PATH="/usr/bin/kubectl" | ||
``` | ||
|
||
This allows users to test or use custom images / different kind versions before making them official. | ||
See https://github.com/kubernetes-sigs/kind/releases for details about node images according to the kind version. | ||
|
||
## Running multi SR-IOV clusters locally | ||
|
||
Kubevirtci SR-IOV provider supports running two clusters side by side with few known limitations. | ||
|
||
General considerations: | ||
|
||
- A SR-IOV PF must be available for each cluster. In order to achieve that, there are two options: | ||
|
||
1. Assign just one PF for each worker node of each cluster by using `export PF_COUNT_PER_NODE=1` (this is the default | ||
value). | ||
2. Optional method: `export PF_BLACKLIST=<PF names>` the non used PFs, in order to prevent them from being allocated to | ||
the current cluster. The user can list the PFs that should not be allocated to the current cluster, keeping in mind | ||
that at least one (or 2 in case of migration), should not be listed, so they would be allocated for the current | ||
cluster. Note: another reason to blacklist a PF, is in case its has a defect or should be kept for other operations ( | ||
for example sniffing). | ||
|
||
- Clusters should be created one by another and not in parallel (to avoid races over SR-IOV PF's). | ||
- The cluster names must be different. This can be achieved by setting `export CLUSTER_NAME=sriov2` on the 2nd cluster. | ||
The default `CLUSTER_NAME` is `sriov`. The 2nd cluster registry would be exposed at `localhost:5001` automatically, | ||
once the `CLUSTER_NAME` | ||
is set to a non default value. | ||
- Each cluster should be created on its own git clone folder, i.e: | ||
`/root/project/kubevirtci1` | ||
`/root/project/kubevirtci2` | ||
In order to switch between them, change dir to that folder and set the env variables `KUBECONFIG` | ||
and `KUBEVIRT_PROVIDER`. | ||
- In case only one PF exists, for example if running on prow which will assign only one PF per job in its own DinD, | ||
Kubevirtci is agnostic and nothing needs to be done, since all conditions above are met. | ||
- Upper limit of the number of clusters that can be run on the same time equals number of PFs / number of PFs per | ||
cluster, therefore, in case there is only one PF, only one cluster can be created. Locally the actual limit currently | ||
supported is two clusters. | ||
- In order to use `make cluster-down` please make sure the right `CLUSTER_NAME` is exported. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,60 @@ | ||
# How to troubleshoot a failing kind job | ||
|
||
If logging and output artifacts are not enough, there is a way to connect to a running CI pod and troubleshoot directly from there. | ||
|
||
## Pre-requisites | ||
|
||
- A working (enabled) account on the [CI cluster](shift.ovirt.org), specifically enabled to the `kubevirt-prow-jobs` project. | ||
- The [mkpj tool](https://github.com/kubernetes/test-infra/tree/master/prow/cmd/mkpj) installed | ||
|
||
## Launching a custom job | ||
|
||
Through the `mkpj` tool, it's possible to craft a custom Prow Job that can be executed on the CI cluster. | ||
|
||
Just `go get` it by running `go get k8s.io/test-infra/prow/cmd/mkpj` | ||
|
||
Then run the following command from a checkout of the [project-infra repo](https://github.com/kubevirt/project-infra): | ||
|
||
```bash | ||
mkpj --pull-number $KUBEVIRTPRNUMBER -job pull-kubevirt-e2e-kind-k8s-sriov-1.17.0 -job-config-path github/ci/prow/files/jobs/kubevirt/kubevirt-presubmits.yaml --config-path github/ci/prow/files/config.yaml > debugkind.yaml | ||
``` | ||
|
||
You will end up having a ProwJob manifest in the `debugkind.yaml` file. | ||
|
||
It's strongly recommended to replace the job's name, as it will be easier to find and debug the relative pod, by replacing `metadata.name` with something more recognizeable. | ||
|
||
The $KUBEVIRTPRNUMBER can be an actual PR on the [kubevirt repo](https://github.com/kubevirt/kubevirt). | ||
|
||
In case we just want to debug the cluster provided by the CI, it's recommended to override the entry point, either in the test PR we are instrumenting (a good sample can be found [here](https://github.com/kubevirt/kubevirt/pull/3022)), or by overriding the entry point directly in the prow job's manifest. | ||
|
||
Remember that we want the cluster long living, so a long sleep must be provided as part of the entry point. | ||
|
||
Make sure you switch to the `kubevirt-prow-jobs` project, and apply the manifest: | ||
|
||
```bash | ||
kubectl apply -f debugkind.yaml | ||
``` | ||
|
||
You will end up with a ProwJob object, and a pod with the same name you gave to the ProwJob. | ||
|
||
Once the pod is up & running, connect to it via bash: | ||
|
||
```bash | ||
kubectl exec -it debugprowjobpod bash | ||
``` | ||
|
||
### Logistics | ||
|
||
Once you are in the pod, you'll be able to troubleshoot what's happening in the environment CI is running its tests. | ||
|
||
Run the follow to bring up a [kind](https://github.com/kubernetes-sigs/kind) cluster with a single node setup and the SR-IOV operator already setup to go (if it wasn't already done by the job itself). | ||
|
||
```bash | ||
KUBEVIRT_PROVIDER=kind-k8s-sriov-1.17.0 make cluster-up | ||
``` | ||
|
||
The kubeconfig file will be available under `/root/.kube/kind-config-sriov`. | ||
|
||
The `kubectl` binary is already on board and in `$PATH`. | ||
|
||
The container acting as node is the one named `sriov-control-plane`. You can even see what's in there by running `docker exec -it sriov-control-plane bash`. |
115 changes: 115 additions & 0 deletions
115
cluster-up/cluster/kind-1.19-sriov/certcreator/certlib/selfsign.go
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,115 @@ | ||
package certlib | ||
|
||
import ( | ||
"bytes" | ||
cryptorand "crypto/rand" | ||
"crypto/rsa" | ||
"crypto/x509" | ||
"crypto/x509/pkix" | ||
"encoding/pem" | ||
"fmt" | ||
"math/big" | ||
"math/rand" | ||
"time" | ||
) | ||
|
||
type SelfSignedCertificate struct { | ||
DNSNames []string | ||
CommonName string | ||
Certificate *bytes.Buffer | ||
PrivateKey *bytes.Buffer | ||
} | ||
|
||
func (s *SelfSignedCertificate) Generate() error { | ||
var caPEM *bytes.Buffer | ||
|
||
randomSource := rand.New(rand.NewSource(time.Now().Unix())) | ||
caCertificateConfig := &x509.Certificate{ | ||
SerialNumber: big.NewInt(randomSource.Int63()), | ||
Subject: pkix.Name{ | ||
Organization: []string{"kubvirt.io"}, | ||
}, | ||
NotBefore: time.Now(), | ||
NotAfter: time.Now().AddDate(1, 0, 0), | ||
IsCA: true, | ||
ExtKeyUsage: []x509.ExtKeyUsage{x509.ExtKeyUsageClientAuth, x509.ExtKeyUsageServerAuth}, | ||
KeyUsage: x509.KeyUsageDigitalSignature | x509.KeyUsageCertSign, | ||
BasicConstraintsValid: true, | ||
} | ||
|
||
caPrivateKey, err := rsa.GenerateKey(cryptorand.Reader, 4096) | ||
if err != nil { | ||
return fmt.Errorf("failed to generate CA private key: %v", err) | ||
} | ||
|
||
caSelfSignedCertificateBytes, err := x509.CreateCertificate( | ||
cryptorand.Reader, | ||
caCertificateConfig, | ||
caCertificateConfig, | ||
&caPrivateKey.PublicKey, | ||
caPrivateKey) | ||
if err != nil { | ||
return fmt.Errorf("failed to generate CA certificate: %v", err) | ||
} | ||
|
||
// PEM encode CA cert | ||
caPEM = new(bytes.Buffer) | ||
err = pem.Encode(caPEM, &pem.Block{ | ||
Type: "CERTIFICATE", | ||
Bytes: caSelfSignedCertificateBytes, | ||
}) | ||
if err != nil { | ||
return fmt.Errorf("failed to encode CA certificate bytes to PEM: %v", err) | ||
} | ||
|
||
serverCertificateConfig := &x509.Certificate{ | ||
DNSNames: s.DNSNames, | ||
SerialNumber: big.NewInt(randomSource.Int63()), | ||
Subject: pkix.Name{ | ||
CommonName: s.CommonName, | ||
Organization: []string{"kubevirt.io"}, | ||
}, | ||
NotBefore: time.Now(), | ||
NotAfter: time.Now().AddDate(1, 0, 0), | ||
SubjectKeyId: []byte{1, 2, 3, 4, 6}, | ||
ExtKeyUsage: []x509.ExtKeyUsage{x509.ExtKeyUsageClientAuth, x509.ExtKeyUsageServerAuth}, | ||
KeyUsage: x509.KeyUsageDigitalSignature, | ||
} | ||
|
||
serverPrivateKey, err := rsa.GenerateKey(cryptorand.Reader, 4096) | ||
if err != nil { | ||
return fmt.Errorf("failed to generate server private key: %v", err) | ||
} | ||
|
||
// Signing server certificate | ||
serverCertificateBytes, err := x509.CreateCertificate( | ||
cryptorand.Reader, | ||
serverCertificateConfig, | ||
caCertificateConfig, | ||
&serverPrivateKey.PublicKey, | ||
caPrivateKey) | ||
if err != nil { | ||
return fmt.Errorf("failed to sign server certificate: %v", err) | ||
} | ||
|
||
// PEM encode the server cert and key | ||
s.Certificate = new(bytes.Buffer) | ||
err = pem.Encode(s.Certificate, &pem.Block{ | ||
Type: "CERTIFICATE", | ||
Bytes: serverCertificateBytes, | ||
}) | ||
if err != nil { | ||
return fmt.Errorf("failed to encode server certificate bytes to PEM: %v", err) | ||
} | ||
|
||
s.PrivateKey = new(bytes.Buffer) | ||
err = pem.Encode(s.PrivateKey, &pem.Block{ | ||
Type: "RSA PRIVATE KEY", | ||
Bytes: x509.MarshalPKCS1PrivateKey(serverPrivateKey), | ||
}) | ||
if err != nil { | ||
return fmt.Errorf("failed to encode server private key bytes to PEM: %v", err) | ||
} | ||
|
||
return nil | ||
} |
Oops, something went wrong.