[BUG] Restarting the installation process can cause certificate problems if K8s was not fully configured #2669

romsok24 · 2021-10-06T08:30:26Z

Describe the bug
When reruning the epicli installation process one can fail with this error:

failed to connect to {https://127.0.0.1:2379  <nil> 0 <nil>}. Err :connection error: desc = "transport: authentication handshake failed: x509: certificate signed by unknown authority (possibly because of \"crypto/rsa: verification error\" while trying to verify candidate authority certificate \"etcd-ca\")"

How to reproduce
Steps to reproduce the behavior:

execute epicli init ... (with params)
if the installation from step on 1 will fail in between of kubernetes component creation - execute the step no 1 again

Expected behavior
On epicli preflight phase there should be a task to check for the existence of /etc/kubernetes/pki/ folder
and it should be cleaned if exists to ensure that all the certs from a brand new installation run will be signed with the most current CA cert.

Config files

Environment

OS: Ubuntu 18.04.4 LTS

epicli version: 1.0.1

DoD checklist

The text was updated successfully, but these errors were encountered:

atsikham · 2021-10-06T10:34:23Z

@romsok24 is that a constant issue? On which step it failed during the first run? May be an investigation needed when it occurs.

We have a possibility to re-generate certificates. Someone could have custom validity period, I think pki folder should not be cleaned up each time for apply.

romsok24 · 2021-10-07T13:46:50Z

Failing ansible task is: TASK [kubernetes_common : Update in-cluster configuration]

IMO - the failure is not related to the code but to the fact, that - as I wrote - part of the certs in /etc/kubernetes/pki/ are signed with the cA cert from the previous run and the other with the current one.

Cleaning this folder would be probably a good solution for this.

przemyslavic · 2021-12-08T12:30:05Z

Seems to be related to #1175.

atsikham · 2022-01-04T08:16:28Z

My proposal is to check this task after #2828 as it might be related.

przemyslavic · 2022-02-04T08:41:51Z

Tested multiple times apply command after cancelling the build at different stage. It went smoothly on re-apply.
The task on which the first build failed may be relevant here. I would close this task and re-open if it occurs again.

romsok24 added type/bug status/grooming-needed labels Oct 6, 2021

atsikham added the area/kubernetes label Oct 6, 2021

przemyslavic mentioned this issue Dec 8, 2021

[BUG] Task 'Join to Kubernetes cluster' may fail when Ansible vault already exists #1175

Closed

10 tasks

erzetpe removed the status/grooming-needed label Dec 9, 2021

przemyslavic mentioned this issue Dec 29, 2021

Fix 'Join to Kubernetes cluster' task #2805

Merged

atsikham changed the title ~~[BUG] Restarting the installation process can cause certificate problems if the k8s was not fully configured~~ [BUG] Restarting the installation process can cause certificate problems if K8s was not fully configured Jan 24, 2022

atsikham self-assigned this Jan 24, 2022

przemyslavic self-assigned this Feb 4, 2022

seriva closed this as completed Feb 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Restarting the installation process can cause certificate problems if K8s was not fully configured #2669

[BUG] Restarting the installation process can cause certificate problems if K8s was not fully configured #2669

romsok24 commented Oct 6, 2021 •

edited by seriva

Loading

atsikham commented Oct 6, 2021 •

edited

Loading

romsok24 commented Oct 7, 2021

przemyslavic commented Dec 8, 2021

atsikham commented Jan 4, 2022

przemyslavic commented Feb 4, 2022

[BUG] Restarting the installation process can cause certificate problems if K8s was not fully configured #2669

[BUG] Restarting the installation process can cause certificate problems if K8s was not fully configured #2669

Comments

romsok24 commented Oct 6, 2021 • edited by seriva Loading

atsikham commented Oct 6, 2021 • edited Loading

romsok24 commented Oct 7, 2021

przemyslavic commented Dec 8, 2021

atsikham commented Jan 4, 2022

przemyslavic commented Feb 4, 2022

romsok24 commented Oct 6, 2021 •

edited by seriva

Loading

atsikham commented Oct 6, 2021 •

edited

Loading