consul connect ca w/ vault provider stuck, can't rotate to new pki mounts or back to consul ca provider #8576
Labels
theme/certificates
Related to creating, distributing, and rotating certificates in Consul
theme/consul-vault
Relating to Consul & Vault interactions
type/bug
Feature does not function as expected
I've had a consul cluster operating off and on for about a year now. Overall the original consul version we used was 1.5.3. A normal procedure we'd do periodically was to update the consul ca config using set-config to rotate the pki mounts that connect ca used with the vault provider. Today this failed and the only recent change I'm aware of is that I upgraded consul to 1.8.2 about a month ago to begin using ingress gateways and other new features we require. The upgrade seemed to go fine at the time, no problems, client agents connect and everything seemed good.
I now seem to be stuck, when I attempted to use set-config with a config file that used a new RootPKIPath and IntermediatePKIPath, it created the new root pki mount in vault but I think it failed during cross signing or something. The reason I attempted to do this rotation is that the root CA cert that consul has been using expired about 4 days ago.
Some background info I came across that it had expired while I was trying to test connectivity between ingress gateway and an application's sidecar proxy, eventually after digging for hours I was able to get envoy trace logs in the sidecar and saw cert expired related errors, then was able to find from vault side that the ca vault pki mount root cert expired. I feel like this is another bug or feature request with consul UI as everything there was green and appeared fine setup wise even though the certs in play were expired that the sidecar was using.
My current issue for this report is that I can not get consul connect ca to rotate to new pki mounts via consul connect ca set-config approach, or get it to move from the vault provider back to the consul provider. Both of these attempts gives me this error at cli and I feel completely stuck. I fear that the consul cluster is hosed and that I may have to setup new server nodes from scratch again.
I've also tried setting ForceWithoutCrossSigning: true , and that hasn't changed anything either. I'd really appreciate some guidance if there is an approach or something I can try to get out of this situation without the cluster being borked and requiring a new setup.
Using Vault version 1.1.1 and consul 1.8.2
The text was updated successfully, but these errors were encountered: