Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not able to launch the 1.8 RC build on my Windows 10 #4191

Closed
gunamata opened this issue Mar 14, 2023 · 3 comments
Closed

Not able to launch the 1.8 RC build on my Windows 10 #4191

gunamata opened this issue Mar 14, 2023 · 3 comments
Labels
kind/bug Something isn't working platform/windows
Milestone

Comments

@gunamata
Copy link
Contributor

I seem to have broken something with my local Rancher Desktop setup as part of 1.8 RC testing. At this state, none of Factory Reset, wsl --shutdown, Machine restart, uninstall & re-install seem to help! Unfortunately, I don't know what got me into this state. 😞

Here's the error screenshot. From the log messages, it looks like something to do with the rancher-desktop-guestagent but I am not sure.

image

2023-03-14T22:26:16.294Z: Registered distributions: Ubuntu
2023-03-14T22:26:16.580Z: Registered distributions: Ubuntu
2023-03-14T22:26:18.346Z: Registered distributions: rancher-desktop,Ubuntu
2023-03-14T22:26:18.483Z: Registered distributions: rancher-desktop,Ubuntu
2023-03-14T22:26:18.484Z: Creating initial data distribution...
2023-03-14T22:26:21.212Z: Did not find a valid mount, mounting /mnt/wsl/rancher-desktop/run/data
2023-03-14T22:26:24.158Z: Installing C:\Users\GunasekharMatamalam\AppData\Local\Programs\Rancher Desktop\resources\resources\linux\internal\trivy as /mnt/c/Users/GunasekharMatamalam/AppData/Local/Programs/Rancher Desktop/resources/resources/linux/internal/trivy into /usr/local/bin/trivy ...
2023-03-14T22:26:24.403Z: Installing C:\Users\GunasekharMatamalam\AppData\Local\Programs\Rancher Desktop\resources\resources\linux\internal\rancher-desktop-guestagent as /mnt/c/Users/GunasekharMatamalam/AppData/Local/Programs/Rancher Desktop/resources/resources/linux/internal/rancher-desktop-guestagent into /usr/local/bin//rancher-desktop-guestagent ...
2023-03-14T22:26:27.280Z: WSL: executing: cat /root/.docker/config.json: Error: wsl.exe exited with code 1

Logs:

logs-20230314.zip

@gunamata gunamata added kind/bug Something isn't working platform/windows labels Mar 14, 2023
@gunamata gunamata added this to the 1.8 milestone Mar 14, 2023
@gunamata gunamata changed the title Not able to launch the 1.8 RC build Not able to launch the 1.8 RC build on my Windows 10 Mar 14, 2023
@gunamata
Copy link
Contributor Author

@IsaSih and I investigated a bit and could figure out the repro steps. This happens while executing the Testcase#88

Here are the repro steps:

  • Connect to Cisco AnyConnect VPN
  • Enable experimental network tunnel rdctl set --experimental.virtual-machine.networking-tunnel=true
  • At this state, pulling a image from private registry did not work.
    docker pull registry.suse.de/suse/containers/caps/2/containers/registry/harbor-nginx:2.0.2
  • Disable experimental network tunnel rdctl set --experimental.virtual-machine.networking-tunnel=false
  • At this state, the status bar in the GUI keep running and if left like that runs into the error shown in the issue description
  • If Cisco AnyConnect is disconnected then the GUI status bar progress completes and the app seems to get to a working condition. However, the RD icon in the system tray still shows red!

@Nino-K
Copy link
Member

Nino-K commented Mar 15, 2023

So I went ahead and tried this out, it is a known issue that also occurs on 1.7. The issue is related to the routing on the host network while VPN is enabled. When the rancher desktop makes a request to the kubeAPI on WSL_IP:6443 the request is being mishandled by the host due to the higher route priority on the VPN interface and it never gets routed to the WSL VM. I have documented a workaround for our users here.

However, as for the test case #88 you would need to make you one of the following occurs:

  1. run rdctl set --experimental.virtual-machine.networking-tunnel=true during the startup even when rancher desktop UI is stuck at the "starting k8s" stage.
  2. Or, start the RD first and run the rdctl set --experimental.virtual-machine.networking-tunnel=true and then start your cisco annyconnect
  3. Or, run the workaround and then enable the rdctl set --experimental.virtual-machine.networking-tunnel=true.

Please note that this will not be an issue once the new network is a default network. This only occurs during this transition.

@gunamata
Copy link
Contributor Author

I could sort this out with @Nino-K 's help. First, I could switch between enabling and disabling the experimental tunnel without connected to VPN while launching RD. Second, there were DNS issues on my machine due to multiple VPN clients running at the same time, which was causing problem resolving the DNS for the private registry. Closing the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working platform/windows
Projects
None yet
Development

No branches or pull requests

2 participants