-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PXE: coreos-installer error "end of file before message length reached" #439
Comments
hmm that is interesting for sure. As an experiment could you use the same setup (with 31.20200210.3.0), but try with the following command line (which will download the latest stable from the Fedora hosted repository):
This should give us at least another data point and help us narrow down the field of possible issues. |
Thanks @dustymabe! removing the |
That's good news. Now I'm wondering why it wasn't working for you with locally copied images. Do you want to try again to copy down a raw.xz and raw.xz.sig (maybe of the latest stable this time) and see if you see the problem again? |
I did try with "fedora-coreos-31.20200310.3.0-metal.x86_64.raw.xz" but got the same error. Note that the gpg key ID I imported and used to succesfuly verify the signature locally was "3C3359C4". By letting cores-installer download the latest image, I can see that it also does use the "50CB390B3C3359C4" gpg key ID and successfully verifies the signature. |
IIUC "3C3359C4" is just a short notation for "50CB390B3C3359C4". "3C3359C4" is the last 8 characters of "50CB390B3C3359C4". |
Ok, so the cause of the issue was probably the integrity of the image downloaded in the PXE environment. I'll have a look at the configuration of my local nginx server. |
This means that the server at I'm not sure what's the underlying root cause, but the symptoms likely point to a server misconfiguration or a network issue. |
@fclerg any updates here. Were you able to investigate your local set up to see what the problem was? |
I looked at few things in my local setup but no luck.
In case the issue come from other headers, I got to a point where the response from my local Nginx server matches the one from the Fedora hosted repo:
The only difference I can see at this point is that I am using HTTP locally instead of HTTPS when using the Fedora hosted repo. |
I could inspect the downloaded archive from the emergency mode. I couldn't find it there though. Do you know where it is supposed to be ? |
|
@fclerg are you still able to reproduce this issue with the latest released artifacts ? |
I tried with the latest release and the install runs well. Has something been done ? However this "Read Disk" stage takes around 20 minutes to complete. What is it exactly doing ? |
That's good to hear, though I don't know of anything specifically that has been done to adress the issue you were seeing. We did do a new release of coreos-installer, but that only made it into the testing stream right now. What exact version of FCOS did you try? Earlier in the disucssion here we were thinking it could be related to your environment. Maybe an issue in your environment resolved itself?
Yeah we have an open issue where we are trying to investigate slow performance of coreos-installer: coreos/coreos-installer#184. Can you test by downloading the latest testing stream ISO and do an install without |
The problem really seems to have been fixed with the initramfs 31.20200420.3.0. Basically the signature error shows up when coreos-installer fetches a local image with initramfs 31.20200310.3.0 (and before). Here is a recap of some use cases I tried :
It must a combination of both my environment and something that has been done in the latest version. Removing the |
There were a lot of packages that changed from
You'd have to use |
Since this issue has been resolved one way or another I'll close it out. Feel free to continue the conversation, though. |
Host Operating System Version: Proxmox 5.2-12
Target Operating System Version: fedora-coreos-31.20200210.3.0
coreos-installer Version: coreos-installer 0.1.2
Expected Behavior
Successful installation of FCOS
Actual Behavior
The relevant part seems to be :
Caused by: end of file before message length reached
Reproduction Steps
PXE boot. Here is my iPXE config (it is provided by Matchbox to match a configuration with a machine):
Other Information
After it fails I can access the emergency mode for maintenance.
From there I could confirm that the guest gets its IP config with DHCP and that it has access to the internet.
I re-tried the PXE install but always get the same error
I didn't find anything obvious about the failure. I am not sure where to look at, based on this "end of file before message length reached" error message.
The image I use is fedora-coreos-31.20200210.3.0-metal.x86_64.raw.xz.
I use a copy of the image in a local web repo and I can confirm its signature and integrity :
The text was updated successfully, but these errors were encountered: