-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error with function load_rhb_at_positions_no_NAs #93
Comments
Hi again, The problem is that the founders can belong to different 16way populations, so they were duplicated in the ID column of the .SAMPLES file, that contained 416 rows for the founders instead of 100. I wanted to ask you if it's possible to indicate that a founder ID can belong to two or more populations. Otherwise I will just split the .SAMPLES by families. Best, |
Thanks for the message, and identifying the problem, I've added a check to the code that should throw an error for that in the future The only use of the reference sample file (alongside the reference haplotypes and legend file) is to exclude certain populations. Let's say you had prepared a file using all 1000 Genomes samples (from multiple populations), and in one run you wanted to do only Europeans, and another only East Asians, or something like that Note STITCH can use reference haplotypes in one of three ways |
PS there are no actual "*" in the file right? The haps file? It should be only 0 and 1 (and spaces and newlines / enters) |
|
I got the following error when executing STITCH with reference files:
It comes from the following function of the script reference-binary.R:
The first row of the reference hap file looks like this:
0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 1* 1* 0* 0* 1* 1* 0* 0* 1* 1* 0* 0* 0* 0* 1* 1* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 1* 1* 0* 0* 1* 1* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 1* 1* 0* 0* 1* 1* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 1* 0* 0* 0* 0* 1* 1* 0* 0* 1* 1* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 1* 1* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0* 0*
Notice there are 200 haplotypes for 100 founders. I suppose STITCH is taking only the 16x2 haplotypes from each corresponding specified 16way population.
What could be causing the issue? I find strange that the colClasses vector has 832 elements.
Jose
The text was updated successfully, but these errors were encountered: