Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Genome GRCh38 not detected #18

Open
micknudsen opened this issue Jan 9, 2024 · 2 comments
Open

Genome GRCh38 not detected #18

micknudsen opened this issue Jan 9, 2024 · 2 comments

Comments

@micknudsen
Copy link

Hi,

I'm trying to generate a library from scratch based on GRCh38 with some regions masked (GRC exclusions). Very early in the main script, the reference genome FASTA is copied to a file with the more generic name ref_genome.fa, and then the genome build is not detected in the masking step.

https://github.com/NCIP/ctat-genome-lib-builder/blob/1cf500c07a67c0240228b8f9c245c0db3e507e4f/util/mask_confounding_features_from_genome.pl#L59

As a consequence, the script dies with the error message Error, didn't locate PAR features in $gencode_gtf. My original input FASTA file is called GCA_000001405.15_GRCh38_no_alt_analysis_set.GRC_exclusions_masked.fasta.

I have made a hack and changed the line to my $IS_GRCH38 = 1, and now everything seems to be running smoothly.

@brianjohnhaas
Copy link
Collaborator

brianjohnhaas commented Jan 9, 2024 via email

@matej-zatko
Copy link

I was struggling with the same issue, thank you @micknudsen for providing the workaround. Would love to see this bug fixed as it prevents the library from being built.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants