Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ref_genome.fa.star.idx/genomeParameters.txt is missing #5

Open
Benoitdw opened this issue Jan 14, 2022 · 1 comment
Open

ref_genome.fa.star.idx/genomeParameters.txt is missing #5

Benoitdw opened this issue Jan 14, 2022 · 1 comment

Comments

@Benoitdw
Copy link

Benoitdw commented Jan 14, 2022

I'm trying to build the CTAT genome lib because the plug and play resources for Fv1.9 do not contain the genome lib build with gencode-v38 which is use during my pipeline.
I tried to generate the library with :
prep_genome_lib.pl --genome_fa /data/hg38.fasta --gtf /data/gtf/gencode.v38.annotation.gtf --dfam_db human --fusion_annot_lib /data/CTAT/StarFv1.9/CTAT_HumanFusionLib.mini.dat.gz --pfam_db current --human_gencode_filter

However the directory generated contains only 39 files (ls -R gencode_38 | wc -l) vs 68 for the plug and play one (gencode_v33).

When I try to launch fusion inspector, it leads to a failed during the STAR process because ref_genome.fa.star.idx/genomeParameters.txt is missing:

* Running CMD: /usr/local/bin/STAR  --runThreadN 5  --genomeDir /ref/CTAT/StarFv1.9/gencode_38/ref_genome.fa.star.idx  --outSAMtype BAM SortedByCoordinate  --twopassMode Basic  --alignSJDBoverhangMin 10  --genomeSuffixLengthMax 10000 --limitBAMsortRAM 49837599450  --alignInsertionFlush Right   --alignMatesGapMax 100000  --alignIntronMax 100000  --readFilesIn /data/BRAINSTORM1_KDM26344_OKDM_1298_20220112104053_R1.concat.fastq.gz /data/BRAINSTORM1_KDM26344_OKDM_1298_20220112104053_R2.concat.fastq.gz  --genomeFastaFiles /config/temp/fi_workdir/finspector.fa  --outSAMfilter KeepAllAddedReferences  --sjdbGTFfile /config/temp/fi_workdir/finspector.gtf  --alignSJstitchMismatchNmax 5 -1 5 5  --scoreGapNoncan -6  --readFilesCommand 'gunzip -c' 
Jan 14 11:05:59 ..... started STAR run
Jan 14 11:05:59 ..... loading genome

EXITING because of FATAL ERROR: could not open genome file /ref/CTAT/StarFv1.9/gencode_38/ref_genome.fa.star.idx/genomeParameters.txt
SOLUTION: check that the path to genome files, specified in --genomeDir is correct and the files are present, and have user read permsissions

There is my gencode_v38 directory generated :

 .
├──  __chkpts
│  ├──  makeblastdb.ok
│  ├──  ref_annot.gtf.mini.sortu.ok
│  ├──  ref_annot.gtf.ok
│  ├──  ref_genome.fa.ok
│  └──  ref_genome_fai.ok
├──  ctat_genome_lib_build_dir
│  ├──  __chkpts
│  ├──  ref_genome.fa
│  ├──  ref_genome.fa.fai
│  ├──  ref_genome.fa.nhr
│  ├──  ref_genome.fa.nin
│  ├──  ref_genome.fa.nsq
│  └──  ref_genome.fa.star.idx
├──  ref_annot.gtf
├──  ref_annot.gtf.mini.sortu
├──  ref_genome.fa
├──  ref_genome.fa.fai
├──  ref_genome.fa.nhr
├──  ref_genome.fa.nin
├──  ref_genome.fa.nsq
└──  ref_genome.fa.star.idx
   ├──  chrLength.txt
   ├──  chrName.txt
   ├──  chrNameLength.txt
   └──  chrStart.txt

Does anyone knows which command line was used to generate the plug and play genome lib?

@brianjohnhaas
Copy link
Collaborator

brianjohnhaas commented Jan 14, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants