-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FASTA host file does not work in IGV #74
Comments
So the problem is not the (unzipped) FASTA file, but the compressed index ( Do you think, it's okay to work with an uncompressed index, @hoelzer , @matthuska ? |
Yes, I unzipped the host.fa file but then was not able to load it into igv. And got the weird output I screenshoted above. It is working for you, @MarieLataretu ? Then maybe this was some rki/HPC/weird problem... ? Maybe it's fine to work with an uncompressed index that an user can also directly load into igv or other browsers? +1 from me if that solves that problem. But maybe we should first see if we can reproduce it (e.g. via running on another machine) |
@hoelzer try renaming the index from .fai to .gzi and leave the other files as they were (do not decompress the host fasta file). Does that work? If not, we can use a non-compressed index, I don't think it's very big but we can check. |
For me, it worked with an uncompressed index (and the same uncompressed FASTA)! I overwrote the index file with Users would still have to uncompress the FASTA (save space) - the index file would be uncompressed and shouldn't be so big 🤔 |
IGV can not handle compressed reference genomes! |
+1 that sounds like a good solution. Let me see if I can reproduce the problem and then fix it like that.... but I only have my mac m2 here and I would not trust any errors that come from running on that machine ;) |
Hm... I am running the [75/dbea91] process > prepare_contamination:concat_contamination [100%] 1 of 1, failed: 1 ✘
[- ] process > clean:minimap2 -
[- ] process > clean:sort_bam -
[- ] process > clean:index_bam -
[- ] process > clean:idxstats_from_bam -
[- ] process > clean:flagstats_from_bam -
[- ] process > clean:split_bam -
[- ] process > clean:index_bam2 -
[- ] process > clean:fastq_from_bam -
[- ] process > qc:nanoplot -
[- ] process > qc:format_nanoplot_report -
[- ] process > qc:multiqc -
WARN: Task runtime metrics are not reported when using macOS without a container engine
ERROR ~ Error executing process > 'prepare_contamination:concat_contamination'
Caused by:
Process `prepare_contamination:concat_contamination` terminated with an error exit status (1)
Command executed:
# Combine input files, rename duplicate sequences (by id) if found, and compress
seqkit seq sc2.fa.gz | seqkit rename | bgzip -@ 1 -c > db.fa.gz
samtools faidx db.fa.gz --gzi-idx db.fa.fai
Command exit status:
1
Command output:
(empty)
Command error:
[faidx] Could not build fai index db.fa.gz.fai or compressed index db.fa.fai maaaybe an mac m2 problem... I tried with the docker and mamba profile - same error. |
ok... that's probably bc
does not work on mac : ) So the downloaded file is then empty... I will look into that and then the index thing |
forget it |
Then I ran nextflow run git/clean/clean.nf -profile local,mamba --host sc2 --input_type nano --input git/clean/test/nanopore.fastq.gz --cleanup_work_dir and tried to open the resulting host FASTA in IGV. To do that, I This time, it worked! But note that I ran on the
@MarieLataretu please see my final comment below. I was not sure where I should suggest that change bc I am confused with the different branches : ) |
Ok, so I think the only thing that I need to get it run on mac is a change of this line:
to
This needs to be done in |
Yes. I can do that. |
Hey, I reopened the issue because now I ran But when I overwrite the index file via samtools faidx host.fa --fai-idx host.fa.fai it works (as suggested by @MarieLataretu ).
I think that's a good solution so +1 from me to change --gzi-idx to --fai-idx. And then the user has to |
- fixes #74 - only the FASTA file has to be decompressed for IGV usage
Resolved by @MarieLataretu 's #79 . |
has these files. But when I
gunzip
thehost.fa.gz
I can not load it into IGV.The text was updated successfully, but these errors were encountered: