Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Usage of indexed files #31

Closed
famosab opened this issue Nov 18, 2024 · 2 comments
Closed

Usage of indexed files #31

famosab opened this issue Nov 18, 2024 · 2 comments

Comments

@famosab
Copy link

famosab commented Nov 18, 2024

While working on the nf-core modules I encountered two things which I could not determine from the docs:

  1. Does the tool use the indexes of the sorted bam files if they are present? And does MuSE expect the index to be passed alongside the bam file?
  2. What exactly is meant by "faidx indexed reference genome file"? When I tested the tool it seemed to me that just the genome fasta file was sufficient.
@jiyunmaths
Copy link
Collaborator

Hi @famosab, Yes, both MuSE 1 and MuSE 2 utilize the index files of BAM files to efficiently access reads at specific genomic locations. If these index files are missing, please generate them using tools such as Samtools or GATK before running MuSE.
For the reference genome index file, MuSE 1 required it to quickly retrieve the reference allele at a specified genomic position. In contrast, MuSE 2 leverages an in-memory dictionary data structure that stores the entire reference genome, enabling it to access the reference allele directly without needing an index file. As a result, the reference genome index file is no longer required for MuSE 2.

@famosab
Copy link
Author

famosab commented Nov 29, 2024

Thanks for the detailed answer :)
Maybe you can extend the docs to add the info of the indices!

@famosab famosab closed this as completed Nov 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants