-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
-o option only works for sketching databases, but not samples #7
Comments
This is somewhat of a tricky issue... The way the CLI is designed, -o only works for genomes because all genomes are grouped together, so they can all be renamed at once. There is no ambiguity. But because sylph can sketch reads and genomes with the In sylph v0.5, I am adding an option called --sample-names so that users can rename read sketch files to a list of sample names. This is probably what one wants for the -o option for reads. If you have specific ideas on what -o should output for reads, let me know. For now, I will add a warning for when the user only uses |
IMO, sylph sketch should process reads by considering they come from a single sample and generate a single sylsp file, no matter the number of fastq files provided. My lab and others generate multiple fastq files per sample to reach a target sequencing depth. Currently, sylph interface is not very convenient for that purpose. At the end, the output file as the name of the file descriptor (e.g: 63.sylsp) that has to renamed later. |
Hmm very interesting. Thanks for the input. I think I will keep this format for now because most software I'm aware of only processes one read pair per sample. What you're saying makes sense, perhaps as an optional mode of input. I will add an option for renaming in sylph v0.5 though. |
Hi @bluenote-1577, I am following up regarding the samples that consist of multiple FASTQ files. In addition to providing the file paths directly on the command line, I was considering a feature where sylph could accept a text file as input. This file would list, for each sample, the file paths of the associated FASTQ files. For example, similar to tools like Simka, the file format could look like this:
Let me know your thoughts on this idea. |
@fplazaonate that sounds like a pretty nice idea. I'll look into this for the |
Hi @bluenote-1577,
-o option seems to be ignored while sketching samples.
Could you fix this?
The text was updated successfully, but these errors were encountered: