-
Notifications
You must be signed in to change notification settings - Fork 603
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FastQ Screen plots don't work with %No Hits 0.00 #1126
Comments
As it happens, I just found the same in an unrelated RNA-seq experiment:
I think MultiQC is not treating the value of |
Good spot! Could you please pop some example data up to https://github.com/ewels/MultiQC_TestData/tree/master/data/modules/fastq_screen as a PR? That would be fab ✨ |
Are you challenging my Github skills again? Will try to make it work! |
Yeah, looks like the current code uses a mahoosive regular expression: This is pretty ugly and should really be rewritten.. |
Just noticed that you have a new warning from
|
Ok, problem (1) solved - these warnings are now gone:
This was simply that the test was checking The first Bisulfite issue thing is separate though I think. The plot doesn't show any alignment to E.Coli even though in the report it should be. Looking into that now. Phil Phil |
Now handles values in a more sensible way instead of using a huge regex. Fixes error in #1126
I tracked down the error - it was because E.Coli had I didn't really like the huge regex, so I have completely rewritten the parsing code to instead loop through the columns and dynamically take the header fields. This is a bit more generic and hopefully less vulnerable to these kinds of hiccups. Phil |
Great thanks. So maybe @StevenWingett should also look into why FastQ Screen comes up with |
Maybe 😉 I'm feeling nice this morning and was thinking of adding a plot for you if there is Bisulfite data in the FastQ Screen report. Does FastQ Screen itself plot this data somehow? Could you attach the output here so that I can mimic it if so? Phil |
Definitely need to make use of your good mood this morning. FastQ Screen produces something like this: |
It could be nice to scale the plot so that it doesn't stretch over four A4 pages and requires a lot of scrolling :) |
See #1126 for discussion of bisfulfite plot
Come on now, give me some credit 😉 I made it so that the bar plot is tabbed, with tabs sorted by the total read count across all samples (hidden if I have also updated the main plot so that (a) the fancy version doesn't show if you have a mixture of different genomes (fixes previously unreported bug) and (b) zero-count genomes are hidden and (c) genomes are sorted by the total read count across all samples. See an example report: multiqc_report.html |
Looking great! Thanks a lot! |
There was also an annoying bug where the |
Hello everyone. Since everyone is looking at FastQ Screen, I thought I should join in. @FelixKrueger, could you point to the file that is generating -0.00. Thanks :-) |
To be honest I don't know Steven, it is somewhere on the cluster... You were looking over my shoulder when we generated this report, don't you remember its location?... |
@FelixKrueger No, I thought it was rude to look at your screen. I'll dig around..... |
Description of bug:
While experimenting with Nextflow I found that that MultiQC plots the results from FastQ Screen fine when run in standard mode, but the bars for the organism in question (here E. coli) are missing if FastQ Screen had been run in
--bisulfite
mode.MultiQC Error log:
File that triggers the error:
This file works fine:
Ecoli_standardMode_R1_screen.txt
This one doesn't:
Ecoli_bisulfiteMode_R1_screen.txt
Additional context
I think the issue might have to do with the fact that the file
*screen.txt
contains and additional section for the bisulfite strands found in the sample in some way:MultiQC doesn't seem to be able to extract the
%Hit_no_genomes: 0.00
correctly. I did a few experiments by deleting the last section and ending in%Hit_no_genomes: 0.00
like in the standardMode files, but it still fails. If you change the number from0.00
to any other number, the plot for E. coli and NoHits starts showing, suggesting that it doesn't like the value of 0.00? Sometimes it assigned values to mouse though, so something weird is going on...Thanks for looking into this.
The text was updated successfully, but these errors were encountered: