memory issue? #39

lucacozzuto · 2022-09-08T14:53:09Z

Dear developers,
thanks for your valuable tool! I'm trying to use it for some nanopore data and I got the following error:

[Thu Sep  8 14:45:34 2022] creating directory for output: KO_fastqc
[limitst]	using file /falco-1.1.0/Configuration/limits.txt
[adapters]	using file /falco-1.1.0/Configuration/adapter_list.txt
[contaminants]	using file /falco-1.1.0/Configuration/contaminant_list.txt
[Thu Sep  8 14:45:34 2022] Started reading file KO.fq.gz
[Thu Sep  8 14:45:34 2022] reading file as gzipped FASTQ format
[running falco|=                                                  |  2%]/ 2: 19870 Killed                  falco -o KO_fastqc -t 1 KO.fq.gz

I used 80Gb of RAM so I don't think I have a problem with RAM.

Luca

The text was updated successfully, but these errors were encountered:

andrewdavidsmith · 2022-09-08T14:58:09Z

@lucacozzuto can you provide some additional information, for example a small piece of the input file? And also the command -- you've snipped the verbose output and progress but we don't see the arguments or anything like filenames. If you feel there would be confidential info in the filenames, then it would help if you could copy them to generic filenames and post the exact command line you used. Thanks!!!

lucacozzuto · 2022-09-08T15:02:38Z

Many thanks for your quick answer!
This is the command line

falco -o KO_fastqc -t 1  KO.fq.gz

The file is huge (59G) and there are some reads that are up to 1 Mb

guilhermesena1 · 2022-09-10T18:47:02Z

hello,

thank you for reaching out about the issue.
I was able to replicate the problem with synthetic very large reads.

This seems not as much a memory issue as it is a bug in falco where we weren't accounting for the maximum read length to be as large as the ones currently produced by oxford nanopore.

If you are working with a clone of the repo, I pushed a fix at 2f82110 that may resolve the issue. On my 16 GB RAM machine I was able to run falco on a simulated read of size 30 million to completion.

If at all possible, could you let us know if you can run falco to completion on your data with this commit?

Thank you very much in advance!

lucacozzuto · 2022-09-12T11:21:27Z

Dear @guilhermesena1, it worked!
Thanks for this fix, I managed to add it to my nextflow pipeline for replacing fastQC. I made a Docker file with your tool, so in case you want I can add it to your repo.

Best,

Luca

lucacozzuto closed this as completed Sep 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memory issue? #39

memory issue? #39

lucacozzuto commented Sep 8, 2022

andrewdavidsmith commented Sep 8, 2022

lucacozzuto commented Sep 8, 2022

guilhermesena1 commented Sep 10, 2022

lucacozzuto commented Sep 12, 2022

memory issue? #39

memory issue? #39

Comments

lucacozzuto commented Sep 8, 2022

andrewdavidsmith commented Sep 8, 2022

lucacozzuto commented Sep 8, 2022

guilhermesena1 commented Sep 10, 2022

lucacozzuto commented Sep 12, 2022