-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add complexity filter? #13
Comments
Good idea - do you know any tool to be able to do/achieve that? |
PRINSEQ I believe |
Possibly a better (more recent) tool designed specifically for the case described above: https://github.com/OpenGene/fastp#polyg-tail-trimming |
Fastp is really a nice tool. I'll add this before AdapterRemoval, keeping adapters and qualities untouched and only performing the poly_g_trimming on demand (default off, but people can turn it on if they want to!) |
Some notes for myself:
|
As of commit 24c3329 , this is implemented and also covered by test cases for both single end and paired end data. |
In our group we've noticed that we regularly get lots of poly G reads from NextSeq data which don't get discarded by the sequencer or demultiplexer. This can mess up some downstream statistics if not thrown out.
Maybe we could consider having as a module some form of complexity filter to remove low complexity reads?
The text was updated successfully, but these errors were encountered: