Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add PCR error filter #251

Open
mmelendrez opened this issue Jun 7, 2016 · 1 comment
Open

add PCR error filter #251

mmelendrez opened this issue Jun 7, 2016 · 1 comment

Comments

@mmelendrez
Copy link
Member

We should look at developing a tool or option into ngsmapper, unless one is already in dev that automates finding the PCR bias in reads. Currently the options are (i) hard crop the reads which could shorten reads to the point of possible not assembling or not assembling correctly (ii) visually picking them out and manually correcting during manual curation. To do (ii) you need to be able to view the BAMs which especially in variant level analysis can get enormous and take forever to load onsite with a good connection - let along over VPN > cluster! My options from here are (i) attempting to use VPN > putty + Xming > cluster > load IGV > load BAMs. Ive I do this it will take hours at best IF VPN doesn't kick me out. (ii) Use geneious, I attempted this - hours to load a BAMs let alone scan through one for the site with potential PCR issues.

A method of identifying such areas where PCR primer issues occur is to look for mutations within 20 bp of the end of a read. Typically - when it's PCR primer bias, mutations are found at the same spot within 20 bp of the end and they are 'stacked' meaning there are a bunch of reads that end at the same coordinate all with the mutation.

Another way of 'seeing' this is there will typically (not always) be a bias of +/- reads. So if there is PCR Primer bias the mutation will be found in 5000+ reads but perhaps only 200- reads (forward/reverse).

For 'real' mutations ideally they need to be found more or less equally on forward and reverse reads, inside the reads as opposed to on the ends.

Thoughts? Ina needs to be brought in for comments on this as well.

@mmelendrez
Copy link
Member Author

considerations per Michael and Mel convo:

  • threshold for forward reverse 'bias'
  • how many constitutes a 'stack' or a 'bunch' of them , stack threshold

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant