Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trimming amplicon primers? #21

Open
Thomieh73 opened this issue Jun 24, 2024 · 4 comments
Open

Trimming amplicon primers? #21

Thomieh73 opened this issue Jun 24, 2024 · 4 comments
Assignees
Labels
question Further information is requested

Comments

@Thomieh73
Copy link

Hi,
I am trying to find a tool that can process nanopore data and identify and trim my PCR primers in the dataset?

How would I do that with porechop_abi? is that possible?

@qbonenfant qbonenfant self-assigned this Jun 24, 2024
@qbonenfant qbonenfant added the question Further information is requested label Jun 24, 2024
@qbonenfant
Copy link
Collaborator

Hi,
Porechop_ABI should be able to retrieve and trim PCR adapters.
In fact, several PCR adapters already exists in the static database imported from the standard Porechop.

There is two ways you may approach this task:

  • Manually specify the forward/reverse sequences as start/end adapters in a custom adapter file using -cap option. This will give you the best control over the sequence to trim,
  • Let the Ab Initio algorithm guess what the adapter sequences are using the -abi option.

The other options are up to you, but standard parameters should do the trick in most case.

I would suggest a "guess-only" (-go) run before proceeding for quality control and sanity check. It is a lot faster than a full trimming run and will only display potential adapters found in the dataset.

@Thomieh73
Copy link
Author

Hey, thanks for the quick answer.

So I used 16s rRNA primers and they have degenerate bases, like this AGRGTTTGATYHTGGCT.

Do you think Porechop_abi is able to handle that if I use that as custom adapters, or should i rather write out all versions of the primers.

@qbonenfant
Copy link
Collaborator

qbonenfant commented Jun 24, 2024

As far as I know, IUPAC sequences are not supported by Porechop's trimming algorithm, which is the one we use.
Enumerating all possible variation may slow things down a bit during the adapter selection phase, but should result in a cleaner trimm. You may even discard the static database (-ddb) if you want to speed things up a bit.

I do not have experience with degenerate bases on ONT sequencer, but I think it may be useful to perform a "--guess-only" run anyway. If the sequencer has any kind of bias, or maybe acts strange on such bases, the ABI algorithm will build several consensus sequence that should match the forms that are actually present in your dataset.

@Thomieh73
Copy link
Author

Okay, thanks for the feedback. That is really helpful. I will try it out my dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants