Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Semi-enzymatic digests not completing? #106

Open
hbarsnes opened this issue Dec 4, 2023 · 1 comment
Open

Semi-enzymatic digests not completing? #106

hbarsnes opened this issue Dec 4, 2023 · 1 comment

Comments

@hbarsnes
Copy link
Contributor

hbarsnes commented Dec 4, 2023

Seems like semi-enzymatic digests are not completing?

At least that is what happens if I try the following settings:

		"enzyme": {
			"missed_cleavages": 2,
			"min_len": 8,
			"max_len": 30,
			"cleave_at": "RK",
			"restrict": "P",
			"c_terminal": true,
			"semi_enzymatic": true
		},

If I remove the last line or replace it with "semi_enzymatic": null the search completes and is as fast as before.

I'm using the default SearchGUI/PeptideShaker example dataset/input, hence there should be no unexpected issues there.

Any idea what is happening?

@lazear
Copy link
Owner

lazear commented Dec 4, 2023

At the moment, semi-enzymatic is kinda unusable unless you have a ton of RAM. Fragment indexing necessitates pre-digesting every peptide and generating every fragment... which is pretty resource intensive for semi-enzymatic (or no-enzyme).

I am working on an internal database splitting solution to at least partially alleviate the problem (and hopefully improve it over time). In the mean time, you can confirm that it works by either reducing the # of missed cleavages, or using a significantly smaller FASTA file (or processing FASTA database in chunks: #97 (comment))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants