-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Combinatorial explosion in the number of answers returned to a query #33
Comments
From TAQA:
from Sharat: four issues to be broken down into
big picture -> deep dive is important from Chris B: perhaps another issue here is: this is a known query with many results - sorry. Or, can we filter/sort our way out of this one? - is this doable? Suggest to the user that they tighten this up. Here are the common predicates associated with the answers you're getting back, can we try to help the user write a better query? from Sharat: agree; this is the best we can do, here are ways to tighten it up. Andrew: Big "hub" nodes are taken into account in the Normalized Google Distance (which is used in scoring by BTE and ARAX) - this is a tunable parameter. |
For the case of grouping records, O&O has a tracking issue at NCATSTranslator/Ordering-Organizing#15, with a few additional comments. |
This is a major issue, but how likely is it to bite us for the September release? |
This issue is to formally report a known Translator issue, namely, a tendency for answer sets to explode combinatorially with certain types of queries.
For instance, during the October 2022 QotM, Translator team members found that moving from connections between ATP1A3 and chemical entities or diseases yields a reasonable number of results; however, when adding in intermediary genes and pathways, the answer sets explode and become unmanageable.
Example from comment posted by @colleenXu here:
"Not sure how to get from ATP1A3 -> related genes -> ChemicalEntity, Procedure, Treatment in a way that doesn't explode / become unmanageable
Pathways / BiologicalProcessOrActivity...caused explosions since they were linked to pathways that had lots of genes"
The text was updated successfully, but these errors were encountered: