-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
prematurely hitting edge threshold? #480
Comments
This would appear to be inferred mode-only behavior. Template-0 returns 3 results, and then Template-1 returns 3560 results, which exceeds the "absolute maximum" (Creative-query limit of 1000 + 500 buffer), which triggers behavior to toss out that template's results and cease operations. Two things here:
|
Current Behavior of inferred-mode cutoffs:For the below explanation, I'm using "accepted results" to refer to results which have been added to the final response, whether by concatenation or result-merging.
Given the scenario in this issue, it may be worth discussing if we want to define something more complex to account for more nuanced situations, such as currently-accepted results being a very low number. The numbers 1000 and 1500 were chosen to not overburden the client. We could try splitting results arbitrarily and then trimming the returned KG, or simply changing those numbers, or some other strategy? |
(copied from #449 (comment) but maybe the discussion is in this separate issue?) From the 8/10 lab meeting on Translator stuff (added feature): (Jackson's plate of work) "should be technically feasible to "top off" results and trim resulting KG"
Also edit the TRAPI logs to make the cap clearer: "the cap of 1500 is reached because the template just run has 3000 results" |
Also I plan to look into this some more |
This is actually due to the Automat registration change, which was addressed by 2802269 and I think @tokebe deployed earlier today (internal lab Slack link)? What I did: This is the ARAX link for the Huntington's creative-mode query 7 days ago: https://arax.ncats.io/?r=16560b7c-fb31-4e1e-ab31-32a8f3e67ded vs This is the link for the same query 3 days ago: https://arax.ncats.io/?r=00642b42-6199-4877-ab92-06bea730fdae When reviewing the TRAPI logs, BTE is providing only the results from template-0 both times. But in the more recent run, the Automat KP MetaEdges are not found/used leading to less results (3 vs 502). For both runs, BTE gets > 3500 results from the next template (template-1), which it doesn't incorporate into the results set (which Jackson described above). |
I still think that the feature described above from the lab meeting is still useful, and we'd want it soonish (since code freeze-ish is August 29?) |
"top off" behavior implemented here |
Marking as done (top-off code implemented in biothings/bte_trapi_query_graph_handler#119 and deployed) |
In the latest update to NCATSTranslator/testing#223, BTE reports 3 results, whereas past runs have returned ~500 results. It looks like BTE is prematurely hitting our edge count threshold. This is an inferred / creative mode query, but I'm not sure this is specific to inferred / creative mode...
Some relevant bits of the log:
The text was updated successfully, but these errors were encountered: