-
Notifications
You must be signed in to change notification settings - Fork 212
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disable unused simple query string features #3327
Comments
We are currently using the default flags for the simple query string, which is identical to I used a query like this to investigate how much each feature was used in the last 4 weeks:
I also noticed that many of the requests are the same as the ones on the https://openverse.org/search-help page. For many features, most of the requests with them are for the examples from the Search help page. This probably means that the users try them out, but don't end up using them consistently, and these features can be turned off. So, for each feature I selected the responses that include the feature but don't include the example from the search help page, like Flags we can remove
|
Fantastic research, Olga! 👏 👏 👏 |
Problem
Simple query string makes it easy for us to accept user input to directly pass as a query term to Elasticsearch without worrying about messy or bad inputs causing unexpected behaviour for users. However, simple query string is expensive, so if it's possible to turn off any of the features that (a) we don't see people using (when we check logs) or (b) don't want people using (fuzzy, wildcard, etc?) then we can gain some performance benefits.
Description
This is a two part issue. For each of the simple query string features that can be turned off by the limit operators investigate whether they've been used in the last month by querying the API logs in CloudWatch. Any that have been used less than 100 times in the last month are worth turning off (that would represent a statistically meaningless number of search requests, 3 per day in the context of several hundred thousand requests per day).
My guess is we can probably turn off the following:
Note: This also requires modifying the frontend "search syntax" guide to remove mention of features we no longer support after this change, hence the "frontend" label alongside the API label.
Additional context
Suggested to us by Greg who works on Jetpack.
The text was updated successfully, but these errors were encountered: