Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BLOG] Optimizing Hybrid Search in OpenSearch #3454

Open
wrigleyDan opened this issue Nov 19, 2024 · 4 comments
Open

[BLOG] Optimizing Hybrid Search in OpenSearch #3454

wrigleyDan opened this issue Nov 19, 2024 · 4 comments
Assignees
Labels
new blog New blog post

Comments

@wrigleyDan
Copy link
Contributor

Describe the blog post

Hybrid search is hard:

  • How do users figure out which parameter set is best for them?
  • What is the best normalization technique and how much neural/keyword search is ideal?

There exists no “one size fits all” solution. The best configuration depends on a plethora of factors related to any given search application’s data, users, or domain.
With the hybrid search optimizer, a set of notebooks, OpenSearch users are empowered to find the best parameter set for their application and explore ways to dynamically predict how the best hybrid search query looks like individually per query.
The blog post should share the approach and also experimentation results that were achieved using the ESCI dataset.

Preliminary outline:

  1. Intro
  2. Problem Statement: It's hard to know the ideal parameters for hybrid search
  3. Hybrid search as a parameter optimization problem: identifying the best parameter set for hybrid search
  4. Hybrid search as a query understanding problem: dynamically predict how the best hybrid search query looks like individually per query
  5. Future work and how to contribute

Expected Title

Optimizing Hybrid Search in OpenSearch

Authors Name

Daniel Wrigley

Authors Email

[email protected]

Target Draft Date

11/26/2024

Blog Post Category

technical, partners

Target Publication Date

12/10/2024

Additional Info

@wrigleyDan wrigleyDan added enhancement New feature or request new blog New blog post untriaged labels Nov 19, 2024
@wrigleyDan
Copy link
Contributor Author

tagging @krisfreedain who had the initial conversation about this blog post together with Eric

@pajuric
Copy link

pajuric commented Nov 20, 2024

@wrigleyDan - Thanks for filing a blog issue. I am the blog manager, and wanted to reach out and let you know that when you have a draft ready for review, you are welcome to include it here in markdown format or email via G-docs. My email is [email protected]. You can also find me on public Slack - Eric and I work together often.

@wrigleyDan
Copy link
Contributor Author

@pajuric - thanks for following up on this.

I sent you the current draft via email. Looking forward to working on this together with you!

@pajuric
Copy link

pajuric commented Dec 9, 2024

@wrigleyDan - I just reviewed the draft and provided feedback. Next step is to implement the edits and open a blog PR including the content so I can get this through reviews. Thanks again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new blog New blog post
Projects
Status: In Progress
Development

No branches or pull requests

2 participants