Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make query endpoint ready for production workloads #3755

Closed
9 of 12 tasks
julian-risch opened this issue Dec 23, 2022 · 0 comments
Closed
9 of 12 tasks

Make query endpoint ready for production workloads #3755

julian-risch opened this issue Dec 23, 2022 · 0 comments
Assignees
Labels

Comments

@julian-risch
Copy link
Member

julian-risch commented Dec 23, 2022

Problem Statement
As a developer running Haystack in a production environment, I want the /query endpoint to be scalable and reliable so that my system is stable.

User Tasks

  • Pull a Docker image
  • Customize the default Haystack setup
  • Configure additional services
  • Configure a pipeline
  • Deploy the container in a pod
    • 🔴 The official Helm chart is not working with the new Haystack images
  • Ensure the container is healthy
    • 🔴 Pain point: we don't know if there was a problem caching the model until we use it
  • Receive query from user
  • Ensure we have enough resources to respond fast
    • 🔴 No idea if there's enough GPU RAM left for larger batches
    • 🔴 Missing guidance on Autoscaling horizontally
    • 🔴 Concurrency is an issue
  • Combine queries in batches
    • 🔴 Feature doesn't exist, look at other ML frameworks for inspiration
  • Send queries to Haystack
  • Run pipeline
    • 🟡  We suspect it can be more efficient
  • Get the query result
  • Handle result schema problems
    • 🔴 Schema change shouldn't go unnoticed
  • Send results to user
  • Change Haystack version
    • 🔴 update is risky as we don't know in advance if upgrade causes problems
  • ensure the deployment is healthy
    • 🔴 very limited service observability

Tasks

  1. P3
  2. enhancement
  3. P2 journey:advanced topic:rest_api type:documentation
  4. P3 journey:advanced topic:docker topic:installation type:enhancement
    mayankjobanputra
  5. P1 topic:rest_api
    mayankjobanputra
  6. topic:rest_api type:feature wontfix
  7. P2 topic:pipeline type:feature wontfix
  8. P1 journey:advanced topic:rest_api type:enhancement
    bogdankostic
  9. P2 topic:rest_api
  10. topic:rest_api type:enhancement wontfix
  11. topic:rest_api type:feature wontfix
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

No branches or pull requests

3 participants