You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Problem Statement
As a developer running Haystack in a production environment, I want the /query endpoint to be scalable and reliable so that my system is stable.
User Tasks
Pull a Docker image
Customize the default Haystack setup
Configure additional services
Configure a pipeline
Deploy the container in a pod
🔴 The official Helm chart is not working with the new Haystack images
Ensure the container is healthy
🔴 Pain point: we don't know if there was a problem caching the model until we use it
Receive query from user
Ensure we have enough resources to respond fast
🔴 No idea if there's enough GPU RAM left for larger batches
🔴 Missing guidance on Autoscaling horizontally
🔴 Concurrency is an issue
Combine queries in batches
🔴 Feature doesn't exist, look at other ML frameworks for inspiration
Send queries to Haystack
Run pipeline
🟡 We suspect it can be more efficient
Get the query result
Handle result schema problems
🔴 Schema change shouldn't go unnoticed
Send results to user
Change Haystack version
🔴 update is risky as we don't know in advance if upgrade causes problems
ensure the deployment is healthy
🔴 very limited service observability
The content you are editing has changed. Please copy your edits and refresh the page.
Problem Statement
As a developer running Haystack in a production environment, I want the
/query
endpoint to be scalable and reliable so that my system is stable.User Tasks
Tasks
explore the world
demo using the Helm chart haystack-demos#6/query
endpoint #3911gunicorn
settings in REST API #3913meta
#3790The text was updated successfully, but these errors were encountered: