Make query endpoint ready for production workloads #3755

julian-risch · 2022-12-23T07:48:50Z

Problem Statement
As a developer running Haystack in a production environment, I want the /query endpoint to be scalable and reliable so that my system is stable.

User Tasks

Pull a Docker image
Customize the default Haystack setup
Configure additional services
Configure a pipeline
Deploy the container in a pod
- 🔴 The official Helm chart is not working with the new Haystack images
Ensure the container is healthy
- 🔴 Pain point: we don't know if there was a problem caching the model until we use it
Receive query from user
Ensure we have enough resources to respond fast
- 🔴 No idea if there's enough GPU RAM left for larger batches
- 🔴 Missing guidance on Autoscaling horizontally
- 🔴 Concurrency is an issue
Combine queries in batches
- 🔴 Feature doesn't exist, look at other ML frameworks for inspiration
Send queries to Haystack
Run pipeline
- 🟡 We suspect it can be more efficient
Get the query result
Handle result schema problems
- 🔴 Schema change shouldn't go unnoticed
Send results to user
Change Haystack version
- 🔴 update is risky as we don't know in advance if upgrade causes problems
ensure the deployment is healthy
- 🔴 very limited service observability

Tasks

Give feedback

Update Helm chart to new Haystack Docker images haystack-helm#3

P3
Deploy explore the world demo using the Helm chart haystack-demos#6

enhancement
Provide guideline on how to deploy Haystack for being used in production #3910

P2 journey:advanced topic:rest_api type:documentation
Provide a Docker image blueprint for air-gapped environments #3618

P3 journey:advanced topic:docker topic:installation type:enhancement
rest_api: worker timeout prevents bootstrapping the app #3870

P1 topic:rest_api
Allow batches of queries in REST API's /query endpoint #3911

topic:rest_api type:feature wontfix
Catch CUDA out-of-memory errors and enhance their error message #3912

P2 topic:pipeline type:feature wontfix
Investigate alternatives to gunicorn settings in REST API #3913

P1 journey:advanced topic:rest_api type:enhancement
Ensure that there are no race conditions in REST API /query endpoint #3914

P2 topic:rest_api
REST API file-upload schema for meta #3790

topic:rest_api type:enhancement wontfix
Add REST API endpoint exposing application metrics #3915

topic:rest_api type:feature wontfix
https://github.com/deepset-ai/haystack-private/issues/46
Options

The text was updated successfully, but these errors were encountered:

julian-risch added topic:speed topic:rest_api epic labels Dec 23, 2022

julian-risch added this to Haystack Public Roadmap Dec 23, 2022

julian-risch moved this to Q1 2023 in Haystack Public Roadmap Dec 23, 2022

masci added the epic:in-progress Epic is in progress label Mar 13, 2023

masci assigned bogdankostic May 10, 2023

julian-risch mentioned this issue Jul 4, 2023

Shape Requirements for REST API #5266

Closed

julian-risch added epic:abandoned Epic was abandoned and not finished and removed epic:in-progress Epic is in progress labels Jul 14, 2023

masci closed this as completed Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make query endpoint ready for production workloads #3755

Make query endpoint ready for production workloads #3755

julian-risch commented Dec 23, 2022 •

edited by silvanocerza

Loading

Tasks

Make query endpoint ready for production workloads #3755

Make query endpoint ready for production workloads #3755

Comments

julian-risch commented Dec 23, 2022 • edited by silvanocerza Loading

Tasks

julian-risch commented Dec 23, 2022 •

edited by silvanocerza

Loading