Detect GPU on startup for default multi-pass indexing value #2242

pablodanswer · 2024-08-27T19:19:01Z

Description

How Has This Been Tested?

[Describe the tests you ran to verify your changes]

Accepted Risk

[Any know risks or failure modes to point out to reviewers]

Related Issue(s)

[If applicable, link to the issue(s) this PR addresses]

Checklist:

All of the automated tests pass
All PR comments are addressed and marked resolved
If there are migrations, they have been rebased to latest main
If there are new dependencies, they are added to the requirements
If there are new environment variables, they are added to all of the deployment methods
If there are new APIs that don't require auth, they are added to PUBLIC_ENDPOINT_SPECS
Docker images build and basic functionalities work
Author has done a final read through of the PR right before merge

vercel · 2024-08-27T19:19:03Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
internal-search	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Aug 31, 2024 0:20am

backend/danswer/utils/gpu_utils.py

yuhongsun96 · 2024-08-30T23:23:17Z

backend/danswer/main.py

@@ -341,6 +370,9 @@ async def lifespan(app: FastAPI) -> AsyncGenerator:

        translate_saved_search_settings(db_session)

+        # update multipass indexing setting based on GPU availability
+        update_default_multipass_indexing(db_session)


I would place this later (after the embedding warm up setup) because that step has backoff and retries. Otherwise this call to check GPU could run before the model server is actually up

note that the call later on checks for the inference server, not indexing

backend/danswer/main.py

yuhongsun96 · 2024-08-31T00:33:08Z

backend/danswer/utils/gpu_utils.py

+    else:
+        model_server_url = f"{MODEL_SERVER_HOST}:{MODEL_SERVER_PORT}"
+
+    if "http" not in model_server_url:


No need to change but if this is now repeated functionality, we can pull it out into a function

vercel bot deployed to Preview August 27, 2024 19:19 View deployment

vercel bot deployed to Preview August 27, 2024 19:26 View deployment

yuhongsun96 approved these changes Aug 30, 2024

View reviewed changes

fix mypy

4c549a5

vercel bot deployed to Preview August 31, 2024 00:14 View deployment

pablodanswer added 6 commits August 30, 2024 17:15

search settings startup

1379675

append the values

f725aa1

on fresh starts, detect GPU to set multipass indexing

083bc4b

squash

1440985

add gpu

16672be

add retries

c1fbbfa

pablodanswer force-pushed the model_startup_gpu branch from c03a1c7 to c1fbbfa Compare August 31, 2024 00:19

vercel bot deployed to Preview August 31, 2024 00:20 View deployment

yuhongsun96 reviewed Aug 31, 2024

View reviewed changes

yuhongsun96 merged commit 76db4b7 into main Aug 31, 2024
6 of 7 checks passed

yuhongsun96 deleted the model_startup_gpu branch August 31, 2024 00:38

onimsha mentioned this pull request Sep 13, 2024

chore/merge upstream 2024091301 mindvalley/danswer#55

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect GPU on startup for default multi-pass indexing value #2242

Detect GPU on startup for default multi-pass indexing value #2242

pablodanswer commented Aug 27, 2024

vercel bot commented Aug 27, 2024 •

edited

Loading

yuhongsun96 Aug 30, 2024

yuhongsun96 Aug 30, 2024

yuhongsun96 Aug 31, 2024

Detect GPU on startup for default multi-pass indexing value #2242

Detect GPU on startup for default multi-pass indexing value #2242

Conversation

pablodanswer commented Aug 27, 2024

Description

How Has This Been Tested?

Accepted Risk

Related Issue(s)

Checklist:

vercel bot commented Aug 27, 2024 • edited Loading

yuhongsun96 Aug 30, 2024

Choose a reason for hiding this comment

yuhongsun96 Aug 30, 2024

Choose a reason for hiding this comment

yuhongsun96 Aug 31, 2024

Choose a reason for hiding this comment

vercel bot commented Aug 27, 2024 •

edited

Loading