AI tracking does not work properly in Python's asynchronous generator scenarios. #3823

uraurora · 2024-11-25T06:26:55Z

Environment

Steps to Reproduce

First, I use an HTTP service to obtain a streaming response (SSE) from an interface, and my local interface is mainly used to relay data and report its token consumption information.
Locally, I use Python FastAPI and employ a Python asynchronous generator to yield each event.
I created a span within the asynchronous generator and used the decorator ai_track on the function. I used with sentry_sdk.start_span(op="ai.chat_completions.create.xxx", name="xxx") as span, and I'm not sure if the op value is set correctly.

Expected Result

I hope the LLM Monitoring works well, but seems only no-stream api does

Actual Result

The stream api does not show anything. I'm not sure whether there's an issue with my configuration or if this method of invocation is not currently supported.

Product Area

Insights

Link

https://moflow.sentry.io/insights/ai/llm-monitoring/?project=4508239351447552&statsPeriod=24h

DSN

No response

Version

2.19.0

The text was updated successfully, but these errors were encountered:

getsantry · 2024-11-25T06:27:40Z

Assigning to @getsentry/support for routing ⏲️

szokeasaurusrex · 2024-11-26T12:24:46Z

Hi @uraurora, thank you for opening this issue.

I am having trouble understanding what you are trying to do, and what the problem is. Could you please provide specific steps on how to reproduce the problem? If possible, please provide a code snippet that we can run, so that we can see what you are trying to do.

uraurora · 2024-11-27T02:03:01Z

Hi @uraurora, thank you for opening this issue.

I am having trouble understanding what you are trying to do, and what the problem is. Could you please provide specific steps on how to reproduce the problem? If possible, please provide a code snippet that we can run, so that we can see what you are trying to do.

Hi, In simple terms, I use FastAPI as the backend, and at the same time, I want to record token consumption in the LLM interface with streaming responses, but after calling the interface, it seems that there are no related displays in the Sentry dashboard(Insights-AI-LLM Monitoring). The code is as follows:

@ai_track("sentry-ai-track-test-pipeline")
async def stream():
    # assume this is a llm stream call
    with sentry_sdk.start_span(op="ai.chat_completions.create.xxx", name="sentry-ai-track-test") as span:
        token = 0
        for i in range(10):
            token += 1
            yield f"{i}"

        record_token_usage(span, total_tokens=token)


@router.post(
    "/xxx/xxx",
    response_class=EventSourceResponse,
    status_code=status.HTTP_200_OK,
)
async def sse_api(
) -> EventSourceResponse:
    return EventSourceResponse(stream())

antonpirker · 2024-11-27T09:16:49Z

Can you link us a transaction that is in the "Performance" tab on Sentry.io that contains the spans ai.chat_completions.create.* that you are creating?

In general the spans you create must contain the data described here: https://develop.sentry.dev/sdk/telemetry/traces/modules/llm-monitoring/

If we have a link to a transaction, we can see if the spans in this transaction have the correct format.

uraurora · 2024-11-28T01:25:59Z

Can you link us a transaction that is in the "Performance" tab on Sentry.io that contains the spans ai.chat_completions.create.* that you are creating?

In general the spans you create must contain the data described here: https://develop.sentry.dev/sdk/telemetry/traces/modules/llm-monitoring/

If we have a link to a transaction, we can see if the spans in this transaction have the correct format.

yes, here it is, Indeed, I suspect that I used incorrect span information.

trace

llm monitoring

antonpirker · 2024-11-28T11:36:54Z

Hey @uraurora !

Yes, some of the span data is not correct. I have created a small sample script that creates correct spans: https://github.com/antonpirker/testing-sentry/blob/main/test-llm-manual-instrumentation/main.py

One bigger thing is that ""ai.chat_completions.create.xxx" is not allowed, instead of xxx it needs to be one of openai, cohere, langchain, huggingface_hub.

Also notice that the pipeline_name var is used twice in the example. This is how Sentry is matching spans together.

If you also want to have the dollar amount spent than it is important to set the ai.model_id.

Hope this helps!

uraurora · 2024-11-29T07:16:31Z

Great! thank you for your assistance. Now the token consumption can display well

InterstellarStella transferred this issue from getsentry/sentry Nov 26, 2024

getsantry bot added the Waiting for: Product Owner label Nov 26, 2024

getsantry bot moved this from Waiting for: Support to Waiting for: Product Owner in GitHub Issues with 👀 3 Nov 26, 2024

getsantry bot removed the Waiting for: Product Owner label Nov 26, 2024

getsantry bot removed the status in GitHub Issues with 👀 3 Nov 26, 2024

szokeasaurusrex added the Waiting for: Community label Nov 26, 2024

getsantry bot moved this to Waiting for: Community in GitHub Issues with 👀 3 Nov 26, 2024

getsantry bot added Waiting for: Product Owner and removed Waiting for: Community labels Nov 27, 2024

getsantry bot moved this from Waiting for: Community to Waiting for: Product Owner in GitHub Issues with 👀 3 Nov 27, 2024

getsantry bot removed the Waiting for: Product Owner label Nov 27, 2024

getsantry bot removed the status in GitHub Issues with 👀 3 Nov 27, 2024

getsantry bot added the Waiting for: Product Owner label Nov 28, 2024

getsantry bot moved this to Waiting for: Product Owner in GitHub Issues with 👀 3 Nov 28, 2024

getsantry bot removed the Waiting for: Product Owner label Nov 28, 2024

getsantry bot removed the status in GitHub Issues with 👀 3 Nov 28, 2024

antonpirker added the Waiting for: Community label Nov 28, 2024

getsantry bot moved this to Waiting for: Community in GitHub Issues with 👀 3 Nov 28, 2024

getsantry bot added Waiting for: Product Owner and removed Waiting for: Community labels Nov 29, 2024

getsantry bot moved this from Waiting for: Community to Waiting for: Product Owner in GitHub Issues with 👀 3 Nov 29, 2024

uraurora closed this as completed Nov 29, 2024

getsantry bot removed the Waiting for: Product Owner label Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI tracking does not work properly in Python's asynchronous generator scenarios. #3823

AI tracking does not work properly in Python's asynchronous generator scenarios. #3823

uraurora commented Nov 25, 2024

getsantry bot commented Nov 25, 2024

szokeasaurusrex commented Nov 26, 2024 •

edited

Loading

uraurora commented Nov 27, 2024 •

edited

Loading

antonpirker commented Nov 27, 2024

uraurora commented Nov 28, 2024 •

edited

Loading

antonpirker commented Nov 28, 2024

uraurora commented Nov 29, 2024 •

edited

Loading

AI tracking does not work properly in Python's asynchronous generator scenarios. #3823

AI tracking does not work properly in Python's asynchronous generator scenarios. #3823

Comments

uraurora commented Nov 25, 2024

Environment

Steps to Reproduce

Expected Result

Actual Result

Product Area

Link

DSN

Version

getsantry bot commented Nov 25, 2024

szokeasaurusrex commented Nov 26, 2024 • edited Loading

uraurora commented Nov 27, 2024 • edited Loading

antonpirker commented Nov 27, 2024

uraurora commented Nov 28, 2024 • edited Loading

antonpirker commented Nov 28, 2024

uraurora commented Nov 29, 2024 • edited Loading

szokeasaurusrex commented Nov 26, 2024 •

edited

Loading

uraurora commented Nov 27, 2024 •

edited

Loading

uraurora commented Nov 28, 2024 •

edited

Loading

uraurora commented Nov 29, 2024 •

edited

Loading