feat(gen ai): showcase different options for computation-based metric #12756

Valeriy-Burlaka · 2024-11-08T15:05:33Z

Description

Fixes #

Note: Before submitting a pull request, please open an issue for discussion if you are not associated with Google.

Checklist

I have followed Sample Guidelines from AUTHORING_GUIDE.MD
README is updated to include all relevant information
Tests pass: nox -s py-3.9 (see Test Environment Setup)
Lint pass: nox -s lint (see Test Environment Setup)
These samples need a new API enabled in testing projects to pass (let us know which ones)
These samples need a new/updated env vars in testing projects set to pass (let us know which ones)
This sample adds a new sample directory, and I updated the CODEOWNERS file with the codeowners for this sample
This sample adds a new Product API, and I updated the Blunderbuss issue/PR auto-assigner with the codeowners for this sample
Please merge this PR for me once it is approved

msampathkumar · 2024-11-08T15:28:36Z

generative_ai/evaluation/get_rouge_score.py

@@ -37,7 +39,37 @@ def get_rouge_score() -> EvalResult:
    life, including endangered species, it faces serious threats from
    climate change, ocean acidification, and coral bleaching."""

-    # Compare pre-generated model responses against the reference (ground truth).
+    # Option1: Run model inference and evaluate model response against the reference (ground truth)


The code samples looks too big now!

Yep, I understand

Valeriy-Burlaka · 2024-11-08T15:35:57Z

generative_ai/evaluation/get_rouge_score.py

@@ -37,7 +39,37 @@ def get_rouge_score() -> EvalResult:
    life, including endangered species, it faces serious threats from
    climate change, ocean acidification, and coral bleaching."""

-    # Compare pre-generated model responses against the reference (ground truth).
+    # Option1: Run model inference and evaluate model response against the reference (ground truth)


@msampathkumar , I'm thinking about showcasing 2 different options of using the computation-based metrics — Bring-your-own-response (BYOR) and with running model inference.
The reason is that for me, as a developer, the line between these options wasn't immediately obvious (hence this issue with the "prompt" column being silently unused), so I want to make it crystal-clear.

While I understand your point, this code samples is still too big(100 lines). Let me check with the tech writing team.

Also note, I don't see any example response section for this part of the code.

feat(gen ai): showcase different options for computation-based metric

215e7d2

Valeriy-Burlaka self-assigned this Nov 8, 2024

Valeriy-Burlaka requested review from a team as code owners November 8, 2024 15:05

Valeriy-Burlaka marked this pull request as draft November 8, 2024 15:05

product-auto-label bot added the samples Issues that are directly related to samples. label Nov 8, 2024

msampathkumar reviewed Nov 8, 2024

View reviewed changes

Valeriy-Burlaka commented Nov 8, 2024

View reviewed changes

StaticScuzzi approved these changes Nov 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gen ai): showcase different options for computation-based metric #12756

feat(gen ai): showcase different options for computation-based metric #12756

Valeriy-Burlaka commented Nov 8, 2024

msampathkumar Nov 8, 2024

Valeriy-Burlaka Nov 12, 2024

Valeriy-Burlaka Nov 8, 2024 •

edited

Loading

msampathkumar Nov 12, 2024

msampathkumar Nov 12, 2024

feat(gen ai): showcase different options for computation-based metric #12756

Are you sure you want to change the base?

feat(gen ai): showcase different options for computation-based metric #12756

Conversation

Valeriy-Burlaka commented Nov 8, 2024

Description

Checklist

msampathkumar Nov 8, 2024

Choose a reason for hiding this comment

Valeriy-Burlaka Nov 12, 2024

Choose a reason for hiding this comment

Valeriy-Burlaka Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

msampathkumar Nov 12, 2024

Choose a reason for hiding this comment

msampathkumar Nov 12, 2024

Choose a reason for hiding this comment

Valeriy-Burlaka Nov 8, 2024 •

edited

Loading