feat: Implement `DeepEvalEvaluator` #346

shadeMe · 2024-02-06T15:11:55Z

Related to #250.

We introduce DeepEvalEvaluator, a component that uses the DeepEval LLM evaluation framework to calculate evaluation metrics for RAG pipelines (among others). Refer deepset-ai/haystack#6784 for an overview of the API design.

This PR introduces the following user-facing classes:

DeepEvalMetric - A enumeration that lists the supported DeepEval metrics. Currently, only those metrics that are related to RAG pipelines are supported.
DeepEvalEvaluator - Th pipeline component interfaces with the evaluation framework. It accepts a single metric and its optional parameters. The inputs to the pipeline are dynamically configured depending on the metric. This is done with help of a metric descriptor table that contains metadata concerning input/output conversion formats, expected inputs/outputs, etc.

The output of the component is a nested list of metric results. Each input can have one or more results, depending on the metric. Each result is a dictionary containing the following keys and values:

name - The name of the metric.
score - The score of the metric.
explanation - An optional explanation of the score.

masci

I'm not familiar with deepeval but code looks good to me. I left a question about api docs now that we merged the CI job that will attempt to run hatch run docs on integrations' changes merged on main. Another option might be making that CI job more resilient.

integrations/deepeval/pyproject.toml

masci

Thanks for adding the docs, LGTM

feat: Implement DeepEvalEvaluator and DeepEvalMetrics

248dfb0

shadeMe added type:documentation Improvements or additions to documentation new integration Discuss the creation of a new integration in Core integration:deepeval labels Feb 6, 2024

shadeMe requested a review from a team as a code owner February 6, 2024 15:11

shadeMe requested review from masci and removed request for a team February 6, 2024 15:11

github-actions bot added the topic:CI label Feb 6, 2024

shadeMe marked this pull request as draft February 6, 2024 15:21

Mock OpenAI API keys

053b71f

shadeMe force-pushed the feat/deepeval-evaluator branch from e03b23f to 053b71f Compare February 6, 2024 16:07

github-actions bot added the integration:uptrain label Feb 6, 2024

Disambiguate module names

60e5922

shadeMe force-pushed the feat/deepeval-evaluator branch from 35bd9ee to 60e5922 Compare February 6, 2024 16:22

shadeMe removed the integration:uptrain label Feb 6, 2024

Update labeler workflow

4243ee3

github-actions bot added the integration:uptrain label Feb 6, 2024

Revert accidental changes to uptrain tests

696d08f

shadeMe removed the integration:uptrain label Feb 6, 2024

shadeMe marked this pull request as ready for review February 6, 2024 16:37

shadeMe mentioned this pull request Feb 7, 2024

Unify evaluation API of model-based and statistical metrics deepset-ai/haystack#6903

Closed

Fix typo

84780c4

shadeMe changed the title ~~feat: Implement DeepEvalEvaluator and DeepEvalMetrics~~ feat: Implement DeepEvalEvaluator Feb 7, 2024

masci reviewed Feb 9, 2024

View reviewed changes

integrations/deepeval/pyproject.toml Show resolved Hide resolved

shadeMe added 4 commits February 9, 2024 17:05

Standardize namespacing and add API doc generation to CI

196603c

Lint

5b3b694

mypy fixes

d7362bd

Docs build fix

0a4b1fb

shadeMe requested a review from masci February 9, 2024 16:22

masci approved these changes Feb 9, 2024

View reviewed changes

shadeMe merged commit 9bd4417 into deepset-ai:main Feb 9, 2024
10 checks passed

shadeMe deleted the feat/deepeval-evaluator branch February 9, 2024 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Implement `DeepEvalEvaluator` #346

feat: Implement `DeepEvalEvaluator` #346

shadeMe commented Feb 6, 2024 •

edited

Loading

masci left a comment

masci left a comment

feat: Implement DeepEvalEvaluator #346

feat: Implement DeepEvalEvaluator #346

Conversation

shadeMe commented Feb 6, 2024 • edited Loading

masci left a comment

Choose a reason for hiding this comment

masci left a comment

Choose a reason for hiding this comment

feat: Implement `DeepEvalEvaluator` #346

feat: Implement `DeepEvalEvaluator` #346

shadeMe commented Feb 6, 2024 •

edited

Loading