[MAINTENANCE] Standardize result object from Checkpoint action runs #10647

cdkini · 2024-11-08T15:59:38Z

We should have a consistent API for ValidationAction.run - this new model is a lightweight wrapper around a status and the existing payload but should allow us to be better prepared for extensibility that users want.

Description of PR changes above includes a link to an existing GitHub issue
PR title is prefixed with one of: [BUGFIX], [FEATURE], [DOCS], [MAINTENANCE], [CONTRIB]
Code is linted - run invoke lint (uses ruff format + ruff check)
Appropriate tests and docs have been updated

For more information about contributing, visit our community resources.

After you submit your PR, keep the page open and monitor the statuses of the various checks made by our continuous integration process at the bottom of the page. Please fix any issues that come up and reach out on Slack if you need help. Thanks for contributing!

netlify · 2024-11-08T15:59:53Z

✅ Deploy Preview for niobium-lead-7998 canceled.

Name	Link
🔨 Latest commit	`ab11718`
🔍 Latest deploy log	https://app.netlify.com/sites/niobium-lead-7998/deploys/672e9261bf71a500087ae11d

for more information, see https://pre-commit.ci

…tations/great_expectations into m/_/better_reporting

codecov · 2024-11-08T18:14:00Z

Codecov Report

Attention: Patch coverage is 82.60870% with 8 lines in your changes missing coverage. Please review.

Project coverage is 80.38%. Comparing base (d51f6c0) to head (ab11718).

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
great_expectations/checkpoint/actions.py	87.17%	5 Missing ⚠️
great_expectations/checkpoint/util.py	25.00%	3 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           develop   #10647   +/-   ##
========================================
  Coverage    80.37%   80.38%           
========================================
  Files          463      463           
  Lines        40113    40130   +17     
========================================
+ Hits         32241    32258   +17     
  Misses        7872     7872

Flag	Coverage Δ
3.10	`68.10% <82.60%> (+0.01%)`	⬆️
3.10 athena or openpyxl or pyarrow or project or sqlite or aws_creds	`?`
3.10 aws_deps	`?`
3.10 big	`?`
3.10 clickhouse	`?`
3.10 filesystem	`?`
3.10 mssql	`?`
3.10 mysql	`?`
3.10 postgresql	`?`
3.10 spark_connect	`?`
3.10 trino	`?`
3.11	`68.10% <82.60%> (+0.01%)`	⬆️
3.11 athena or openpyxl or pyarrow or project or sqlite or aws_creds	`?`
3.11 aws_deps	`?`
3.11 big	`?`
3.11 clickhouse	`?`
3.11 filesystem	`?`
3.11 mssql	`?`
3.11 mysql	`?`
3.11 postgresql	`?`
3.11 spark_connect	`?`
3.11 trino	`?`
3.12	`68.10% <82.60%> (+0.01%)`	⬆️
3.12 athena or openpyxl or pyarrow or project or sqlite or aws_creds	`55.42% <41.30%> (+<0.01%)`	⬆️
3.12 aws_deps	`46.15% <30.43%> (+<0.01%)`	⬆️
3.12 big	`54.75% <30.43%> (-0.01%)`	⬇️
3.12 databricks	`47.89% <39.13%> (+<0.01%)`	⬆️
3.12 filesystem	`61.72% <56.52%> (+<0.01%)`	⬆️
3.12 mssql	`51.49% <30.43%> (-0.01%)`	⬇️
3.12 mysql	`51.56% <30.43%> (-0.01%)`	⬇️
3.12 postgresql	`54.64% <39.13%> (+<0.01%)`	⬆️
3.12 snowflake	`48.87% <39.13%> (+<0.01%)`	⬆️
3.12 spark	`58.07% <39.13%> (+<0.01%)`	⬆️
3.12 spark_connect	`46.44% <30.43%> (+<0.01%)`	⬆️
3.12 trino	`52.69% <39.13%> (+<0.01%)`	⬆️
3.9	`68.12% <82.60%> (+<0.01%)`	⬆️
3.9 athena or openpyxl or pyarrow or project or sqlite or aws_creds	`55.42% <41.30%> (+<0.01%)`	⬆️
3.9 aws_deps	`46.17% <30.43%> (+<0.01%)`	⬆️
3.9 big	`54.76% <30.43%> (-0.01%)`	⬇️
3.9 clickhouse	`43.04% <30.43%> (+<0.01%)`	⬆️
3.9 databricks	`47.91% <39.13%> (+<0.01%)`	⬆️
3.9 filesystem	`61.74% <56.52%> (+<0.01%)`	⬆️
3.9 mssql	`51.48% <30.43%> (-0.01%)`	⬇️
3.9 mysql	`51.54% <30.43%> (-0.01%)`	⬇️
3.9 postgresql	`54.62% <39.13%> (+<0.01%)`	⬆️
3.9 snowflake	`48.88% <39.13%> (+<0.01%)`	⬆️
3.9 spark	`58.03% <39.13%> (+<0.01%)`	⬆️
3.9 spark_connect	`46.46% <30.43%> (+<0.01%)`	⬆️
3.9 trino	`52.67% <39.13%> (+<0.01%)`	⬆️
cloud	`?`
docs-basic	`53.37% <45.65%> (+<0.01%)`	⬆️
docs-creds-needed	`52.94% <45.65%> (+<0.01%)`	⬆️
docs-spark	`52.42% <45.65%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…tations/great_expectations into m/_/better_reporting

…_expectations into m/_/better_reporting

billdirks

Thanks for standardizing this more. I left a few comments inline.

billdirks · 2024-11-08T21:47:21Z

great_expectations/checkpoint/actions.py

+        run_info: A dictionary containing information about the run.
+    """
+
+    status: ValidationActionRunStatus


Should we include the action name (eg the class name) in this result object or will it be obvious since this is embedded inside something? Eg if I see not_run will I know that is associated with a pager duty action?

It seems like the run_info dict has a key which is a string like <action_type>_result. If we put the action type in it's own field, it would make that info easier to parse out.

It'll be obvious since this is only ever used in Checkpoint.run - my thinking is we can take the aggregated results (of these actions) and either log or append to the checkpoint result object.

Don't want to make too many changes here until we can plan better.

billdirks · 2024-11-08T21:48:14Z

great_expectations/checkpoint/actions.py

+    """
+
+    status: ValidationActionRunStatus
+    run_info: dict


Does the run_info vary from action to action or can we be more concrete about this?

Ah yeah I don't love our current dicts but I'm trying to minimize changes. They're all pretty much {"action_type_in_snake_case": "success_or_failure"} so they could be simplified with a simple bool but I think I want to keep it this way for a few reasons:

Minimize changes (for now)

Allow for more complex run_info (UpdateDataDocs leverages this and future custom actions might want to include more details?)

billdirks · 2024-11-08T21:50:16Z

great_expectations/checkpoint/actions.py

-        return {"pagerduty_alert_result": "none sent"}
+        return ValidationActionResult(
+            status=ValidationActionRunStatus.NOT_RUN,
+            run_info={"pagerduty_alert_result": "none sent"},


For ms teams, the equivalent value is None instead of "none sent". Can these be the same thing?

Same note as above - don't love these dicts but doing what I can to minimize changes. If we can determine a consistent standard, I'd be happy to make changes across the board.

billdirks · 2024-11-08T21:54:57Z

tests/actions/test_core_actions.py

@@ -427,6 +432,9 @@ def test_run_integration_success(
        self,
        checkpoint_result: CheckpointResult,
    ):
+        if not os.environ.get("GX_MS_TEAMS_WEBHOOK"):


I don't like programmatically skipping tests. What has happened in the past is someone makes a change and we start skipping this in CI without realizing it. I'd rather just fail and require this env variable or have this moved to a different marker where we require this.

cdkini · 2024-11-08T22:18:27Z

great_expectations/checkpoint/checkpoint.py

+
+        action_results = self._run_actions(checkpoint_result=checkpoint_result)
+        # TODO(cdkini): Figure out how to handle action results
+        print(action_results)  # We could add these to our checkpoint result or logger.warning?


This is where I'm unsure but the goal with this whole effort is to make action results more transparent - we currently silently fail or skip

If we log, there's still a chance can lose track of it.
I'd lean towards updating the checkpoint result but we need to be careful about what we add there - don't want it becoming as cumbersome as its V0 counterpart.

first pass

7a6fef2

cdkini and others added 8 commits November 8, 2024 11:00

comment

17aa22d

[pre-commit.ci] auto fixes from pre-commit.com hooks

54c2d53

for more information, see https://pre-commit.ci

foo

4722d62

Merge branch 'm/_/better_reporting' of https://github.com/great-expec…

98a4a77

…tations/great_expectations into m/_/better_reporting

implement all actions

a6bd5de

update tests

208a530

patch issue with data docs

91ff2d2

misc cleanup

3dc2bf4

cdkini changed the title ~~[MAINTENANCE] Better error reporting around Checkpoint action failures~~ [MAINTENANCE] Standardize reuslt object from Checkpoint action runs Nov 8, 2024

cdkini changed the title ~~[MAINTENANCE] Standardize reuslt object from Checkpoint action runs~~ [MAINTENANCE] Standardize result object from Checkpoint action runs Nov 8, 2024

revert checkpoint

791787e

cdkini marked this pull request as ready for review November 8, 2024 18:14

cdkini added 5 commits November 8, 2024 13:14

Merge branch 'develop' into m/_/better_reporting

b153ca0

add docstring

c9ea050

Merge branch 'm/_/better_reporting' of https://github.com/great-expec…

896b838

…tations/great_expectations into m/_/better_reporting

Merge branch 'develop' of https://github.com/great-expectations/great…

6adff81

…_expectations into m/_/better_reporting

update test

969d021

billdirks reviewed Nov 8, 2024

View reviewed changes

cdkini added 2 commits November 8, 2024 17:06

misc updates from review

0018bc9

plumb into checkpoint

3875cec

cdkini commented Nov 8, 2024

View reviewed changes

cdkini added 2 commits November 8, 2024 17:31

mypy

8028a67

Merge branch 'develop' into m/_/better_reporting

ab11718

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MAINTENANCE] Standardize result object from Checkpoint action runs #10647

[MAINTENANCE] Standardize result object from Checkpoint action runs #10647

cdkini commented Nov 8, 2024 •

edited

Loading

netlify bot commented Nov 8, 2024 •

edited

Loading

codecov bot commented Nov 8, 2024 •

edited

Loading

billdirks left a comment

billdirks Nov 8, 2024

billdirks Nov 8, 2024

cdkini Nov 8, 2024

billdirks Nov 8, 2024

cdkini Nov 8, 2024

billdirks Nov 8, 2024

cdkini Nov 8, 2024

billdirks Nov 8, 2024

cdkini Nov 8, 2024

cdkini Nov 8, 2024

[MAINTENANCE] Standardize result object from Checkpoint action runs #10647

Are you sure you want to change the base?

[MAINTENANCE] Standardize result object from Checkpoint action runs #10647

Conversation

cdkini commented Nov 8, 2024 • edited Loading

netlify bot commented Nov 8, 2024 • edited Loading

✅ Deploy Preview for niobium-lead-7998 canceled.

codecov bot commented Nov 8, 2024 • edited Loading

Codecov Report

billdirks left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cdkini commented Nov 8, 2024 •

edited

Loading

netlify bot commented Nov 8, 2024 •

edited

Loading

codecov bot commented Nov 8, 2024 •

edited

Loading