Improve Pipeline logs error states #2300

manaswinidas · 2023-12-12T19:34:50Z

JIRA:
RHOAIENG-254
RHOAIENG-255

500 Internal Error state will be handled by RHOAIENG-1067

Description

Adds error messages for failed and cleaned-up pods for Pipeline logs
Adds error message in case there is a network issue
Hides the Logs toolbar toolbar for such states

Cleaned-up pods(Error message according to this Slack message):

No internet(Error message according to this Slack message):

Failed pod(Error message according to this Slack message):

How Has This Been Tested?

Click on any running pipeline with a failed node and check the error state in the Logs Tab
Click on any node of an old pipeline(pods are cleaned-up) and check the error state
Check the error state for no-internet state too.

Test Impact

Request review criteria:

Self checklist (all need to be checked):

The developer has manually tested the changes and verified that the changes work
Commits have been squashed into descriptive, self-contained units of work (e.g. 'WIP' and 'Implements feedback' style messages have been removed)
Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
The developer has added tests or explained why testing cannot be added (unit tests & storybook for related changes)

If you have UI changes:

Included any necessary screenshots or gifs if it was a UI change.
Included tags to the UX team if it was a UI/UX change (find relevant UX in the SMEs section).

After the PR is posted & before it merges:

The developer has tested their solution on a cluster by using the image produced by the PR to main

manaswinidas · 2023-12-12T19:37:32Z

WIP to add tests and screenshots for cleaned-up pods.

manaswinidas · 2023-12-19T13:44:54Z

@yih-wang can you check the error states? Do we want to show the toolbar in any of the states above?

yih-wang · 2023-12-20T08:28:30Z

@manaswinidas No, I don't think we will include any error state in the toolbar. The only two ways we display the error messages are:

From the inline alert as what you show in the screenshot - that will show the pod error or some other general issue, e.g. network issue
From the step dropdown - that will specify which step is encountering the error, offering users detailed insights into the exact point of failure for more effective troubleshooting

yih-wang · 2023-12-20T08:29:53Z

Oops sorry that I misread the question...
So for the failed to fetch case, is that true that all the steps will fail to fetch the log, or is it possible that only the current step failed to fetch while other steps could still have logs avaible?

manaswinidas · 2024-01-02T14:24:31Z

@yih-wang it's highly unlikely that some step logs may be fetched while some cannot, in case of network issues.

yih-wang · 2024-01-04T09:40:53Z

@manaswinidas Then I think we should still show the toolbar in the 'No internet' error case to provide the ability to switch to other steps.

manaswinidas · 2024-01-08T18:17:18Z

@yih-wang But we can't retrieve logs when there is no internet connection, even if we are able to switch the steps using the dropdown or the download dropdown, it's doing nothing because there is no internet. Here's a screen recording to demonstrate the same. Do we still show the toolbar in this case?

Screen.Recording.2024-01-08.at.11.43.27.PM.mov

yih-wang · 2024-01-09T12:08:56Z

@manaswinidas Oops, read your previous message again and realized you were saying it's unlikely that some steps have logs while others do not... Then yes you are right, we don't show the toolbar in the network issue case too.

Gkrumbach07

on error, you are still polling, i assume this is intended?

frontend/src/concepts/pipelines/content/pipelinesDetails/pipelineRun/runLogs/LogsTabStatus.tsx

manaswinidas · 2024-01-15T14:48:02Z

@yih-wang Can you have a final look at the screenshots?

yih-wang · 2024-01-16T07:42:13Z

@manaswinidas Didn't we combine case 1 (failed/cleaned-up pods) and case 3 (failed pod) you show in the screenshots?

manaswinidas · 2024-01-16T08:04:28Z

@yih-wang I did according to this discussion we had last month

yih-wang · 2024-01-16T08:13:31Z

The error messages look good to me.
Should the 1st case apply to only cleaned-up pods (now it's for failed/cleaned-up pods) since we have an error for failed pods in the 3rd case?

manaswinidas · 2024-01-16T08:17:40Z

@yih-wang Thanks for pointing it out. I updated it just now.

Gkrumbach07 · 2024-01-16T14:26:34Z

code looks good to me.

/lgtm

just need @yih-wang approval and an advisor

manaswinidas · 2024-01-16T14:43:46Z

We have @yih-wang approval here. She was just asking me to change the PR description as it was outdated.

openshift-ci · 2024-01-17T09:44:51Z

New changes are detected. LGTM label has been removed.

manaswinidas · 2024-01-17T09:48:53Z

Rebased, cleaned up a few nits after the last merge.

Gkrumbach07 · 2024-01-17T15:37:34Z

/lgtm all still works

openshift-ci · 2024-01-17T16:18:24Z

[APPROVALNOTIFIER] This PR is APPROVED

Approval requirements bypassed by manually added approval.

This pull-request has been approved by: mturley

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci bot requested review from christianvogt and dpanshug December 12, 2023 19:34

manaswinidas changed the title ~~Handle failed and cleaned-up pods~~ WIP: Handle failed and cleaned-up pods Dec 12, 2023

openshift-ci bot added the do-not-merge/work-in-progress This PR is in WIP state label Dec 12, 2023

manaswinidas force-pushed the improve-log-error-state branch from 19da91e to ae4ba27 Compare December 19, 2023 13:41

manaswinidas changed the title ~~WIP: Handle failed and cleaned-up pods~~ Handle failed and cleaned-up pods Dec 19, 2023

openshift-ci bot removed the do-not-merge/work-in-progress This PR is in WIP state label Dec 19, 2023

manaswinidas force-pushed the improve-log-error-state branch from ae4ba27 to ee092af Compare December 19, 2023 14:31

manaswinidas changed the title ~~Handle failed and cleaned-up pods~~ Improve Pipeline logs error states Dec 20, 2023

openshift-merge-robot added the needs-rebase PR needs to be rebased label Jan 4, 2024

manaswinidas force-pushed the improve-log-error-state branch from ee092af to 6eb59ca Compare January 8, 2024 18:11

openshift-merge-robot removed the needs-rebase PR needs to be rebased label Jan 8, 2024

manaswinidas force-pushed the improve-log-error-state branch 2 times, most recently from df0ae8a to 9e76f19 Compare January 8, 2024 19:29

Gkrumbach07 reviewed Jan 9, 2024

View reviewed changes

frontend/src/concepts/pipelines/content/pipelinesDetails/pipelineRun/runLogs/LogsTabStatus.tsx Outdated Show resolved Hide resolved

frontend/src/concepts/pipelines/content/pipelinesDetails/pipelineRun/runLogs/LogsTabStatus.tsx Outdated Show resolved Hide resolved

manaswinidas force-pushed the improve-log-error-state branch 3 times, most recently from febc1d8 to 44e0dfa Compare January 15, 2024 14:44

openshift-ci bot assigned Gkrumbach07 Jan 16, 2024

openshift-ci bot added the lgtm label Jan 16, 2024

mturley approved these changes Jan 16, 2024

View reviewed changes

openshift-ci bot assigned mturley Jan 16, 2024

Improve log error state

a91976f

manaswinidas force-pushed the improve-log-error-state branch from 44e0dfa to a91976f Compare January 17, 2024 09:44

openshift-ci bot removed the lgtm label Jan 17, 2024

manaswinidas added lgtm approved labels Jan 17, 2024

openshift-merge-bot bot merged commit 991cd37 into opendatahub-io:f/pipelines-enhancement Jan 17, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Pipeline logs error states #2300

Improve Pipeline logs error states #2300

manaswinidas commented Dec 12, 2023 •

edited

Loading

manaswinidas commented Dec 12, 2023

manaswinidas commented Dec 19, 2023 •

edited

Loading

yih-wang commented Dec 20, 2023

yih-wang commented Dec 20, 2023 •

edited

Loading

manaswinidas commented Jan 2, 2024

yih-wang commented Jan 4, 2024

manaswinidas commented Jan 8, 2024

yih-wang commented Jan 9, 2024

Gkrumbach07 left a comment

manaswinidas commented Jan 15, 2024

yih-wang commented Jan 16, 2024

manaswinidas commented Jan 16, 2024

yih-wang commented Jan 16, 2024

manaswinidas commented Jan 16, 2024

Gkrumbach07 commented Jan 16, 2024

manaswinidas commented Jan 16, 2024

openshift-ci bot commented Jan 17, 2024

manaswinidas commented Jan 17, 2024

Gkrumbach07 commented Jan 17, 2024

openshift-ci bot commented Jan 17, 2024

Improve Pipeline logs error states #2300

Improve Pipeline logs error states #2300

Conversation

manaswinidas commented Dec 12, 2023 • edited Loading

Description

How Has This Been Tested?

Test Impact

Request review criteria:

manaswinidas commented Dec 12, 2023

manaswinidas commented Dec 19, 2023 • edited Loading

yih-wang commented Dec 20, 2023

yih-wang commented Dec 20, 2023 • edited Loading

manaswinidas commented Jan 2, 2024

yih-wang commented Jan 4, 2024

manaswinidas commented Jan 8, 2024

yih-wang commented Jan 9, 2024

Gkrumbach07 left a comment

Choose a reason for hiding this comment

manaswinidas commented Jan 15, 2024

yih-wang commented Jan 16, 2024

manaswinidas commented Jan 16, 2024

yih-wang commented Jan 16, 2024

manaswinidas commented Jan 16, 2024

Gkrumbach07 commented Jan 16, 2024

manaswinidas commented Jan 16, 2024

openshift-ci bot commented Jan 17, 2024

manaswinidas commented Jan 17, 2024

Gkrumbach07 commented Jan 17, 2024

openshift-ci bot commented Jan 17, 2024

manaswinidas commented Dec 12, 2023 •

edited

Loading

manaswinidas commented Dec 19, 2023 •

edited

Loading

yih-wang commented Dec 20, 2023 •

edited

Loading