Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Application Alerting for BIP #3017

Closed
2 tasks done
meganhicks opened this issue May 22, 2024 · 2 comments
Closed
2 tasks done

Application Alerting for BIP #3017

meganhicks opened this issue May 22, 2024 · 2 comments

Comments

@meganhicks
Copy link

meganhicks commented May 22, 2024

As a member of the VRO platform team responsible for supporting partner teams, I need to be alerted about any connectivity issues with BIP. This will allow me to proactively notify partner teams about potential interruptions in their services

Acceptance Criteria:

  • When the monitor for BIP detects a disconnect or significant delay in service there is an alert that is triggered notifying the team.
  • Share why the thresholds were chosen to the team in slack

What you need to know

Implementation / Developer Steps (to be filled out by ticket assignee)

linked ticket-https://app.zenhub.com/workspaces/vro-team-6557e67173391c000e1409f3/issues/gh/department-of-veterans-affairs/abd-vro/2932

@meganhicks meganhicks changed the title Copy of Application Monitoring for BIP Application Alerting for BIP May 22, 2024
@meganhicks meganhicks mentioned this issue May 22, 2024
17 tasks
@meganhicks meganhicks mentioned this issue Jun 26, 2024
26 tasks
@meganhicks meganhicks mentioned this issue Jul 30, 2024
19 tasks
@meganhicks meganhicks mentioned this issue Aug 13, 2024
18 tasks
@chengjie8 chengjie8 self-assigned this Aug 29, 2024
@Ponnia-M Ponnia-M self-assigned this Sep 22, 2024
@dfitchett
Copy link
Contributor

Current Monitors can be viewed on this dashboard. They need to be re-evaluated for accuracy.

@dfitchett
Copy link
Contributor

dfitchett commented Oct 1, 2024

Datadog Dashboard

For this ticket, I did the following:

  • Updated Monitor: VRO : BIP Elevated Average Latency
    • Lowered threshold for alert to 1500ms, removed threshold. High latency indicates issues with downstream BIP claims API.
  • Updated Monitor: VRO : BIP non-2xx responses
    • Lowered threshold to 1 non-2xx response since any error should be looked into
  • Added monitor overlay for VRO : BIP Elevated Average Latency to request duration chart in dashboard

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants