-
Notifications
You must be signed in to change notification settings - Fork 779
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[chip/prim_esc_receiver] check if prim_esc_receiver is out-of-sync #14887
Comments
Triaged for |
This was mainly to test synchronization corner cases that were pointed out in the alert handler revere ping test cases. Assigning to @msfschaffner to review priority. I suspect this could potentially be lowered to P3 if we are ok with the current coverage. |
Regarding the assertion, I think adding a check that makes sure that the alert handler and escalation receiver are both reset together (with some small margin) should do the trick here. |
|
Discussed with @matutem - this does not seem to be very important, given existing escalations test, hence moving this to backlog. |
We have a number of escalation tests in both simulation and fpga. I think various RTL changes have made this area much more safe than at the time this issue was filed:
My preference would be to add SVA to check the alert handler escalation request and pwrmgr detecting it happen no more than a few cycles apart. As for pwrmgr triggering escalation when a ping timeout occurs, we could add the following assertions:
The assertions should run in both unit and top level tests. I think we have enough stimulus to verify this. I think this doesn't need to block D/V2(S). |
More about this: it seems much of this is addressed if we assert each individual esc_sender/receiver pair is connected to the same clk and rst_n signals. That can be run as a connectivity test. |
The proposal is to create a connectivity config that checks all prim_esc_sender for the clk_i and rst_ni. The check is expected to fail because pwrmgr's esc_receiver rst_i is NOT connected to the same signal as alert_handler's esc_sender rst_i, which is captured in #23888. |
… and rst_n The escalation sender and receiver pairs need to connect to the same clock and reset signals, or they can malfunction. This adds a connectivity configuration for these tests, but it is expected to fail due to the mis-connection of pwrmgr rst_ni described in lowRISC#23888. Fixes lowRISC#14887 Signed-off-by: Guillermo Maturana <[email protected]>
Discussed in the triage meeting: postponing this to M7 because the problem described in #23888 is well understood but won't be changed for Earlgrey-PROD (because it doesn't have a significant impact). Thus the test in PR #23894, which currently fails because it tests the eventual behavior, needs to be adapted to test the current behavior and pass. |
… and rst_n The escalation sender and receiver pairs need to connect to the same clock and reset signals, or they can malfunction. This adds a connectivity configuration for these tests, but it is expected to fail due to the mis-connection of pwrmgr rst_ni described in lowRISC#23888. Fixes lowRISC#14887 Signed-off-by: Guillermo Maturana <[email protected]>
This discussion comes from PR: #14663 (comment)
We would like some checking between
prim_esc_receiver
andalert_handler
, andpwrmgr
to make sure they are not out-of-sync.For example - When alert_handler/pwrmgr reset, will
prim_esc_receiver
counters clear to 0?Currently if the
prim_esc_receiver
did not reset, and we configured the alert_handler to ping after reset, it is hard to find this error.So the current suggestions are:
@moidx @tjaychen @msfschaffner
The text was updated successfully, but these errors were encountered: