Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change made some time ago to delay snmp in favor of faster fast-reboot stopped working #3453

Closed
stepanblyschak opened this issue Sep 13, 2019 · 8 comments
Assignees

Comments

@stepanblyschak
Copy link
Collaborator

Description

Steps to reproduce the issue:

  1. docker ps # after fast-reboot
  2. uptime of snmp is same as all other containers

Describe the results you received:
Delaying snmp really affect startup time in fast and warm reboot, but now it starts slower cause different issues (BGP flap, LAG flap)

Describe the results you expected:
SNMP container start is delayed

Additional information you deem important (e.g. issue happens only occasionally):

**Output of `show version`:**

```
(paste your output here)
```

**Attach debug file `sudo generate_dump`:**

```
(paste your output here)
```
@pavel-shirshov
Copy link
Contributor

@stepanblyschak you're saying that it doesn't matter now, to delay snmp start or to don't delay it?

@jleveque
Copy link
Contributor

@stepanblyschak: I am also confused as to what you are requesting here. Could you please add more information?

@stepanblyschak
Copy link
Collaborator Author

@jleveque @pavel-shirshov
This change 07ea974#diff-d39caff3b7cc5743b317762b959ae11fR2 introduces delay for 3m30s of counters thread and SNMP container. Change was introduced AFAIK to speed up system startup on fast-reboot flow (note: this affects warm-reboot as well)

This change 6eca27e#diff-f1225e24f7a66e6e78aedf4c7d7514f7R14 made SNMP start right after SWSS so previously introduced timer doesn't work anymore.

The question is:

  1. Did we optimize sonic startup so that delay is not needed and we should stay with current behavior and just delete snmp.timer as it does not work?
  2. Does second change break the optimization that was introduced in first change and should be fixed, so that SNMP will start with delay? Note, that SNMP will spam errors while counters are not available

@jleveque
Copy link
Contributor

@qiluo-msft to comment

@qiluo-msft
Copy link
Collaborator

My understanding is that we should delay starting snmp docker container now, but current code is not.

@lguohan
Copy link
Collaborator

lguohan commented Sep 18, 2019

@pavel-shirshov , how to resolve this issue?

@pavel-shirshov
Copy link
Contributor

@lguohan
we can:

  1. Revert the change which break the feature.
  2. Or just remove the timer, if Fast-Reboot still works and we didn't find any regressions.

@stepanblyschak
Copy link
Collaborator Author

Should be fixed by #3657

There is another interesting side effect of snmp.timer and WantedBy=swss.service, if we do config reload right after start, during swss restart, after snmp has stopped snmp.timer may trigger snmp to start in the middle of config reload causing swss restart to fail. This should be fixed by #3657.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants