Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tailing pod logs with higher log rate fails. #1230

Open
evilbulgarian opened this issue Nov 6, 2023 · 2 comments
Open

Tailing pod logs with higher log rate fails. #1230

evilbulgarian opened this issue Nov 6, 2023 · 2 comments
Labels
kind/bug Something isn't working

Comments

@evilbulgarian
Copy link

Description

We notices that on the flatcaros nodes when tailing pods with higher log volume tailing the logs fails. After increasing fs.inotify.max_user_watches seems to alleviate the issue.

Impact

debugging cluster issues is impacted

Environment and steps to reproduce

  1. flatcaros stable nodes running busy containers
  2. kubectl logs | stern logs should work on busy containers

Expected behavior

stern and logs should work even on busy containers

Additional information

@jepio
Copy link
Member

jepio commented Nov 7, 2023

hi @evilbulgarian,
According to kernel docs the default value of fs.inotify.max_user_watches automatically scales with available memory at boot.

What value does your node have by default, and what value is sufficient for your usecase?

@evilbulgarian
Copy link
Author

FlatcarOS Setup:

fs.inotify.max_queued_events = 16384
fs.inotify.max_user_instances = 128
fs.inotify.max_user_watches = 1007804
user.max_inotify_instances = 128
user.max_inotify_watches = 1007804
core@ip-10-37-4-218 ~ $ free -m
               total        used        free      shared  buff/cache   available
Mem:          127684       30225       79784         260       17674       96004
Swap:              0           0           0

Using: fs.inotify.max_user_watches: 524288

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working
Projects
Status: 📝 Needs Triage
Development

No branches or pull requests

2 participants