Add instructions for structured logging migration #4793

serathius · 2020-05-20T14:26:51Z

Ref kubernetes/enhancements#1602
/cc @44past4 @brancz @DirectXMan12 @thockin

Wrote instructions for structured logging migration.

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

thockin

I wrote a lot of comments and then you addressed them later, so I deleted them. Good job :)

thockin · 2020-05-27T18:23:12Z

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

+minimal design from [logr] thus there is no one-to-one mapping.
+
+Simplified mapping between functions:
+* `klog.Infof`, `klog.Info`, `klog.Infoln`, `klog.InfoDepth` -> `klog.InfoS`


The "depth" ones are challenging because they presume hat popping up the call-stack is possible. I count about 8 such files that make these calls in k/k. Do you link we need an InfoS variant that includes depth? That would be ugly to add to logr because it tries to abstract multiple implementations, but maybe that's something we should tackle in logr and klog, if it is a missing capability...

So, the list is:

directxman12@💻 ~[go:kubernetes] $ git rev-parse HEAD 02637bb25016c5362dc945406df0b2114868ecf2 directxman12@💻 ~[go:kubernetes] $ ag 'klog\.\w+Depth' staging/src/k8s.io/component-base/logs/logs.go 49: klog.InfoDepth(1, string(data)) staging/src/k8s.io/kubectl/pkg/util/logs/logs.go 42: klog.InfoDepth(1, string(data)) staging/src/k8s.io/kubectl/pkg/cmd/util/helpers.go 93: klog.FatalDepth(2, msg) staging/src/k8s.io/client-go/tools/clientcmd/config.go 486: klog.FatalDepth(1, err) staging/src/k8s.io/apiserver/pkg/storage/etcd3/logger.go 35: klog.InfoDepth(klogWrapperDepth, args...) 39: klog.InfoDepth(klogWrapperDepth, fmt.Sprintln(args...)) 43: klog.InfoDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) 47: klog.WarningDepth(klogWrapperDepth, args...) 51: klog.WarningDepth(klogWrapperDepth, fmt.Sprintln(args...)) 55: klog.WarningDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) 59: klog.ErrorDepth(klogWrapperDepth, args...) 63: klog.ErrorDepth(klogWrapperDepth, fmt.Sprintln(args...)) 67: klog.ErrorDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) 71: klog.FatalDepth(klogWrapperDepth, args...) 75: klog.FatalDepth(klogWrapperDepth, fmt.Sprintln(args...)) 79: klog.FatalDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) staging/src/k8s.io/apiserver/pkg/server/httplog/httplog.go 161: klog.InfoDepth(1, fmt.Sprintf("verb=%q URI=%q latency=%v resp=%v UserAgent=%q srcIP=%q: %v%v", 168: klog.InfoDepth(1, fmt.Sprintf("verb=%q URI=%q latency=%v UserAgent=%q srcIP=%q: hijacked", 198: klog.InfoDepth(1, fmt.Sprintf("Unable to convert %+v into http.Flusher", rl.w)) staging/src/k8s.io/apimachinery/pkg/util/runtime/runtime.go 114: klog.ErrorDepth(2, err) pkg/kubelet/kubeletconfig/util/log/log.go 36: klog.ErrorDepth(1, fmt.Sprintf(logFmt, s)) 48: klog.InfoDepth(1, fmt.Sprintf(logFmt, s))

Going through those,

component-base is an implementation of io.Writer --> klog that preserves line information. Mainly used to capture stdlib logs to klog.

the first kubectl is a copy of component-base (actually, I think it was vice-versa, but whatever)

Conditionally calls normal logger if the log verbosity is high enough.

GetConfigOrDie, attempting to provide useful info as to where the error came from. Better off calling ErrorS with a sink that produces stack traces.

Extra wrapper for the etcd client to log to klog.

Helper "middleware" for logging http requests. Trying to report caller of the logging helper.

Helper for runtime.HandleError. Just use an implementation that reports stack traces with errors. Take care not to contemplate runtime.HandleError too deeply, or it will make you very, very sad.

Inserts a prefix at the beginning of every line. Really should just go away with proper named loggers.

Several of those are semi-global logging wrappers (1, 2, 5) that are better off just having an implementation that can do caller-skip (e.g. zap). I'm not sure we want the caller-skip at the interface level, though.

3, 4, and 7 and probably just need to go away -- they're better dealt-with with implementations that capture backtraces on error.

8 should be replaced by proper prefix/logger name support, so that a wrapper is no longer needed

That just leaves 6, the middleware logger. @thockin's suggestion @ https://github.com/kubernetes/community/pull/4793/files#r431353200 is a good one, but removes the middleware part (a comment in the file suggests that the middleware-link functionality is probably superfluous). Along the same lines, something like klog.InfoS("http request", httplog.Request(req)) would work too, where httplog.Request is analagous to log.KObj.

Several of those are semi-global logging wrappers (1, 2, 5) that are better off just having an implementation that can do caller-skip (e.g. zap). I'm not sure we want the caller-skip at the interface level, though.

These ones seem like valid use-cases to me. In fact, I wrote the first version of (5), I think. The ability to strip frames from the callstack seems pretty necessary for any sort of logging util helper or adapter.

I don't quite grok your comment. How would an implementation that can skip callframes be utilized under something like logr without a function for it in the interface?

In other words, should logr and klog grow new methods for InfoDepth(depth, msg, string...) and ErrorDepth(depth, err, msg, string...) ?

Hmm... let me rephrase a bit. I think I misinterpreted somethings, and miscommunicated others. I've rewritten this comment a few times, so it's possible there's something I'm not quite articulating correctly.

I agree that they're valid use cases, but:

All the mentioned numbers follow a pattern of "here's a set of helper functions where all the functions have a consistent caller skip". The caller skip isn't varying per-function in the file. I was originally (above) suggesting something like

var logWrapper = klog.WithSkip(4) func Info(msg) { logWrapper.InfoS(prefix+msg) }

However, that only works when we're not redirecting to logr. That's not too big a deal -- it preserves the status quo, punts on putting things into our interface, and should be doable with the way the internals of klog work.

We could consider adding the WithSkip to logr, which makes the caller skip compose and feels a lot more like the rest of logr's interface. However, this is a bit annoying to implement in some logging backends (see below the fold), and still feels a bit brittle (again, below the fold).

[below the fold] Why I'm kinda against adding the extra Depth methods

My objections/uneasiness with this has two parts:

The first is that adding depth functions feels a lot like leaking implementation details through to the user. This kinda paints us into the corner in terms of always explicitly supporting caller skip like this -- I could see a world where we decide to drop caller in favor of logger names + stack traces for errors or a world where we decide to do "capture caller into context".

The second is that that per-function caller skip is an easy interface to accidentally misuse:

It's easy to miss/forget the Depth in one of your functions.

It doesn't compose (you can't wrap helpers in helpers and still get useful information, which is something I've seen with similar types of functionality in codebases like controller-runtime).

It's sufficiently brittle that I'd actively encourage folks not to use it -- it relies on exact code structure not changing, which isn't something we can enforce with the compiler, lint for, etc. In the case of the etcd logger, if the internals of the etcd client change, we suddenly lose the usefulness of our logging information. That's not really an interface I want people writing against.

Moving on to practical matters: how much we care about people actually being able to implement this, or would we rather they just ignore it? Zap doesn't support per-function caller skip OOTB, but it's doable with some hacking (you'd call entry := logger.Check(...); entry.Caller = /* calculate caller yourself */, or maybe could finagle a custom zapcore.Core to do it), logrus doesn't even do caller skip AFAICT, etc. Do we want to continue to support delegating to logr? It's not clear how much of a goal that it from the KEP (I think we should, or at least have some similar functionality).

We could consider adding the WithSkip to logr

The problem I have with this is that it doesn't feel like a property ofthe
logger object, but of the call-site. I guess if you are wrapping a Logger in
some other type, you can argue that this IS part of the logger (or as you
characterized - it doesn't change). If there's a one-off logging helper
function, you can always make a temporary object (though that can be
allocation-heav if it is done a lot). E.g.

func logHelper(log logr.Logger, foo Foo) { logr.SkipFrames(1).Info("foo message", "address", foo.Addr(), "phone", foo.Phone) }

per-function caller skip is an easy interface to accidentally misuse:

OK, you convinced me.

how much we care about people actually being able to implement this, or would we rather they just ignore it?

I suspect some log libs will handle it and some will no-op it. I think that's
fine, honestly. There will be pressure or there won't, but at least it is
possible to DTRT

it can be alloc-heavy if it is done a lot

I would hope escape analysis and friends would prevent a heap allocation here.

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

DirectXMan12 · 2020-05-28T00:54:33Z

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

+minimal design from [logr] thus there is no one-to-one mapping.
+
+Simplified mapping between functions:
+* `klog.Infof`, `klog.Info`, `klog.Infoln`, `klog.InfoDepth` -> `klog.InfoS`


So, the list is:

directxman12@💻 ~[go:kubernetes] $ git rev-parse HEAD 02637bb25016c5362dc945406df0b2114868ecf2 directxman12@💻 ~[go:kubernetes] $ ag 'klog\.\w+Depth' staging/src/k8s.io/component-base/logs/logs.go 49: klog.InfoDepth(1, string(data)) staging/src/k8s.io/kubectl/pkg/util/logs/logs.go 42: klog.InfoDepth(1, string(data)) staging/src/k8s.io/kubectl/pkg/cmd/util/helpers.go 93: klog.FatalDepth(2, msg) staging/src/k8s.io/client-go/tools/clientcmd/config.go 486: klog.FatalDepth(1, err) staging/src/k8s.io/apiserver/pkg/storage/etcd3/logger.go 35: klog.InfoDepth(klogWrapperDepth, args...) 39: klog.InfoDepth(klogWrapperDepth, fmt.Sprintln(args...)) 43: klog.InfoDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) 47: klog.WarningDepth(klogWrapperDepth, args...) 51: klog.WarningDepth(klogWrapperDepth, fmt.Sprintln(args...)) 55: klog.WarningDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) 59: klog.ErrorDepth(klogWrapperDepth, args...) 63: klog.ErrorDepth(klogWrapperDepth, fmt.Sprintln(args...)) 67: klog.ErrorDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) 71: klog.FatalDepth(klogWrapperDepth, args...) 75: klog.FatalDepth(klogWrapperDepth, fmt.Sprintln(args...)) 79: klog.FatalDepth(klogWrapperDepth, fmt.Sprintf(format, args...)) staging/src/k8s.io/apiserver/pkg/server/httplog/httplog.go 161: klog.InfoDepth(1, fmt.Sprintf("verb=%q URI=%q latency=%v resp=%v UserAgent=%q srcIP=%q: %v%v", 168: klog.InfoDepth(1, fmt.Sprintf("verb=%q URI=%q latency=%v UserAgent=%q srcIP=%q: hijacked", 198: klog.InfoDepth(1, fmt.Sprintf("Unable to convert %+v into http.Flusher", rl.w)) staging/src/k8s.io/apimachinery/pkg/util/runtime/runtime.go 114: klog.ErrorDepth(2, err) pkg/kubelet/kubeletconfig/util/log/log.go 36: klog.ErrorDepth(1, fmt.Sprintf(logFmt, s)) 48: klog.InfoDepth(1, fmt.Sprintf(logFmt, s))

Going through those,

component-base is an implementation of io.Writer --> klog that preserves line information. Mainly used to capture stdlib logs to klog.

the first kubectl is a copy of component-base (actually, I think it was vice-versa, but whatever)

Conditionally calls normal logger if the log verbosity is high enough.

GetConfigOrDie, attempting to provide useful info as to where the error came from. Better off calling ErrorS with a sink that produces stack traces.

Extra wrapper for the etcd client to log to klog.

Helper "middleware" for logging http requests. Trying to report caller of the logging helper.

Helper for runtime.HandleError. Just use an implementation that reports stack traces with errors. Take care not to contemplate runtime.HandleError too deeply, or it will make you very, very sad.

Inserts a prefix at the beginning of every line. Really should just go away with proper named loggers.

Several of those are semi-global logging wrappers (1, 2, 5) that are better off just having an implementation that can do caller-skip (e.g. zap). I'm not sure we want the caller-skip at the interface level, though.

3, 4, and 7 and probably just need to go away -- they're better dealt-with with implementations that capture backtraces on error.

8 should be replaced by proper prefix/logger name support, so that a wrapper is no longer needed

That just leaves 6, the middleware logger. @thockin's suggestion @ https://github.com/kubernetes/community/pull/4793/files#r431353200 is a good one, but removes the middleware part (a comment in the file suggests that the middleware-link functionality is probably superfluous). Along the same lines, something like klog.InfoS("http request", httplog.Request(req)) would work too, where httplog.Request is analagous to log.KObj.

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

DirectXMan12 · 2020-05-28T00:57:02Z

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

+because in [logr] Error is used to inform users about errors received from subordinate function calls.
+This means that `klog.ErrorS` should only be used when there is a obvious Golang `error` object available.
+
+Creating an error just to call `klog.ErrorS` or passing `nil` is discouraged, please consider using `klog.InfoS` instead.


I'm curious as to why that is -- ErrorS in logr has some semantic differences (e.g. it captures stack traces). I think passing nil is semi-acceptable if the case is this is an error condition that deserves a stack trace, but this is the origin point of the error.

Makes sense, I will apply your suggestion.

It seems that the current implementation of klog.ErrorS() calls logr.Info() if the err is nil. If we want to encourage people to pass nil I believe that this should be changed so that logr.Error() would be called.

Added kubernetes/klog#153

One note that klog.ErrorS is not guaranteed to generate a stacktrace as by default it still calls klog.print

Yep, we just kinda note in logr that Error may take special action to deal with errors in logr (see below). For instance, I could see an impl that formatted errors for capture by special error recording tools (e.g. stackdriver error reporting or somesuch), ones that print stack traces (like zap), and so on. It's nice because it leaves us a way to do cool stuff like that in the future, even if we don't do it by default now.

FWIW, logr docs say:

Furthermore, certain implementations may choose to attach additional information (such as stack traces) on calls to Error, so it's preferred to use Error to log errors.

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

DirectXMan12 · 2020-05-29T00:24:44Z

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

+As part of structured logging migration we want to ensure that kubernetes objects references are consistent within the
+codebase. Two new utility functions were introduced to klog `klog.KObj` and `klog.KRef`. Any reference
+(name, uid, namespace) to Kubernetes Object (Pod, Node, Deployment, CRD) should be rewritten to utilize those functions.
+In situations when object `UID` is would be beneficial for log, it should be added as separate field with `_uid` suffix.


why not just have UID be a field in the output of KObj or have KObjWithUID?

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

DirectXMan12

a couple of minor comments, but I think this looks pretty good.

I'm curious to see how things like UID practically play out -- can always adjust the plan when we see how relevant that is.

DirectXMan12 · 2020-06-01T22:26:36Z

contributors/devel/sig-instrumentation/migration-to-structured-logging.md

+Functions with depth (`klog.InfoDepth`, `klog.WarningDepth`, `klog.ErrorDepth`, `klog.FatalDepth`) are used to indicate
+that the source of the log (added as metadata in log) is different than the invocation of logging library. This is
+usually used when implementing logging util functions. As logr interface doesn't support depth, those functions should 
+return logging arguments instead of calling `klog` directly.


might be worth noting that there are situations where this isn't possible (mainly "adapters" and runtime.HandleError -- see my comment above), even if we decide to punt down the road on how to handle those cases

maybe add a todo about expanding this section with the discussion above (we'll need it before beta)?

serathius · 2020-06-03T10:50:25Z

/assign @logicalhan

Co-authored-by: Md. Tahsin Rahman <[email protected]>

serathius · 2020-06-04T08:29:23Z

ping @logicalhan

logicalhan · 2020-06-05T17:06:58Z

/lgtm
/approve

k8s-ci-robot · 2020-06-05T17:07:15Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: DirectXMan12, logicalhan, serathius

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~contributors/devel/sig-instrumentation/OWNERS~~ [logicalhan]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot requested review from 44past4, brancz, DirectXMan12 and thockin May 20, 2020 14:26

serathius force-pushed the migration branch from 003b01f to 70f45f9 Compare May 25, 2020 15:32

serathius changed the title ~~[WIP] Add instructions for structured logging migration~~ Add instructions for structured logging migration May 25, 2020

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label May 25, 2020

44past4 reviewed May 26, 2020

View reviewed changes

serathius force-pushed the migration branch 3 times, most recently from fe85beb to 7b8f926 Compare May 26, 2020 13:00

serathius mentioned this pull request May 26, 2020

Structured logging kubernetes/enhancements#1602

Open

39 tasks

github-actions bot mentioned this pull request May 27, 2020

Week Ending May 24, 2020 dev-obs/actus#155

Closed

thockin reviewed May 27, 2020

View reviewed changes

DirectXMan12 suggested changes May 28, 2020

View reviewed changes

serathius force-pushed the migration branch from 7b8f926 to a0b4809 Compare May 28, 2020 17:17

DirectXMan12 reviewed May 29, 2020

View reviewed changes

tahsinrahman reviewed May 29, 2020

View reviewed changes

contributors/devel/sig-instrumentation/migration-to-structured-logging.md Outdated Show resolved Hide resolved

tahsinrahman reviewed May 29, 2020

View reviewed changes

contributors/devel/sig-instrumentation/migration-to-structured-logging.md Outdated Show resolved Hide resolved

tahsinrahman reviewed May 29, 2020

View reviewed changes

contributors/devel/sig-instrumentation/migration-to-structured-logging.md Outdated Show resolved Hide resolved

DirectXMan12 approved these changes Jun 1, 2020

View reviewed changes

serathius force-pushed the migration branch from fc2bfe4 to 720c699 Compare June 3, 2020 10:20

k8s-ci-robot assigned logicalhan Jun 3, 2020

Add instructions for structured logging migration

5e71635

Co-authored-by: Md. Tahsin Rahman <[email protected]>

serathius force-pushed the migration branch from 720c699 to 5e71635 Compare June 3, 2020 14:03

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 5, 2020

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 5, 2020

k8s-ci-robot merged commit 894a89a into kubernetes:master Jun 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add instructions for structured logging migration #4793

Add instructions for structured logging migration #4793

serathius commented May 20, 2020 •

edited

Loading

thockin left a comment

thockin May 27, 2020

DirectXMan12 May 28, 2020 •

edited

Loading

thockin May 28, 2020

DirectXMan12 May 29, 2020 •

edited

Loading

thockin Jun 1, 2020

DirectXMan12 Jun 3, 2020

DirectXMan12 May 28, 2020 •

edited

Loading

DirectXMan12 May 28, 2020

serathius May 28, 2020

44past4 May 28, 2020

serathius May 28, 2020

serathius May 28, 2020

DirectXMan12 May 28, 2020

DirectXMan12 May 29, 2020

DirectXMan12 left a comment

DirectXMan12 Jun 1, 2020

logicalhan Jun 5, 2020

serathius commented Jun 3, 2020

serathius commented Jun 4, 2020

logicalhan commented Jun 5, 2020

k8s-ci-robot commented Jun 5, 2020

Add instructions for structured logging migration #4793

Add instructions for structured logging migration #4793

Conversation

serathius commented May 20, 2020 • edited Loading

thockin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DirectXMan12 May 28, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DirectXMan12 May 29, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DirectXMan12 May 28, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DirectXMan12 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

serathius commented Jun 3, 2020

serathius commented Jun 4, 2020

logicalhan commented Jun 5, 2020

k8s-ci-robot commented Jun 5, 2020

serathius commented May 20, 2020 •

edited

Loading

DirectXMan12 May 28, 2020 •

edited

Loading

DirectXMan12 May 29, 2020 •

edited

Loading

DirectXMan12 May 28, 2020 •

edited

Loading