Add rolling-file appender to core logging #84735

pgayvallet · 2020-12-02T08:33:29Z

Summary

This PR adds a new rolling-file appender to core's logging system.

The appender currently supports two triggering policy: time-based (time-interval) and size-base (size-limit), and one rolling strategy: appending a numeric-based suffix at the end of the file when rolling it (numeric). See src/core/server/logging/README.md in the PR for more documentation

Examples

Rolling every 24h

logging:
  appenders:
    lets-roll:
      kind: rolling-file
      path: /var/logs/kibana.log
      policy:
        kind: time-interval
        interval: 24h
        modulate: true
      strategy:
        kind: numeric
        pattern: '-%i'
        max: 2
      layout:
        kind: pattern

This appender is configured to roll the file every 24h hours (interval: 24h) everyday at 0:00am (modulate: true). it will retain a maximum of 2 (max: 2) rolled files:

During the first rollover, kibana.log is renamed to kibana-1.log. A new kibana.log file is created and starts
being written to.
During the second rollover, kibana-1.log is renamed to kibana-2.log and kibana.log is renamed to kibana-1.log.
A new kibana.log file is created and starts being written to.
During the third and subsequent rollovers, kibana-2.log is deleted, kibana-1.log is renamed to kibana-2.log and
kibana.log is renamed to kibana-1.log. A new kibana.log file is created and starts being written to.

Rolling once the file reaches 50mb

logging:
  appenders:
    rock-n-roll:
      kind: rolling-file
      path: /var/logs/kibana.log
      policy:
        kind: size-limit
        size: 50mb
      strategy:
        kind: numeric
        pattern: '-%i'
        max: 2
      layout:
        kind: pattern

This appender is configured to roll the file once it reaches 50mb. The same rolling strategy than previous example applies.

Checklist

Documentation was added for features that require explanation or tutorials
Unit or functional tests were updated or added to match the most common scenarios

…-file-appender

pgayvallet

self-review

src/core/server/logging/appenders/rolling_file/policies/policy.ts

src/core/server/logging/appenders/rolling_file/policies/size_limit/size_limit_policy.ts

src/core/server/server.api.md

src/core/server/logging/appenders/rolling_file/rolling_file_appender.test.ts

pgayvallet · 2020-12-03T18:44:54Z

src/core/server/logging/appenders/rolling_file/rolling_file_appender.ts

+    try {
+      await this.strategy.rollout();
+      await this.fileManager.closeStream();
+    } catch (e) {
+      // eslint-disable-next-line no-console
+      console.log('Error while rolling file: ', e);
+    }


I'm forced to try/catch this, else it could result in an unhandled promise rejection from the call in append.

I'm not sure what to do with the error though. Crashing the server if the rollout failed seems extreme, but I'm not sure console.log is enough. I could eventually keep an internal ConsoleAppender and use it to log the error instead of relying on console.log, WDYT?

Would a failed rollout never, sometimes, or always prevent future logs from being written? My guess is sometimes?

Eventually keeping an internal ConsoleAppender sounds like a good idea to me. In the short term, I don't have many other suggestions, other than changing console.log to console.error, so that we can at least see this in stderr instead of stdout.

In the context of audit logging specifically, we've discussed optionally halting the server to prevent users from interacting with Kibana when we can't audit their actions: #60636. This is out of scope for this PR, but it might inform how we decide to handle these failures.

In the hypothetical future, we could leverage the notification center to alert administrators about this as well.

Would a failed rollout never, sometimes, or always prevent future logs from being written?

it's never never when you interact with filesystem, but except in case of hard I/O failure, I think the worth case would be that we would not be able to roll the log file, and that the next rolling cycle would still write in the previous (non-renamed) file. We skip a rolling, basically, which seemed acceptable (also we can't really do anything except warning about the failure)

I'm not sure what to do with the error though. Crashing the server if the rollout failed seems extreme, but I'm not sure console.log is enough.

It seems Elasticsearch logs a caught exception without any additional measurements (we can further check with them). see elastic/elasticsearch#45523

src/core/server/logging/integration_tests/logging.test.ts

src/core/server/logging/integration_tests/rolling_file_appender.test.ts

src/core/server/logging/logging_system.ts

elasticmachine · 2020-12-03T19:25:05Z

Pinging @elastic/kibana-core (Team:Core)

legrego

I tested this new appender for Kibana's new audit log, and it's working very nicely! This is a really impressive PR, thanks again for putting this up so quickly

legrego · 2020-12-04T13:57:07Z

src/core/server/logging/appenders/rolling_file/rolling_file_appender.ts

+    try {
+      await this.strategy.rollout();
+      await this.fileManager.closeStream();
+    } catch (e) {
+      // eslint-disable-next-line no-console
+      console.log('Error while rolling file: ', e);
+    }


Would a failed rollout never, sometimes, or always prevent future logs from being written? My guess is sometimes?

Eventually keeping an internal ConsoleAppender sounds like a good idea to me. In the short term, I don't have many other suggestions, other than changing console.log to console.error, so that we can at least see this in stderr instead of stdout.

In the context of audit logging specifically, we've discussed optionally halting the server to prevent users from interacting with Kibana when we can't audit their actions: #60636. This is out of scope for this PR, but it might inform how we decide to handle these failures.

In the hypothetical future, we could leverage the notification center to alert administrators about this as well.

legrego · 2020-12-04T13:58:20Z

src/core/server/logging/appenders/rolling_file/rolling_file_context.ts

+      this.currentFileTime = birthtime.getTime();
+      this.currentFileSize = size;
+    } catch (e) {
+      this.currentFileTime = Date.now();


Are there only certain failure scenarios that should cause us to reset the file info like this? Or do we really want to reset whenever any error is caught?

Ideally, we would only catch ENOENT (or such) errors. But that's kinda the first time we got a logger/appender implementation complex enough to potentially throw errors, so I'm not really sure how we should handle failures internal to the logger system or its appenders, so for safety I used a catch-all instead.

Ideally, we would only catch ENOENT (or such) errors. But that's kinda the first time we got a logger/appender implementation complex enough to potentially throw errors,

shouldn't we log an exception different from ENOENT to console at least?

Good point, will add a ENOENT check and logs the error.

Remaining question (same as in #84735 (comment)) is, should we:

use console.log

use console.error

create a ConsoleAppender in the RollingFileAppender and use it internally for all error logging (we would use the same layout as the one provided to the rolling file appender). In that case, it will logs in stdout as if we were using console.log

wdyt?

The 3rd option looks fine. However, I'd expect that it uses the default layout that a user specifies for the root context.
If the 3rd option is too hard to implement, the 2nd one is well enough.

legrego · 2020-12-04T14:01:13Z

src/core/server/logging/appenders/rolling_file/strategies/index.ts

+export { RollingStrategy } from './strategy';
+export type RollingStrategyConfig = NumericRollingStrategyConfig;
+
+export const rollingStrategyConfigSchema = schema.oneOf([numericRollingStrategyConfigSchema]);


Since we only have a single strategy available, what do you think about providing a default value, so that consumers aren't forced to specify it? I mistakenly omitted this from my yml, and the @kbn/config-schema error message was hard to decipher at best.

Yea, I even added default values for both the policy and the strategy at some point, so that

rolling-file: kind: rolling-file path: /Users/pierregayvallet/Documents/test-log/kibana.log layout: kind: pattern

would be enough, then I reverted it, thinking that we didn't do that for other appenders (like you must always specify layout, there isn't a default value for it).

@elastic/kibana-core do you have any opinion on this?

don't see a reason not to add defaults. we have got them for some values

kibana/src/core/server/logging/layouts/pattern_layout.ts

Lines 36 to 48 in ab92bbb

const DEFAULT_PATTERN = `[%date][%level][%logger]%meta %message`;

export const patternSchema = schema.string({

validate: (string) => {

DateConversion.validate!(string);

},

});

const patternLayoutSchema = schema.object({

highlight: schema.maybe(schema.boolean()),

kind: schema.literal('pattern'),

pattern: schema.maybe(patternSchema),

});

src/core/server/logging/appenders/rolling_file/strategies/numeric/pattern_matcher.ts

src/core/server/logging/appenders/rolling_file/strategies/numeric/rolling_tasks.ts

…-file-appender

mshustov · 2020-12-08T10:05:15Z

src/core/server/logging/appenders/rolling_file/policies/size_limit/size_limit_policy.ts

+
+export const sizeLimitTriggeringPolicyConfigSchema = schema.object({
+  kind: schema.literal('size-limit'),
+  size: schema.byteSize({ min: '1b', defaultValue: '100mb' }),


Have you discussed the defaults with Cloud? The legacy appender has other limits + we might want to have higher min and explicit max limits

kibana/packages/kbn-legacy-logging/src/schema.ts

Lines 77 to 82 in e176def

// > 1MB

.greater(1048576)

// < 1GB

.less(1073741825)

// 10MB

.default(10485760),

I haven't. @Kushmaro maybe you can help us here?

@pgayvallet
By our log delivery feature design (see storage section) we concluded that the smallest kibana instance in cloud will have around 1GB of storage space

is the 100mb referring to the log size rotation limit? cloud also enforces a quota on the log path, and I * think * that quota is 100mb total, so maybe a more sensible rotation limit would be 10mb.

@s-nel for awareness and further guidelines (he's on PTO though so may take a while)

is the 100mb referring to the log size rotation limit

That's the maximum size of the log file before it got rolled out.

We use 100MB rollover size for ES audit logs and keep 2 files. We might choose something smaller for Kibana since the instances don't have much storage.

mshustov · 2020-12-08T10:26:03Z

src/core/server/logging/appenders/rolling_file/rolling_file_context.ts

+      this.currentFileTime = birthtime.getTime();
+      this.currentFileSize = size;
+    } catch (e) {
+      this.currentFileTime = Date.now();


Ideally, we would only catch ENOENT (or such) errors. But that's kinda the first time we got a logger/appender implementation complex enough to potentially throw errors,

shouldn't we log an exception different from ENOENT to console at least?

src/core/server/logging/appenders/rolling_file/strategies/fs.ts

src/core/server/logging/appenders/rolling_file/policies/time_interval/time_interval_policy.ts

src/core/server/logging/appenders/rolling_file/strategies/numeric/numeric_strategy.ts

src/core/server/logging/integration_tests/rolling_file_appender.test.ts

mshustov · 2020-12-08T14:36:40Z

src/core/server/logging/integration_tests/rolling_file_appender.test.ts

+
+      const logger = root.logger.get('test.rolling.file');
+
+      // size = 100b, message.length ~= 40b, should roll every 3 message


we can create a buffer of fixed size to make sure to have better control over the rolling process

const asciiBuf = Buffer.alloc(5, 'a', 'utf8');

I'm only using ascii chars in the message, so we kinda know that sizeInBytes === msg.length, but I can use a buffer to add more control if we want to.

src/core/server/logging/integration_tests/rolling_file_appender.test.ts

src/core/server/logging/logging_system.ts

…-file-appender

pgayvallet · 2020-12-09T07:32:19Z

@restrry I addressed most of your comments. The points I still need input on are

PTAL

mshustov

We can change size-limit policy defaults in a follow up

…-file-appender

kibanamachine · 2020-12-10T09:37:42Z

💚 Build Succeeded

continuous-integration/kibana-ci/pull-request
Commit: bbb96f4

Metrics [docs]

Distributable file count

id	before	after	diff
`default`	46991	47768	+777
`oss`	27598	27615	+17

History

💚 Build #93004 succeeded bf0e390
💚 Build #92791 succeeded 3aa0bc7
💚 Build #92267 succeeded 6694e24
💚 Build #91806 succeeded 3e740c6
💚 Build #91795 succeeded a9e21d5

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

* You need to start somewhere * revert comment * rename default strategy to numeric * add some tests * fix some tests * update documentation * update generated doc * change applyBaseConfig to be async * fix integ tests * add integration tests * some renames * more tests * more tests * nits on README * some self review * doc nits * self review * use `escapeRegExp` from lodash * address some review comments * a few more nits * extract `isDevCliParent` check outside of LoggingSystem.upgrade * log errors from context * add defaults for policy/strategy

* master: (53 commits) Fixing recovered instance reference bug (elastic#85412) Switch to new elasticsearch client for Visualizations (elastic#85245) Switch to new elasticsearch client for TSVB (elastic#85275) Switch to new elasticsearch client for Vega (elastic#85280) [ILM] Add shrink field to hot phase (elastic#84087) Add rolling-file appender to core logging (elastic#84735) [APM] Service overview: Dependencies table (elastic#83416) [Uptime ]Update empty message for certs list (elastic#78575) [Graph] Fix graph saved object references (elastic#85295) [APM] Create new API's to return Latency and Throughput charts (elastic#85242) [Advanced settings] Reset to default for empty strings (elastic#85137) [SECURITY SOLUTION] Bundles _source -> Fields + able to sort on multiple fields in Timeline (elastic#83761) [Fleet] Update agent listing for better status reporting (elastic#84798) [APM] enable 'sanitize_field_names' for Go (elastic#85373) Update dependency @elastic/charts to v24.4.0 (elastic#85452) Introduce external url service (elastic#81234) Deprecate disabling the security plugin (elastic#85159) [FLEET] New Integration Policy Details page for use in Integrations section (elastic#85355) [Security Solutions][Detection Engine] Fixes one liner access control with find_rules REST API chore: 🤖 remove extraPublicDirs (elastic#85454) ...

pgayvallet added 9 commits December 2, 2020 09:32

You need to start somewhere

1ae0289

revert comment

864c587

rename default strategy to numeric

004436e

add some tests

f59bca6

fix some tests

b5f098f

update documentation

d5d3fd1

update generated doc

a56c61e

change applyBaseConfig to be async

6b6fc49

fix integ tests

a04d0c7

pgayvallet added v7.11.0 v8.0.0 labels Dec 2, 2020

pgayvallet added 9 commits December 2, 2020 21:58

add integration tests

91f1420

some renames

8165f35

more tests

30ba0d4

Merge remote-tracking branch 'upstream/master' into kbn-56291-rolling…

1077f30

…-file-appender

more tests

8f4581b

nits on README

b226a63

some self review

2634c76

doc nits

a9e21d5

self review

3e740c6

pgayvallet commented Dec 3, 2020

View reviewed changes

pgayvallet marked this pull request as ready for review December 3, 2020 19:24

pgayvallet requested a review from a team as a code owner December 3, 2020 19:24

pgayvallet added the Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc label Dec 3, 2020

pgayvallet added the release_note:skip Skip the PR/issue when compiling release notes label Dec 3, 2020

pgayvallet requested a review from legrego December 3, 2020 19:25

legrego reviewed Dec 4, 2020

View reviewed changes

pgayvallet added 2 commits December 7, 2020 09:31

Merge remote-tracking branch 'upstream/master' into kbn-56291-rolling…

02adb81

…-file-appender

use escapeRegExp from lodash

6694e24

mshustov reviewed Dec 8, 2020

View reviewed changes

pgayvallet added 6 commits December 8, 2020 16:11

Merge remote-tracking branch 'upstream/master' into kbn-56291-rolling…

c72c20f

…-file-appender

address some review comments

3aa0bc7

a few more nits

2cf122f

extract isDevCliParent check outside of LoggingSystem.upgrade

a8d26ef

log errors from context

e76a089

Merge remote-tracking branch 'upstream/master' into kbn-56291-rolling…

bf0e390

…-file-appender

pgayvallet requested a review from mshustov December 9, 2020 07:32

mshustov approved these changes Dec 9, 2020

View reviewed changes

pgayvallet added 2 commits December 10, 2020 08:09

Merge remote-tracking branch 'upstream/master' into kbn-56291-rolling…

fc190b9

…-file-appender

add defaults for policy/strategy

bbb96f4

pgayvallet merged commit 8cdcae2 into elastic:master Dec 10, 2020

pgayvallet mentioned this pull request Dec 10, 2020

[7.x] Add rolling-file appender to core logging (#84735) #85524

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add rolling-file appender to core logging #84735

Add rolling-file appender to core logging #84735

pgayvallet commented Dec 2, 2020 •

edited

Loading

pgayvallet left a comment

pgayvallet Dec 3, 2020

legrego Dec 4, 2020

pgayvallet Dec 4, 2020 •

edited

Loading

mshustov Dec 8, 2020

elasticmachine commented Dec 3, 2020

legrego left a comment

legrego Dec 4, 2020

legrego Dec 4, 2020

pgayvallet Dec 4, 2020

mshustov Dec 8, 2020

pgayvallet Dec 9, 2020

mshustov Dec 9, 2020

legrego Dec 4, 2020

pgayvallet Dec 4, 2020

mshustov Dec 8, 2020

mshustov Dec 8, 2020

pgayvallet Dec 8, 2020

Kushmaro Dec 8, 2020 •

edited

Loading

pgayvallet Dec 8, 2020

s-nel Dec 21, 2020

mshustov Dec 8, 2020

mshustov Dec 8, 2020 •

edited

Loading

pgayvallet Dec 9, 2020

pgayvallet commented Dec 9, 2020

mshustov left a comment

kibanamachine commented Dec 10, 2020

	const DEFAULT_PATTERN = `[%date][%level][%logger]%meta %message`;

	export const patternSchema = schema.string({
	validate: (string) => {
	DateConversion.validate!(string);
	},
	});

	const patternLayoutSchema = schema.object({
	highlight: schema.maybe(schema.boolean()),
	kind: schema.literal('pattern'),
	pattern: schema.maybe(patternSchema),
	});

	// > 1MB
	.greater(1048576)
	// < 1GB
	.less(1073741825)
	// 10MB
	.default(10485760),


		const logger = root.logger.get('test.rolling.file');

		// size = 100b, message.length ~= 40b, should roll every 3 message

Add rolling-file appender to core logging #84735

Add rolling-file appender to core logging #84735

Conversation

pgayvallet commented Dec 2, 2020 • edited Loading

Summary

Examples

Rolling every 24h

Rolling once the file reaches 50mb

Checklist

pgayvallet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pgayvallet Dec 4, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elasticmachine commented Dec 3, 2020

legrego left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Kushmaro Dec 8, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mshustov Dec 8, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pgayvallet commented Dec 9, 2020

mshustov left a comment

Choose a reason for hiding this comment

kibanamachine commented Dec 10, 2020

💚 Build Succeeded

Metrics [docs]

Distributable file count

History

pgayvallet commented Dec 2, 2020 •

edited

Loading

pgayvallet Dec 4, 2020 •

edited

Loading

Kushmaro Dec 8, 2020 •

edited

Loading

mshustov Dec 8, 2020 •

edited

Loading