Support UTF-8 label matchers: Add new parser #3453

grobinson-grafana · 2023-08-09T09:31:46Z

What this pull request does

This pull request adds the new label matchers parser as proposed in #3353. Included is a number of compliance tests comparing the grammar supported in the new parser with the existing parser in pkg/labels. The compliance tests can be run passing the "compliance" tag when running go test.

Motivation

The original motivation for writing this parser was to add support for matching label names containing . and spaces to grafana/grafana. However, about the same time I learned that Prometheus maintainers agreed to add support for UTF-8 labels in Alertmanager, and so I decided to further the work to see if it could be upstreamed to Alertmanager instead.

The original source code can be found at grobinson-grafana/matchers.

Supported grammar

This LL(1) parser in its current version is not 100% compatible with the existing regular expression, although it is close and can be modified if required. The grammar can be understood as follows:

<expr>        ::= "{" <sequence> "}" | <sequence>
<sequence>    ::= <matcher> | <sequence> "," <matcher>
<matcher>     ::= <label_name> <operator> <label_value>
<label_name>  ::= <quoted> | <unquoted>
<operator>    ::= "=" | "=~" | "!=" | "!~"
<label_value> ::= <quoted> | <unquoted>
<quoted>      ::= "\"" /.*/ "\""
<unquoted>    ::= /^[^{}!=~,"'` \]+$/

Here are some examples of valid inputs:

{}
foo=bar
{foo=bar}
{foo=bar🙂}
{foo!=bar}
{foo="bar"}
{foo="bar🙂"}
{foo=~[a-zA-Z0-9]+}
{foo=~"[a-zA-Z0-9]+"}
{"foo"!~"[0-9]+"}
{ "foo with spaces" = "bar with spaces" }
{foo="bar",bar="foo 🙂","baz"!=qux,qux!="baz 🙂"}

and some examples of invalid inputs:

=
foo
{
{foo
{foo=
{foo=bar
{foo=bar,
{foo=bar,}
{foo=bar 🙂}
{foo with spaces=bar with spaces}

Breaking changes

All ^{}!=~,"'`\ and whitespace must be double quoted

It is possible to use UTF-8 on both sides of the expression. However, label names and label values that contain one or more ^{}!=~,"'` characters or whitespace must be double quoted.

~~#### Expressions must start and end with open and closing braces~~

~~All expressions must start and end with { and }, although this can be relaxed if required. For example foo=bar is not valid, it must be {foo=bar}.~~

~~#### Trailing commas are not permitted~~

~~Trailing commas are not permitted. For example {foo=bar,} is not valid, it must be {foo=bar}.~~

~~#### All non [a-zA-Z_:][a-zA-Z0-9_:]* values must be double quoted~~

The set of unquoted characters is now the same on both sides of the expression. In other words, both label names and label values without double quotes must match the regular expression [a-zA-Z_:][a-zA-Z0-9_:]*. For example {foo=!bar} is not valid, it must be {foo="!bar"}. In current versions of Alertmanager, unquoted label values can contain all UTF-8 code points with the exception of comma, such as {foo=!bar}.

~~There are two reasons for this:~~

~~1. It's no longer possible to write ambiguous matchers which I feel is something Alertmanager should fix. For example is {foo=~} equivalent to {foo="~"} or {foo=~""}?~~

2. If we restrict the =, !, ~ characters to double quotes we can keep the grammar LL(1). Without this restriction lookahead/backtrack is required to parse matchers such as {foo==~!=!~bar} which are valid in current versions of Alertmanager.

Errors

One of the goals with this LL(1) parser is to provide better error messages than what is possible using just a regular expression. For example:

{foo
0:4: end of input: expected an operator such as '=', '!=', '=~' or '!~'

{foo=bar
0:8: end of input: expected close paren

foo=bar}
0:8: }: expected opening paren

{foo=bar,,}
9:10: unexpected ,: expected a matcher or close paren after comma

{foo=bar 🙂}
9:13: 🙂: invalid input: expected comma or closing '}'

{foo with spaces=bar with spaces}
5:9: unexpected with: expected an operator such as '=', '!=', '=~' or '!~'

Benchmarks

I've also provided a number of benchmarks of both the LL(1) parser and regex parser that supports UTF-8. These can be found at grobinson-grafana/matchers-benchmarks. However, to run them go.mod must be updated to use the branch https://github.com/grafana/prometheus-alertmanager/tree/yuri-tceretian/utf-8-label-names here.

BenchmarkMatchersSimple, BenchmarkPrometheusSimple
{foo="bar"}

BenchmarkMatchersComplex, BenchmarkPrometheusComplex
{foo="bar",bar="foo 🙂","baz"!=qux,qux!="baz 🙂"}

BenchmarkMatchersRegexSimple, BenchmarkPrometheusRegexSimple
{foo=~"[a-zA-Z_:][a-zA-Z0-9_:]*"}

BenchmarkMatchersRegexComplex, BenchmarkPrometheusRegexComplex
{foo=~"[a-zA-Z_:][a-zA-Z0-9_:]*",bar=~"[a-zA-Z_:]","baz"!~"[a-zA-Z_:][a-zA-Z0-9_:]*",qux!~"[a-zA-Z_:]"}

go test -bench=. -benchmem
goos: darwin
goarch: arm64
pkg: github.com/grobinson-grafana/matchers-benchmarks
BenchmarkMatchersRegexSimple-8      	  488295	      2425 ns/op	    3248 B/op	      49 allocs/op
BenchmarkMatchersRegexComplex-8     	  138081	      9074 ns/op	   11448 B/op	     169 allocs/op
BenchmarkPrometheusRegexSimple-8    	  329244	      3496 ns/op	    3531 B/op	      58 allocs/op
BenchmarkPrometheusRegexComplex-8   	   95188	     12554 ns/op	   12619 B/op	     204 allocs/op
BenchmarkMatchersSimple-8           	 2888340	       414.9 ns/op	      56 B/op	       2 allocs/op
BenchmarkMatchersComplex-8          	  741590	      1628 ns/op	     248 B/op	       7 allocs/op
BenchmarkPrometheusSimple-8         	 1919209	       613.9 ns/op	     233 B/op	       8 allocs/op
BenchmarkPrometheusComplex-8        	  425430	      2803 ns/op	    1015 B/op	      31 allocs/op
PASS
ok  	github.com/grobinson-grafana/matchers-benchmarks	11.766s

grobinson-grafana · 2023-08-09T09:32:06Z

I forgot to signoff the commit, forced pushed.

matchers/compliance/compliance_test.go

gotjosh

Dropping the first half of my review - it's mostly nits. I still need to go through parse.go and its tests but so far so good (with the caveat that I don't know much about parsers but hope that by the end of this feature, I'm more educated on the matter 😄 )

matchers/parse/lexer.go

matchers/parse/lexer_test.go

matchers/compliance/compliance_test.go

matchers/parse/lexer_test.go

matchers/parse/token.go

matchers/parse/lexer.go

grobinson-grafana · 2023-08-09T17:15:45Z

I changed the name of the PR to start with "Support UTF-8 matchers". I will use this on all PRs for this ongoing work to make it easier to find PRs related to this epic.

gotjosh

Great job so far, your test coverage is truly impressive - I played around modifying things as I went along and every time I changed something there were justified cases against them.

I've dropped you a set of extra nits that I'd love to discuss - I'm not done with parse.go as it's very dense but we're getting there.

matchers/parse/parse.go

matchers/parse/token.go

gotjosh · 2023-08-11T12:36:26Z

matchers/parse/parse.go

+// and TokenNone is not one of the accepted kinds. It is possible to use either
+// Scan() or Peek() as fn depending on whether accept should consume or peek
+// the next token.
+func (p *Parser) accept(fn func() (Token, error), kind ...TokenKind) (bool, error) {


I understand the need to try and keep some level of parity between accept and expect but I don't see the correlation between them.

My understanding of the current usage of accept is Peek at the next token and tell me if it's the kind I expect. As such, it feels clear to me to just inline this as it's only relevant on the parseOpenParen function and there's no need for an extraction.

I propose the following:

// In parse.go func (p *Parser) PeekNext() (Token, error) { t, err := p.lexer.Peek() if err != nil { return Token{}, err } if t.Kind == TokenNone { return Token{}, fmt.Errorf("0:%d: %w", len(p.input), ErrEOF) } return t, nil } // in token.go // IsAny verifies that the token is of any specified TokenKinds. func (t Token) IsAny(kinds ...TokenKind) bool { for _, k := range kinds { if t.Kind == k { return true } } return false }

And finally, change parseOpenParen

func (p *Parser) parseOpenParen(l *Lexer) (parseFn, error) { // Can start with an optional open brace. currentToken, err := p.peekNext() if err != nil { if errors.Is(err, ErrEOF) { return p.parseEOF, nil } return nil, err } p.hasOpenParen = currentToken.IsAny(TokenOpenBrace) // If the token was an open brace it must be scanned so the token // following it can be peeked. if p.hasOpenParen { if _, err = l.Scan(); err != nil { panic("Unexpected error scanning open brace") } // If the next token is a close brace there are no matchers in the input, // and we can just parse the close brace. currentToken, err = p.peekNext() if err != nil { return nil, fmt.Errorf("%s: %w", err, ErrNoCloseBrace) } if currentToken.IsAny(TokenCloseBrace) { return p.parseCloseParen, nil } } if currentToken.IsAny(TokenCloseBrace) { return p.parseCloseParen, nil } return p.parseLabelMatcher, nil }

Updated. I'm thinking if I should just update the lexer to return an ErrorEOF error at the same time as TokenNone if there is no more input. I think doing so would remove the need for peekNext and instead we can just call l.Peek().

Discussed offline, agreed that we'll move the ErrorEOF errors into the lexer to simplify what's going on the parser. I'll wait on that before I continue with the review.

I don't think I'm going to continue with moving ErrorEOF into the lexer, it doesn't work quite as well as I had expected. I think instead I'm going to go with the other option where the lexer makes its position available via a public method and we remove input from the parser.

matchers/parse/parse.go

beorn7 · 2023-08-15T19:13:57Z

Without having done a thorough review, just a thought:

All non [a-zA-Z_:][a-zA-Z0-9_:]* values must be double quoted

I like this as it matches the Prometheus naming restrictions. And even with the upcoming introduction of UTF-8 strings as names, the requirement to quote the name would be exactly the same. Yay, consistency! However, the AM matchers aren't really fully consistent with PromQL anyway (e.g. single quotes and backticks aren't allowed). How about going the other way and only require double-quoting if a specific subset of characters is contained? Which would be something like {}"=!~, and any whitespace (and maybe some others to be prepared for future extensions – for example, I could see AM matchers supporting backticks in the future to switch off escape sequence expansion as in PromQL).

beorn7 · 2023-08-15T19:17:20Z

Speaking of escape sequences: I see cases like foo="\\\"" in the test cases, so they seem to be supported, but I don't see that mentioned in the grammar. Am I missing something?

grobinson-grafana · 2023-08-16T09:41:27Z

However, the AM matchers aren't really fully consistent with PromQL anyway (e.g. single quotes and backticks aren't allowed).

Single quotes and backticks can be supported if needed. The strings.Unquote function from Go's string package supports double-quoted, single quoted and backtick strings - so we'd just need to update the lexer to lex strings with either delimiter.

How about going the other way and only require double-quoting if a specific subset of characters is contained? Which would be something like {}"=!~

I don't think it's that useful because in the case of regex there will be some regexes that work without double quoting and others that don't, depending on the contents of the regex. For example foo=~[a-z+] will parse but foo=~[a-z+!] won't because there is an unquoted !. For that reason I chose to enforce double quoting of all non [a-zA-Z_][a-zA-Z0-9_]* inputs.

beorn7 · 2023-08-16T10:21:44Z

The strings.Unquote function from Go's string package supports double-quoted, single quoted and backtick strings - so we'd just need to update the lexer to lex strings with either delimiter.

And you also needed to change whether escape sequences are honored or not. `foo\tbar` and "foo\tbar" are different strings in PromQL.

I don't think it's that useful because in the case of regex there will be some regexes that work without double quoting and others that don't, depending on the contents of the regex.

I think it's mostly useful because fewer current use cases would break. I leave it to you and @gotjosh to make the trade-off here.

Just saying that technically, every string is a regexp, even without any characters that have a special meaning in regexps. So even with the more restrictive character set as in this PR, abc is a regexp that works without quoting and 0abc is a regexp that doesn't work. While I appreciate the consistency with the Prometheus special character definition (which is the strongest argument in favor IMHO), for those not familiar with Prometheus, it will be anyway confusing when quoting is required and when not. So might as well do it as convenient as possible…

I'm not feeling strongly either way. I just want you to make an informed call. "Consistency with Prometheus" and "break as few existing use cases as possible" are arguments in different directions, and I would weigh each one much heavier than "characters with special meaning in regexps should always require quoting".

grobinson-grafana · 2023-08-16T11:37:06Z

And you also needed to change whether escape sequences are honored or not. `foo\tbar` and "foo\tbar" are different strings in PromQL.

Ah! Yes, I remember! Perhaps we can add that in future if there is demand for it, as it won't be a breaking change.

Just saying that technically, every string is a regexp, even without any characters that have a special meaning in regexps.

The concern I have here is that I don't think we can give useful error messages to users should they change their regex from alertname=abc to alertname=abc! because the lexer just lexes the input into a sequence of tokens and doesn't know about the structure of matchers.

For example, the error message will be something like:

unexpected end of input, expected one of '=~'

The user will then change their regex to alertname=abc!= which will fix the lexer error, but will then encounter a parser error because the parser has found an operator != at the end:

unexpected !=: expected a comma or close paren

It doesn't know if the users intention was to have a regex abc!= or if it they had made a copy/paste error and accidentally added != to the end by mistake.

What do you think @gotjosh?

beorn7 · 2023-08-16T12:06:50Z

Are you saying that the current state of this PR always requires quoting for regexps? E.g. foo=~bar would be invalid?

grobinson-grafana · 2023-08-16T12:12:50Z

Yes that's correct! It should all be mentioned under Breaking changes at the top of the PR (see All non [a-zA-Z_:][a-zA-Z0-9_:]* values must be double quoted).

grobinson-grafana · 2023-08-16T12:31:38Z

The reasoning behind this is that the original goals of this project were 1. to eliminate parsing ambiguities in the current parser and 2. to support UTF-8.

If we allow unquoted control characters we have parsing ambiguities. For example foo=~=bar. If we disallow them then we no longer have parsing ambiguities, but the parser as designed cannot provide helpful error messages as the lexer was structured around control characters being double-quoted. It will need to be rewritten if we want to facilitate helpful error messages in this case.

beorn7 · 2023-08-16T12:46:24Z

But my counter proposal does require quoting for all characters that are potentially part of the comparison operator ({}"=!~). What is kind of a pity is that even things like / or . require quoting.

grobinson-grafana · 2023-08-16T12:57:58Z

But my counter proposal does require quoting for all characters that are potentially part of the comparison operator ({}"=!~). What is kind of a pity is that even things like / or . require quoting.

Yes I understand (don't forget commas need to be quoted too). The concern I have is about error messages. If we relax the grammar so just {}"=!~, has to be double quoted then the grammar is still unambiguous, however the problem is that it creates a weird situation where a user can start with a valid matcher such as the following:

alertname=~foo

and then can change it to an invalid matcher by adding a comparison operator such as:

alertname=~~foo

giving the error message:

11:12: ~: invalid input: expected label value

The main discussion point for me is that I'm not sure this error message is useful enough to help the user understand they need to add double quotes. But because the lexer is designed around this being double quoted, to facilitate better error messages for this case it will need to be rewritten to be aware of the structure of a matcher rather than just a dumb tokenizer.

The options I see are the following:

Keep the grammar as it is, and require regexes to be double quoted
Change the grammar, keep the lexer as it is, and accept the error message isn't the best for this use case
Change the grammar, but rewrite the lexer to know about the structure of matchers, rather than be just a dumb tokenizer, so we can have better error messages for this use case

beorn7 · 2023-08-16T13:22:40Z

Maybe that error message isn't the worst. It says that a label value is expected, and what it got was a character that is invalid in a label value unless the label value is quoted.

In different news, I don't think this problem is linked to regexp matching. foo=bar is valid. foo=!bar yields "5:6: !: expected one of '=~': expected label value". That's at least as confusing as the error message above. I think if we want to accept unquoted label values at all, we either need to live with sub-optimal error messages or go the extra mile and code a smarter lexer.

BTW:
foo=.bar → ".: invalid input: expected label value"
foo=foo.bar → ".: invalid input: expected a comma or close brace"

None of these says that quoting is needed. And my proposal (to be more liberal with special characters and only ban those that create ambiguities) would actually make both examples pass, which would clearly be less confusing than a confusing error message.

grobinson-grafana · 2023-08-16T14:44:18Z

You make a good point about the error messages for the cases.

When I first started this work (this was before the proposal to allow UTF-8 in Prometheus/Alertmanager was accepted) I had started with a much stricter grammar where all text had to be double quoted. We later relaxed this to [a-zA-Z_][a-zA-Z0-9_]* to avoid breaking almost all existing use cases.

Given that the grammar has been relaxed before I think we can relax it further provided doing so does not add parsing ambiguities to the grammar - which this change should not. At this time I'm impartial to doing so and am willing to be persuaded for either case.

If we choose to relax the grammar as suggested, I suppose the question remaining is how can we add this to the lexer. To answer that question I think we can do something like the following:

diff --git a/matchers/parse/lexer.go b/matchers/parse/lexer.go
index cdf25161..a7546540 100644
--- a/matchers/parse/lexer.go
+++ b/matchers/parse/lexer.go
@@ -32,6 +32,10 @@ func isNum(r rune) bool {
 	return r >= '0' && r <= '9'
 }

+func isReserved(r rune) bool {
+	return unicode.IsSpace(r) || strings.ContainsRune("{}!=~,", r)
+}
+
 // ExpectedError is returned when the next rune does not match what is expected.
 type ExpectedError struct {
 	input       string
@@ -168,7 +172,7 @@ func (l *Lexer) Scan() (Token, error) {
 			l.rewind()
 			tok, l.err = l.scanQuoted()
 			return tok, l.err
-		case r == '_' || isAlpha(r):
+		case !isReserved(r):
 			l.rewind()
 			tok, l.err = l.scanIdent()
 			return tok, l.err
@@ -191,7 +195,7 @@ func (l *Lexer) Scan() (Token, error) {

 func (l *Lexer) scanIdent() (Token, error) {
 	for r := l.next(); r != eof; r = l.next() {
-		if !isAlpha(r) && !isNum(r) && r != '_' && r != ':' {
+		if isReserved(r) {
 			l.rewind()
 			break
 		}

grobinson-grafana · 2023-08-16T14:47:41Z

This would also allow unquoted matchers in non-Latin alphabets, such as:

alertname=测试

which if Google translate is correct is the Chinese word for test.

Signed-off-by: George Robinson <[email protected]>

gotjosh · 2023-09-05T10:33:16Z

Thank you very much for your contribution.

grobinson-grafana · 2023-10-25T11:23:44Z

I did some additional verification this morning and found a couple of examples we might want to fix.

foo=\n is not un-escaped as we don't support openmetrics escape sequences outside of double quotes. I propose that we do not allow backslashes outside of double quotes.
All escape sequences that are supported in strconv.Unquote are supported in double quotes, where as in the classic matchers parser just the openmetrics escape sequences are supported. I'm not sure how much of an issue this is, but it might be something we might want to restrict in future.

Input	UTF-8	Classic
`foo=bar`	`{foo="bar"}`	`{foo="bar"}`
`foo==bar`	error	`{foo="=bar"}`
`foo=bar🙂`	`{foo="bar🙂"}`	`{foo="bar🙂"}`
`foo🙂=bar`	`{foo🙂="bar"}`	error
`foo=`	error	`{foo=""}`
`foo=\n`	`{foo="\\n"}`	`{foo="\n"}`
`foo=\t`	`{foo="\\t"}`	`{foo="\\t"}`
`foo=\`	`{foo="\\"}`	`{foo="\\"}`
`foo=\"`	error	`{foo="\""}`
`foo=\r`	`{foo="\r"}`	`{foo="\\r"}`
`foo=bar,`	`{foo="bar"}`	`{foo="bar"}`
`foo=bar,,`	error	error
`foo=,bar`	error	error
`foo=""`	`{foo=""}`	`{foo=""}`
`foo="\n"`	`{foo="\n"}`	`{foo="\\n"}`
`foo="\t"`	`{foo=" "}`	`{foo="\t"}`
`foo="\r"`	carriage return	`{foo="\\r"}`
`foo="bar,"`	`{foo="bar,"}`	`{foo="bar,"}`
`foo="bar,,"`	`{foo="bar,,"}`	`{foo="bar,,"}`
`"foo "=bar`	`{foo ="bar"}`	error
`今日は=世界`	`{今日は="世界"}`	error

beorn7 · 2023-10-25T12:24:46Z

I propose that we do not allow backticks outside of double quotes.

Do you mean backslash?

About the OM vs. strconv.Unquote dissonance: Note that this is nothing new. PromQL uses strconv.Unquote, and the OM escaping is used in the exposition format. I would tend towards strconv.Unquote because the OM escaping is rather specific (and specifically designed for the use case of an exposition format).

grobinson-grafana · 2023-10-25T12:30:41Z

I propose that we do not allow backticks outside of double quotes.

Do you mean backslash?

About the OM vs. strconv.Unquote dissonance: Note that this is nothing new. PromQL uses strconv.Unquote, and the OM escaping is used in the exposition format. I would tend towards strconv.Unquote because the OM escaping is rather specific (and specifically designed for the use case of an exposition format).

Yes that's what I meant! 😄

grobinson-grafana · 2023-10-25T12:31:10Z

Thanks for the clarification on strconv though, this is super helpful. In such case, I'm inclined to keep it.

grobinson-grafana · 2023-10-26T09:15:47Z

Here is the PR that rejects backslashes outside double quotes #3571 for reference.

* [CHANGE] Deprecate and remove api/v1/ #2970 * [CHANGE] Remove unused feature flags #3676 * [CHANGE] Newlines in smtp password file are now ignored #3681 * [CHANGE] Change compat metrics to counters #3686 * [CHANGE] Do not register compat metrics in amtool #3713 * [CHANGE] Remove metrics from compat package #3714 * [CHANGE] Mark muted alerts #3793 * [FEATURE] Add metric for inhibit rules #3681 * [FEATURE] Support UTF-8 label matchers #3453, #3507, #3523, #3483, #3567, #3568, #3569, #3571, #3595, #3604, #3619, #3658, #3659, #3662, #3668, 3572 * [FEATURE] Add counter to track alerts dropped outside of time_intervals #3565 * [FEATURE] Add date and tz functions to templates #3812 * [FEATURE] Add limits for silences #3852 * [FEATURE] Add time helpers for templates #3863 * [FEATURE] Add auto GOMAXPROCS #3837 * [FEATURE] Add auto GOMEMLIMIT #3895 * [FEATURE] Add Jira receiver integration #3590 * [ENHANCEMENT] Add the receiver name to notification metrics #3045 * [ENHANCEMENT] Add the route ID to uuid #3372 * [ENHANCEMENT] Add duration to the notify success message #3559 * [ENHANCEMENT] Implement webhook_url_file for discord and msteams #3555 * [ENHANCEMENT] Add debug logs for muted alerts #3558 * [ENHANCEMENT] API: Allow the Silences API to use their own 400 response #3610 * [ENHANCEMENT] Add summary to msteams notification #3616 * [ENHANCEMENT] Add context reasons to notifications failed counter #3631 * [ENHANCEMENT] Add optional native histogram support to latency metrics #3737 * [ENHANCEMENT] Enable setting ThreadId for Telegram notifications #3638 * [ENHANCEMENT] Allow webex roomID from template #3801 * [BUGFIX] Add missing integrations to notify metrics #3480 * [BUGFIX] Add missing ttl in pushhover #3474 * [BUGFIX] Fix scheme required for webhook url in amtool #3409 * [BUGFIX] Remove duplicate integration from metrics #3516 * [BUGFIX] Reflect Discord's max length message limits #3597 * [BUGFIX] Fix nil error in warn logs about incompatible matchers #3683 * [BUGFIX] Fix a small number of inconsistencies in compat package logging #3718 * [BUGFIX] Fix log line in featurecontrol #3719 * [BUGFIX] Fix panic in acceptance tests #3592 * [BUGFIX] Fix flaky test TestClusterJoinAndReconnect/TestTLSConnection #3722 * [BUGFIX] Fix crash on errors when url_file is used #3800 * [BUGFIX] Fix race condition in dispatch.go #3826 * [BUGFIX] Fix race conditions in the memory alerts store #3648 * [BUGFIX] Hide config.SecretURL when the URL is incorrect. #3887 * [BUGFIX] Fix invalid silence causes incomplete updates #3898 * [BUGFIX] Fix leaking of Silences matcherCache entries #3930 * [BUGFIX] Close SMTP submission correctly to handle errors #4006 Signed-off-by: SuperQ <[email protected]>

grobinson-grafana force-pushed the grobinson/label-matchers-parser branch from f446c6d to 3f0e90d Compare August 9, 2023 09:32

grobinson-grafana force-pushed the grobinson/label-matchers-parser branch from 3f0e90d to 24b7b52 Compare August 9, 2023 09:33

gotjosh reviewed Aug 9, 2023

View reviewed changes

matchers/compliance/compliance_test.go Outdated Show resolved Hide resolved

grobinson-grafana force-pushed the grobinson/label-matchers-parser branch 3 times, most recently from bb4e042 to e975ad6 Compare August 9, 2023 10:36

gotjosh reviewed Aug 9, 2023

View reviewed changes

grobinson-grafana force-pushed the grobinson/label-matchers-parser branch from da17465 to 687e615 Compare August 9, 2023 15:27

grobinson-grafana changed the title ~~Add label matchers parser~~ Support UTF-8 matchers: Add new matchers parser Aug 9, 2023

grobinson-grafana changed the title ~~Support UTF-8 matchers: Add new matchers parser~~ Support UTF-8 label matchers: Add new parser Aug 9, 2023

gotjosh reviewed Aug 11, 2023

View reviewed changes

grobinson-grafana commented Aug 14, 2023

View reviewed changes

matchers/parse/parse.go Outdated Show resolved Hide resolved

grobinson-grafana force-pushed the grobinson/label-matchers-parser branch from 5dd3efa to 858262d Compare August 15, 2023 07:36

armandgrillet mentioned this pull request Aug 18, 2023

Grafana 10 > Alerting > Add Silence > bad matcher format when label contains dot grafana/grafana#73369

Closed

grobinson-grafana force-pushed the grobinson/label-matchers-parser branch from 3e6453c to daa7cdf Compare August 19, 2023 12:23

grobinson-grafana added 16 commits September 1, 2023 17:08

Add comments for emit, next, rewind and skip

2122180

Signed-off-by: George Robinson <[email protected]>

Fix bug when skipping two or more columns

fad8236

Signed-off-by: George Robinson <[email protected]>

Feedback

2286467

Signed-off-by: George Robinson <[email protected]>

Fix comment

c67a5c1

Signed-off-by: George Robinson <[email protected]>

Fix comments

3fafba3

Signed-off-by: George Robinson <[email protected]>

Remove duplicated errors from parse.go

c24b758

Signed-off-by: George Robinson <[email protected]>

Remove check for open and closing braces in Matcher

ba09503

Signed-off-by: George Robinson <[email protected]>

Make Unquote a method of Token

222f85d

Signed-off-by: George Robinson <[email protected]>

Make all code package private

55e2cda

Signed-off-by: George Robinson <[email protected]>

Use position in errors

4d7acef

Signed-off-by: George Robinson <[email protected]>

Fix unwrapped errors

fca77b0

Signed-off-by: George Robinson <[email protected]>

Make errors package-private and remove function wrapping

c080f9d

Signed-off-by: George Robinson <[email protected]>

recover from panic on fatal error

47feed7

Signed-off-by: George Robinson <[email protected]>

Small non-functional changes to improve code structure and comments

ce06d33

Signed-off-by: George Robinson <[email protected]>

Fix mistake added in previous commit to accept

41d4fb6

Signed-off-by: George Robinson <[email protected]>

Consistent use of t as token variable

28c0d3a

Signed-off-by: George Robinson <[email protected]>

grobinson-grafana mentioned this pull request Sep 5, 2023

Support UTF-8 label matchers: Add test for PromQL braces when parsing lists of matchers #3507

Merged

gotjosh merged commit 353c0a1 into prometheus:main Sep 5, 2023
1 check passed

grobinson-grafana mentioned this pull request Oct 25, 2023

Support UTF-8 label matchers: Do not allow unquoted escape sequences #3571

Merged

armandgrillet mentioned this pull request Nov 30, 2023

Alerting: Having dot in label name prevents alert being delivered to prometheus alertmanager grafana/grafana#77872

Open

grobinson-grafana deleted the grobinson/label-matchers-parser branch April 16, 2024 14:44

grobinson-grafana mentioned this pull request Jul 5, 2024

Alerting: Support utf8_strict_mode: false in Mimir grafana/grafana#90092

Merged

SuperQ mentioned this pull request Oct 16, 2024

Release v0.28.0-rc.0 #4072

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support UTF-8 label matchers: Add new parser #3453

Support UTF-8 label matchers: Add new parser #3453

grobinson-grafana commented Aug 9, 2023 •

edited

Loading

grobinson-grafana commented Aug 9, 2023

gotjosh left a comment

grobinson-grafana commented Aug 9, 2023

gotjosh left a comment

gotjosh Aug 11, 2023

grobinson-grafana Aug 14, 2023

gotjosh Aug 15, 2023

grobinson-grafana Aug 22, 2023

beorn7 commented Aug 15, 2023

beorn7 commented Aug 15, 2023

grobinson-grafana commented Aug 16, 2023

beorn7 commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023

beorn7 commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023

beorn7 commented Aug 16, 2023 •

edited

Loading

grobinson-grafana commented Aug 16, 2023 •

edited

Loading

beorn7 commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023 •

edited

Loading

grobinson-grafana commented Aug 16, 2023

gotjosh commented Sep 5, 2023

grobinson-grafana commented Oct 25, 2023 •

edited

Loading

beorn7 commented Oct 25, 2023

grobinson-grafana commented Oct 25, 2023

grobinson-grafana commented Oct 25, 2023

grobinson-grafana commented Oct 26, 2023

Support UTF-8 label matchers: Add new parser #3453

Support UTF-8 label matchers: Add new parser #3453

Conversation

grobinson-grafana commented Aug 9, 2023 • edited Loading

What this pull request does

Motivation

Supported grammar

Breaking changes

All ^{}!=~,"'`\ and whitespace must be double quoted

Errors

Benchmarks

grobinson-grafana commented Aug 9, 2023

gotjosh left a comment

Choose a reason for hiding this comment

grobinson-grafana commented Aug 9, 2023

gotjosh left a comment

Choose a reason for hiding this comment

gotjosh Aug 11, 2023

Choose a reason for hiding this comment

grobinson-grafana Aug 14, 2023

Choose a reason for hiding this comment

gotjosh Aug 15, 2023

Choose a reason for hiding this comment

grobinson-grafana Aug 22, 2023

Choose a reason for hiding this comment

beorn7 commented Aug 15, 2023

beorn7 commented Aug 15, 2023

grobinson-grafana commented Aug 16, 2023

beorn7 commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023

beorn7 commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023

beorn7 commented Aug 16, 2023 • edited Loading

grobinson-grafana commented Aug 16, 2023 • edited Loading

beorn7 commented Aug 16, 2023

grobinson-grafana commented Aug 16, 2023 • edited Loading

grobinson-grafana commented Aug 16, 2023

gotjosh commented Sep 5, 2023

grobinson-grafana commented Oct 25, 2023 • edited Loading

beorn7 commented Oct 25, 2023

grobinson-grafana commented Oct 25, 2023

grobinson-grafana commented Oct 25, 2023

grobinson-grafana commented Oct 26, 2023

grobinson-grafana commented Aug 9, 2023 •

edited

Loading

beorn7 commented Aug 16, 2023 •

edited

Loading

grobinson-grafana commented Aug 16, 2023 •

edited

Loading

grobinson-grafana commented Aug 16, 2023 •

edited

Loading

grobinson-grafana commented Oct 25, 2023 •

edited

Loading