Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Grammer Endless Loop detector triggers inappropriately #145

Closed
msftrncs opened this issue Jan 22, 2021 · 3 comments · Fixed by #146
Closed

Grammer Endless Loop detector triggers inappropriately #145

msftrncs opened this issue Jan 22, 2021 · 3 comments · Fixed by #146
Labels
bug Issue identified by VS Code Team member as probable bug
Milestone

Comments

@msftrncs
Copy link
Contributor

msftrncs commented Jan 22, 2021

Trying to improve an IEC 61131 language grammar, I ran across a construct that incorrectly triggers the '[2] - Grammar is in an endless loop - Grammar pushed the same rule without advancing' debug.

The fact is that the grammar was NOT in an endless loop, but did push the same rule twice in a row, for completely different parts of the document.

The triggering of the debug is not dependent on the rule occurring twice on the same place in the document, only that the last rule on the stack is the same rule, and the rule did not capture any content (EDIT) due to both being an empty match and other rules having matched the document immediately in front of it.

CASE hello OF
    3:
        CASE hello OF
            4:
        END_CASE
END_CASE

In this example, the same rule is applied to both CASE keywords using a positive lookahead, thus the cursor doesn't advance when the rule triggers again for the second one. There are no other worthwhile rules to trigger within the statements, though rules do trigger for the expression and the OF, then again for the numeric evaluator and the :, but no other push rules to cover the stack. (EDIT) There is a rule in this grammar which has scoped the spaces preceeding the CASE and that is dependency in this issue.

The trigger only utilizes !hasAdvanced (which the begin rule did not meet because its a positive lookahead) and that the matching ruleID matches the previously stacked ruleId. I think the use of !hasAdvanced is incorrect here, as it only indicates if the current rule match moved, and does not indicate if the anchorPos has moved since the previous push of the same rule.

Interestingly, if one thinks about this, it would be possible to detect a multi-rule non-advancing loop by using the anchorPos values, by comparing the ruleId of all the rules on the stack that have the same non -1 value, if more than 1 rule on the stack have the same anchorPos (not -1) and ruleId then there has been a non-advancing loop. Maybe this loop detection can be written as a method similar to hasSameRuleAs method, since it would be needed for both BeginEnd and BeginWhile rules.

just a snippet of the grammar file:

{
	"begin": "\\b(?=CASE\\b)",
	"end": "\\b(?:END_CASE\\b|(?=END_(?:PROGRAM|INTERFACE|FUNCTION(?:_BLOCK)?|METHOD|PROPERTY|ACTION)\\b))",
	"endCaptures": {
		"0": {
			"name": "keyword.control.case.end.st"
		}
	},
	"name": "meta.switch.st",
	"patterns": [
		{
			"contentName": "meta.switch.expression.st",
			"begin": "\\G(?:CASE)",
			"beginCaptures": {
				"0": {
					"name": "keyword.control.case.st"
				}
			},
			"end": "\\b(?:OF\\b|(?=END_(?:CASE|PROGRAM|INTERFACE|FUNCTION(?:_BLOCK)?|METHOD|PROPERTY|ACTION)\\b))",
			"endCaptures": {
				"0": {
					"name": "keyword.control.case.of.st"
				}
			},
			"patterns": [
				{
					"include": "$self"
				}
			]
		},
		{
			"name": "punctuation.separator.st",
			"match": ":(?!=)"
		},
		{
			"include": "$self"
		}
	]
},
{
	"begin": "^(?= )",
	"end": "(?=[^ ])",
	"name": "meta.leading-space",
	"patterns": [
		{
			"captures": {
				"1": {
					"name": "meta.odd-tab.spaces"
				},
				"2": {
					"name": "meta.even-tab.spaces"
				}
			},
			"match": "(  )(  )?"
		}
	]
},
@msftrncs
Copy link
Contributor Author

Would the following, as a method on StackElement seem acceptable? I'll roll this in to a PR if there isn't some reason this cannot be used. Its working in the case described above allowing the successive matches on the same rule because they are on different lines, while making an intentional loop with a single rule properly triggers the debug.

public hasEndlessLoop(anchorPosition: number): boolean {
    if (anchorPosition === -1) {
        // method is unavailable
        return true;
    }

    let se = this as StackElement;

    while (anchorPosition === se._anchorPos && se.parent !== nul) {
        se = se.parent;
	if (this.hasSameRuleAs(se)) {
	    // endless loop detected
	    return true;
        }
    }
    return false;
}

msftrncs added a commit to msftrncs/vscode-textmate that referenced this issue Jan 22, 2021
For BeginEnd and BeginWhile rules, improves the endless loop detection,
detecting multiple rule deep loops and not falsing detecting successively
repeating rules that did not occur at the same place.

Fixes microsoft#145.
@msftrncs
Copy link
Contributor Author

While working on testing scenarios, I determined I missed a detail as to reproduction. I'll edit the original post to include this:

The issue is dependent on another BeginEnd rule ending immediately before the matching of the rule that is triggering the endless loop debug. In this case, the language has odd/even spaces rules that scope out odd and even spaces, and the rules end (?=[^ ]) so it ends in front of the CASE, which causes there to be no cursor movement before the positive lookahead match.

@msftrncs
Copy link
Contributor Author

To more generically describe the issue (and to better match test patterns I will include in a PR), assume the following document:

abc
test this line
not
    not
        not

With the grammar below (extended from the First-Mate test suite), three patterns of possible endless loops is demonstrated; 1 currently caught endless loop (base pattern), 1 multiple-rule endless loop not currently caught (test), and 1 rule incorrectly caught as an endless loop (not_a_problem).

{
	"name": "infinite-loop-grammar",
	"scopeName": "source.infinite-loop",
	"patterns": [
		{
			"name": "start",
			"begin": "\\A",
			"end": "$",
			"patterns": [
				{
					"name": "negative-look-ahead",
					"match": "(?!a)"
				}
			]
		},
		{
			"include": "#test"
		},
		{
			"include": "#not_a_problem"
		}
	],
	"repository": {
		"test": {
			"name": "test",
			"begin": "(?=test)",
			"end": "$",
			"patterns": [
				{
					"include": "#test_this"
				}
			]
		},
		"test_this": {
			"name": "test_this",
			"begin": "(?=test this)",
			"end": "$",
			"patterns": [
				{
					"include": "#test_this_line"
				}
			]
		},
		"test_this_line": {
			"name": "test_this_line",
			"begin": "(?=test this line)",
			"end": "$",
			"patterns": [
				{
					"include": "#test"
				}
			]
		},
		"spaces":
		{
			"name": "spaces",
			"begin": "^(?=\\s)",
			"end": "(?=\\S)"
		},
		"not_a_problem":
		{
			"name": "not_a_problem",
			"begin": "(?=not)",
			"end": "\\z",
			"patterns": [
				{
					"name": "not",
					"match": "\\Gnot"
				},
				{
					"include": "#not_a_problem"
				},
				{
					"include": "#spaces"
				}
			]
		}
	}
}

msftrncs added a commit to msftrncs/vscode-textmate that referenced this issue Jan 24, 2021
@alexdima alexdima added this to the February 2021 milestone Jan 29, 2021
@alexdima alexdima added the bug Issue identified by VS Code Team member as probable bug label Jan 29, 2021
sebthom added a commit to sebthom/tm4e that referenced this issue Apr 28, 2022
sebthom added a commit to eclipse-tm4e/tm4e that referenced this issue Apr 28, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Issue identified by VS Code Team member as probable bug
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants