internal/stack: Use control flow for state #110

abhinav · 2023-10-21T20:43:20Z

In anticipation of parsing more information from stack traces make the
stack trace parsing logic more manageable by moving it from a state
machine into a layout closer to a recursive descent parser.

That is, instead of a central loop that reads input line-by-line
and needs to manage its various states:

current, result := ...
for {
    input := read()
    if cond(input) {
        result.append(current)
        current = startNew(input)
    } else {
        current = accumulate(input)
    }
}
result = flush(current)

Break it down so that parsing of individual results is its own function,
representing the state machine via control flow.

result := ...
for {
    input := read()
    if cond(input) {
        result.append(parseOne())
    }
}

// where

func parseOne(input) {
    value := ...
    for ; !cond(input); input = read() {
        value = accumulate(input)
    }
    return value
}

The net effect of this is to make the parsing logic more maintainable
once it gets more complex -- adds more states.

For example, to parse more information for individual stacks
with a state machine, we'd have to make the main loop more complex.
State for an individual stack (e.g. "all the functions in the stack")
will leak into the state management for the whole state machine.
On the other hand, with this method, we'll only modify parseStack,
keeping its responsiblity encapsulated to parsing a single stack trace.

This idea was also demonstrated recently in the first section of
Storing Data in Control flow by Russ Cox.

To make it easy to write this parser, we switch from bufio.Reader
to bufio.Scanner, and wrap it with the ability to "Unscan":
basically "don't move forward on next Scan()".

Lastly, we need to bump the go directive in go.mod to Go 1.20
to allow use of errors.Join.

In anticipation of parsing more information from stack traces make the stack trace parsing logic more manageable by moving it from a state machine into a layout closer to a recursive descent parser. That is, instead of a central loop that reads input line-by-line and needs to manage its various states: current, result := ... for { input := read() if cond(input) { result.append(current) current = startNew(input) } else { current = accumulate(input) } } result = flush(current) Break it down so that parsing of individual results is its own function, representing the state machine via control flow. result := ... for { input := read() if cond(input) { result.append(parseOne()) } } // where func parseOne(input) { value := ... for ; !cond(input); input = read() { value = accumulate(input) } return value } The net effect of this is to make the parsing logic more maintainable once it gets more complex -- adds more states. For example, to parse more information for individual stacks with a state machine, we'd have to make the main loop more complex. State for an individual stack (e.g. "all the functions in the stack") will leak into the state management for the whole state machine. On the other hand, with this method, we'll only modify parseStack, keeping its responsiblity encapsulated to parsing a single stack trace. This idea was also demonstrated recently in the first section of [Storing Data in Control flow by Russ Cox][1]. [1]: https://research.swtch.com/pcdata#step --- To make it easy to write this parser, we switch from bufio.Reader to bufio.Scanner, and wrap it with the ability to "Unscan": basically "don't move forward on next Scan()".

This is needed for use of `errors.Join`.

codecov · 2023-10-21T20:44:37Z

Codecov Report

Merging #110 (aee45b6) into master (f995fdb) will increase coverage by 1.60%.
The diff coverage is 93.50%.

@@            Coverage Diff             @@
##           master     #110      +/-   ##
==========================================
+ Coverage   96.58%   98.18%   +1.60%     
==========================================
  Files           5        6       +1     
  Lines         234      276      +42     
==========================================
+ Hits          226      271      +45     
+ Misses          5        4       -1     
+ Partials        3        1       -2

Files	Coverage Δ
internal/stack/scan.go	`100.00% <100.00%> (ø)`
internal/stack/stacks.go	`95.14% <92.53%> (+6.41%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

internal/stack/scan_test.go

internal/stack/stacks.go

prashantv · 2023-10-22T13:00:42Z

internal/stack/stacks.go

-				fullStack: &bytes.Buffer{},
+			stack, err := p.parseStack(line)
+			if err != nil {
+				p.errors = append(p.errors, err)


does it make sense to continue scanning once we hit an error? especially since any error means we don't use the results

The only difference is that it'll report all the errors in all the traces instead of stopping at the first one.
I don't feel strongly about doing it this way. Do you think it's better not to?

Only reason not to do this is if a failure here can cause subsequent parses to fail. However, since this method only looks for the goroutine prefix, which wouldn't be affected by where we stopped parsing, I think this is fine.

internal/stack/scan.go

Adds support to the stack parser for reading the full list of functions for a stack trace. NOTE: The function that created the goroutine is NOT considered part of the stack. We don't maintain the order of the functions since that's not something we need at this time. The functions are all placed in a set. This unblocks #41 and allows implementing an IgnoreAnyFunction option (similar to the stalled #80 PR). Depends on #110

abhinav added 2 commits October 21, 2023 13:41

go.mod: Bump to Go 1.20

2f2c1d5

This is needed for use of `errors.Join`.

abhinav mentioned this pull request Oct 21, 2023

stack: Parse all functions #111

Merged

prashantv approved these changes Oct 22, 2023

View reviewed changes

abhinav added 4 commits October 22, 2023 09:16

test: Verify Unscan returns the same token

7de2b92

getStacks: Explain the panic, include the stack trace

7973cbb

stackParser.Parse: Don't nest success case

6396c7f

doc(Unscan): Clarify that it doesn't move the token

aee45b6

abhinav merged commit 25cbb67 into master Oct 22, 2023
6 of 7 checks passed

abhinav deleted the parse-flow branch October 22, 2023 18:15

mway mentioned this pull request Oct 24, 2023

Release v1.3.0 #115

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

internal/stack: Use control flow for state #110

internal/stack: Use control flow for state #110

abhinav commented Oct 21, 2023

codecov bot commented Oct 21, 2023 •

edited

Loading

prashantv Oct 22, 2023

abhinav Oct 22, 2023

prashantv Oct 22, 2023

internal/stack: Use control flow for state #110

internal/stack: Use control flow for state #110

Conversation

abhinav commented Oct 21, 2023

codecov bot commented Oct 21, 2023 • edited Loading

Codecov Report

prashantv Oct 22, 2023

Choose a reason for hiding this comment

abhinav Oct 22, 2023

Choose a reason for hiding this comment

prashantv Oct 22, 2023

Choose a reason for hiding this comment

codecov bot commented Oct 21, 2023 •

edited

Loading