[ refactor ] Multiline error report #1155

andrevidela · 2021-03-04T05:12:59Z

Parser errors can now take bounds for better diagnostics
fix Reporting single-quoted multi-line strings #1133

andrevidela · 2021-03-05T20:15:00Z

ready for review @gallais

tests/idris2/perror008/expected

andylokandy · 2021-03-06T11:36:31Z

I'm still confused by why WithBounds tok, which could be NoBound, is enforced, while you can return tok when bound is not provided.

andrevidela · 2021-03-06T12:18:25Z

It's explained in the commit message. Essentially, WithBounds used to always carry bounds but it also had a flag that indicates if the bounds are valid or not. This change makes it so that we know when a term has valid bounds or not. With the additional invariant that bounds are only valid if our grammar consumes its input. Indeed I assume that the token does not come from the input if the grammar is not guaranteed to consume. This is not quite the case because peek and alt both can return valid bounds while not being guaranteed to consume, but it's enough for most uses of the bound constructor. Bounds from non-consuming grammars can be recovered with clever use of position.

Almost all signatures return WithBounds True which means the parsed terms need to have valid bounds but that does not mean the flag is unnecessary, it means that every term that does not have valid bounds needs to be either merged with one which has (like in https://github.com/idris-lang/Idris2/pull/1155/files#diff-28f7a243932138226d35676705da183ae3ff559d1683ef8afb779e850004f378R692-R693) or needs to be given bounds (like in https://github.com/idris-lang/Idris2/pull/1155/files#diff-28f7a243932138226d35676705da183ae3ff559d1683ef8afb779e850004f378R855)

Edit: I just realised you're suggesting to return the token directly when there are no bounds. I haven't tried that but I suspect it would make mergebounds harder to use because now it needs to know in advance what the bounds are:

current:

mergeBounds : WithBounds lb ty -> WithBounds rb ty' -> WithBounds (lb || rb) ty'

updated:

mergeBounds : Either tok (WithBounds tok) -> Either tok (WithBounds tok) -> Either tok (WithBounds tok)

It doesn't even capture the invariant properly since if either the bounds are valid the result should have valid bounds, we could update this to carry proofs about Either but at this point it seems the original definition of mergeBounds is better suited.

tests/idris2/interface016/expected

src/Idris/Parser.idr

andylokandy · 2021-03-06T16:49:20Z

It doesn't even capture the invariant properly since if either the bounds are valid the result should have valid bounds, we could update this to carry proofs about Either but at this point it seems the original definition of mergeBounds is better suited.

Is there any concrete case where the proof is used?

andrevidela · 2021-03-06T20:01:12Z

Basically everywhere its using mergeBounds in the parser the parser expects valid bounds but either arguments aren't guaranteed to have bounds. Typically when parsing do blocks, we have a symbol do which always has bounds, but the block might not have bounds and be empty, but the entire parsed term has bounds because we merge its bounds with the bounds of the keyword do: https://github.com/idris-lang/Idris2/pull/1155/files#diff-28f7a243932138226d35676705da183ae3ff559d1683ef8afb779e850004f378R693

andylokandy · 2021-03-07T14:26:24Z

I still can't figure out why the type-level proof for valid bound is necessary. (1) mergeBounds works before this PR (2) I can't find anything that uses Withbounds False in this PR.

andrevidela · 2021-03-07T17:30:53Z

The type-level proof is necessary in order to have a correct implementation of .getBounds.
.getBounds is necessary in order to get the bounds of a symbol and forward it to failLoc which gives new bounds to an error
This is necessary because with the previous implementation, bounds for errors were reported with the next token to be parsed
Intuitively you would think that the old implementation of Fail could have taken a WithBounds tok (instead of Bound) and manually add it as the next token, however, this does not work because the type tok does not necessarily match with the type of the token parsed. Typically, symbols return () which is not PTerm.
Therefore, we need to use Bounds, therefore we need a reliable way to get a Bound, therefore we need an indication of when they are valid or not, and therefore the Bool flag

I can't find anything that uses Withbounds False in this PR.

Every calls to the constructor NoBounds is of type WithBounds False. Typically, the position primitive of the parser returns WithBounds False Bounds, because the value returned does not come from a valid position in the file being parsed.

zhongzc · 2021-03-08T16:25:45Z

I think it should be fine to return an empty (col 0, row 0) Bound in case no bound is given, just like the isirrelevant in previous implementation. Because it's rare that a term has no bound, and eoi in blockEntry can be assigned the bound to the last row and last column plus one. So no need to carry the proof at type-level.

andrevidela · 2021-03-08T18:54:02Z

Yes this is correct, technically speaking you don't need the additional invariant because you can return a sentinel value just fine. In fact this is what most software does in practice, programming languages like Go or C do this routinely since they do not have ADTs. This is also what the previous implementation did by returning bounds of (-1, -1).

However I don't believe this is the right move when developing software. We have the opportunity to both document and enforce an invariant that we know should result in inaccurate diagnostics if broken (those are also particularly hard to notice since they don't result in runtime error). Maybe you're worried about the runtime performance, but it should be the same. That's because the flag simply moved position in the struct that the codegen uses for WIthBounds, it's now automatically handled by the compiler when matching on it, rather than checked manually by the programmer. If you are really worried about the runtime performance we can try to erase the flag but since every operation on it is O(1) I do not expect any measurable gain in performance

andylokandy · 2021-03-09T08:32:04Z

I agree with @zhongzc, but I rather worry about the balance between the assumption explicitness and the development efficiency. Even if Idris is more powerful in expressiveness than others, we should still think twice when we are able to use the power -- is it necessary? does it bring us runtime cost? is it harder to understand? In this case, I think the overwhelming syntax noise is not deserved by the only one (maybe not any) special case. Not only that, I think the special case is still well handled by the internal dynamical isIrrelavant field in WithBounds which is just hiding the type-level Bool into runtime.

andrevidela · 2021-03-13T12:50:39Z

I couldn't figure out what was wrong with the out-of-files bounds so I've reverted the changes to WithBounds @gallais

src/Libraries/Text/Bounded.idr

andylokandy · 2021-03-15T15:36:10Z

@andrevidela A bug breaking the CI has been fixed by #1189. You need to rebase on the master.

andrevidela force-pushed the multiline-error-report branch from 24c2828 to 1d692da Compare March 5, 2021 18:58

andrevidela changed the title ~~WIP - Multiline error report~~ Multiline error report Mar 5, 2021

andrevidela requested a review from gallais March 5, 2021 20:15

andylokandy reviewed Mar 6, 2021

View reviewed changes

tests/idris2/perror008/expected Outdated Show resolved Hide resolved

gallais reviewed Mar 6, 2021

View reviewed changes

tests/idris2/interface016/expected Outdated Show resolved Hide resolved

ShinKage reviewed Mar 6, 2021

View reviewed changes

src/Idris/Parser.idr Outdated Show resolved Hide resolved

andrevidela force-pushed the multiline-error-report branch from 1d692da to 359667a Compare March 13, 2021 12:49

gallais reviewed Mar 15, 2021

View reviewed changes

src/Libraries/Text/Bounded.idr Outdated Show resolved Hide resolved

gallais added enhancement error: reporting labels Mar 15, 2021

andrevidela force-pushed the multiline-error-report branch from 359667a to 05cc5b7 Compare March 15, 2021 15:06

andrevidela force-pushed the multiline-error-report branch from 05cc5b7 to 485057c Compare March 15, 2021 15:38

Add location for parsing errors

36e2de9

andrevidela force-pushed the multiline-error-report branch from 485057c to 36e2de9 Compare March 15, 2021 15:52

gallais changed the title ~~Multiline error report~~ [ refactor ] Multiline error report Mar 16, 2021

gallais merged commit 405c266 into idris-lang:master Mar 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ refactor ] Multiline error report #1155

[ refactor ] Multiline error report #1155

andrevidela commented Mar 4, 2021 •

edited

Loading

andrevidela commented Mar 5, 2021

andylokandy commented Mar 6, 2021 •

edited

Loading

andrevidela commented Mar 6, 2021 •

edited

Loading

andylokandy commented Mar 6, 2021

andrevidela commented Mar 6, 2021

andylokandy commented Mar 7, 2021

andrevidela commented Mar 7, 2021 •

edited

Loading

zhongzc commented Mar 8, 2021 •

edited

Loading

andrevidela commented Mar 8, 2021

andylokandy commented Mar 9, 2021 •

edited

Loading

andrevidela commented Mar 13, 2021

andylokandy commented Mar 15, 2021

[ refactor ] Multiline error report #1155

[ refactor ] Multiline error report #1155

Conversation

andrevidela commented Mar 4, 2021 • edited Loading

andrevidela commented Mar 5, 2021

andylokandy commented Mar 6, 2021 • edited Loading

andrevidela commented Mar 6, 2021 • edited Loading

andylokandy commented Mar 6, 2021

andrevidela commented Mar 6, 2021

andylokandy commented Mar 7, 2021

andrevidela commented Mar 7, 2021 • edited Loading

zhongzc commented Mar 8, 2021 • edited Loading

andrevidela commented Mar 8, 2021

andylokandy commented Mar 9, 2021 • edited Loading

andrevidela commented Mar 13, 2021

andylokandy commented Mar 15, 2021

andrevidela commented Mar 4, 2021 •

edited

Loading

andylokandy commented Mar 6, 2021 •

edited

Loading

andrevidela commented Mar 6, 2021 •

edited

Loading

andrevidela commented Mar 7, 2021 •

edited

Loading

zhongzc commented Mar 8, 2021 •

edited

Loading

andylokandy commented Mar 9, 2021 •

edited

Loading