Rustdoc doctest attribute splitting is probably too liberal #78344

casey · 2020-10-25T06:26:26Z

I was thinking about proposing a new doctest attribute of the form name=TEXT that would allow giving a documentation tests names which would be printed when they run.

I was looking at what it would take to implement this change, so I checked out the way that doctest attributes are parsed from markdown info strings, and found that they are split using the following code:

let tokens = string.split(|c: char| !(c == '_' || c == '-' || c.is_alphanumeric()));

("info string" is the name that common mark and GFM use for the text that follows a ```.)

This conflicts with the name=foo syntax that I was hoping to use, since the attribute would be split on the =.

I think this behavior also conflicts with how people think the feature works, since a few people told me that attributes are split on ,, which is also what Clippy does.

My feeling is that the current attribute splitting is probably too liberal in what it accepts, and prevents otherwise desirable attributes, such as attributes of the form foo=bar, or attributes with more freeform text, from being possible.

A few thoughts:

Is this worth worrying about at all? It's unfortunate, but it's not a huge deal, since workarounds like having attributes of the form foo-bar are possible.
If nobody (according to crater) is relying on the liberal splitting, would it be acceptable to consider this to be a bug, and change it to just split on ,? Of course, this could break non-public code.
Would it be worth doing as part of an edition? It would be pretty easy to automatically transform info strings from being whatever-split to being comma split as part of an edition upgrade. It would also be easy to create a lint or warning for it, just check that the current splitting rules and comma splitting produce the same attributes, and warn if they don't.

The text was updated successfully, but these errors were encountered:

jyn514 · 2020-10-25T09:25:09Z

cc @rust-lang/rustdoc: this proposes changing the doctest attribute parsing to only split on ,, not anything else. Personally I'm in favor - what do you think?

GuillaumeGomez · 2020-10-25T11:11:24Z

The handling of ignore will need a small update but otherwise sounds good to me. Also, being able to name a doctest is a good idea.

casey · 2020-10-26T22:40:35Z

I'm happy to submit a PR to change this, but I wonder if perhaps this is something that should be brought to the attention of more people, considering the possibility for breakage.

jyn514 · 2020-10-26T22:42:37Z

@casey go ahead and make the PR and we can start an FCP for the breaking change there.

casey · 2020-10-27T02:27:18Z

@jyn514 Sounds good, I just opened #78429 with the change and some notes.

jyn514 added the T-rustdoc Relevant to the rustdoc team, which will review and decide on the PR/issue. label Oct 25, 2020

jyn514 added the T-lang Relevant to the language team, which will review and decide on the PR/issue. label Oct 25, 2020

camelid added the A-doctests Area: Documentation tests, run by rustdoc label Oct 25, 2020

casey mentioned this issue Oct 27, 2020

[librustdoc] Only split lang string on ,, , and \t #78429

Merged

jyn514 added the A-markdown-parsing Area: Markdown parsing for doc-comments label Nov 12, 2020

bors closed this as completed in d95d304 Feb 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rustdoc doctest attribute splitting is probably too liberal #78344

Rustdoc doctest attribute splitting is probably too liberal #78344

casey commented Oct 25, 2020 •

edited

Loading

jyn514 commented Oct 25, 2020

GuillaumeGomez commented Oct 25, 2020

casey commented Oct 26, 2020

jyn514 commented Oct 26, 2020

casey commented Oct 27, 2020

Rustdoc doctest attribute splitting is probably too liberal #78344

Rustdoc doctest attribute splitting is probably too liberal #78344

Comments

casey commented Oct 25, 2020 • edited Loading

jyn514 commented Oct 25, 2020

GuillaumeGomez commented Oct 25, 2020

casey commented Oct 26, 2020

jyn514 commented Oct 26, 2020

casey commented Oct 27, 2020

casey commented Oct 25, 2020 •

edited

Loading