Fix multi-line comment bug #3

slorber · 2023-06-22T13:05:23Z

The tokenizer had a problem handling comment openings immediately followed by a line break.

Note, I did not update one of the tests (It renders within HTML elements) because I believe it does not seem to support multi-line comments in the first place (starting with line breaks or not): that is probably a separate bug to fix.

Previous text before edit:

WIP: for now it's just a unit test proof that this library has the bug reported here: facebook/docusaurus#9084

No CI, so proof of local failure with a simple test case change:

slorber · 2023-06-22T14:32:50Z

@leebyron PR is ready for review, tests are passing

cc @wooorm I'm not super comfortable with this micromark parsing logic so please let me know if you see any possible problem 🤪 that was a good opportunity to learn a bit.

Please let me know if you can review/merge/publish this soon because I need it for Docusaurus (facebook/docusaurus#9084).

Otherwise, I can publish my fork.
Or you can also give me npm/github permissions (slorber on both) in case you want another maintainer.

I tested the change on Docusaurus and it fixes our issue:

slorber · 2023-06-22T14:34:49Z

index.js

+    if (markdownLineEnding(code)) {
+      return atLineEnding(code);
+    }
+


This is the fix.

Not sure what I'm doing 🤪
But I assume we shouldn't effects.enter(types.data) if there's no data to consume on that line

blank lines (only occurs in the block/flow version) probably are broken from a quick glance. Otherwise looks good. See also https://github.com/micromark/micromark/blob/e10f892185d5616db6a9efad3a557ca1845d1843/packages/micromark-core-commonmark/dev/lib/html-text.js#L129

Thanks 👍

Not sure what you mean by "blank lines" 🤪 do you have an example I could use in a test?

, 

I tried these samples in tests and not sure to understand what you mean by "probably are broken". What do you think it the bad behavior these samples produce?

According to my local tests:

Those comments are removed when they should

Those comments are serialized back in their original form when they should

Just added those to the test in case you want to take a look.

Might have to run the tests with --conditions development to get instrumented code that checks if the extension works (so node test.mjs -> node --conditions development test.mjs). That’s not loaded normally because it would slow everyone down and increase the bundle size. See also: https://github.com/micromark/micromark#size--debug

My hunch is that there are empty tokens, which should not exist.

If that works, it’s all good.

Thanks, I'm running DEBUG="*" node --conditions development test/test.mjs and seeing the debug statements now 👍

The current code (even before my changes) has this assertion error at the comment end step:

Assertion: expected last token to be open. code=62

It looks like it does not like to consume just after an exit:

if (code === codes.greaterThan) { effects.exit(types.data); effects.consume(code); effects.exit("comment"); return ok(code); }

I'm not sure to understand the issue, that looks fine to me 🤔

Anyway, if I do this (which kind of feel useless?), now both tests and assertions are all passing:

if (code === codes.greaterThan) { effects.exit(types.data); effects.enter("commentEnd"); // NEW effects.consume(code); effects.exit("commentEnd"); // NEW effects.exit("comment"); return ok(code); }

Does it make sense to add the code above?

My hunch is that there are empty tokens, which should not exist.

Not sure how I can see those empty tokens. If there are no more assertion failures, does it means there are no empty tokens?

FYI this project we‘re discussing on has exactly one commit. It’s very likely that it doesn’t work well, is not used a lot in practise, and might be abandoned.

I'm not sure to understand the issue, that looks fine to me 🤔

It’s not. That’s why there’s an error: every byte has to be in something specific.

Anyway, if I do this (which kind of feel useless?), now both tests and assertions are all passing:

Yep, that’s good! That’s the important part: putting every byte into something. For remark, which has ASTs, that’s indeed useless. But micromark can be used to make CSTs, where every character is present.

Not sure how I can see those empty tokens. If there are no more assertion failures, does it means there are no empty tokens?

If there are no more errors, including for those blank line fixtures (), it’s good! 👍

If there are no more errors, including for those blank line fixtures (), it’s good! 👍

Thanks!

FYI this project we‘re discussing on has exactly one commit. It’s very likely that it doesn’t work well, is not used a lot in practise, and might be abandoned.

Yes I understand that, and as I wasn't sure Lee would answer/merge fast I just published @slorber/remark-comment with these PR fixes.

We'll use it on Docusaurus mostly to make the migration easier, according to what I see it seems good enough as a transitory measure. We have a flag for users to opt-out of this plugin once they have fully migrated to MDX comments.

slorber · 2023-06-22T14:35:26Z

test/test.mjs

+<!--
+has a multi-line comment 
+-->
+
+<!-- another 
+multi-line 
+comment -->


Apart the refactor to a template literal, only those lines were added and the rest remains unchanged

slorber · 2023-06-22T14:36:23Z

test/test.mjs

 and a paragraph
 `,
    { ast: true }
  ),
-  '<h1>This document</h1>\n\n<p>and a paragraph</p>'
+  '<h1>This document</h1>\n\n\n\n<p>and a paragraph</p>'


See your own test comment: the extra line breaks are expected

slorber · 2023-06-22T16:39:57Z

test/test.mjs

+<!--\\n\\n-->
+
+<!--\\na\\n\\nb\\n-->


those are successfully removed @wooorm

slorber · 2023-06-22T16:40:38Z

test/test.mjs

+<!--\\n\\n-->
+
+<!--\\na\\n\\nb\\n-->


those are successfully printed back @wooorm

proof of multiline comment bug

51f4ca3

slorber mentioned this pull request Jun 22, 2023

Does not support multi-line HTML comments? #2

Open

fix multi-line comments edge case

457a2f8

slorber marked this pull request as ready for review June 22, 2023 14:29

slorber commented Jun 22, 2023

View reviewed changes

slorber mentioned this pull request Jun 22, 2023

Multiline HTML comments throw a MDX compilation error in canary (3.0.0-alpha.0) facebook/docusaurus#9084

Closed

7 tasks

add "blank line" tests

5b69693

slorber commented Jun 22, 2023

View reviewed changes

slorber added 2 commits June 23, 2023 18:59

Add recommended fix from Titus

f496b9d

test with dev assertions

426dde7

slorber mentioned this pull request Oct 22, 2024

HTML comments hidden in texts but displayed in DocCard facebook/docusaurus#10589

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix multi-line comment bug #3

Fix multi-line comment bug #3

slorber commented Jun 22, 2023 •

edited

Loading

slorber commented Jun 22, 2023

slorber Jun 22, 2023

wooorm Jun 22, 2023

slorber Jun 22, 2023

wooorm Jun 22, 2023 •

edited

Loading

slorber Jun 22, 2023

wooorm Jun 22, 2023

slorber Jun 23, 2023

wooorm Jun 23, 2023

slorber Jun 23, 2023

slorber Jun 22, 2023

slorber Jun 22, 2023

slorber Jun 22, 2023

slorber Jun 22, 2023

		<!--\\n\\n-->

		<!--\\na\\n\\nb\\n-->

		<!--\\n\\n-->

		<!--\\na\\n\\nb\\n-->

Fix multi-line comment bug #3

Are you sure you want to change the base?

Fix multi-line comment bug #3

Conversation

slorber commented Jun 22, 2023 • edited Loading

slorber commented Jun 22, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wooorm Jun 22, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

slorber commented Jun 22, 2023 •

edited

Loading

wooorm Jun 22, 2023 •

edited

Loading