Refactor directive and role parsing #181

fwkoch · 2023-02-03T20:18:49Z

Context:
Over the past year we have been coming up against some of the limitations of early design around the markdown-it tokenizer. There were many things that are in that library that are now either duplicated or deprecated (e.g. references and state management), there are also challenges in having errors being reported to the CLI, jupyter or web context in a consistent way. Some errors are not possible to report until much later (i.e. after transformations), and some errors need to be much more lenient (e.g. parser errors). We also need to start thinking about adding myst language extensions (e.g. tabs, diagrams) in a way that work across all serializers and are agnostic to the tokenizer being used (e.g. unified or markdown-it).

Current state:

markdown-it-docutils - markdown-it plugin to handle tokenizing roles and directives. In addition to generic roles/directives, it introduces special token stream behaviour for admonitions, code, math, etc etc.
markdown-it-myst-extras - markdown-it parsing support for additional markdown syntax features including colon fence, block breaks, targets, comments
mystjs - creates a parser with markdown-it-docutils plugin and extensibility to add more markdown-it tokenizers for new directives/roles, defines tokens-to-myst, including the special roles/directives from markdown-it-docutils, defines basic transforms, includes myst-to-hast
myst-cli - consumes and extends mystjs, makes it usable from CLI, defines a bunch of new directives/roles (in a markdown-it, token-y way, i.e. has to define parsing to token stream then transforming to mdast)
myst-transforms - a bunch of additional mdast transforms consumed by myst-cli
myst-ext-card/grid/tabs - directives previously defined in myst-cli, pulled into separate subpackages

Desired state:

markdown-it-docutils - continues to exist as-is to support existing vscode integration, unused in mystjs
markdown-it-myst-extras - unchanged, continues to be used by mystjs as-is
markdown-it-myst - pulls out basic role/directive tokenizing from markdown-it-docutils. No custom tokens for any specific roles/directives, instead, they are all just mystRole/Directive with args, options, value and parsed_args/options/value
mystjs - no longer exists in it's previous capacity. Lets us potentially rename myst-cli to mystjs?
myst-parser - this does what mystjs used to do for converting markdown-it token stream to mdast. However, it moves the plugin functionality for new directives to come after that conversion. I.e. all directives/roles become some sort of rawMystDirective node with mystDirectiveArgs, mystDirectiveOptions, etc. children... then new directives/roles are just transforms of these nodes. We do not want to have to make any decisions about parsing, nor ever have new directives/roles touch the token stream.
myst-directives / myst-roles - home for the core directives and roles currently defined in markdown-it-docutils. These will look very different since they are dealing in mdast transforms, not markdown-it tokenizers. These will come into myst-parser as defaults.
myst-to-html - stashes mdast-to-hast stuff from mystjs
myst-ext-* - structured definitions of new roles/directives, info about arguments, options defined as data, and functions for "validate" and "transform." Eventually this will be a place for additional, directive/role-specific myst-to-* functionality for all the new node types that are created by "transform."

The text was updated successfully, but these errors were encountered:

rowanc1 · 2023-02-16T21:48:22Z

This has largely been completed in #184, I think to close this issue we should:

add/update the readmes of myst-parser with the above information
update anything from bringing this into other contexts like JupyterLab and the theme demo.

rowanc1 · 2023-02-18T20:56:42Z

This has fully landed with myst v0.1.15. 🚀

tavin · 2023-05-09T09:59:04Z

If you have to work with markdown-it and markdown-it-myst how do you cause directives (e.g. admonitions) to actually be rendered?

rowanc1 · 2023-05-13T14:44:19Z

I think the best path for now is either (1) sticking with markdown-it-docutils for now; (2) introduce a tokenizer transformer on top of the markdown-it-myst layer that modifies the token stream back to an HTML-focused export for use inside of markdown-it; or (3) if you are in control of the render process (you might not be depending on the use case), you can use something like myst-to-html after you get an AST out.

I think the best path is probably (2), but it is also probably a decent amount of work. Sticking with (1) should be mostly fine, there haven't been substantial changes at that level, mostly just allowing errors to propagate to the CLI and changing/simplifying the extension mechanism.

tavin · 2023-05-26T18:55:44Z

Sticking with (1) is already infeasible due to obsolescence :)

fwkoch added the enhancement New feature or request label Feb 3, 2023

fwkoch self-assigned this Feb 3, 2023

This was referenced Feb 3, 2023

🔌 Add card, grid, tabs directive plugins executablebooks/jupyterlab-mystjs#25

Merged

💥 Role / Directive Refactor #184

Merged

rowanc1 mentioned this issue Feb 7, 2023

Audit of Features missing in JupyterBook #189

Open

71 tasks

fwkoch mentioned this issue Feb 9, 2023

👩‍💻 Implement new directive / role specs #206

Merged

rowanc1 mentioned this issue Feb 15, 2023

Text formatting role with {underline}hello not working in online editor #221

Closed

rowanc1 mentioned this issue Feb 17, 2023

Inline code within the caption of a code-block directive does not render #230

Closed

This was referenced Feb 18, 2023

📖 Update package lists in README #240

Merged

🧜‍♀️ Upgrade to new MyST markdown parsers jupyter-book/jupyterlab-myst#88

Merged

rowanc1 mentioned this issue May 13, 2023

What is the status of this project? executablebooks/markdown-it-docutils#48

Open

tavin mentioned this issue May 26, 2023

Proposal: markdown-it-htmyst #396

Draft

cmarmo added this to Jupyterlab for education pyData Paris 2024 sprint Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor directive and role parsing #181

Refactor directive and role parsing #181

fwkoch commented Feb 3, 2023 •

edited by rowanc1

Loading

rowanc1 commented Feb 16, 2023 •

edited

Loading

rowanc1 commented Feb 18, 2023

tavin commented May 9, 2023

rowanc1 commented May 13, 2023

tavin commented May 26, 2023

Refactor directive and role parsing #181

Refactor directive and role parsing #181

Comments

fwkoch commented Feb 3, 2023 • edited by rowanc1 Loading

rowanc1 commented Feb 16, 2023 • edited Loading

rowanc1 commented Feb 18, 2023

tavin commented May 9, 2023

rowanc1 commented May 13, 2023

tavin commented May 26, 2023

fwkoch commented Feb 3, 2023 •

edited by rowanc1

Loading

rowanc1 commented Feb 16, 2023 •

edited

Loading