You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Nokogiri gem doesn’t handle HTML entities other than &, < ,> , " , and ', the rest of the entities are ignored/replaced, but they are valid input in MathML.
Why do we have to use any HTML entities when we can use the character codes?
@ronaldtse, we do not need to use HTML entities, but MathML editors (MathJax for example) does support HTML entities and some examples also contain HTML entities (∑ and ∏ for example).
Also, µ is available in the prefixes.yaml file in UnitsDB for HTML reference, which is used for MathML conversion in Unitsml-Ruby.
I see, so this is purely for supporting bad XML (bad MathML editors): MathML that contains HTML entities.
When Plurimath parses HTML or MathML, sure it can accept HTML entities. But when it outputs MathML, there is no reason for it to output HTML entities, which is unsupported in XML.
I don’t know how we can make Nokogiri support them, in my memory the Nokogiri HTML parser is needed.
HTML Entities have caused me issues in the past, because they will turn up in markup and they are not guaranteed to be supported by Nokogiri at all: I did indeed need to use the Nokogiri HTML parser in Metanorma, and when Nokogiri forced me to stop doing so, I instead converted all HTML entities in Metanorma Asciidoc to XML entities in preprocessing: metanorma/metanorma-iso#666
And HTML entities will turn up in markup. Declining to support them in reading documents is not an option.
Nokogiri gem doesn’t handle HTML entities other than
&
,<
,>
,"
, and'
, the rest of the entities are ignored/replaced, but they are valid input in MathML.Issue faced while MathML parsing in plurimath/mml#2.
@ronaldtse @HassanAkbar should we consider Ox for this issue or is this implementable in Lutaml-Model?
The text was updated successfully, but these errors were encountered: