From 737aa68967b1a23cf4596f558014ed3b17b31e97 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Andr=C3=A9=20Letterer?= <48132449+111andre111@users.noreply.github.com> Date: Sat, 23 May 2020 00:47:05 +0200 Subject: [PATCH 1/2] non valid XML Content documentation Added documentation regarding non valid characters --- docs/index.asciidoc | 9 +++++++++ 1 file changed, 9 insertions(+) diff --git a/docs/index.asciidoc b/docs/index.asciidoc index 437eb52..2bcde38 100644 --- a/docs/index.asciidoc +++ b/docs/index.asciidoc @@ -99,6 +99,15 @@ By default the parser is not strict and thus accepts some invalid content. Currently supported options are: - `strict` - forces the parser to fail early instead of accumulating errors when content is not valid xml. + +This still means that non allowed control characters like for instance ASCII 0x0 always mean a non valid XML. + +XML 1.0 Spec: https://www.w3.org/TR/2008/REC-xml-20081126/#charsets + +XML 1.1 Spec: https://www.w3.org/TR/xml11/#charsets + +In case whenever XML Content is not valid, it will be tagged with a tag `_xmlparsefailure`. + [id="plugins-{type}s-{plugin}-remove_namespaces"] ===== `remove_namespaces` From be37382643e5a146c175d03aa505502124bcc1a5 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Andr=C3=A9=20Letterer?= <48132449+111andre111@users.noreply.github.com> Date: Fri, 3 Dec 2021 23:34:40 +0100 Subject: [PATCH 2/2] Update docs/index.asciidoc Co-authored-by: Karen Metts <35154725+karenzone@users.noreply.github.com> --- docs/index.asciidoc | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/index.asciidoc b/docs/index.asciidoc index 2bcde38..b817be7 100644 --- a/docs/index.asciidoc +++ b/docs/index.asciidoc @@ -100,7 +100,7 @@ Currently supported options are: - `strict` - forces the parser to fail early instead of accumulating errors when content is not valid xml. -This still means that non allowed control characters like for instance ASCII 0x0 always mean a non valid XML. +Control characters such as ASCII 0x0 are not allowed and _always_ result in non-valid XML. XML 1.0 Spec: https://www.w3.org/TR/2008/REC-xml-20081126/#charsets