You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With html5lib 0.95 it produces the following traceback:
python test.py
Traceback (most recent call last):
File "test.py", line 13, in <module>
test(html)
File "test.py", line 10, in test
html5lib.treebuilders.dom.dom2sax(dom, handler)
File "/home/vkuznets/packages/html5lib-0.95/html5lib-0.95/html5lib/treebuilders/dom.py", line 271, in dom2sax
for child in node.childNodes: dom2sax(child, handler, nsmap)
File "/home/vkuznets/packages/html5lib-0.95/html5lib-0.95/html5lib/treebuilders/dom.py", line 256, in dom2sax
del attributes[(attr.namespaceURI, attr.nodeName)]
KeyError: (None, u'xml:lang')
With previous versions(at least 0.11) there's no any error. I assume this attribute may be invalid in the xml namespace, but anyway I don't think it is ok for parser just to crash. I've seen A LOT of html documents that has such attribute in the real world.
Tested it with Python 2.6.5, Linux
Please advise.
Thanks,
--Vladimir
The text was updated successfully, but these errors were encountered:
…alker
This moves the function to a new treeadapters module (where later
the adapters from test_treewalker.py will get moved). dom2sax
remains for backwards-compatibility, calling the new function.
gsnedders
added a commit
to gsnedders/html5lib-python
that referenced
this issue
Jun 16, 2013
This moves the functionality to a new treeadapters module (where
later the adapters from test_treewalker.py will get moved) and
removes the previous dom2sax function.
http://code.google.com/p/html5lib/issues/detail?id=200
Reported by vovanec, Mar 6, 2012
The text was updated successfully, but these errors were encountered: