libhtml-html5-sanity-perl

Make html5 dom trees less insane
  http://search.cpan.org/dist/HTML-HTML5-Sanity/
  0
  no reviews



Html::html5::sanity is a perl library tidying html::html5::parser.

the document object model (dom) generated by html::html5::parser meets the requirements of the html5 spec, but will probably catch a lot of people by surprise.

the main oddity is that elements and attributes which appear to be namespaced are not really. for example, the following element:

...

looks like it should be parsed so that it has an attribute "lang" in the xml namespace. not so. it will really be parsed as having the attribute "xml:lang" in the null namespace.