libhtmlcxx3 binary package in Ubuntu Trusty armhf
htmlcxx is a simple non-validating CSS1 and HTML parser for C++. Although
there are several other html parsers available, htmlcxx has some
characteristics that make it unique:
.
* STL like navigation of DOM tree, using excellent tree.hh library from
Kasper Peeters
* It is possible to reproduce exactly, character by character, the original
document from the parse tree
* Bundled CSS parser
* Optional parsing of attributes
* C++ code that looks like C++ (not so true anymore)
* Offsets of tags/elements in the original document are stored in the nodes
of the DOM tree
.
The parsing politics of htmlcxx were created trying to mimic Mozilla Firefox
(http://
those create by Firefox. However, differently from Firefox, htmlcxx does not
insert non-existent stuff in your html. Therefore, serializing the DOM tree
gives exactly the same bytes contained in the original HTML document.
Publishing history
Date | Status | Target | Component | Section | Priority | Phased updates | Version | ||
---|---|---|---|---|---|---|---|---|---|
2014-02-17 16:18:34 UTC | Published | Ubuntu Trusty armhf | release | universe | libs | Extra | 0.85-3 | ||
|
|||||||||
Deleted | Ubuntu Trusty armhf | proposed | universe | libs | Extra | 0.85-3 | |||
|
|||||||||
2014-02-17 16:20:04 UTC | Superseded | Ubuntu Trusty armhf | release | universe | libs | Extra | 0.85-2ubuntu1 | ||
|
|||||||||
2014-02-18 18:10:11 UTC | Deleted | Ubuntu Trusty armhf | proposed | universe | libs | Extra | 0.85-2ubuntu1 | ||
|
|||||||||
2014-02-16 20:13:31 UTC | Superseded | Ubuntu Trusty armhf | release | universe | libs | Extra | 0.85-2 | ||
|