libtagsoup-java binary package in Ubuntu Precise amd64

 TagSoup, a SAX-compliant parser written in Java that, instead of parsing
 well-formed or valid XML, parses HTML as it is found in the wild: poor,
 nasty and brutish, though quite often far from short. TagSoup is designed
 for people who have to process this stuff using some semblance of a
 rational application design.
 .
 By providing a SAX interface, it allows standard XML tools to be applied
 to even the worst HTML. TagSoup also includes a command-line processor
 that reads HTML files and can generate either clean HTML or well-formed
 XML that is a close approximation to XHTML.
 .
 TagSoup is designed as a parser, not a whole application; it isn't
 intended to permanently clean up bad HTML, as HTML Tidy does, only to
 parse it on the fly. Therefore, it does not convert presentation HTML
 to CSS or anything similar. It does guarantee well-structured results:
 tags will wind up properly nested, default attributes will
 appear appropriately, and so on.

Publishing history

Date Status Target Pocket Component Section Priority Phased updates Version
  2011-12-15 22:04:33 UTC Published Ubuntu Precise amd64 release universe libs Optional 1.2.1-1
  • Published
  • Copied from ubuntu precise-release i386 in Primary Archive for Ubuntu
  2011-12-15 22:06:00 UTC Superseded Ubuntu Precise amd64 release universe libs Optional 1.2-2ubuntu1
  • Removed from disk .
  • Removal requested .
  • Superseded by i386 build of tagsoup 1.2.1-1 in ubuntu precise RELEASE
  • Published
  • Copied from ubuntu natty-release i386 in Primary Archive for Ubuntu

Source package