Binary package “libhtml-tableextract-perl” in ubuntu precise
module for extracting the content contained in HTML tables
HTML::TableExtract is a module that simplifies the extraction of information
contained in tables within HTML documents, either as text or encoded element
trees.
.
For extracting a tree structure of element objects, the additional package
libhtml-