Phtx - Peter's HTML Table Data Extractor

phtx is a command line tool that extract data from tables in HTML-encoded files (possible downloaded with wget or curl).

It will strip the data from HTML tags and (if told so) extra whitespace, and output it as CSV data (on stdout by default). It should handle multiple, recursive HTML tables in a (hopefully) sane way. If you find bugs in this program, please notify the author.

AUTHOR:

Peter Eriksson <pen@lysator.liu.se>

LICENSE:

See the GNU General Public License version 3.

REQUIREMENTS:

An ANSI C Compiler.

DOWNLOAD:

Main FTP site: ftp://ftp.ifm.liu.se/pub/unix/phtx