Way to Parse HTML fields to Excel Spreadsheet

Hi,

I've got a text file that contains the html of approximately 514 web
pages, concatenated into 1 file. Each html page has approximately the
same format, with data in tables. Can anyone recommend a way I can
parse the document to get the fields as variables and write them out
as an excel spreadsheet? [e.g. using win32 ole]

Thanks,

George

ghadley_00 [at] yahoo.com
ghadley_00 [ Mo, 14 Januar 2008 22:59 ] [ ID #1907779 ]

Re: Way to Parse HTML fields to Excel Spreadsheet

ghadley_00 [at] yahoo.com wrote:
> Hi,
>
> I've got a text file that contains the html of approximately 514 web
> pages, concatenated into 1 file. Each html page has approximately the
> same format, with data in tables. Can anyone recommend a way I can
> parse the document to get the fields as variables and write them out
> as an excel spreadsheet? [e.g. using win32 ole]

Parse the HTML - There are many modules and possibly HTML::TableExtract
might be useful, or possibly a regular expression might be enough.
Write out the data.

To really answer your question you should show us an example of the data
(NOT the entire file), and what you've tried. See the Posting Guidelines.
glex_no-spam [ Mo, 14 Januar 2008 23:17 ] [ ID #1907780 ]
Perl » comp.lang.perl.misc » Way to Parse HTML fields to Excel Spreadsheet

Vorheriges Thema: FAQ 8.25 How can I capture STDERR from an external command?
Nächstes Thema: FAQ 8.4 How do I print something out in color?