[NTLUG:Discuss] utility to convert doc and xls to text, xml, or html

MadHat madhat at unspecific.com
Tue Feb 3 17:38:15 CST 2004


On Feb 3, 2004, at 4:13 PM, robert apodaca wrote:
> Does anyone know of a linux utility, perl module, anything that can 
> convert microsoft word files into either plain text, xml, html or even 
> some other easily parsed format?
> Also need the same for excel files to csv, xml, html, or anything else.
> I know there are programs like abiword and open office which can 
> convert these, but I'm looking for something I can call from a script.

For word Doc, I have used catdoc and the same guy has a xsl2csv to 
convert it to a comma separated volume

http://www.45.free.net/~vitus/ice/catdoc/

There are some perl Modules, such as
         Spreadsheet::ParseExcel
         XML::Excel
I am not aware of anything for Word docs, but have not looked that hard.

Excel::Template, is not what you are looking for BTW.




More information about the Discuss mailing list