[NTLUG:Discuss] bash question
Frank Lewis Jr.
frank.lewis at netzero.net
Wed Sep 12 12:48:40 CDT 2001
#!/bin/bash
for i in `ls *.pdf | awk -F. '{print $1}'` ; do
pdf2text "$i.pdf" > "$i.txt"
# Not sure if you have to escape the quotes here or
# not. If so uncomment the line with \'s and comment
# out the previous pdf2text line.
# pdf2text \"$i.pdf\" > \"$i.txt\"
done
# You may need to correct the usage of the pdf2text
# command. I've never used pdf2text before, but the
# variable declariations should be exactly what you
# are looking for.
#-Frank
-----Original Message-----
From: discuss-admin at ntlug.org [mailto:discuss-admin at ntlug.org] On Behalf
Of Gregory L. Camp
Sent: Wednesday, September 12, 2001 11:54 AM
To: 'discuss at ntlug.org'
Subject: RE: [NTLUG:Discuss] bash question
I'm pretty sure you can just surround the file name with quotes (you
don't
have to change them to _ or the like). I'll let someone else answer the
scripting part! :)
> -----Original Message-----
> From: Wrenn, Bobby J. [mailto:Bobby.Wrenn at banctec.com]
> Sent: Wednesday, September 12, 2001 11:41 AM
> To: 'discuss at ntlug.org'
> Subject: [NTLUG:Discuss] bash question
>
>
> If I can get an answer to this I will finally be able to use
> Linux at work.
>
> I need to take 209 pdf files with spaces in the file names.
> and convert them
> into text. I am very new to scripting and know nothing about regular
> expressions. Is there an easy way to remove the spaces from
> the file names?
> Then how do I recursively submit the files to pdftotext with
> the same name
> except for the .pdf changed to .txt?
>
> Just getting that much done will be a big help. The next step may be
> trickier. I need to extract a name, address, and equipment
> list from each of
> the files and get it into some kind of database where I can
> query for total
> by item or item by location.
>
> I'm a database beginner but a quick learner. TIA for any help. Please
> contact me directly if this is not appropriate for the list.
>
> TIA
> Bobby Wrenn
> Sr. Service Planner
> BancTec, Inc.
> 972.450.7832
>
> _______________________________________________
> http://www.ntlug.org/mailman/listinfo/discuss
>
_______________________________________________
http://www.ntlug.org/mailman/listinfo/discuss
More information about the Discuss
mailing list