[NTLUG:Discuss] bash question

Frank Lewis Jr. frank.lewis at netzero.net
Wed Sep 12 12:48:40 CDT 2001


#!/bin/bash

for i in `ls *.pdf | awk -F. '{print $1}'` ; do
	pdf2text "$i.pdf" > "$i.txt"
# Not sure if you have to escape the quotes here or
# not. If so uncomment the line with  \'s and comment
# out the previous pdf2text line.
#	pdf2text \"$i.pdf\" > \"$i.txt\"
done

# You may need to correct the usage of the pdf2text 
# command. I've never used pdf2text before, but the 
# variable declariations should be exactly what you 
# are looking for.

#-Frank

-----Original Message-----
From: discuss-admin at ntlug.org [mailto:discuss-admin at ntlug.org] On Behalf
Of Gregory L. Camp
Sent: Wednesday, September 12, 2001 11:54 AM
To: 'discuss at ntlug.org'
Subject: RE: [NTLUG:Discuss] bash question

I'm pretty sure you can just surround the file name with quotes (you
don't
have to change them to _ or the like).  I'll let someone else answer the
scripting part!  :)

> -----Original Message-----
> From: Wrenn, Bobby J. [mailto:Bobby.Wrenn at banctec.com]
> Sent: Wednesday, September 12, 2001 11:41 AM
> To: 'discuss at ntlug.org'
> Subject: [NTLUG:Discuss] bash question
> 
> 
> If I can get an answer to this I will finally be able to use 
> Linux at work.
> 
> I need to take 209 pdf files with spaces in the file names. 
> and convert them
> into text. I am very new to scripting and know nothing about regular
> expressions. Is there an easy way to remove the spaces from 
> the file names?
> Then how do I recursively submit the files to pdftotext with 
> the same name
> except for the .pdf changed to .txt?
> 
> Just getting that much done will be a big help. The next step may be
> trickier. I need to extract a name, address, and equipment 
> list from each of
> the files and get it into some kind of database where I can 
> query for total
> by item or item by location.
> 
> I'm a database beginner but a quick learner. TIA for any help. Please
> contact me directly if this is not appropriate for the list.
> 
> TIA
> Bobby Wrenn
> Sr. Service Planner
> BancTec, Inc.
> 972.450.7832 
> 
> _______________________________________________
> http://www.ntlug.org/mailman/listinfo/discuss
> 
_______________________________________________
http://www.ntlug.org/mailman/listinfo/discuss




More information about the Discuss mailing list