[NTLUG:Discuss] bash question
Fred James
fredjame at concentric.net
Wed Sep 12 13:01:20 CDT 2001
I must say, I like that one.
Frank Lewis Jr. wrote:
> #!/bin/bash
>
> for i in `ls *.pdf | awk -F. '{print $1}'` ; do
> # This section removes the spaces from the filenames
> file=`echo $i | sed -e s/\ //g`
> mv "$i.pdf" $file.pdf
> i=`echo $file`
> # This section handles the pdf2text stuff.
> pdf2text "$i.pdf" > "$i.txt"
> # Not sure if you have to escape the quotes here or
> # not. If so uncomment the line with \'s and comment
> # out the previous pdf2text line.
> # pdf2text \"$i.pdf\" > \"$i.txt\"
> done
>
> # You may need to correct the usage of the pdf2text
> # command. I've never used pdf2text before, but the
> # variable declariations should be exactly what you
> # are looking for.
>
> #-Frank
>
>
> -----Original Message-----
> From: discuss-admin at ntlug.org [mailto:discuss-admin at ntlug.org] On Behalf
> Of George Lass
> Sent: Wednesday, September 12, 2001 12:25 PM
> To: discuss at ntlug.org
> Subject: Re: [NTLUG:Discuss] bash question
>
> On newer versions of RedHat (7.1 anyway) you can also escape the
> space(s):
>
> "file name with spaces.pdf"
>
> file\ name\ with\ spaces.pdf
>
> George
>
>
> "Wrenn, Bobby J." wrote:
>
>>If I can get an answer to this I will finally be able to use Linux at
>>
> work.
>
>>I need to take 209 pdf files with spaces in the file names. and
>>
> convert them
>
>>into text. I am very new to scripting and know nothing about regular
>>expressions. Is there an easy way to remove the spaces from the file
>>
> names?
>
>>Then how do I recursively submit the files to pdftotext with the same
>>
> name
>
>>except for the .pdf changed to .txt?
>>
>>Just getting that much done will be a big help. The next step may be
>>trickier. I need to extract a name, address, and equipment list from
>>
> each of
>
>>the files and get it into some kind of database where I can query for
>>
> total
>
>>by item or item by location.
>>
>>I'm a database beginner but a quick learner. TIA for any help. Please
>>contact me directly if this is not appropriate for the list.
>>
>>TIA
>>Bobby Wrenn
>>Sr. Service Planner
>>BancTec, Inc.
>>972.450.7832
>>
>>_______________________________________________
>>http://www.ntlug.org/mailman/listinfo/discuss
>>
> _______________________________________________
> http://www.ntlug.org/mailman/listinfo/discuss
>
> _______________________________________________
> http://www.ntlug.org/mailman/listinfo/discuss
>
>
>
--
...make every program a filter...
More information about the Discuss
mailing list