[NTLUG:Discuss] Simple file dedupe script

Fred James fredjame at fredjame.cnc.net
Mon Jul 19 14:22:18 CDT 2010


Chris Cox wrote:
> Ok... my changes (and I'll update the file link as well):
>
> find $* -type f -print0 |
> xargs -0 ls -sd * | sort -k1bn |
> awk '{num=$1;$1="";printf("%10d%s\n",num,$0);}' | uniq -w 10 -D |
> sed 's/^[ ]*[0-9]* //' | tr '\012' '\000' |
> xargs -0 ls -id | sort |
> awk '{num=$1;$1="";printf("%10d%s\n",num,$0);}' | uniq -w 10 |
> sed 's/^[ ]*[0-9]* //' | tr '\012' '\000' |
> xargs -0 md5sum | sort | uniq -w 32 -D
>
>
> Yes... I know.. I used awk... sigh....
>   
(omissions for brevity)
Chris Cox
Awk is good Awk is good
Regards
Fred James




More information about the Discuss mailing list