[clue-tech] CLUE Talk Mailing list mbox file too big to rsynch
David L. Anselmi
anselmi at anselmi.us
Sun Feb 1 20:21:09 MST 2009
Angelo Bertolli wrote:
> I wrote a findd (find duplicate) script for this sort of thing. It's old
> and probably makes excessive use of files, but it works.
If you can stand to have less verbose output you could write that as:
find . -type f -print0 | xargs -0 md5sum | sort | uniq -D -w 32
xargs is frequently as good as a loop. uniq can show dupes, and can
limit how much of a line it looks at (the 32 goes with using md5sum,
cksum would be different).
The difference isn't just in getting rid of temp files. It's also in
taking advantage of more features of the commands you used.
Of course in Jed's case he probably prefers diff output (he's looking
for differences, not dupes). And the directories to compare are on
different machines.
Dave
More information about the clue-tech
mailing list