[ale] Filed De-duplication

JD jdp at algoloma.com
Fri Oct 18 12:59:33 EDT 2013


Slashdot had a question about this 1-2 yrs ago.  Lots of people suggested
scripting it, others pointed out some C code on sourceforge.

I had a few hrs free that day and wrote some Perl (200+ LOC). Use it all the
time, but I'd probably go with the C tool for any very large datasets.  Mine
doesn't automaticly remove anything and is far from perfect, that is certain.
It is relatively fast on most types of files, however.

On 10/18/2013 12:34 PM, Calvin Harrigan wrote:
> Good Afternoon,
>     I'm looking for a little advice/recommendation on file de-duplication
> software. I've have a disk filled with files that most certainly have
> duplicates.  What's the best way to get rid of the duplicates.  I'd like to
> check deeper than just file name/date/size.  If possible I'd like to check
> content (checksum?).  Are you aware of anything like that?  Linux or windows is
> fine.  Thanks
> _______________________________


More information about the Ale mailing list