[ale] Filed De-duplication

Derek Atkins derek at ihtfp.com
Fri Oct 18 13:24:03 EDT 2013


Have you tried the "hardlink" utility?   man hardlink

-derek

On Fri, October 18, 2013 12:59 pm, JD wrote:
> Slashdot had a question about this 1-2 yrs ago.  Lots of people suggested
> scripting it, others pointed out some C code on sourceforge.
>
> I had a few hrs free that day and wrote some Perl (200+ LOC). Use it all
> the
> time, but I'd probably go with the C tool for any very large datasets.
> Mine
> doesn't automaticly remove anything and is far from perfect, that is
> certain.
> It is relatively fast on most types of files, however.
>
> On 10/18/2013 12:34 PM, Calvin Harrigan wrote:
>> Good Afternoon,
>>     I'm looking for a little advice/recommendation on file
>> de-duplication
>> software. I've have a disk filled with files that most certainly have
>> duplicates.  What's the best way to get rid of the duplicates.  I'd like
>> to
>> check deeper than just file name/date/size.  If possible I'd like to
>> check
>> content (checksum?).  Are you aware of anything like that?  Linux or
>> windows is
>> fine.  Thanks
>> _______________________________
> _______________________________________________
> Ale mailing list
> Ale at ale.org
> http://mail.ale.org/mailman/listinfo/ale
> See JOBS, ANNOUNCE and SCHOOLS lists at
> http://mail.ale.org/mailman/listinfo
>


-- 
       Derek Atkins                 617-623-3745
       derek at ihtfp.com             www.ihtfp.com
       Computer and Internet Security Consultant



More information about the Ale mailing list