A tool to find duplicate files in complex directory structures. The following features will be offered:
Identify duplicate files inside one or more directory tree(s).
Offer various options for deciding when two file are identical.
Generate reports about duplicate files suitable for human consumption.
Generate scripts for various operations, for example deleting all redundant copies.
The purpose of this tool is helping users cleanup directories in order to reduce the amount of duplicate files stored. The tool can be run periodically to scan directory trees, identify and delete duplicate files, without the user having to manually go through the directories themselves.