.. -*- coding: utf-8; -*-============== Duplication.==============.. contents:: :local:Search for duplicate lines.===========================http://en.wikipedia.org/wiki/Duplicate_code Wiki page.http://students.cis.uab.edu/tairasr/clones/literature/ Code Clones Literature.https://moz.com/devblog/near-duplicate-detection/ MinHash vs SimHash algorithm explanation.Open source or free licence:* http://duplo.sourceforge.net/* http://clonedigger.sourceforge.net/* http://www.ccfinder.net/ccfinderxos.htmlProprietary or restricted licence:* http://www.txl.ca/nicaddownload.html* http://www.harukizaemon.com/simian/index.html* http://getatomiq.com/* http://www.harukizaemon.com/simian/index.htmlhttp://stackoverflow.com/questions/191614/how-to-detect-code-duplication-during-development How to detect code duplication during development?https://en.wikipedia.org/wiki/List_of_tools_for_static_code_analysis List of tools for static code analysis.http://askubuntu.com/questions/434545/identify-duplicate-lines-in-a-file-without-deleting-them Identify duplicate lines in a file without deleting them?http://stackoverflow.com/questions/13046791/how-to-delete-the-repeat-lines-in-emacs How to delete the repeat lines in emacs.http://emacs.stackexchange.com/questions/13092/how-can-i-highlight-duplicate-lines How can I highlight duplicate lines?https://www.emacswiki.org/emacs/DuplicateLines Duplicate Lines.Search for duplicate files.===========================This utilities only search for duplicate files:http://duff.sourceforge.net/ duff home pagehttp://freedup.org/ freedup home pagehttp://dupedit.com/ dupedit home pagehttp://rdfind.pauldreik.se/ Rdfind home pagehttp://code.google.com/p/softenido/wiki/FindRepe FindRepe home pagefdupes======:: $ sudo apt-get install fdupesSee:http://code.google.com/p/fdupes/ fdupes home pagehttp://ru.wikipedia.org/wiki/Fdupes fdupes wiki pagehttp://packages.debian.org/search?keywords=fdupes fdupes Debian packagefreedups========Freedups searches through the directories you specify. When it finds twoidentical files, it hard links them together. Now the two or more files stillexist in their respective directories, but only one copy of the data is storedon disk; both directory entries point to the same data blocks.http://www.stearns.org/freedups/ freedups home pagedupmerge========Dupmerge reads a list of files from standard input (eg., as produced by "find .-print") and looks for identical files. When it finds two or more identicalfiles, all but one are unlinked to reclaim the disk space and recreated as hardlinks to the remaining copy.https://sourceforge.net/projects/dupmerge/ dupmerge home pagessdeep======ssdeep is a program for computing context triggered piecewise hashes (CTPH).Also called fuzzy hashes, CTPH can match inputs that have homologies. Suchinputs have sequences of identical bytes in the same order, although bytes inbetween these sequences may be different in both content and length.http://ssdeep.sourceforge.net/ ssdeep home pagecomparator==========Available under Cygwin. Find duplication in source files:: $ comparator -s 5 $dir1 $dir2