Testing various clustering algorithms and analyzing their results to find a suitable match for our task (determining which modules are similar and possible candidates to be merged).
Also contains a brief literature review of code similarity detection. List of possible candidates for improvement of clustering using better algorithms.
to share – to copy, distribute and transmit the work
to remix – to adapt the work
Under the following conditions:
attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.