articleIEEE Transactions on Software EngineeringJul 1, 2002GREEN OA

CCFinder: a multilinguistic token-based code clone detection system for large scale source code

Osaka University

Indexed incrossref

Abstract

A code clone is a code portion in source files that is identical or similar to another. Since code clones are believed to reduce the maintainability of software, several code clone detection techniques and tools have been proposed. This paper proposes a new clone detection technique, which consists of the transformation of input source text and a token-by-token comparison. For its implementation with several useful optimization techniques, we have developed a tool, named CCFinder (Code Clone Finder), which extracts code clones in C, C++, Java, COBOL and other source files. In addition, metrics for the code clones have been developed. In order to evaluate the usefulness of CCFinder and metrics, we conducted…

Citation impact

1,721
total citations
FWCI
59.72
Percentile
100%
References
19
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Source code
  • Security token
  • clone (Java method)
  • Java
  • Codebase
  • Programming language
  • Code (set theory)
No related works found for this paper.