Contributor(s)The Pennsylvania State University CiteSeerX Archives
Full recordShow full item record
AbstractAbstract: The paper presents a plagiarism detection framework, which aims to determine the similarity degree of program source codes. The issue of plagiarism detection has been considered earlier for written material, such as student essays. For these, text-based algorithms have been published. We argue that in case of program code comparison, structure based techniques may be much more efficient. The main idea is to transform the source code into mathematical objects, use appropriate reduction and comparison methods on them, and interpret the results appropriately. We have designed a generic program structure comparison framework and implemented it for the Prolog and SML programming languages. We have been using the implementation to successfully detect plagiarism in student assignments since 2000.