Contributor(s)The Pennsylvania State University CiteSeerX Archives
Full recordShow full item record
AbstractPlagiarism detection systems are known for years in the university community. However, most of the existing detectors for the natural language texts use rather simple comparison methods that make the instances of plagiarism easy to hide. The software, designed for plagiarism detection in computer programs, utilizes far more advanced techniques. We propose a method, which adds functionalities similar to tokenization and tree matching, to the natural language texts-oriented detectors. This method requires noticeable work to be applied in practice, but also makes use of the existing software for parsing and word sense disambiguation. Keywords machine semantic analysis, plagiarism detection, string matching, plagiarism, computational linguistics. 1.