University of Warwick
  • Study
  • |
  • Research
  • |
  • Business
  • |
  • Alumni
  • |
  • News
  • Sign in
  • Computer Science Repository
  • More…

    Computer Science Repository

    • Home
    • About
    • Browse by Year
    • Browse by Subject
    • Browse by Division
    • Browse by Author
      • Login

    Sentence-Based Natural Language Plagiarism Detection

    White, D.R. and Joy, M.S. (2004) Sentence-Based Natural Language Plagiarism Detection. ACM Journal on Educational Resources in Computing, 4 (4). pp. 1-20.

    [img]
    Preview
    PDF - Accepted Version
    Download (2149Kb) | Preview
      [img] PDF - Published Version
      Restricted to Repository staff only

      Download (2148Kb)

        Abstract

        With the increasing levels of access to higher education in the United Kingdom, larger class sizes make it unrealistic for tutors to be expected to identify instances of peer-to-peer plagiarism by eye and so automated solutions to the problem are required. This document details a novel algorithm for comparison of suspect documents at a sentence level and has been implemented as a component of plagiarism detection software for detecting similarities in both natural language documents and comments within program source-code. The algorithm is capable of detecting sophisticated obfuscation (such as paraphrasing, reordering, merging, and splitting sentences) as well as direct copying. The implemented algorithm has also been used to successfully detect plagiarism on real assignments at the university. The software has been evaluated by comparison with other plagiarism detection tools.

        Item Type: Article
        Uncontrolled Keywords: ias edtech educational technology plagiarism
        Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
        Q Science > QA Mathematics > QA76 Computer software
        Divisions: Faculty of Science > Computer Science
        Depositing User: Dr Mike Joy
        Date Deposited: 06 Nov 2010 16:53
        Last Modified: 26 Jul 2011 12:16
        URI: http://eprints.dcs.warwick.ac.uk/id/eprint/302

        Actions (login required)

        View Item
        Close this email form
        Page contact: Repository administrator Last revised: Wed 21 Mar 2012
        • Sign in
        • | Powered by EPrints 3