Docode 5: Building a real world plagiarism detection system
Author
dc.contributor.author
Pizarro, Gaspar
Author
dc.contributor.author
Velásquez Silva, Juan
Admission date
dc.date.accessioned
2018-06-29T14:45:30Z
Available date
dc.date.available
2018-06-29T14:45:30Z
Publication date
dc.date.issued
2017
Cita de ítem
dc.identifier.citation
Engineering Applications of Artificial Intelligence, 64 (2017): 261–271
es_ES
Identifier
dc.identifier.other
10.1016/j.engappai.2017.06.001
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/149346
Abstract
dc.description.abstract
Plagiarism refers to the appropriation of someone else's ideas and expression. Its ubiquity makes it necessary to counter it, and invites the development of commercial systems to do so. In this document we introduce Docode 5, a system for plagiarism detection that can perform analyses on the World Wide Web and on user-defined collectionsj and can be used as a decision support system. Our contribution in this document is to present this system in all its range of components, from the algorithms used in it to the user interfaces, and the issues with deployment on a commercial scale at an algorithmic and architectural level. We ran performance tests on the plagiarism detection algorithm showing an acceptable performance from an academic and commercial point of view, and load tests on the deployed system, showing that we can benefit from a distributed deployment. With this, we conclude we can adapt algorithms made for small-scale plagiarism detection to a commercial-scale system.
es_ES
Patrocinador
dc.description.sponsorship
Millennium Institute on Complex Engineering Systems
ICM: P-05-004-F
CONICYT: FBO16