Antiplag Web Save Abandoned

A text-similarity computation software (web version) for the codes and documents of assignment.

Project README

antiplag-web

A text-similarity computation software (web version) for the codes and documents of assignment.

This system is a web version based on antiplag.

requirement

jdk12 and above
Browser, such as chrome
platform: mac, linux. (not tested windows platform)

ScreenShot

how to use

download the source zip file and unzip.
backstage: configure your database infomation with file pom.xml & application.porperties.
frontend: configure your host in file main.js.
package backstage by maven, build frontend by vue-cli3. then deploy and enjoy it.

theory

The main techniques used by the system are string similarity comparison algorithms, code lexical grammar parsing, and word segmentation in natural language processing (nlp).

The similarity comparison of program text is based on 3 open systems:

One is MOSS system based on web services (Stanford University's open system that supports the similarity comparison of multiple programming language codes);
The second is the sim system executed locally (supporting text similarity comparison in languages such as java, c).
The third is jplag system which is executed locally (supporting text similarity comparison in languages such as java, c / c ++, python).

The system has been developed and packaged on the basis of them. For the moss system, a client access module has been developed to implement code file submission, result acquisition and analysis, and result sequencing. For sim and jplag, the It is integrated into the system and can be used as a replacement product when moss is not available due to network failure or other reasons.

Comparison of Chinese and English document assignment similarity provides two algorithms:

The first is based on shinglecloud algorithm (a language fingerprint-based, language-independent similarity Fast calculation method), the main process of the document is as follows:

Use tika to read the text content in different encoding files in different formats (txt, doc, docx, pdf, html, etc.) and convert it into text that can be processed uniformly;
Use hanlp to preprocess and segment the text;
Calculate the similarity between texts using the singlecloud algorithm;
Sort according to similarity and output comparison results.

The second is based on jplag's GST algorithm, which has been expanded in functionality. The added "doc" language type can perform similarity calculations on various documents and provide a web-based visual comparison function.

references:

Software Plagiarism Detection Techniques: A Comparative Study
JPlag: Finding plagiarisms among a set of programs
Winnowing: Local Algorithms for Document Fingerprinting The core algorithm used by the moss system
Summary of software plagiarism detection research

Open Source Agenda is not affiliated with "Antiplag Web" Project. README Source: mooyyu/antiplag-web

Stars

Open Issues

Last Commit

1 year ago

License

GPL-3.0

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/antiplag-web"><img src="https://www.opensourceagenda.com/projects/antiplag-web/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022