Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Plagiarism Detector Introduction In this homework you will construct a system that checks the similarity rate between a specific document and a set of documents

Plagiarism Detector

Introduction In this homework you will construct a system that checks the similarity rate between a specific document and a set of documents in java.

Implementation Input files

The program will get two inputs: 1) A set of documents (.txt files) under a folder, 2) The main file (.txt file) which will be checked for plagiarism. Outputs The program will display the following outputs:

1) The similarity rate between the main document and each document under the folder given as input,

2) The most similar 5 statements / sentences for each document.

For the evaluation of your implementation, during the demo that will be held in the last week of the semester you will be asked to run your program for a folder containing multiple .txt files named as document1.Txt, document2.Txt, etc. and a main .txt file called main_doc.txt.

1.1 Details - You can use any text searching algorithm and/or data structure you would like; you can even use algorithms not discussed in class. - The pattern matching algorithm must be written by yourself. - In addition to your algorithms similarity detection ability, you will also be graded on the speed of your code, so you should try to choose an efficient algorithm.

1.2 Important Instructions

Yourprogram shouldgive an output tothe consolein thefollowing format when executed:

1) {Similarity Rate with the First Document} {Most Similar Sentence} {Second Most Similar Sentence} {Third Most Similar Sentence} {Fourth Most Similar Sentence} {Fifth Most Similar Sentence}

2) {Similarity Rate with the Second Document} {Most Similar Sentence} {Second Most Similar Sentence} {Third Most Similar Sentence} {Fourth Most Similar Sentence} {Fifth Most Similar Sentence} 3) {Similarity Rate with the Third Document} {Most Similar Sentence} {Second Most Similar Sentence} {Third Most Similar Sentence} {Fourth Most Similar Sentence} {Fifth Most Similar Sentence}

------------------------------------------------------------------------------------------------

Dear chegg teacher you can use your own txt fles for input. I just need the code. Please name your txt files as mentioned above. With that, the code will be enough for me.

I will be so glad if you write comments that describes what is it doing.

Thank you from now.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database And Expert Systems Applications 31st International Conference Dexa 2020 Bratislava Slovakia September 14 17 2020 Proceedings Part 1 Lncs 12391

Authors: Sven Hartmann ,Josef Kung ,Gabriele Kotsis ,A Min Tjoa ,Ismail Khalil

1st Edition

303059002X, 978-3030590024

More Books

Students also viewed these Databases questions

Question

10:16 AM Sun Jan 29 Answered: 1 week ago

Answered: 1 week ago