Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

In this project you are going to play the role of software forensics expert to analyze the similarity between set of textual documents. Your program

In this project you are going to play the role of software forensics expert to analyze the similarity between set of textual documents. Your program need to produce kind of quantitative measures (e.g. Checksum Coefficients) that identify the similarity between the given documents based on sound algorithms that you can find over the Web (see below some good list of them). Your documents can reside in external files or be defined inside your clojure program: (def docs [["doc0" "This is a textual English document.. The quick brown fox jumps over the lazy dog. This is a document."] ["doc1" "This is a textual English document. The quick white wolf eats the lazy sheep."] ["doc2" "This is a textual English document. The slow brown fox jumps into the quizzical dog."] ["doc3" "This is a textual English document. The slow white wolf lays next to the lazy dog."] ["doc4" "This is a textual English document. The quick brown fox jumps over the lazy cat."] ["doc5" "This is a textual English document. The quick brown fox jumps over the lazy dog. This is a document.."]]) You have the freedom to choose one or more algorithm that help to quantify similarity between the given textual documents. You are also encouraged to test your program on variety of documents and compare the different algorithms affects. The way you are going to conduct your analysis and taking a position on which is more effective will reveal the kind of programming creativity that you have invested in this project. Some Helpful Algorithms that you may like to use: Cosine similarity Hamming distance DamerauLevenshtein distance Euclidean distance Mahalanobis distance Jaccard index SrensenDice coefficient Greedy String Tiling Vector-Distance Algorithm Smith-Waterman Algorithm Fingerprinting Algorithm Karp-Rabin String Matching

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Big Data, Mining, And Analytics Components Of Strategic Decision Making

Authors: Stephan Kudyba

1st Edition

1466568704, 9781466568709

More Books

Students also viewed these Databases questions

Question

3.What are the Importance / Role of Bank in Business?

Answered: 1 week ago