Question
In this project, we will work on text processing and analysis. Text analyzers could be used to identify the language in which a text has
In this project, we will work on text processing and analysis. Text analyzers could be used to identify the language in which a text has been written (language detection), to identify keywords in the text (keyword extraction) or to summarize and categorize a text. You will calculate the letter (character) frequency in a text. Letter frequency measurements can be used to identify languages as well as in cryptanalysis. You will also explore the concept of n-grams in Natural Language Processing. N-grams are sequential patterns of n-words that appear in a document. In this project, we are just considering uni-grams and bi-grams. Uni-grams are the unique words that appear in a text whereas bi-grams are patterns of two-word sequences that appear together in a document. Write a Java application that implements a basic Text Analyzer. The Java application will analyze text stored in a text file. The user should be able to select a file to analyze and the application should produce the following text metrics:
Number of characters in the text.
Relative frequency of letters in the text in descending order. (How the relative frequency that you calculated compares with relative letter frequencies in English already published?)
Number of words in the text.
The sizes of the longest and the shortest word.
The twenty most repeated uni-grams (single words) in the text in descending order.
The twenty most repeated bi-grams (pairs of words) in the text in descending order.
Step by Step Solution
3.35 Rating (161 Votes )
There are 3 Steps involved in it
Step: 1
Code to be copied CharCountjava Declare class Charcount public class CharCount implements Comparable delcare variables character char character double countrelativeFrequency construtor public CharCoun...Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started