Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Develop domain name and email address linkification app in python: Develop a python code to identify valid domain names and email addresses in the text

Develop domain name and email address linkification app in python: Develop a python code to identify valid domain names and email addresses in the text and automatically convert them into relevant links. This process is called linkification. 1. You will use the latest list of valid top-level domains, see https://data.iana.org/TLD/tlds-alpha-by-domain.txt, with the assumption that this list could be updated on a daily basis. 2. You would need to use the Unicode scripts.txt file at http://www.unicode.org/Public/UCD/latest/ucd/Scripts.txt. 3. For characters which are valid for domain names, set the latest list at https://www.iana.org/assignments/idna-tables-11.0.0/idna-tables-11.0.0.xml. DISALLOWED characters should not be part of a domain name.

Step-1: Recognize all ASCII domain names and email addresses in text Develop an application which inputs an HTML webpage which has content text in UTF-8 format. The application then scans its textual content within the relevant HTML tags. It identifies all domain names and email addresses in ASCII and converts these domain names or email addresses into clickable links. Finally it saves the updated version of the HTML webpage.

Step-2: Recognize all Arabic script domain names and Arabic email addresses in text Update the application to allow linkification of Arabic script domain names and Arabic email addresses from the input HTML page. The application scans textual content within the relevant HTML tags. It identifies all domain names and email addresses in ASCII and Arabic script and converts these domain names or email addresses into clickable links. Finally it saves the updated version of the HTML webpage.

Step-3: Recognize domain names and email addresses in any script in text Extend the application to allow for linkification of domain names and email addresses in any script supported by the Unicode.

(note: the code has to open index.html and scan the text to find domains and emails and edit it and paste the code in output.html)

sample input:

Linkification Test

Linkification Test

Here are some example domain names and email addresses: www.example.com www.example.org www.example.net john.doe@example.com jane.doe@example.org info@example.net

Here are some example domain names and email addresses in Arabic script: . . .@. .@.

sample output:

Linkification Test

Linkification Test

Here are some example domain names and email addresses: www.example.com www.example.org www.example.net john.doe@example.com jane.doe@example.org info@example.net

Here are some example domain names and email addresses in Arabic script: . . .@. .@.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Pro Database Migration To Azure Data Modernization For The Enterprise

Authors: Kevin Kline, Denis McDowell, Dustin Dorsey, Matt Gordon

1st Edition

1484282299, 978-1484282298

More Books

Students also viewed these Databases questions