Question
Develop domain name and email address linkification app in python: Develop a python code to identify valid domain names and email addresses in the text
Develop domain name and email address linkification app in python: Develop a python code to identify valid domain names and email addresses in the text and automatically convert them into relevant links. This process is called linkification. 1. You will use the latest list of valid top-level domains, see https://data.iana.org/TLD/tlds-alpha-by-domain.txt, with the assumption that this list could be updated on a daily basis. 2. You would need to use the Unicode scripts.txt file at http://www.unicode.org/Public/UCD/latest/ucd/Scripts.txt. 3. For characters which are valid for domain names, set the latest list at https://www.iana.org/assignments/idna-tables-11.0.0/idna-tables-11.0.0.xml. DISALLOWED characters should not be part of a domain name.
Step-1: Recognize all ASCII domain names and email addresses in text Develop an application which inputs an HTML webpage which has content text in UTF-8 format. The application then scans its textual content within the relevant HTML tags. It identifies all domain names and email addresses in ASCII and converts these domain names or email addresses into clickable links. Finally it saves the updated version of the HTML webpage.
Step-2: Recognize all Arabic script domain names and Arabic email addresses in text Update the application to allow linkification of Arabic script domain names and Arabic email addresses from the input HTML page. The application scans textual content within the relevant HTML tags. It identifies all domain names and email addresses in ASCII and Arabic script and converts these domain names or email addresses into clickable links. Finally it saves the updated version of the HTML webpage.
Step-3: Recognize domain names and email addresses in any script in text Extend the application to allow for linkification of domain names and email addresses in any script supported by the Unicode.
(note: the code has to open index.html and scan the text to find domains and emails and edit it and paste the code in output.html)
sample input:
Linkification Test
Here are some example domain names and email addresses: www.example.com www.example.org www.example.net john.doe@example.com jane.doe@example.org info@example.net
Here are some example domain names and email addresses in Arabic script: . . .@. .@.
sample output:
Linkification Test
Here are some example domain names and email addresses: www.example.com www.example.org www.example.net john.doe@example.com jane.doe@example.org info@example.net
Here are some example domain names and email addresses in Arabic script: . . .@. .@.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started