Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Program has to be in Python using regular expressions: Could you please help me find the problem in this code, I am trying to deidentify

Program has to be in Python using regular expressions: Could you please help me find the problem in this code, I am trying to deidentify "names" and "emails" from a text file. The program code so far deidentify emails, but it does NOT deidentify any names that start with prefix. Please help!

# This program removes names and email addresses occurring in a given input file and saves it in an output file.

import re def deidentify(): infilename = input("Give the input file name: ") outfilename = input("Give the output file name: ")

infile = open(infilename,"r") text = infile.read() infile.close()

# replace names nameRE = "(Ms\.|Mr\.|Dr\.|Prof\.) [A-Z](\.|[a-z]+) [A-Z][a-z]+" deidentified_text = re.sub(nameRE,"**name**",text)

emailRE = "(\S*@\S*\S?)" deidentified_text = re.sub(emailRE, "**email**", text)

outfile = open(outfilename,"w") print(deidentified_text, file=outfile) outfile.close()

deidentify()

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Intelligent Information And Database Systems Asian Conference Aciids 2012 Kaohsiung Taiwan March 2012 Proceedings Part 2 Lnai 7197

Authors: Jeng-Shyang Pan ,Shyi-Ming Chen ,Ngoc-Thanh Nguyen

2012th Edition

3642284892, 978-3642284892

More Books

Students also viewed these Databases questions

Question

1. Explain how technology has changed the learning environment.

Answered: 1 week ago