Answered step by step
Verified Expert Solution
Question
1 Approved Answer
python with machine learnining I'm trying to run this code from this link http://machinelearner.net/nb.html but the problem it gives me this type of error: UnicodeDecodeError:
python with machine learnining I'm trying to run this code from this link http://machinelearner.net/nb.html but the problem it gives me this type of error: UnicodeDecodeError: 'utf8' codec can't decode byte 0xc3 in position 0: unexpected end of data
I tried to import this list of codes but that did not fix the problem. I'm using ipython3.4 with Juypter. How can i solve the problem??
import sys reload(sys) sys.setdefaultencoding('utf-8')
# Feature Extraction example. In this example, we vectorize words into binary features used by the # classifiers. This is the format used by "Bag of Words" models. # DictVectorizer is used to do word vectorization from sklearn.feature_extraction import DictVectorizer from nltk.stem import PorterStemmer stemmer = PorterStemmer() # In R, this is done using the tm package. We could use the equivalent NLTK package in python # but we just do the feature extraction manually to show more python syntax def text2dict(text): words_filtered = [e for e in text.split() if (e.islower() and '@' not in e)] record = {} for word in words_filtered: w = stemmer.stem(word) if w not in record: record[w] = 1 return record predictors = classified_tweets.text.apply(text2dict) # R: Pandas also have "apply" like R. # R: More tm package equivalents to set up the predictor and response matrices v = DictVectorizer(sparse=False) X = v.fit_transform(predictors) y = np.asarray(classified_tweets['Sentiment2']) X.shape # R: dim(X)
error is:
UnicodeDecodeError: 'utf8' codec can't decode byte 0xc3 in position 0: unexpected end of data
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started