Answered step by step
Verified Expert Solution
Question
1 Approved Answer
As in demo 5A, I suggest using sklearn's implementation of LDA, which is the best option for generating topical relevance at the document level. We
As in demo 5A, I suggest using sklearn's implementation of LDA, which is the best option for generating topical relevance at the document level. We need to do a little tuning for our model. I recommend doing this on a smaller sample of 5,000 records to save time. You should try between 40 and 150 topics, counting by 10s. You should set topic_word_prior equal to 0.15 and doc_topic_prior to 25 divided by the number of topics. Finally, we will use the coherence score, u_mass, as in demo 5A to evaluate topic quality
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started