Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

As in demo 5A, I suggest using sklearn's implementation of LDA, which is the best option for generating topical relevance at the document level. We

As in demo 5A, I suggest using sklearn's implementation of LDA, which is the best option for generating topical relevance at the document level. We need to do a little tuning for our model. I recommend doing this on a smaller sample of 5,000 records to save time. You should try between 40 and 150 topics, counting by 10s. You should set topic_word_prior equal to 0.15 and doc_topic_prior to 25 divided by the number of topics. Finally, we will use the coherence score, u_mass, as in demo 5A to evaluate topic quality

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Principles of Auditing An Introduction to International Standards on Auditing

Authors: Rick Hayes, Philip Wallage, Hans Gortemaker

3rd edition

273768174, 978-0273768173

Students also viewed these General Management questions