Identify Survey topic #89

TiffanyAndrews · 2019-10-22T17:01:14Z

10x Qualitative Data User story

As a data scientist, I want to the number of topics so that the LDA algorithm can match them with keywords in the text data I provide.

To do:

choose the total number of topics(K)
go through each document and randomly assign each word in the comment to one the K topics
For each comment c
- For each word w in c
  A. For each topic t compute two things:
  1) p(topic t | comment c)
  2) p(word w | topic t)
  B. Reassign w a new topic, choosing t with probability p(topic t | comment c) * p(word w | topic t) {(Probability that topic t generated word w)}

TiffanyAndrews added the data science label Oct 22, 2019