×

DATA-PARALLEL PARAMETER ESTIMATION OF THE LATENT DIRICHLET ALLOCATION MODEL BY GREEDY GIBBS SAMPLING

  • US 20160210718A1
  • Filed: 01/16/2015
  • Published: 07/21/2016
  • Est. Priority Date: 01/16/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method for identifying sets of correlated words comprising:

  • receiving information for a set of documents;

    wherein the set of documents comprises a plurality of words;

    running a partially-collapsed Gibbs sampler over a Dirichlet distribution of the plurality of words in the set of documents to produce sampler result data, further comprising;

    calculating a mean of the Dirichlet distribution;

    determining, from the sampler result data, one or more sets of correlated words;

    wherein the method is performed by one or more computing devices.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×