×

Mining geographic knowledge using a location aware topic model

  • US 7,853,596 B2
  • Filed: 06/21/2007
  • Issued: 12/14/2010
  • Est. Priority Date: 06/21/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method in a computing device for identifying a location associated with a first document, the method comprising:

  • providing a collection of documents of words, each document labeled with an associated location, the collection not including the first document;

    generating by the computing device collection level parameters for a latent Dirichlet allocation style model for the collection of documents that is based on latent topics and the location of each document, the collection level parameters indicating a probability that a document in the collection relates to each latent topic, a probability that each word of the collection relates to each latent topic, and a probability that each location of the collection relates to each latent topic, wherein a variational expectation maximization algorithm is used to estimate the collection level parameters that are a maximization of a lower bound on the collection level parameters represented by a summation for each document in the collection of the log of the conditional probability of the document and its location given the collection level parameters;

    for each location, estimating, using the collection level parameters, a probability that the location is associated with the first document based on an aggregation of, for each topic, the conditional probability of the location given the topic and the conditional probability of the topic given the document, the conditional probabilities being derived from the collection of documents in which each document is labeled with an associated location; and

    selecting the location with the highest probability as the location associated with the first document.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×