×

Knowledge discovery from citation networks

  • US 9,269,051 B2
  • Filed: 12/29/2014
  • Issued: 02/23/2016
  • Est. Priority Date: 12/06/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method of modeling a set of documents within an automated computer system, each document within the set of documents comprising content, and at least a portion of the documents being linked to other documents through citations, comprising:

  • automatically defining a Bernoulli Process Topic model with the automated computer system, for the set of documents representing each document as a represented as a mixture over latent topics, dependent on (1) an intrinsic content of each respective document, and (2) a content of other documents related to the respective document through a multi-level citation structure of direct and indirect linkages to the other documents;

    for each respective document, automatically determining with the automated computer system, a first set of latent topic characteristics based on an intrinsic content of the respective document;

    for each document, automatically determining with the automated computer system, a second set of latent topic characteristics based on a respective content of the other documents which are directly and indirectly linked, the indirectly linked documents contributing transitively to the latent topic characteristics of the respective document;

    automatically representing with the automated computer system, a third set of latent topics for the respective document based on a joint probability distribution of at least the first and second sets of latent topic characteristics, Bernoulli Process Topic model; and

    outputting, by the automated computer system, at least one document in response to an input, based on at least the third set of latent topics.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×