×

KNOWLEDGE DISCOVERY FROM CITATION NETWORKS

  • US 20140188780A1
  • Filed: 01/14/2014
  • Published: 07/03/2014
  • Est. Priority Date: 12/06/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for characterizing a set of documents, comprising:

  • identifying a network of multilevel hierarchically related documents having direct and indirect references associated with content relationships;

    for each respective document, determining a set of latent topic characteristics based on an intrinsic content of the respective document and a set of latent topic characteristics based on a respective content of other documents which are directly referenced and indirectly referenced through at least one other document to the respective document;

    representing a set of latent topics for the respective document based on a joint probability distribution of at least the latent topic characteristics based on the intrinsic content and the respective content of other documents which are directly referenced and indirectly referenced through at least one other document to the respective document, dependent on the identified network and a random process; and

    storing, in a memory, the represented set of latent topics for the respective document.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×