×

System and method for facilitating evergreen discovery of digital information

  • US 8,706,678 B2
  • Filed: 04/23/2012
  • Issued: 04/22/2014
  • Est. Priority Date: 10/12/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented system for facilitating evergreen discovery of digital information, comprising:

  • a hierarchy of topics for topically-limited subject areas, each of the subject areas comprising pages of electronically-stored digital information maintained in a storage device;

    a computer comprising a processor and memory within which code for execution by the processor is stored, comprising;

    a user interface of the computer configured to select seed words that are characteristic of each of the topics and to designate training material from the digital information that corresponds to the respective subject area of each of the topics;

    a topic modeler configured to form candidate topic models from the seed words, each candidate topic model comprising a pattern evaluable against the digital information;

    a topic tester configured to test an ability of each of the candidate topic models to identify such digital information matching the candidate topic model'"'"'s topic by matching the pattern in the candidate topic model to the training material;

    a topic rater configured to rate the respective abilities of the candidate topic models, comprising;

    a performance rater configured to rank each candidate topic model'"'"'s performance in matching the training material correctly for the corresponding topic;

    a simplicity rater configured to prefer those candidate topic models with simpler patterns over the patterns of other candidate topic models that correctly match the same training material; and

    a bias rater configured to assign a bias to those candidate topic models that comprise terms also found in the corresponding topic;

    a topic model selector configured to choose the candidate topic model for each topic that comprises the highest abilities with respect to the topic in performance, simplicity and bias; and

    an index builder configured to form an evergreen index by pairing the chosen candidate topic model to each topic in the hierarchy.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×