×

Topic identification and use thereof in information retrieval systems

  • US 20030167252A1
  • Filed: 02/26/2002
  • Published: 09/04/2003
  • Est. Priority Date: 02/26/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method to identify topics in a data corpus having a plurality of segments, comprising:

  • determining a segment-level actual usage value for one or more word combinations;

    computing a segment-level expected usage value for each of the one or more word combinations; and

    designating a word combination as a topic if the segment-level actual usage value of the word combination is substantially greater than the segment-level expected usage value of the word combination.

View all claims
  • 15 Assignments
Timeline View
Assignment View
    ×
    ×