×

Systems and methods for determining the topic structure of a portion of text

  • US 20030182631A1
  • Filed: 03/22/2002
  • Published: 09/25/2003
  • Est. Priority Date: 03/22/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method for determining the topic structure of a portion of text, comprising:

  • identifying candidate segmentation points of the portion of text corresponding to locations between text blocks;

    applying a folding-in process to each text block to determine a distribution of probabilities over a plurality of latent variables for each text block;

    using the determined distributions to estimate a distribution of words for each text block;

    making comparisons of the distributions of words in adjacent text blocks using a similarity metric to determine similarity values; and

    selecting segmentation points from the candidate segmentation points of the portion of text based on the comparison to define a plurality of segments.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×