Computer method and apparatus for segmenting text streams
First Claim
Patent Images
1. Computer apparatus for segmenting text streams, comprising:
- an input text stream formed of a series of words;
a probability member providing working probabilities of a group of words being of a topic selected from a plurality of predetermined topics, said probability member accounting for relationship between words, and wherein the probability member is a Hidden Markov Model combined with an Aspect model; and
a processing module coupled to receive the input text stream and using the probability member determining probability of certain words in the input text stream being of a same topic such that the processing module segments the input text stream into single topic groupings of words, where each grouping is of a respective single topic.
3 Assignments
0 Petitions
Accused Products
Abstract
Computer method and apparatus for segmenting text streams is disclosed. Given is an input text stream formed of a series of words. A probability member provides working probabilities that a group of words is of a topic selected from a plurality of predetermined topics. The probability member accounts for relationships between words. A processing module receives the input text stream and using the probability member determines probability of certain words in the input text stream being of a same topic. As such, the processing module segments the input text stream into single topic groupings of words, where each grouping is of a respective single topic.
-
Citations
4 Claims
-
1. Computer apparatus for segmenting text streams, comprising:
-
an input text stream formed of a series of words;
a probability member providing working probabilities of a group of words being of a topic selected from a plurality of predetermined topics, said probability member accounting for relationship between words, and wherein the probability member is a Hidden Markov Model combined with an Aspect model; and
a processing module coupled to receive the input text stream and using the probability member determining probability of certain words in the input text stream being of a same topic such that the processing module segments the input text stream into single topic groupings of words, where each grouping is of a respective single topic. - View Dependent Claims (2)
-
-
3. A method for segmenting text streams, comprising the computer implemented steps of:
-
receiving an input text stream formed of a series of words;
providing working probabilities of a group of words being of a topic selected from a plurality of predetermined topics, said working probabilities accounting for relationship between words, and wherein the step of providing working probabilities includes combining a Hidden Markov Model with an Aspect model such that use of the Aspect model is extended to text segmentation through the Hidden Markov Model; and
using the working probabilities, determining probability of certain words in the input text stream being of a same topic such that the input text stream is segmented into single topic groupings of words, where each grouping is of a respective single topic. - View Dependent Claims (4)
-
Specification