Method for statistical text analysis
First Claim
Patent Images
1. A method for retrieving relevant stories from a collection of stories, the method comprising:
- identifying at least one query term;
applying an asymmetrical, cooccurrence matrix to the at least one query term to provide a list of query terms;
determining if a story in the collection contains any terms on the list of query terms;
increasing a relevance measure if the story does contain words on the list of query words; and
adding a story to a list of relevant stories if the relevance measure is higher than a threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for retrieving relevant stories from a collection of stories. The method comprises the steps of identifying at least one query term, applying a cooccurrence matrix to the query term to provide a list of query terms, determining if a story in the collection contains any terms on the list of query terms, and then increasing a relevance measure if the story does contain words on the list of query words. If the relevance measure is higher than a threshold, the story is added to a list of relevant stories.
106 Citations
9 Claims
-
1. A method for retrieving relevant stories from a collection of stories, the method comprising:
-
identifying at least one query term;
applying an asymmetrical, cooccurrence matrix to the at least one query term to provide a list of query terms;
determining if a story in the collection contains any terms on the list of query terms;
increasing a relevance measure if the story does contain words on the list of query words; and
adding a story to a list of relevant stories if the relevance measure is higher than a threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
determining a frequency of occurrence of the query term in the story;
comparing the frequency to a threshold; and
increasing the relevance measure based upon the comparing step.
-
-
7. The method of claim 1, wherein a directed graph is used to derive higher order associative information for each term in the matrix.
-
8. The method of claim 1, wherein the matrix is used to assist the user in formulating a query.
-
9. The method of claim 1, wherein the matrix is updated dynamically on the basis of new stories.
Specification