Method and system for identifying relationships between text documents and structured variables pertaining to the text documents
First Claim
Patent Images
1. A method for automatically identifying relationships between text documents and structured variables pertaining to said text documents, comprising:
- generating a dictionary of keywords in said text documents;
forming categories of said text documents using said dictionary and an automated algorithm;
counting occurrences of said structured variables, said categories and said structured variable/category combinations in said text documents; and
calculating probabilities of occurrences of said structured variable/category combinations.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for interesting relationships in text documents includes generating a dictionary of keywords in the text documents, forming categories of the text documents using the dictionary and an automated algorithm, counting occurrences of the structured variables, categories and structured variable/category combinations in the text documents, and calculating probabilities of occurrences of the structured variable/category combinations.
75 Citations
23 Claims
-
1. A method for automatically identifying relationships between text documents and structured variables pertaining to said text documents, comprising:
-
generating a dictionary of keywords in said text documents;
forming categories of said text documents using said dictionary and an automated algorithm;
counting occurrences of said structured variables, said categories and said structured variable/category combinations in said text documents; and
calculating probabilities of occurrences of said structured variable/category combinations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20, 21, 22)
-
-
17. A system for automatically identifying relationships between text documents and structured variables pertaining to said text documents, comprising:
-
an input device for inputting text documents;
a processor for forming categories of said text documents and counting occurrences of said structured variables, categories and structured variable/category combinations and calculating probabilities of occurrence of said structured variable/category combinations; and
a display, for displaying said probabilities.
-
-
23. A programmable storage medium tangibly embodying a program of machine-readable instructions executable by a digital processing apparatus to perform a method for automatically identifying relationships between text documents and structured variables pertaining to said text documents, said method comprising:
-
generating a dictionary of keywords in said text documents;
forming categories of said text documents using said dictionary and an automated algorithm;
counting occurrences of said structured variables, said categories and said structured variable/category combinations in said text documents; and
calculating probabilities of occurrences of said structured variable/category combinations.
-
Specification