Method and system for identifying relationships between text documents and structured variables pertaining to the text documents
First Claim
Patent Images
1. A computer-implemented method for identifying relationships between text documents and structured variables pertaining to said text documents, comprising:
- providing a dictionary of keywords in said text documents;
forming categories of said text documents using said dictionary and an automated algorithm;
counting occurrences of said structured variables, said categories, and combinations of said structured variables and said categories for said text documents;
calculating probabilities of occurrences of said combinations of structured variables and categories; and
identifying a relationship between a structured variable and text documents included in a category based on a probablility of occurrence of a combination of said structured variable and said category,wherein said text documents comprise problem tickets in a helpdesk log, and said period of time comprises dates of said problem tickets.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for interesting relationships in text documents includes generating a dictionary of keywords in the text documents, forming categories of the text documents using the dictionary and an automated algorithm, counting occurrences of the structured variables, categories and structured variable/category combinations in the text documents, and calculating probabilities of occurrences of the structured variable/category combinations.
-
Citations
1 Claim
-
1. A computer-implemented method for identifying relationships between text documents and structured variables pertaining to said text documents, comprising:
-
providing a dictionary of keywords in said text documents; forming categories of said text documents using said dictionary and an automated algorithm; counting occurrences of said structured variables, said categories, and combinations of said structured variables and said categories for said text documents; calculating probabilities of occurrences of said combinations of structured variables and categories; and identifying a relationship between a structured variable and text documents included in a category based on a probablility of occurrence of a combination of said structured variable and said category, wherein said text documents comprise problem tickets in a helpdesk log, and said period of time comprises dates of said problem tickets.
-
Specification