INFORMATION MINING USING DOMAIN SPECIFIC CONCEPTUAL STRUCTURES
First Claim
1. A method for use with 1) a first set of documents related to a first topic of interest and 2) a second set of documents related to a second topic of interest, the method comprising the steps of:
- using a first taxonomy to categorize the first set of documents into a set of categories;
categorizing the second set of documents according to the set of categories of the first set of documents; and
examining a category to identify a document of interest, the document of interest being a representative document within the category.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and analytics tools for information mining incorporating domain specific knowledge and conceptual structures are disclosed, the method including: providing a first set of documents related to a first topic of interest; using a first taxonomy to categorize the first set of documents into a set of categories; providing a second set of documents related to a second topic of interest; categorizing the second set of documents according to the set of categories of the first set of documents; using an element of domain knowledge to re-categorize the first set of documents; and examining a category to identify a document of interest.
32 Citations
20 Claims
-
1. A method for use with 1) a first set of documents related to a first topic of interest and 2) a second set of documents related to a second topic of interest, the method comprising the steps of:
-
using a first taxonomy to categorize the first set of documents into a set of categories; categorizing the second set of documents according to the set of categories of the first set of documents; and examining a category to identify a document of interest, the document of interest being a representative document within the category. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for use with a set of documents related to a first topic of interest, the method comprising:
-
creating a first set of categories of the set of documents according to an automatically generated taxonomy; creating a second set of categories of the set of documents according to at least one of unstructured data, structured data, and annotations derived from text in the set of documents; constructing a contingency table having the first set of categories along a first axis and the second set of categories along a second axis; and identifying a relationship between at least two different categories using the contingency table. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
extracting a set of documents related to a specified topic from a data warehouse; generating a taxonomy for the set of documents that provides a first partition of the set of documents according to the taxonomy; using domain-specific knowledge to re-partition the set of documents to provide a second partition of the set of documents; and creating a refined taxonomy for the set of documents according to the second partition so that the refined taxonomy incorporates the domain-specific knowledge. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer program product for use with 1) a first set of documents related to a first topic of interest and 2) a second set of documents related to a second topic of interest, the computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
categorize the first set of documents into a set of categories using a first taxonomy; categorize the second set of documents according to the set of categories of the first set of documents; and examine a category to identify a document of interest, wherein the document of interest typifies the category by most nearly matching a mathematical definition of the category.
-
-
20. A computer program product comprising a computer useable medium including a computer readable program, wherein the computer readable program when executed on a computer causes the computer to:
-
extract a set of documents related to a specified topic from a data warehouse; generate a taxonomy for the set of documents that provides a first partition of the set of documents according to the taxonomy; use domain-specific knowledge to re-partition the set of documents to provide a second partition of the set of documents; and create a refined taxonomy for the set of documents according to the second partition so that the refined taxonomy incorporates the domain-specific knowledge.
-
Specification