Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
First Claim
Patent Images
1. A computer-based method to allow a user to analyze unstructured objects, comprising:
- allowing a user to select a plurality of unstructured objects, at least a portion organized into categories, as a corpus;
allowing the user to select at least one concept to form a concept set; and
displaying a first dimension including at least one of the categories, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each category in the first dimension.
4 Assignments
0 Petitions
Accused Products
Abstract
A method, operating model, system, data structure, computer program and computer program product for analyzing and categorizing and exploring or querying unstructured information and for tracking trends and exceptions. Method for analytical processing of unstructured objects in a dimensional space. Method for tracking trends in concepts. Method for tracking exceptions in concepts. Tools and interface for displaying concepts, query results, tracked trends and exceptions.
-
Citations
56 Claims
-
1. A computer-based method to allow a user to analyze unstructured objects, comprising:
-
allowing a user to select a plurality of unstructured objects, at least a portion organized into categories, as a corpus; allowing the user to select at least one concept to form a concept set; and displaying a first dimension including at least one of the categories, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each category in the first dimension. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 15, 16, 17, 18)
-
-
11. A computer-based method to allow a user to analyze unstructured objects, comprising:
-
allowing a user to select a plurality of unstructured objects as a corpus; allowing the user to select at least one concept to form a concept set; allowing the user to select a first dimension, wherein a dimension includes at least one unstructured object from the corpus and/or at least one category; displaying the first dimension, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each unstructured object and/or category in the first dimension; and further comprising the following steps which support online analytical processing (OLAP) requirements of drill down, slice and dice, and graphical analysis; allowing the user to display a second dimension as a result of selecting a first category from the first dimension, a first concept from the concept set, or a first indication of a presence or absence of a concept from the concept set in an unstructured object or category of the first dimension, thereby supporting drill down; allowing the user to modify the first dimension, the second dimension or the concept set, thereby supporting slice and dice; allowing the user to specify a second category for sharing with a second corpus, thereby further supporting slice and dice; and allowing the user to graphically display information provided in the first dimension or the second dimension, thereby supporting graphical analysis.
-
-
13. A computer-based method to allow a user to analyze unstructured objects, comprising:
-
allowing a user to select a plurality of unstructured objects as a corpus; allowing the user to select at least one concept to form a concept set; allowing the user to select a first dimension, wherein a dimension includes at least one unstructured object from the corpus and/or at least one category; displaying the first dimension, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each unstructured object and/or category in the first dimension; adding an additional unstructured object to the corpus; and categorizing the additional unstructured object to be associated with a category, wherein the categorizing step comprises; calculating an inverted category ratio and a concept ratio for each concept associated with the additional unstructured object; calculating a category normalization ratio; and associating the additional unstructured object with a category based on the inverted category ratio, the concept ratio, and the category normalization ratio.
-
-
19. A system to allow a user to analyze unstructured objects, comprising:
-
a processor; and a memory in communication with the processor, wherein the memory stores a plurality of processing instructions for directing the processor to; allow a user to select a plurality of unstructured objects, at least a portion organized into categories, as a corpus; allow the user to select at least one concept to form a concept set; and display a first dimension including at least one of the categories, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each category in the first dimension. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 30, 32, 33, 34, 35, 36)
-
-
29. A system to allow a user to analyze unstructured objects, comprising:
-
a processor; and a memory in communication with the processor, wherein the memory stores a plurality of processing instructions for directing the processor to; allow a user to select a plurality of unstructured objects as a corpus; allow the user to select at least one concept to form a concept set; allow the user to select a first dimension, wherein a dimension includes at least one unstructured object from the corpus and/or at least one category; and display the first dimension, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each unstructured object and/or category in the first dimension, wherein the plurality of processing instructions supports online analytical processing (OLAP) requirements of drill down, slice and dice and graphical analysis by further including instructions for directing the processor to; allow the user to display a second dimension as a result of selecting a first category from the first dimension, a first concept from the concept set, or a first indication of a presence or absence of a concept from the concept set in an unstructured object or category of the first dimension, thereby supporting drill down; allow the user to modify the first dimension, the second dimension or the concept set, thereby supporting slice and dice; allow the user to specify a second category for sharing with a second corpus, thereby further supporting slice and dice; and allow the user to graphically display information provided in the first dimension or the second dimension, thereby supporting graphical analysis.
-
-
31. A system to allow a user to analyze unstructured objects, comprising:
-
a processor; and a memory in communication with the processor, wherein the memory stores a plurality of processing instructions for directing the processor to; allow a user to select a plurality of unstructured objects as a corpus; allow the user to select at least one concept to form a concept set; allow the user to select a first dimension, wherein a dimension includes at least one unstructured object from the corpus and/or at least one category; display the first dimension, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each unstructured object and/or category in the first dimension; add an additional unstructured object to the corpus; and categorize the additional unstructured object to be associated with a category, wherein the processing instructions for directing the processor to categorize the additional unstructured object include instructions for directing the processor to; calculate an inverted category ratio and a concept ratio for each concept associated with the additional unstructured object; calculate a category normalization ratio; and associate the additional unstructured object with a category based on the inverted category ratio, the concept ratio, and the category normalization ratio.
-
-
37. A computer program product comprising a computer usable medium having control logic stored therein for causing a computer to allow a user to analyze unstructured objects, said control logic comprising:
-
first computer readable program code means for causing the computer to allow a user to select a plurality of unstructured objects, at least a portion organized into categories, as a corpus; second computer readable program code means for causing the computer to allow the user to select at least one concept to form a concept set; and third computer readable program code means for causing the computer to display a first dimension including at least one of the categories, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each category in the first dimension. - View Dependent Claims (38, 39, 40, 41, 42, 43, 44, 45, 46, 48, 50, 51, 52, 53, 54)
-
-
47. A computer program product comprising a computer usable medium having control logic stored therein for causing a computer to allow a user to analyze unstructured objects, said control logic comprising:
-
first computer readable program code means for causing the computer to allow a user to select a plurality of unstructured objects as a corpus; second computer readable program code means for causing the computer to allow the user to select at least one concept to form a concept set; third computer readable program code means for causing the computer to allow the user to select a first dimension, wherein a dimension includes at least one unstructured object from the corpus and/or at least one category; fourth computer readable program code means for causing the computer to display the first dimension, wherein for each concept in the concept set a presence or absence of the concept is indicated for each unstructured object and/or category in the first dimension; and fifth computer readable program code means supporting online analytical processing (OLAP) requirements of drill down, slice and dice and graphical analysis, the fifth computer readable program code means comprising; sixth computer readable program code means for causing the computer to allow the user to display a second dimension as a result of selecting a first category from the first dimension, a first concept from the concept set, or a first indication of a presence or absence of a concept from the concept set in an unstructured object or category of the first dimension, thereby supporting drill down; seventh computer readable program code means for causing the computer to allow the user to modify the first dimension, the second dimension or the concept set, thereby supporting slice and dice; eighth computer readable program code means for causing the computer to allow the user to specify a second category for sharing with a second corpus, thereby further supporting slice and dice; and ninth computer readable program code means for causing the computer to allow the user to graphically display information provided in the first dimension or the second dimension, thereby supporting graphical analysis.
-
-
49. A computer program product comprising a computer usable medium having control logic stored therein for causing a computer to allow a user to analyze unstructured objects, said control logic comprising:
-
first computer readable program code means for causing the computer to allow a user to select a plurality of unstructured objects as a corpus; second computer readable program code means for causing the computer to allow the user to select at least one concept to form a concept set; third computer readable program code means for causing the computer to allow the user to select a first dimension, wherein a dimension includes at least one unstructured object from the corpus and/or at least one category; fourth computer readable program code means for causing the computer to display the first dimension, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each unstructured object and/or category in the first dimension; fifth computer readable program code means for causing the computer to add an additional unstructured object to the corpus; and sixth computer readable program code means for causing the computer to categorize the additional unstructured object to be associated with a category, wherein the sixth computer readable program code means includes; seventh computer readable program code means for causing the computer to calculate an inverted category ratio and a concept ratio for each concept associated with the additional unstructured object; eighth computer readable program code means for causing the computer to calculate a category normalization ratio; and ninth computer readable program code means for causing the computer to associate the additional unstructured object with a category based on the inverted category ratio, the concept ratio, and the category normalization ratio.
-
-
55. A computer-based method that supports online analytical processing (OLAP) requirements of drill down, slice and dice, and graphical analysis to allow a user to analyze a corpus of unstructured objects, comprising:
-
allowing the user to select at least one concept to form a concept set; allowing the user to select a first dimension, wherein a dimension includes at least one unstructured object from the corpus and/or at least one category; displaying the first dimension, wherein for each concept in the concept set, a presence or absence of the concept is indicated for each unstructured object and/or category in the first dimension; allowing the user to display a second dimension as a result of selecting a first category from the first dimension, a first concept from the concept set, or a first indication of a presence or absence of a concept from the concept set in an unstructured object or category of the first dimension, thereby supporting drill down; allowing the user to modify the first dimension, the second dimension or the concept set, thereby supporting slice and dice; and allowing the user to graphically display information provided in the first dimension or the second dimension, thereby supporting graphical analysis.
-
-
56. A computer program product that supports online analytical processing (OLAP) to allow a user to analyze a corpus of unstructured objects, the computer program product comprising a computer usable medium having control logic stored therein, said control logic comprising:
-
first computer readable program code means for causing the computer to display a first dimension including at least one concept, at least one category, and at least one indication of a presence or absence of one of the at least one concept in one of the at least one category, wherein a category includes an unstructured object in the corpus; second computer readable program code means for causing the computer to allow the user to display a second dimension as a result of the user selecting one of the at least one category, one of the at least one concept, or one of the at least one indication, thereby supporting drill down; third computer readable program code means for causing the computer to allow the user to modify the at least one concept, thereby supporting slice and dice; and fourth computer readable program code means for causing the computer to allow the user to graphically display information provided in the first dimension or the second dimension, thereby supporting graphical analysis.
-
Specification