Detection and creation of appropriate row concept during automated model generation
First Claim
1. A computer program product for comparative peer analysis for business intelligence (BI), the computer program product comprising a computer-readable storage medium having program code embodied therewith, the program code executable by at least one processing device to:
- receive data, wherein the data is organized as a plurality of named fields, wherein each named field includes a set of values associated with the named field, wherein each named field is assigned to a category from a plurality of categories and wherein each set of values includes two or more entries;
determine, for at least one category, whether there is at least one identifier field for that category, wherein each identifier field is a named field that acts as an identifier for that category; and
select a concept, wherein selecting the concept includes;
determining whether one of the categories includes an identifier field that has a unique value for each entry in the identifier field set of values and, if so, selecting the identifier field as the concept;
if none of the categories include an identifier field that has a unique value for each entry in the identifier field set of values, determining whether one of the categories includes two or more identifier fields that, when combined, have a unique value for each entry in the combined identifier field set of values and, if so, selecting the combined identifier fields as the concept andif none of the categories include an identifier field that has a unique value for each entry in the identifier field set of values and if none of the categories include two or more identifier fields that, when combined, have a unique value for each entry in the combined identifier field set of values, adding a new identifier field, wherein adding the new identifier field includes providing a unique value for each entry in set of values included in the new identifier field, associating the new identifier field with one of the categories, and selecting the new identifier field as the concept.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for assigning concepts to sets of values. Data is received, wherein the data is organized as a plurality of named fields and as two or more sets of values associated with the named fields, wherein each named field is assigned to a category. For each category, determine whether there is at least one identifier field for that category, wherein each identifier field is a named field that acts as an identifier for that category, and identify identifier fields, if any, for each category that have a unique value in the identifier field for that category for each set of values. Then select one of the categories as a concept representing the sets of values. In some embodiments, the data is organized as a table, wherein the named fields are columns and the sets of values are rows.
107 Citations
9 Claims
-
1. A computer program product for comparative peer analysis for business intelligence (BI), the computer program product comprising a computer-readable storage medium having program code embodied therewith, the program code executable by at least one processing device to:
-
receive data, wherein the data is organized as a plurality of named fields, wherein each named field includes a set of values associated with the named field, wherein each named field is assigned to a category from a plurality of categories and wherein each set of values includes two or more entries; determine, for at least one category, whether there is at least one identifier field for that category, wherein each identifier field is a named field that acts as an identifier for that category; and select a concept, wherein selecting the concept includes; determining whether one of the categories includes an identifier field that has a unique value for each entry in the identifier field set of values and, if so, selecting the identifier field as the concept; if none of the categories include an identifier field that has a unique value for each entry in the identifier field set of values, determining whether one of the categories includes two or more identifier fields that, when combined, have a unique value for each entry in the combined identifier field set of values and, if so, selecting the combined identifier fields as the concept and if none of the categories include an identifier field that has a unique value for each entry in the identifier field set of values and if none of the categories include two or more identifier fields that, when combined, have a unique value for each entry in the combined identifier field set of values, adding a new identifier field, wherein adding the new identifier field includes providing a unique value for each entry in set of values included in the new identifier field, associating the new identifier field with one of the categories, and selecting the new identifier field as the concept. - View Dependent Claims (2, 3, 4)
-
-
5. A system comprising:
-
a network; a business intelligence system; and one or more computing devices connected through the network to the business intelligence system; wherein the business intelligence system receives data, wherein the data is organized as a plurality of named fields, wherein each named field includes a set of values associated with the named field, wherein each named field is assigned to a category from a plurality of categories and wherein each set of values includes two or more entries; and wherein the business intelligence system includes a dataset analytics tool, wherein the dataset analytics tool; receives the data; determines, for at least one category, whether there is at least one identifier field for that category, wherein each identifier field is a named field that acts as an identifier for that category; and selects a concept, wherein selecting the concept includes; determining whether one of the categories includes an identifier field that has a unique value for each entry in the identifier field set of values and, if so, selecting the identifier field as the concept; if none of the categories include an identifier field that has a unique value for each entry in the identifier field set of values, determining whether one of the categories includes two or more identifier fields that, when combined, have a unique value for each entry in the combined identifier field set of values and, if so, selecting the combined identifier fields as the concept; and if none of the categories include an identifier field that has a unique value for each entry in the identifier field set of values and if none of the categories include two or more identifier fields that, when combined, have a unique value for each entry in the combined identifier field set of values, adding a new identifier field, wherein adding the new identifier field includes providing a unique value for each entry in set of values included in the new identifier field, associating the new identifier field with one of the categories, and selecting the new identifier field as the concept. - View Dependent Claims (6, 7, 8, 9)
-
Specification