Concept-based analysis of structured and unstructured data using concept inheritance
First Claim
Patent Images
1. A method, comprising:
- defining a first concept based on a first set of structured and unstructured data objects, the first concept including a first seed concept and at least one first related concept and being defined by a first regular expression indicating (1) a presence of a text string in an unstructured data object from the first set and (2) a data code stored in a structured data object from the first set;
defining a second concept based on a second set of structured and unstructured data objects, the second concept including a second seed concept and at least one second related concept and being defined by a second regular expression indicating (1) a presence of a text string in an unstructured data object from the second set and (2) a data code stored in a structured data object from the second set, wherein the first concept is not positively correlated with the second concept;
defining a business rule to include a third regular expression indicating (1) the presence or absence of a text string indicative of at least one of the first concept or the second concept in an unstructured data object from the first set or from the second set and (2) the presence or absence of data code indicative of at least one of the first concept or the second concept in a structured data object from the first set or from the second set;
applying the business rule to a third set of structured and unstructured data objects to make a prediction indicative of whether the third set satisfies the business rule; and
outputting to a display information associated with the prediction.
6 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a method comprises defining a set of concepts based on a first set of structured and unstructured data objects, defining a business rule based on the set of concepts, applying the business rule to a second set of structured and unstructured data objects to make a determination associated with that set, and outputting to a display information associated with the determination.
-
Citations
18 Claims
-
1. A method, comprising:
-
defining a first concept based on a first set of structured and unstructured data objects, the first concept including a first seed concept and at least one first related concept and being defined by a first regular expression indicating (1) a presence of a text string in an unstructured data object from the first set and (2) a data code stored in a structured data object from the first set; defining a second concept based on a second set of structured and unstructured data objects, the second concept including a second seed concept and at least one second related concept and being defined by a second regular expression indicating (1) a presence of a text string in an unstructured data object from the second set and (2) a data code stored in a structured data object from the second set, wherein the first concept is not positively correlated with the second concept; defining a business rule to include a third regular expression indicating (1) the presence or absence of a text string indicative of at least one of the first concept or the second concept in an unstructured data object from the first set or from the second set and (2) the presence or absence of data code indicative of at least one of the first concept or the second concept in a structured data object from the first set or from the second set; applying the business rule to a third set of structured and unstructured data objects to make a prediction indicative of whether the third set satisfies the business rule; and outputting to a display information associated with the prediction. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method, comprising:
-
defining a first concept based on a first set of structured and unstructured data objects; defining a second concept based on the first set, the definition of the second concept based at least in part on at least one structured data element from the first set and at least one unstructured data element from the first set; defining a third concept based on a second set of structured and unstructured data objects; defining a fourth concept; defining a relationship between the first concept and the second concept, the relationship specifying that the second concept is a subconcept of the first concept; defining a relationship between the second concept and the third concept, the relationship specifying that the second concept is a subconcept of the third concept; defining, at a first time, a business rule to include a regular expression indicative of a logical relationship between the first concept, the third concept, and the fourth concept; redefining, at a second time after the first time, the business rule to (1) include a regular expression indicative of a logical relationship between the first concept and the third concept and (2) exclude the fourth concept based on a positive correlation between the fourth concept and one of the first concept or the third concept; and executing the business rule on a third set of structured and unstructured data objects, the executing including (1) a first determination of a presence of the first concept in the third set, the first determination including a second determination of a presence of the second concept in the third set, and (2) a third determination of a presence of the third concept in the third set, the third determination including a fourth determination of a presence of the second concept in the third set; and making a prediction indicative of whether the third set satisfies the business rule, the prediction based on the first determination and the third determination; and outputting to a display information associated with the prediction. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method, comprising:
-
retrieving a concept hierarchy, the concept hierarchy including a first concept, a second concept, and a third concept, the first concept based at least in part on a first set of structured and unstructured data objects; receiving a first plurality of user input signals, each signal from the plurality of user input signals indicating a selection of one or more of the first concept, the second concept or the third concept; outputting information associated with the first concept, the second concept and the third concept to a display, wherein the first concept and the third concept are not positively correlated; receiving a second plurality of user input signals that (1) defines a first logical relationship that is between the first concept and a second concept, the first logical relationship based on a first regular expression indicating (a) a presence of a text string in an unstructured data object the first set or from a second set of structured and unstructured data objects and (b) a data code stored in a structured data object from the first set or from the second set, and (2) defines a second logical relationship that is between the first concept and the third concept, the second logical relationship based on a second regular expression indicating (a) a presence of a text string in an unstructured data object from the first set or from the second set and (b) a data code stored in a structured data object from the first set or from the second set, the second logical relationship defining a business rule; and executing the business rule on a third set of structured and unstructured data objects to make a prediction whether the second set of structured and unstructured data objects satisfies the business rule. - View Dependent Claims (15, 16, 17, 18)
-
Specification