Multi-entity normalization
First Claim
1. A computer-implemented method, comprising:
- retrieving data from a plurality of data providers, wherein the data represents entities and relationships between the entities;
defining a plurality of graph rules for a graph model, the graph model defining a plurality of data patterns and the relationships between the entities;
searching the data using the plurality of graph rules, the searching including comparing the plurality of data patterns in the graph rules to a structure associated with the data;
in response to determining that at least a portion of the structure associated with the data matches at least one of the plurality of data patterns in the graph rules, assessing whether the data includes missing relationships between the entities that result in a violation of at least one of the plurality of graph rules; and
in response to determining that the data includes missing relationships between at least two of the entities, correcting, using the at least one graph rule, the missing relationships in the structure to generate corrected data and corrected structure of the data,wherein the correcting includes automatic correction of the graph model and generation of at least one additional relationship linking the at least two entities based on the portion of the structure matching at least one of the plurality of data patterns.
9 Assignments
0 Petitions
Accused Products
Abstract
In accordance with aspects of the disclosure, systems and methods are provided for normalizing data representing entities and relationships linking the entities including defining one or more graph rules describing searchable characteristics for the data representing the entities and relationships linking the entities, applying the one or more graph rules to the data representing the entities and the relationships linking the entities, identifying one or more matching instances between the one or more graph rules and the data representing the entities and the relationships linking the entities, and performing one or more actions to update the one or more matching instances between the one or more graph rules and the data representing the entities and the relationships linking the entities.
14 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
retrieving data from a plurality of data providers, wherein the data represents entities and relationships between the entities; defining a plurality of graph rules for a graph model, the graph model defining a plurality of data patterns and the relationships between the entities; searching the data using the plurality of graph rules, the searching including comparing the plurality of data patterns in the graph rules to a structure associated with the data; in response to determining that at least a portion of the structure associated with the data matches at least one of the plurality of data patterns in the graph rules, assessing whether the data includes missing relationships between the entities that result in a violation of at least one of the plurality of graph rules; and in response to determining that the data includes missing relationships between at least two of the entities, correcting, using the at least one graph rule, the missing relationships in the structure to generate corrected data and corrected structure of the data, wherein the correcting includes automatic correction of the graph model and generation of at least one additional relationship linking the at least two entities based on the portion of the structure matching at least one of the plurality of data patterns. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product including instructions recorded on a non-transitory computer-readable storage medium and comprising instructions that, when executed by at least one processor, cause the at least one processor to:
-
retrieve data from a plurality of data providers, wherein the data represents entities and relationships between the entities; define a plurality of graph rules for a graph model, the graph model defining a plurality of data patterns and the relationships between the entities; search the data using the plurality of graph rules, the searching including comparing the plurality of data patterns in the graph rules to a structure associated with the data; in response to determining that at least a portion of the structure associated with the data matches at least one of the plurality of data patterns in the graph rules, assessing whether the data includes missing relationships between the entities that result in a violation of at least one of the plurality of graph rules; and in response to determining that the data includes missing relationships between at least two of the entities, correcting, using the at least one graph rule, the missing relationships in the structure to generate corrected data and corrected structure of the data, wherein the correcting includes automatic correction of the graph model and generation of at least one additional relationship linking the at least two entities based on the portion of the structure matching at least one of the plurality of data patterns. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computer implemented system including instructions recorded on a non-transitory computer-readable medium and executable by at least one processor, the system comprising:
-
a normalization engine configured to cause the at least one processor to validate and clean data representing entities and relationships linking the entities, the normalization engine including; a rule definition module to define a plurality of graph rules based on a graph model describing searchable characteristics of a plurality of data patterns that specify how entities and relationships linking the entities are interrelated; a rule application module to search the data using the plurality of graph rules, the searching including comparing one or more of the plurality of graph rules to a structure associated with the data and responsive to determining that at least a portion of the structure associated with the data matches at least one data pattern in the plurality of data patterns, assessing whether the data includes missing relationships between the entities that result in a violation of at least one of the plurality of graph rules; and a rule action module to generate an indication to correct the data, the structure of the data, and the graph model in response to determining that the data violates one or more of the plurality of graph rules based on the determined missing relationships and to correct the missing relationships, wherein the graph model is automatically corrected based on the portion of the structure matching the at least one data pattern. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification