Multi-entity normalization
First Claim
1. A computer-implemented method, comprising:
- retrieving, from a plurality of data providers, data representing entities and relationships linking the entities;
defining, a plurality of graph rules defined in a graph model, the graph rules including one or more graph patterns associated with the graph model, the graph model specifying how the data is expected to be patterned and interrelated;
searching the data representing entities and relationships linking the entities using the one or more graph patterns associated with the graph model, the searching including comparing the one or more graph patterns to a structure associated with the data;
andin response to determining that the structure associated with the data matches at least one of the one or more graph patterns associated with the graph model, applying, in a continuous mode, the one or more graph rules by selecting an anchor entity and performing a graph walk from the anchor entity to determine violations to one or more of the graph rules, and correcting the data and the structure of the data, the correcting including updating the data representing the entities and relationships linking the entities that violate one or more of the plurality of graph rules.
11 Assignments
0 Petitions
Accused Products
Abstract
In accordance with aspects of the disclosure, systems and methods are provided for normalizing data representing entities and relationships linking the entities including defining one or more graph rules describing searchable characteristics for the data representing the entities and relationships linking the entities, applying the one or more graph rules to the data representing the entities and the relationships linking the entities, identifying one or more matching instances between the one or more graph rules and the data representing the entities and the relationships linking the entities, and performing one or more actions to update the one or more matching instances between the one or more graph rules and the data representing the entities and the relationships linking the entities.
-
Citations
18 Claims
-
1. A computer-implemented method, comprising:
-
retrieving, from a plurality of data providers, data representing entities and relationships linking the entities; defining, a plurality of graph rules defined in a graph model, the graph rules including one or more graph patterns associated with the graph model, the graph model specifying how the data is expected to be patterned and interrelated; searching the data representing entities and relationships linking the entities using the one or more graph patterns associated with the graph model, the searching including comparing the one or more graph patterns to a structure associated with the data; and in response to determining that the structure associated with the data matches at least one of the one or more graph patterns associated with the graph model, applying, in a continuous mode, the one or more graph rules by selecting an anchor entity and performing a graph walk from the anchor entity to determine violations to one or more of the graph rules, and correcting the data and the structure of the data, the correcting including updating the data representing the entities and relationships linking the entities that violate one or more of the plurality of graph rules. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer system including instructions recorded on a non-transitory computer-readable medium and executable by at least one processor, the system comprising, a normalization engine configured to cause the at least one processor to validate and clean data representing entities and relationships linking the entities, the normalization engine including:
a rule definition module configured to define a plurality of graph rules describing searchable characteristics for the data representing the entities and the relationships linking the entities, the searchable characteristics defining one or more patterns that specify how entities and relationships linking the entities are expected to be modeled and interrelated; a rule application module configured to apply, in a continuous mode, the plurality of graph rules to ensure one or more sets of entities and the relationships linking the entities satisfy the plurality of graph rules including comparing the one or more patterns to a structure associated with the data representing entities and relationships linking the entities to determine whether the one or more patterns match the structure associated with the data representing the entities and relationships linking the entities; and a rule action module configured to perform one or more actions on the data and on the structure associated with the one or more sets of entities and the relationships linking the entities, the actions being performed to update the data representing the entities and relationships linking the entities that violate one or more of the plurality of graph rules, the update including cleaning the data by performing at least one of modifying one or more relationships linking the entities, wherein the update is performed in response to determining that the structure matches at least one of the one or more patterns. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15)
-
16. A computer program product, the computer program product being tangibly embodied on a non-transitory computer-readable medium and comprising instructions that, when executed by at least one processor, are configured to:
-
retrieving, from a plurality of data providers, data representing entities and relationships linking the entities; defining, a plurality of graph rules defined in a graph model, the graph rules including one or more graph patterns associated with the graph model, the graph model specifying how the data is expected to be patterned and interrelated; searching the data representing entities and relationships linking the entities using the one or more graph patterns associated with the graph model, the searching including comparing the one or more graph patterns to a structure associated with the data; and in response to determining that the structure associated with the data matches at least one of the one or more graph patterns associated with the graph model, applying, in a continuous mode, the one or more graph rules by selecting an anchor entity and performing a graph walk from the anchor entity to determine violations to one or more of the graph rules and correcting the data and the structure of the data, the correcting including updating the data representing the entities and relationships linking the entities that violate one or more of the plurality of graph rules. - View Dependent Claims (17, 18)
-
Specification