System and method of detecting common patterns within unstructured data elements retrieved from big data sources
First Claim
Patent Images
1. A method for detection of common patterns within unstructured data elements, comprising:
- searching a plurality of unstructured data elements extracted from big data sources to identify a plurality of patches;
extracting the plurality of patches that were identified;
generating, by a signature generator system, at least one signature for each one patch of the plurality of patches to generate a plurality of signatures for the plurality of patches, wherein the signature generator system includes a plurality of computational cores configured to receive the plurality of patches, each one computational core of the plurality of computational cores having properties that are at least partly statistically independent of other ones of the plurality of computational cores, wherein the properties of the one computational core are set independently of each other computational core of the plurality of computational cores;
identifying common patterns among the plurality of signatures;
clustering the plurality of signatures having the common patterns that were identified to generate a plurality of clusters; and
correlating the plurality of clusters to identify associations between the respective common patterns that were identified.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for detection of common patterns within unstructured data elements. The method includes searching a plurality of unstructured data elements extracted from big data sources to identify a plurality of patches; extracting the identified plurality of patches; generating, by a signature generator system, at least one signature for each patch; identifying common patterns among the at least one generated signature; clustering the signatures having the identified common patterns; and correlating the generated clusters to identify associations between the respective identified common patterns.
369 Citations
19 Claims
-
1. A method for detection of common patterns within unstructured data elements, comprising:
-
searching a plurality of unstructured data elements extracted from big data sources to identify a plurality of patches; extracting the plurality of patches that were identified; generating, by a signature generator system, at least one signature for each one patch of the plurality of patches to generate a plurality of signatures for the plurality of patches, wherein the signature generator system includes a plurality of computational cores configured to receive the plurality of patches, each one computational core of the plurality of computational cores having properties that are at least partly statistically independent of other ones of the plurality of computational cores, wherein the properties of the one computational core are set independently of each other computational core of the plurality of computational cores; identifying common patterns among the plurality of signatures; clustering the plurality of signatures having the common patterns that were identified to generate a plurality of clusters; and correlating the plurality of clusters to identify associations between the respective common patterns that were identified. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for analyzing unstructured data, comprising:
-
a network interface for allowing connectivity to a plurality of big data sources; a processing unit; and a memory connected to the processing unit, the memory containing instructions that, when executed by the processing unit, configure the system to; search a plurality of unstructured data elements extracted from the big data sources to identify a plurality of patches; extract the plurality of patches that were identified; generate, by a signature generator system, at least one signature for each one patch of the plurality of patches to generate a plurality of signatures for the plurality of patches, wherein the signature generator system includes a plurality of computational cores configured to receive the plurality of patches, each one computational core of the plurality of computational cores having properties that are at least partly statistically independent of other ones of the plurality of computational cores, wherein the properties of the one computational core are set independently of each other computational core of the plurality of computational cores; identify common patterns among the plurality of signatures; cluster the plurality of signatures having the common patterns that were identified to generate a plurality of clusters; and correlate the plurality of clusters to identify associations between the respective common patterns that were identified. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification