Signature representation of data with aliasing across synonyms
First Claim
Patent Images
1. A method for generating, in a computing device, an m-dimensional aliased signature vector comprising m vector elements, the method comprising:
- accessing a plurality of key-value pairs, each comprising a respective key, corresponding to one of n unique identifiers, and a non-zero value, at least one of the keys being associated with one or more synonymous unique identifiers from the n unique identifiers; and
calculating each vector element based on a summation of a plurality of terms, each term calculated from a respective key-value pair, when the key of the respective key-value pair is associated one or more synonymous unique identifiers, the term calculated by adding one or more sub-terms, each calculated from a respective synonymous unique identifier of the one or more synonymous unique identifiers associated with the key of the respective key-value pair by;
generating a synonym seed based on the respective synonymous unique identifier and the element identifier associated with the vector element being calculated;
generating a synonym pseudo-random number from the generated synonym seed; and
multiplying the synonym pseudo-random number by the value of the respective key-value pair,wherein m>
>
n.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method for generating an m-dimensional signature vector with aliasing across synonyms in a computing device is provided. Data representable by a plurality of key-value pairs, each comprising a unique identifier and an associated non-zero value of n-dimensional data is received at the computing device. Elements of the m-dimensional aliased signature vector are generated using synonym information of keys from the plurality of key-value pairs.
21 Citations
27 Claims
-
1. A method for generating, in a computing device, an m-dimensional aliased signature vector comprising m vector elements, the method comprising:
-
accessing a plurality of key-value pairs, each comprising a respective key, corresponding to one of n unique identifiers, and a non-zero value, at least one of the keys being associated with one or more synonymous unique identifiers from the n unique identifiers; and calculating each vector element based on a summation of a plurality of terms, each term calculated from a respective key-value pair, when the key of the respective key-value pair is associated one or more synonymous unique identifiers, the term calculated by adding one or more sub-terms, each calculated from a respective synonymous unique identifier of the one or more synonymous unique identifiers associated with the key of the respective key-value pair by; generating a synonym seed based on the respective synonymous unique identifier and the element identifier associated with the vector element being calculated; generating a synonym pseudo-random number from the generated synonym seed; and multiplying the synonym pseudo-random number by the value of the respective key-value pair, wherein m>
>
n. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computing device for generating an m-dimensional aliased signature vector, the device comprising:
-
a memory containing instructions; and a processor for executing instructions, the instructions when executed by the processor configuring the device to provide functionality for; accessing a plurality of key-value pairs, each comprising a respective key, corresponding to one of n unique identifiers, and a non-zero value, at least one of the keys being associated with one or more synonymous unique identifiers from the n unique identifiers; and calculating each vector element based on a summation of a plurality of terms, each term calculated from a respective key-value pair, when the key of the respective key-value pair is associated one or more synonymous unique identifiers, the term calculated by adding one or more sub-terms, each calculated from a respective synonymous unique identifier of the one or more synonymous unique identifiers associated with the key of the respective key-value pair by; generating a synonym seed based on the respective synonymous unique identifier and the element identifier associated with the vector element being calculated; generating a synonym pseudo-random number from the generated synonym seed; and multiplying the synonym pseudo-random number by the value of the respective key-value pair, wherein m>
>
n. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A non-transitory computer readable memory containing instructions for generating an m-dimensional signature vector comprising m vector elements, the instructions which when executed by a processor perform the method of:
-
accessing a plurality of key-value pairs, each comprising a respective key, corresponding to one of n unique identifiers, and a non-zero value, at least one of the keys being associated with one or more synonymous unique identifiers from the n unique identifiers; and calculating each vector element based on a summation of a plurality of terms, each term calculated from a respective key-value pair, when the key of the respective key-value pair is associated one or more synonymous unique identifiers, the term calculated by adding one or more sub-terms, each calculated from a respective synonymous unique identifier of the one or more synonymous unique identifiers associated with the key of the respective key-value pair by; generating a synonym seed based on the respective synonymous unique identifier and the element identifier associated with the vector element being calculated; generating a synonym pseudo-random number from the generated synonym seed; and multiplying the synonym pseudo-random number by the value of the respective key-value pair, wherein m>
>
n.
-
Specification