Computer-Implemented System And Method For Analyzing Data Quality
First Claim
1. A computer-implemented system for analyzing data quality, comprising:
- memory storing a dataset comprising attributes each associated with one or more elements;
a client comprising an interest vector module to receive from a user a request for determining data quality of at least one attribute of the dataset based on an interest vector comprising a listing of the elements of that attribute and a selection of one or more of the elements as elements of interest, wherein each element is encrypted; and
a server, comprising;
a condensed vector module to populate a condensed vector comprising the same listing of elements as the interest vector with occurrence frequencies for each of the listed elements;
an encryption module to encrypt the elements of the condensed vector by computing an encrypted product of each element in the condensed vector and the corresponding element of the interest vector;
an aggregate module to determine an aggregate based on the encrypted products of each element of the interest vector and the corresponding element of the condensed vector; and
providing the aggregate as results of the data quality.
5 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented system and method for analyzing data quality is provided. Attributes each associated with one or more elements are maintained. A request from a user is received for determining data quality of at least one attribute based on an interest vector having a listing of the elements of that attribute and a selection of elements of interest. Each element is encrypted. A condensed vector having the same listing of elements as the interest vector is populated with occurrence frequencies for each of the listed elements. The elements of the condensed vector are encrypted by computing an encrypted product of each element in the condensed vector and the corresponding element of the interest vector. An aggregate is determined based on the encrypted products of each element of the interest vector and the corresponding element of the condensed vector. The aggregate is provided as results of the data quality.
6 Citations
20 Claims
-
1. A computer-implemented system for analyzing data quality, comprising:
-
memory storing a dataset comprising attributes each associated with one or more elements; a client comprising an interest vector module to receive from a user a request for determining data quality of at least one attribute of the dataset based on an interest vector comprising a listing of the elements of that attribute and a selection of one or more of the elements as elements of interest, wherein each element is encrypted; and a server, comprising; a condensed vector module to populate a condensed vector comprising the same listing of elements as the interest vector with occurrence frequencies for each of the listed elements; an encryption module to encrypt the elements of the condensed vector by computing an encrypted product of each element in the condensed vector and the corresponding element of the interest vector; an aggregate module to determine an aggregate based on the encrypted products of each element of the interest vector and the corresponding element of the condensed vector; and providing the aggregate as results of the data quality. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method for analyzing data quality, comprising:
-
maintaining a dataset comprising attributes each associated with one or more elements; receiving from a user a request for determining data quality of at least one attribute of the dataset based on an interest vector comprising a listing of the elements of that attribute and a selection of one or more of the elements as elements of interest, wherein each element is encrypted; populating a condensed vector comprising the same listing of elements as the interest vector with occurrence frequencies for each of the listed elements; encrypting the elements of the condensed vector by computing an encrypted product of each element in the condensed vector and the corresponding element of the interest vector; determining an aggregate based on the encrypted products of each element of the interest vector and the corresponding element of the condensed vector; and providing the aggregate as results of the data quality. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification