Vector space method for secure information sharing
First Claim
1. A computer-based method for securely sharing sensitive documents, wherein a non-sensitive vector space is embodied in a computer readable medium, and the non-sensitive vector space includes non-sensitive term vectors and non-sensitive document vectors corresponding to a collection of non-sensitive documents, the computer-based method comprising:
- (a) transmitting a subset of the non-sensitive term vectors to a plurality of entities that each manage a sensitive document database, each non-sensitive term vector corresponding to a non-sensitive term;
(b) receiving offset vectors from the plurality of entities responsive to sensitive documents in the sensitive document databases that contain the non-sensitive terms;
(c) updating the term vectors in the non-sensitive vector space responsive to the offset vectors from the sensitive documents, wherein step (c) comprises;
updating the term vectors in the non-sensitive vector space responsive to the offset vectors, such that a first term vector is updated based on a combination of offset vectors respectively received from each entity, andupdating the term vectors in the non-sensitive vector space responsive to the offset vectors, such that a new term vector is generated for each offset vector;
(d) generating a vector representation of a query in the non-sensitive vector space based on the updated term vectors; and
(e) identifying at least one non-sensitive document based on a comparison between the vector representation of the query and the non-sensitive document vectors.
3 Assignments
0 Petitions
Accused Products
Abstract
Presented are systems and methods for securely sharing confidential information. In such a method, term vectors corresponding to ones of a plurality of confidential terms included in a plurality of confidential documents is received. Each of the received term vectors is mapped into a vector space. Non-confidential documents are mapped into the vector space to generate a document vector corresponding to each non-confidential document, wherein the generation of each document vector is based on a subset of the received term vectors. At least one of the non-confidential documents is identified in response to a query mapped into the vector space.
62 Citations
8 Claims
-
1. A computer-based method for securely sharing sensitive documents, wherein a non-sensitive vector space is embodied in a computer readable medium, and the non-sensitive vector space includes non-sensitive term vectors and non-sensitive document vectors corresponding to a collection of non-sensitive documents, the computer-based method comprising:
-
(a) transmitting a subset of the non-sensitive term vectors to a plurality of entities that each manage a sensitive document database, each non-sensitive term vector corresponding to a non-sensitive term; (b) receiving offset vectors from the plurality of entities responsive to sensitive documents in the sensitive document databases that contain the non-sensitive terms; (c) updating the term vectors in the non-sensitive vector space responsive to the offset vectors from the sensitive documents, wherein step (c) comprises; updating the term vectors in the non-sensitive vector space responsive to the offset vectors, such that a first term vector is updated based on a combination of offset vectors respectively received from each entity, and updating the term vectors in the non-sensitive vector space responsive to the offset vectors, such that a new term vector is generated for each offset vector; (d) generating a vector representation of a query in the non-sensitive vector space based on the updated term vectors; and (e) identifying at least one non-sensitive document based on a comparison between the vector representation of the query and the non-sensitive document vectors. - View Dependent Claims (2, 3, 4)
-
-
5. A computer program product comprising a computer readable storage medium having computer readable program code stored therein for securely sharing sensitive information, wherein a non-sensitive vector space is embodied in the computer readable storage medium, and the non-sensitive vector space includes non-sensitive term vectors and non-sensitive document vectors corresponding to a collection of non-sensitive documents, the computer readable program code comprising:
-
computer readable first program code for causing a computer to transmit a subset of the non-sensitive term vectors to a plurality of entities that each manage a sensitive document database, each non-sensitive term vector corresponding to a non-sensitive term; computer readable second program code for causing a computer to receive offset vectors from the plurality of entities responsive to sensitive documents in the sensitive document databases that contain the non-sensitive terms; computer readable third program code for causing a computer to update the term vectors in the non-sensitive vector space responsive to the offset vectors from the sensitive documents, wherein the computer readable third program code comprises; code for causing the computer to update the term vectors in the non-sensitive vector space responsive to the offset vectors, such that a first term vector is updated based on a combination of offset vectors respectively received from each of the entities, and code for causing the computer to update the term vectors in the non-sensitive vector space responsive to the offset vectors, such that a new term vector is generated for each offset vector; computer readable fourth program code for causing a computer to generate a vector representation of a query in the non-sensitive vector space based on the updated term vectors; and computer readable fifth Program code for causing a computer to identify at least one non-sensitive document based on a comparison between the vector representation of the query and the non-sensitive document vectors. - View Dependent Claims (6, 7, 8)
-
Specification