Approximate hashing functions for finding similar content
First Claim
Patent Images
1. A system comprising:
- one or more computers; and
a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising;
A) training a plurality of learning systems, each learning system implementing a learning function and having an input and producing an output where training includes;
identifying a training set including target output values associated therewith;
providing the training set to each learning system in a small number of plurality of cycles and adjusting parameters of the learning system to improve matching to the target output values;
adjusting the target output values based on the actual output provided by the respective learning system; and
continuing training the learning system; and
B) initializing one or more data structures including;
providing samples to each trained learning system;
combining outputs of the learning systems for each sample; and
mapping the combined outputs to one or more data structures, the combined outputs providing indices to respective samples in those data structures; and
C) evaluating a target sample including;
providing the target sample to each trained learning system;
combining the outputs of the trained learning systems; and
locating matching samples in the one or more data structures using the combined outputs of the trained learning system for the target sample.
2 Assignments
0 Petitions
Accused Products
Abstract
A method including training a plurality of learning systems, each learning system implementing a learning function and having an input and producing an output, initializing one or more data structures, and evaluating a target sample is described. Also described are methods that include initializing one or more data structures and evaluating a target sample for a best match.
68 Citations
24 Claims
-
1. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; A) training a plurality of learning systems, each learning system implementing a learning function and having an input and producing an output where training includes; identifying a training set including target output values associated therewith; providing the training set to each learning system in a small number of plurality of cycles and adjusting parameters of the learning system to improve matching to the target output values; adjusting the target output values based on the actual output provided by the respective learning system; and continuing training the learning system; and B) initializing one or more data structures including; providing samples to each trained learning system; combining outputs of the learning systems for each sample; and mapping the combined outputs to one or more data structures, the combined outputs providing indices to respective samples in those data structures; and C) evaluating a target sample including; providing the target sample to each trained learning system; combining the outputs of the trained learning systems; and locating matching samples in the one or more data structures using the combined outputs of the trained learning system for the target sample. - View Dependent Claims (2, 3)
-
-
4. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; initializing a data structure including mapping samples that are to be included as entries in the data structure to locations in the data structure using a plurality of learning systems; and evaluating a target sample for a best match to a sample in the data structure including using an index system created using the plurality of learning systems to locate a match in the data structure. - View Dependent Claims (5, 6)
-
-
7. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; initializing a data structure including mapping samples that are to be included as entries in the data structure to locations in the data structure using a plurality of learning systems; and evaluating a target sample for a best match to the samples in the data structure including using the learning systems to locate a match in the data structure without directly comparing the target sample to the data structure sample. - View Dependent Claims (8, 9)
-
-
10. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
A) training a plurality of learning systems, each learning system implementing a learning function and having an input and producing an output where training includes; identifying a training set including target output values associated therewith; providing the training set to each learning system in a small number of plurality of cycles and adjusting parameters of the learning system to improve matching to the target output values; adjusting the target output values based on the actual output provided by the respective learning system; and continuing training the learning system; and B) initializing one or more data structures including; providing samples to each trained learning system; combining outputs of the learning systems for each sample; and mapping the combined outputs to one or more data structures, the combined outputs providing indices to respective samples in those data structures; and C) evaluating a target sample including; providing the target sample to each trained learning system; combining the outputs of the trained learning systems; and locating matching samples in the one or more data structures using the combined outputs of the trained learning system for the target sample. - View Dependent Claims (11, 12)
-
-
13. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
initializing a data structure including mapping samples that are to be included as entries in the data structure to locations in the data structure using a plurality of learning systems; and evaluating a target sample for a best match to a sample in the data structure including using an index system created using the plurality of learning systems to locate a match in the data structure. - View Dependent Claims (14, 15)
-
-
16. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
initializing a data structure including mapping samples that are to be included as entries in the data structure to locations in the data structure using a plurality of learning systems; and evaluating a target sample for a best match to the samples in the data structure including using the learning systems to locate a match in the data structure without directly comparing the target sample to the data structure sample. - View Dependent Claims (17, 18)
-
-
19. A computer-implemented method comprising:
-
providing, by a client machine, a target sample to one or more servers for evaluation as a best match to a sample in a data structure stored by the one or more servers, wherein evaluation as a best match comprises using an index system created using a plurality of learning systems to locate a match in the data structure, wherein samples that are included as entries in the data structure are mapped to locations in the data structure using the plurality of learning systems; and receiving, by the client machine, data that identifies the best match from the one or more servers. - View Dependent Claims (20, 21)
-
-
22. A computer-implemented method comprising:
providing, by a client machine, a target sample to one or more servers for evaluation as a best match to samples in a data structure stored by the one or more servers, wherein evaluation as a best match comprises using a plurality of learning systems to locate a match in the data structure without directly comparing the target sample to the data structure sample, wherein samples that are included as entries in the data structure are mapped to locations in the data structure using the plurality of learning systems. - View Dependent Claims (23, 24)
Specification