×

SYSTEM AND METHOD FOR MACHINE LEARNING AND CLASSIFYING DATA

  • US 20140344195A1
  • Filed: 05/20/2014
  • Published: 11/20/2014
  • Est. Priority Date: 05/20/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computerized method for classifying data comprising:

  • (a) receiving the data;

    (b) dividing the received data into two or more chunks;

    (c) mapping each chunk into a token and storing the token in a token collection;

    (d) hashing each token using two or more local sensitivity hashing functions, wherein each local sensitivity hashing function contains two or more random hashing seed numbers, determining a minimum hash value for each local sensitivity hashing function, and storing the minimum hash value for each local sensitivity hashing function in a minimum hash set collection;

    (e) classifying the data using the minimum hash values for the tokens; and

    wherein the foregoing steps are performed by one or more processors.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×