×

Systems and methods for record linkage and paraphrase generation using surrogate learning

  • US 8,108,326 B2
  • Filed: 02/06/2009
  • Issued: 01/31/2012
  • Est. Priority Date: 02/06/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of using a processor and a memory for classifying data associated with a feature space X to a set of classes y={0,1}, wherein features defining the feature space X are partitioned into X=X1×

  • X2, a random feature vector xε

    X is denoted correspondingly as x=(x1, x2), and feature x1 is a binary random variable, the method comprising;

    estimating P(x1|x2) from a set of unlabeled data;

    estimating P(x1=0|x2) from a set of labeled data;

    determining whether to classify a portion of the data to y=0 or y=1 based on the estimated P(x1=0|x2); and

    logically associating the portion of the data in the memory with the class y=0 or the class y=1 based on the determination.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×