×

Heuristic method of classification

  • US 7,096,206 B2
  • Filed: 06/19/2001
  • Issued: 08/22/2006
  • Est. Priority Date: 06/19/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer implemented method of constructing a model configured to classify biological samples as being of one of at least a first state or a second state different than the first state, comprising:

  • providing a plurality of data strings, each data string being derived from a biological sample known to be of the first state or the second state;

    using a genetic algorithm to select a first set of variables that identify data in each of the plurality of data strings;

    calculating a sample vector for each member of the set of data strings using the first set of variables;

    finding a location in a first vector space of each of at least two data clusters that best fit the sample vectors calculated using the first set of variables;

    determining a variability for the at least two data clusters that best fit the sample vectors calculated using the first set of variables;

    determining whether the variability of the at least two data clusters that best fit the sample vectors calculated using the first set of variables is within an acceptable tolerance;

    if it is determined that the variability of the at least two data clusters that best fit the sample vectors calculated using the first set of variables is within the acceptable tolerance, providing the locations in the first vector space of the at least two data clusters that best fit the sample vectors calculated using the first set of variables; and

    if it is determined that the variability of the at least two data clusters that best fit the sample vectors calculated using the first set of variables is not within the acceptable tolerance, using the genetic algorithm to select a second set of variables different than the first set of variables, calculating a sample vector for each member of the set of data strings using the second set of variables, finding a location in a second vector space of each of at least two data clusters that best fit the sample vectors calculated using the second set of variables, determining a variability for the at least two data clusters that best fit the sample vectors calculated using the second set of variables, determining whether the variability for the at least two data clusters that best fit the sample vectors calculated using the second set of variables is within the acceptable tolerance, and if it is determined that the variability of the at least two data clusters that best fit the sample vectors calculated using the second set of variables is within the acceptable tolerance, providing the locations in the second vector space of the at least two data clusters that best fit the sample vectors calculated using the second set of variables.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×