×

Data mining technique with maintenance of ancestry counts

  • US 10,268,953 B1
  • Filed: 01/13/2015
  • Issued: 04/23/2019
  • Est. Priority Date: 01/28/2014
  • Status: Active Grant
First Claim
Patent Images

1. A data mining system, for use with a data mining training database containing a plurality of data samples, comprising:

  • a computer system having a memory having a candidate gene database identifying a pool of candidate individuals, each of the candidate individuals identifying a plurality of conditions and at least one corresponding proposed output in dependence upon the conditions, anda gene pool processor which;

    performs a procreation step of forming new individuals in the pool of candidate individuals at least in part by copying into each subject new individual at least one member of the group consisting of;

    a condition in an individual in a set of at least one parent individual corresponding to the subject new individual, and an output in an individual in the set of parent individuals corresponding to the subject new individual;

    tests each individual in a testing subset of at least one of the candidate individuals, each of the tests applying the conditions of the respective individual to a respective subset of the data samples in the training database to propose a result, each individual in the testing subset being tested on at least one data sample and at least one of the individuals in the testing subset being tested on more than one data sample;

    calculates an overall fitness estimate for each of the individuals in the testing subset, in dependence upon the results proposed by the respective individual when the conditions of the respective individual were applied to the respective subset of the data samples; and

    stores, in association with each of the candidate individuals in the testing subset, a respective ancestry count indicating a respective number of procreation events in the ancestry of the individual,the gene pool processor further including a competition module which (i) adjusts respective overall fitness estimates of the individuals in dependence upon their respective ancestry counts and (ii) selects individuals for discarding in dependence upon comparisons among their respective overall fitness estimates,the computer system further having a gene harvesting module providing for deployment selected ones of the remaining individuals from the pool of candidate individuals,wherein the computer system comprises a server and a collection of at least one client device, andwherein in testing each individual in a testing subset of at least one of the candidate individuals;

    the server delegates to the at least one client device the testing of the individuals in the testing subset; and

    the server receives tested individuals from the at least one client device, a first subset of at least one of the received tested individuals being different from all of the individuals previously delegated by the server, each tested individual being received in association with an indication of its performance during testing by the at least one client device and at least the tested individuals in the first subset also being received in association with an indication of its ancestry count.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×