×

Parallel classification for data mining in a shared-memory multiprocessor system

  • US 6,230,151 B1
  • Filed: 04/16/1998
  • Issued: 05/08/2001
  • Est. Priority Date: 04/16/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for generating a decision-tree classifier in a shared-memory multiprocessor system from a set of records, the tree having a plurality of nodes, the method comprising the steps of:

  • (a) generating cooperatively by the processors, in the shared memory, an attribute list for each attribute of the records, the attribute lists corresponding a current node and including tuples each having information on a record class;

    (b) assigning each attribute list of the current node to one of the processors;

    (c) each processor accessing the attribute lists assigned to the processor, in the shared memory, to determine a best split for each attribute list;

    (d) the processors cooperatively determining, through the shared memory, a global best split for all the attribute lists associated with the current node;

    (e) reassigning each attribute list of the current node to one of the processors;

    (f) each processor splitting the attribute lists reassigned to the processor according to the global best split into new attribute lists, the new lists corresponding to child nodes of the current node and residing in the shared memory; and

    (g) repeating steps (b)-(f) with each newly created child node as the current node, until each attribute list for the newly created child nodes includes only tuples of the same record class.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×