Spread kernel support vector machine

US 7,406,450 B2
Filed: 02/20/2006
Issued: 07/29/2008
Est. Priority Date: 09/28/2005
Status: Active Grant

First Claim

Patent Images

1. A method for training a support vector machine, comprising the steps of:

a) selecting, via a processor of a first processing node, a local working set of training data based on local training data stored in a memory of the first processing node;

b) transmitting, via a network interface of the first processing node, certain gradients to a second processing node, the certain gradients selected from gradients of the working set of training data;

c) receiving at the network interface of the first processing node an identification of a global working set of training data;

d) executing, via the processor of the first processing node, a quadratic function stored in a storage device of the first processing node to optimize said global working set of training data;

e) updating gradients of the training data stored in the memory of the first processing node; and

f) repeating said steps a) through e) until a convergence condition is met.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Disclosed is a parallel support vector machine technique for solving problems with a large set of training data where the kernel computation, as well as the kernel cache and the training data, are spread over a number of distributed machines or processors. A plurality of processing nodes are used to train a support vector machine based on a set of training data. Each of the processing nodes selects a local working set of training data based on data local to the processing node, for example a local subset of gradients. Each node transmits selected data related to the working set (e.g., gradients having a maximum value) and receives an identification of a global working set of training data. The processing node optimizes the global working set of training data and updates a portion of the gradients of the global working set of training data. The updating of a portion of the gradients may include generating a portion of a kernel matrix. These steps are repeated until a convergence condition is met. Each of the local processing nodes may store all, or only a portion of, the training data. While the steps of optimizing the global working set of training data, and updating a portion of the gradients of the global working set, are performed in each of the local processing nodes, the function of generating a global working set of training data is performed in a centralized fashion based on the selected data (e.g., gradients of the local working set) received from the individual processing nodes.

Citations

18 Claims

1. A method for training a support vector machine, comprising the steps of:
- a) selecting, via a processor of a first processing node, a local working set of training data based on local training data stored in a memory of the first processing node;
  
  b) transmitting, via a network interface of the first processing node, certain gradients to a second processing node, the certain gradients selected from gradients of the working set of training data;
  
  c) receiving at the network interface of the first processing node an identification of a global working set of training data;
  
  d) executing, via the processor of the first processing node, a quadratic function stored in a storage device of the first processing node to optimize said global working set of training data;
  
  e) updating gradients of the training data stored in the memory of the first processing node; and
  
  f) repeating said steps a) through e) until a convergence condition is met.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 18)
- - 2. The method of claim 1 wherein said local working set of training data comprises a local subset of gradients.
  - 3. The method of claim 1 wherein said local working set of training data comprises a maximum value of a function determining maximum values of a local subset of gradients.
  - 4. The method of claim 1 wherein said certain gradients of said local working set of training data are gradients having a maximum value.
  - 5. The method of claim 1 wherein the local training data stored in the memory of the first processing node comprises an entire set of training data.
  - 6. The method of claim 1 wherein the local training data stored in the memory of the first processing node comprises a portion of an entire set of training data.
  - 7. The method of claim 6 wherein said step of receiving further comprises the step of receiving at least a portion of the global working set of training data.
  - 8. The method of claim 1 wherein said step of updating further comprises the step of generating a portion of a kernel matrix.
  - 18. The method of claim 1, further comprising the step of generating, via a processor of a network machine, the global working set of training data using the certain gradients.

9. A method for training a support vector machine, comprising the steps of:
- a) selecting, at each of a plurality of processing nodes, via a processor of each of the processing nodes, a local working set of training data based on local training data stored in a memory of each of the processing nodes;
  
  b) generating, via a processor of a network machine, a global working set of training data using certain gradients selected from gradients of each of the working sets of training data;
  
  c) executing, at each of said plurality of processing nodes, via the processor of each of the processing nodes, a quadratic function stored in a storage device of each of the processing nodes to optimize said global working set of training data;
  
  d) updating, at each of said plurality of processing nodes, gradients of the training data stored in the memory of each of the processing nodes; and
  
  e) repeating steps a) through d) until a convergence condition is met.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
- - 10. The method of claim 9 wherein said working set of training local data comprises a local subset of gradients.
  - 11. The method of claim 9 wherein said local working set of training data comprises a maximum value of a function of a local subset of gradients.
  - 12. The method of claim 9 wherein said certain gradients of said local working set of training data are gradients having a maximum value.
  - 13. The method of claim 12 further comprising the steps of:
    - determining said gradients having a maximum value using a tree structure of network nodes; and
      
      transmitting, at each of the plurality of processing nodes via a network interface of each of the processing nodes, said gradients having a maximum value using hierarchal broadcast.
  - 14. The method of claim 9 wherein the local training data stored in the memory of each of the processing nodes comprises an entire set of training data.
  - 15. The method of claim 9 wherein the local training data stored in the memory of each of the processing nodes comprises a portion of an entire set of training data.
  - 16. The method of claim 15 further comprising the step of receiving, at each of said plurality of processing nodes, at least a portion of the global working set of training data.
  - 17. The method of claim 9 wherein said step of updating comprises the step of generating, at each of said plurality of processing nodes, a portion of a kernel matrix.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
NEC Corporation
Original Assignee
NEC Laboratories America Inc (NEC Corporation)
Inventors
Durdanovic, Igor, Graf, Hans Peter, Cosatto, Eric, Vapnik, Vladimir
Primary Examiner(s)
Holmes; Michael B

Application Number

US11/276,235
Publication Number

US 20070094170A1
Time in Patent Office

890 Days
Field of Search

None
US Class Current

706/15
CPC Class Codes

G06F 18/2411   based on the proximity to a...

G06N 20/00   Machine learning

G06N 20/10   using kernel methods, e.g. ...

Spread kernel support vector machine

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

18 Claims

Specification

Solutions

Use Cases

Quick Links

Spread kernel support vector machine

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

18 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links