System and Method for Parallelizing and Accelerating Learning Machine Training and Classification Using a Massively Parallel Accelerator
First Claim
1. A method for training an apparatus to recognize a pattern, the method comprising the steps of:
- providing the apparatus with a host processor executing steps of a machine learning process;
providing the apparatus with an accelerator including at least two processors;
inputting training pattern data into the host processor;
transferring the training data to the accelerator;
determining coefficient changes in the machine learning process with the host processor using the training pattern data;
transferring indices pertaining to one or more training vectors to the acceleratordetermining kernel dot-products with the at least two processors of the accelerator using the training data; and
transferring the dot-products back to the host processor.
3 Assignments
0 Petitions
Accused Products
Abstract
A method system for training an apparatus to recognize a pattern includes providing the apparatus with a host processor executing steps of a machine learning process; providing the apparatus with an accelerator including at least two processors; inputting training pattern data into the host processor; determining coefficient changes in the machine learning process with the host processor using the training pattern data; transferring the training data to the accelerator; determining kernel dot-products with the at least two processors of the accelerator using the training data; and transferring the dot-products back to the host processor.
-
Citations
21 Claims
-
1. A method for training an apparatus to recognize a pattern, the method comprising the steps of:
-
providing the apparatus with a host processor executing steps of a machine learning process; providing the apparatus with an accelerator including at least two processors; inputting training pattern data into the host processor; transferring the training data to the accelerator; determining coefficient changes in the machine learning process with the host processor using the training pattern data; transferring indices pertaining to one or more training vectors to the accelerator determining kernel dot-products with the at least two processors of the accelerator using the training data; and transferring the dot-products back to the host processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for training an apparatus to recognize a pattern, the system comprising:
-
a host processor of the apparatus for determining coefficient changes of a machine learning process from input training data; an accelerator including at least two processors for determining kernel dot-products using the training data; and at least one conduit for transferring the training data from the host processor to the accelerator and for transferring the kernel dot-products from the accelerator to the host processor. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for recognizing patterns, the method comprising the steps of:
-
providing host processor executing steps of a support vector machine learning process; providing an accelerator including at least two processors and a memory bank associated with each of the at least two processors; storing support vectors in the memory banks of the accelerator; reducing the precision of unlabeled pattern data with the host processor; transferring unlabeled pattern data from the host processor to the accelerator; calculating labels for the unlabeled pattern data with the at least two processors of the accelerator using the support vectors stored in the memory banks of the accelerator; and transferring the labeled pattern data back to the host processor.
-
Specification