System and method facilitating pattern recognition
First Claim
Patent Images
1. A pattern recognition system, comprising:
- a preprocessing component that receives an input and provides an output pattern;
at least one convolutional layer that receives the output pattern from the preprocessing component, the convolutional layer comprising a plurality of feature maps, the feature map including a weighted set of trainable parameters, the feature map sub-sampling every other position of at least a portion of the output pattern with the weighted set of trainable parameters utilized for the position sub-sampled and a next two adjacent positions in all dimensions, the at least one convolutional layer providing outputs associated with features extracted from the output pattern; and
,at least one fully connected layer that receives outputs from the at least one convolutional layer, the at least one fully connected layer classifying the features extracted by the at least one convolutional layer, the at least one fully connected layer providing a plurality of outputs, the output comprising a probability associated with a class, the pattern recognition system trained utilizing cross entropy error minimization based at least in part, upon the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method facilitating pattern recognition is provided. The invention includes a pattern recognition system having a convolutional neural network employing feature extraction layer(s) and classifier layer(s). The feature extraction layer(s) comprises convolutional layers and the classifier layer(s) comprises fully connected layers. The pattern recognition system can be trained utilizing a calculated cross entropy error. The calculated cross entropy error is utilized to update trainable parameters of the pattern recognition system.
-
Citations
31 Claims
-
1. A pattern recognition system, comprising:
-
a preprocessing component that receives an input and provides an output pattern; at least one convolutional layer that receives the output pattern from the preprocessing component, the convolutional layer comprising a plurality of feature maps, the feature map including a weighted set of trainable parameters, the feature map sub-sampling every other position of at least a portion of the output pattern with the weighted set of trainable parameters utilized for the position sub-sampled and a next two adjacent positions in all dimensions, the at least one convolutional layer providing outputs associated with features extracted from the output pattern; and
,at least one fully connected layer that receives outputs from the at least one convolutional layer, the at least one fully connected layer classifying the features extracted by the at least one convolutional layer, the at least one fully connected layer providing a plurality of outputs, the output comprising a probability associated with a class, the pattern recognition system trained utilizing cross entropy error minimization based at least in part, upon the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A pattern recognition system, comprising:
-
a preprocessing component that receives an input and provides a bitmap output pattern; a first convolutional layer that receives the bitmap output pattern from the preprocessing component, the first convolutional layer comprising a plurality of first feature maps, the first feature map including a first weighted set of trainable parameters, the first feature map sub-sampling every other position of at least a portion of the bitmap output pattern with the first weighted set of trainable parameters utilized for the position sub-sampled and a next two adjacent positions in all dimensions, the first feature maps providing outputs associated with first features extracted from the bitmap output pattern; a second convolutional layer that receives the outputs of the first feature maps, the second convolutional layer comprising a plurality of second feature maps, the second feature map including a second weighted set of trainable parameters, the second feature map sub-sampling every other position of at least a portion of the outputs of the first feature map with the second weighted set of trainable parameters utilized for the position sub-sampled and a next two adjacent positions in all dimensions, the second feature maps providing outputs associated with second features; a first fully connected layer that classifies the outputs of the second feature maps, the first fully connected layer providing outputs; and a second fully connected layer that classifies the outputs of the first fully connected layer, the second fully connected layer providing a plurality of outputs, the output comprising a probability associated with a class, the pattern recognition system trained utilizing cross entropy error minimization based at least in part, upon the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes. - View Dependent Claims (10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
13. The pattern recognition system of claim 13, the second feature map receiving 25 inputs.
-
26. A system for training a pattern recognition system, comprising:
-
a pattern recognition system comprising a preprocessing component that receives an input and provides an output pattern and a convolutional neural network that receives the output pattern from the preprocessing component sub-samples every other position of at least a portion of the output pattern with a weighted set of trainable parameters utilized for the position sub-sampled and a next two adjacent positions in all dimensions, and provides a plurality of class probability outputs; a cross entropy error calculator that calculates a cross entropy error based, at least in part, upon the plurality of class probability outputs and training class information utilizing the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes; a back propagation gradient descent component that utilizes a stochastic descent algorithm to update the weighted set of trainable parameters of the pattern recognition system based, at least in part, upon the cross entropy error; and a trainable parameter update component that updates the weighted set of trainable parameters of the pattern recognition system. - View Dependent Claims (27)
-
-
28. A method for training a pattern recognition system, comprising:
-
preprocessing a training pattern utilizing a preprocessing component; performing pattern recognition on an output pattern from the preprocessing component utilizing a pattern recognition system based, at least in part, upon a convolutional neural network having a weighted set of trainable parameters; providing a plurality of class probability outputs based on the training pattern; calculating a cross entropy error based, at least in part, upon the plurality of class probability outputs and information associated with the training pattern utilizing the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes; and
,updating the weighted set of trainable parameters based, at least in part, upon a gradient descent algorithm utilizing the calculated cross entropy error.
-
-
29. A data packet, transmitted between two or more computer components on a computer readable medium, that facilitates training a pattern recognition system, the data packet comprising:
-
a data field comprising a set of trainable parameters for a pattern recognition system based, at least in part, upon a convolutional neural network, the set of trainable parameters updated based, at least in part, upon a gradient descent algorithm utilizing a calculated cross entropy error utilizing the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes.
-
-
30. A computer readable medium storing computer executable components of a system facilitating training of a pattern recognition, comprising:
-
a preprocessing component that receives an input and provides a bitmap output pattern; a pattern recognition component comprising a convolutional neural network that receives an output pattern from the preprocessing component, sub-samples every other position of at least a portion of the output pattern with a weighted set of trainable parameters utilized for the position sub-sampled and a next two adjacent positions in all dimensions, and provides a plurality of class probability outputs; a cross entropy error calculator component that calculates a cross entropy error based, at least in part, upon the plurality of class probability outputs and training class information utilizing the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes; a back propagation gradient descent component that utilizes a stochastic gradient descent algorithm to update the weighted set of trainable parameters of the pattern recognition system based, at least in part, upon the cross entropy error; and a trainable parameter update component that updates the weighted set of trainable parameters of the pattern recognition system.
-
-
31. A training system for a pattern recognition system, comprising:
-
means for accepting an input; means for preprocessing the input; means for outputting a preprocessed pattern; means for performing pattern recognition utilizing a convolutional neural network that receives the preprocessed pattern, sub-samples every other position of at least a portion of the preprocessed pattern with a weighted set of trainable parameters utilized for the position sub-sampled and a next two adjacent positions in all dimensions, and provides a plurality of class probability outputs; means for calculating a cross entropy error based, at least in part, upon the plurality of class probability outputs and training class information utilizing the equation where E is the energy to be minimized, n indexes a pattern, t is the target value, ykn is the pattern recognition output on unit k for pattern n, and k indexes the classes; and
,means for updating the weighted set of trainable parameters of the means for performing pattern recognition, the means for updating utilizing a stochastic gradient descent algorithm to update the trainable parameters of the pattern recognition system based, at least in part, upon the cross entropy error.
-
Specification