×

Compact models for object recognition

  • US 10,706,267 B2
  • Filed: 01/12/2018
  • Issued: 07/07/2020
  • Est. Priority Date: 01/12/2018
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus for object recognition, comprising:

  • a processor;

    memory in electronic communication with the processor; and

    instructions stored in the memory and executable by the processor to cause the apparatus to;

    obtain a two-dimensional array of pixels representing an image;

    apply a first convolutional operation to the two-dimensional array of pixels to generate a plurality of input channels;

    perform a set of processing operations on the plurality of input channels, the set of processing operations comprising;

    applying a second convolutional operation to the plurality of input channels to generate a second plurality of input channels;

    dividing the second plurality of input channels into first channel groups, wherein each input channel of the second plurality of input channels is associated with a single first channel group;

    performing a feature selection operation for each first channel group to generate a first plurality of intermediate channels, wherein each intermediate channel is associated with a respective channel group of the first channel groups; and

    applying a third convolutional operation to the first plurality of intermediate channels, wherein the third convolutional operation comprises a first operation applied to each intermediate channel to generate a plurality of feature maps followed by a second operation applied across the plurality of feature maps to generate a first plurality of output channels;

    applying the second convolutional operation to the plurality of output channels to generate a third plurality of input channels;

    dividing the third plurality of input channels into second channel groups, wherein each input channel of the third plurality of input channels is associated with a single second channel group;

    performing the feature selection operation for each second channel group to generate a second plurality of intermediate channels, wherein each intermediate channel of the second plurality of intermediate channels is associated with a respective channel group of the second channel groups; and

    applying a pooling function to the second plurality of intermediate channels to generate a second plurality of output channels; and

    recognize an object in the image based at least in part on the first plurality of output channels and the second plurality of output channels.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×