Image processing using multiple aspect ratios
First Claim
Patent Images
1. A method, comprising:
- obtaining an image;
identifying a candidate location in the image that likely contains a text character;
determining candidate features for the candidate location;
downscaling the image into a first downscaled image having a first resolution and a first aspect ratio;
downscaling the image into a second downscaled image having a second resolution and a second aspect ratio, wherein the first resolution is different from the second resolution, and the first aspect ratio is different than the second aspect ratio;
generating first contextual image gradient features for the first downscaled image comprising first magnitude values and first angles;
generating second contextual image gradient features for the second downscaled image comprising second magnitude values and second angles;
normalizing the first magnitude values features to a uniform scale, producing normalized first contextual image gradient features;
normalizing the second magnitude values to the uniform scale, producing normalized second contextual image gradient features;
combining the normalized first contextual image gradient features, the normalized second contextual image gradient features, and the candidate features; and
determining that the candidate location contains at least one text character, using the combined features and at least one classifier model,wherein the first and second magnitude values are approximated, and the first and second angles are quantized.
1 Assignment
0 Petitions
Accused Products
Abstract
A system to recognize text, objects, or symbols in a captured image using machine learning models reduces computational overhead by generating a plurality of thumbnail versions of the image at different downscaled resolutions and aspect ratios, and then processing the downscaled images instead of the entire image, or sections of the entire image. The downscaled images are processed to produce a combine feature vector characterizing the overall image. The combined feature vector is processed using the machine learning model.
-
Citations
17 Claims
-
1. A method, comprising:
-
obtaining an image; identifying a candidate location in the image that likely contains a text character; determining candidate features for the candidate location; downscaling the image into a first downscaled image having a first resolution and a first aspect ratio; downscaling the image into a second downscaled image having a second resolution and a second aspect ratio, wherein the first resolution is different from the second resolution, and the first aspect ratio is different than the second aspect ratio; generating first contextual image gradient features for the first downscaled image comprising first magnitude values and first angles; generating second contextual image gradient features for the second downscaled image comprising second magnitude values and second angles; normalizing the first magnitude values features to a uniform scale, producing normalized first contextual image gradient features; normalizing the second magnitude values to the uniform scale, producing normalized second contextual image gradient features; combining the normalized first contextual image gradient features, the normalized second contextual image gradient features, and the candidate features; and determining that the candidate location contains at least one text character, using the combined features and at least one classifier model, wherein the first and second magnitude values are approximated, and the first and second angles are quantized.
-
-
2. A computing device comprising:
-
at least one processor; a memory including instruction operable to be executed by the at least one processor to perform a set of actions to configure the at least one processor to; downscale an image into a first downscaled image having a first resolution and a first aspect ratio; downscale the image into a second downscaled image having a second resolution that is different than the first resolution and a second aspect ratio that is different than the first aspect ratio; generate first image gradient features for the first downscaled image, wherein the instructions to generate the first image gradient features include instructions to; determine an X gradient for a pixel of the first downscaled image, determine a Y gradient for the pixel; approximate a magnitude of a gradient vector associated with the pixel within the first downscaled image based on the X gradient and the Y gradient, and assign the gradient vector a quantized angle value; generate second image gradient features for the second downscaled image; concatenate the first image gradient features and the second image gradient features; and process the concatenated image gradient features to identify an object in the image or determine a characteristic of the image. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable storage medium storing processor-executable instructions for controlling a computing device, comprising program code to configure the computing device to:
-
downscale an image into a first downscaled image having a first resolution and a first aspect ratio; downscale the image into a second downscaled image having a second resolution that is different than the first resolution and a second aspect ratio that is different than the first aspect ratio; generate first image gradient features for the first downscaled image, wherein the program code to generate the first image gradient features further configuring the computing device to; determine an X gradient for a pixel of the first downscaled image, determine a Y gradient for the pixel; approximate a magnitude of a gradient vector associated with the pixel within the first downscaled image based on the X gradient and the Y gradient, and assign the gradient vector a quantized angle value; generate second image gradient features for the second downscaled image; concatenate the first image gradient features and the second image gradient features; and process the concatenated image gradient features to identify an object in the image or determine a characteristic of the image. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
Specification