Font recognition using text localization
First Claim
1. In a digital medium environment to improve image font recognition through use of text localization, a method implemented by one or more computing devices comprising:
- obtaining a model, by the one or more computing devices, that is trained using machine learning as applied to a plurality of training images having text rendered using a corresponding font;
predicting a bounding box, automatically and without user intervention by the one or more computing devices, for text in an image received using the obtained model by forming a plurality of cropped portions of the image and processing each of the plurality of cropped portions of the image by the model independently, one to another, the text overlapping a first and second cropped portion of the plurality of cropped portions; and
generating an indication of the predicted bounding box by the one or more computing devices based on a result of the processing of each of the plurality of cropped portions of the image by calculating an average or a median of a top and bottom line of the predicted bounding box, the indication usable to specify a region of the image that includes the text having a font to be recognized.
0 Assignments
0 Petitions
Accused Products
Abstract
Font recognition and similarity determination techniques and systems are described. In a first example, localization techniques are described to train a model using machine learning (e.g., a convolutional neural network) using training images. The model is then used to localize text in a subsequently received image, and may do so automatically and without user intervention, e.g., without specifying any of the edges of a bounding box. In a second example, a deep neural network is directly learned as an embedding function of a model that is usable to determine font similarity. In a third example, techniques are described that leverage attributes described in metadata associated with fonts as part of font recognition and similarity determinations.
66 Citations
20 Claims
-
1. In a digital medium environment to improve image font recognition through use of text localization, a method implemented by one or more computing devices comprising:
-
obtaining a model, by the one or more computing devices, that is trained using machine learning as applied to a plurality of training images having text rendered using a corresponding font; predicting a bounding box, automatically and without user intervention by the one or more computing devices, for text in an image received using the obtained model by forming a plurality of cropped portions of the image and processing each of the plurality of cropped portions of the image by the model independently, one to another, the text overlapping a first and second cropped portion of the plurality of cropped portions; and generating an indication of the predicted bounding box by the one or more computing devices based on a result of the processing of each of the plurality of cropped portions of the image by calculating an average or a median of a top and bottom line of the predicted bounding box, the indication usable to specify a region of the image that includes the text having a font to be recognized. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. In a digital medium environment to improve image font recognition through use of text localization, a system comprising:
-
a text localization module implemented at least partially in hardware of at least one computing device to obtain a model that is trained using machine learning as applied to a plurality of training images having text rendered using a corresponding font; a machine learning module implemented at least partially in the hardware of the at least one computing device to predict a bounding box, automatically and without user intervention, for text in an image by forming a plurality of cropped portions of the image and processing each of the plurality of cropped portions of the image independently, one to another, the text overlapping a first and second cropped portion of the plurality of cropped portions; and the text localization module further implemented at least partially in the hardware of the at least one computing device to generate an indication of the predicted bounding box based on a result of the processing of each of the plurality of cropped portions of the image by calculating an average or a median of a top and bottom line of the predicted bounding box, the indication is usable to specify a region of the image that includes the text having a font to be recognized. - View Dependent Claims (11, 12, 13, 14)
-
-
15. In a digital medium environment to improve image font recognition through use of text localization, a system comprising:
-
means for obtaining a model that is trained using machine learning as applied to a plurality of training images having text rendered using a corresponding font; means for predicting a bounding box, automatically and without user intervention, for text in an image received using the obtained model by forming a plurality of cropped portions of the image and processing each of the plurality of cropped portions of the image by the model independently, one to another, the text overlapping a first and second cropped portion of the plurality of cropped portions; and means for generating an indication of the predicted bounding box based on a result of the processing of each of the plurality of cropped portions of the image by calculating an average or a median of a top and bottom line of the predicted bounding box, the indication is usable to specify a region of the image that includes the text having a font to be recognized. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification