Font attributes for font recognition and similarity
First Claim
1. In a digital medium environment to recognize a font in rendered text in an image or determine similarity of the font in the rendered text in the image to other fonts, a method implemented by a computing device, the method comprising:
- predicting, automatically and without user intervention by the computing device, a bounding box for the rendered text in the image using a model that is trained using machine learning as applied to a plurality of training images having text rendered using the font;
normalizing coordinates of boundaries of the rendered text using the font in the plurality of training images; and
generating, by the computing device, an indication of the predicted bounding box, the indication specifying a region of the image that includes the rendered text having the font to be recognized.
0 Assignments
0 Petitions
Accused Products
Abstract
Font recognition and similarity determination techniques and systems are described. In a first example, localization techniques are described to train a model using machine learning (e.g., a convolutional neural network) using training images. The model is then used to localize text in a subsequently received image, and may do so automatically and without user intervention, e.g., without specifying any of the edges of a bounding box. In a second example, a deep neural network is directly learned as an embedding function of a model that is usable to determine font similarity. In a third example, techniques are described that leverage attributes described in metadata associated with fonts as part of font recognition and similarity determinations.
-
Citations
20 Claims
-
1. In a digital medium environment to recognize a font in rendered text in an image or determine similarity of the font in the rendered text in the image to other fonts, a method implemented by a computing device, the method comprising:
-
predicting, automatically and without user intervention by the computing device, a bounding box for the rendered text in the image using a model that is trained using machine learning as applied to a plurality of training images having text rendered using the font; normalizing coordinates of boundaries of the rendered text using the font in the plurality of training images; and generating, by the computing device, an indication of the predicted bounding box, the indication specifying a region of the image that includes the rendered text having the font to be recognized. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. In a digital medium environment to recognize a font in rendered text in an image or determine similarity of the font in the rendered text in the image to other fonts, a system implemented by at least one computing device, the system comprising:
-
a training set generation module implemented at least partially in hardware of a computing device to; generate a training image and a font collection that includes additional training images, and render text in the training image and the additional training images in the font collection using a selection of fonts, the rendered text including one or more perturbations; and a machine learning module implemented at least partially in hardware of the computing device to obtain the training image and the font collection and train a model to predict, automatically and without user intervention, bounding boxes using machine learning applied to the training image and the font collection. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. In a digital medium environment to recognize a font in rendered text in an image or determine similarity of the font in the rendered text in the image to other fonts, a system implemented by at least one computing device, the system comprising:
-
means for obtaining a plurality of training images having a text rendered using a font; means for training a model to; predict, automatically and without user intervention, bounding boxes for text in images, the model trained using machine learning as applied to the plurality of training images having text rendered using the font, and normalize coordinates of boundaries of the text rendered using the font in the plurality of training images; means for predicting, automatically and without user intervention, a bounding box for the rendered text in the image using the model; and means for generating an indication of the predicted bounding box, the indication specifying a region of the image that includes the rendered text having a font to be recognized. - View Dependent Claims (17, 18, 19, 20)
-
Specification