Database assisted OCR for street scenes and other images
First Claim
1. A method for constructing a database storing one or more street scene images, each street scene image associated with GPS location data identifying an actual location of the street scene image, the method executed by a computer system and comprising:
- storing in the database of the computer system, a plurality business names, each business name associated with a street address and one or more keywords;
estimating a street address for a street scene image of the street scene images using the GPS location data associated with that street scene image;
identifying a target address range based on the estimated street address;
querying the database to identify a business name associated with a street address in the target address range and to identify the one or more keywords associated with the identified business name;
performing optical character recognition of the street scene image to recognize text contained in the street scene image, wherein the optical character recognition of the street scene is constrained by at least one of the identified keywords resulting from querying the database;
determining if the recognized text contained in the street scene image corresponds to at least one of the identified keywords;
updating the estimated street address in the database including the GPS location data in response to a determination that the recognized text corresponds to at least one of the identified keywords.
2 Assignments
0 Petitions
Accused Products
Abstract
Optical character recognition (OCR) for images such as a street scene image is generally a difficult problem because of the variety of fonts, styles, colors, sizes, orientations, occlusions and partial occlusions that can be observed in the textual content of such scenes. However, a database query can provide useful information that can assist the OCR process. For instance, a query to a digital mapping database can provide information such as one or more businesses in a vicinity, the street name, and a range of possible addresses. In accordance with an embodiment of the present invention, this mapping information is used as prior information or constraints for an OCR engine that is interpreting the corresponding street scene image, resulting in much greater accuracy of the digital map data provided to the user.
-
Citations
36 Claims
-
1. A method for constructing a database storing one or more street scene images, each street scene image associated with GPS location data identifying an actual location of the street scene image, the method executed by a computer system and comprising:
-
storing in the database of the computer system, a plurality business names, each business name associated with a street address and one or more keywords; estimating a street address for a street scene image of the street scene images using the GPS location data associated with that street scene image; identifying a target address range based on the estimated street address; querying the database to identify a business name associated with a street address in the target address range and to identify the one or more keywords associated with the identified business name; performing optical character recognition of the street scene image to recognize text contained in the street scene image, wherein the optical character recognition of the street scene is constrained by at least one of the identified keywords resulting from querying the database; determining if the recognized text contained in the street scene image corresponds to at least one of the identified keywords; updating the estimated street address in the database including the GPS location data in response to a determination that the recognized text corresponds to at least one of the identified keywords. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for performing image analysis of a street scene image, the street scene image associated with GPS location data identifying an actual location of the street scene image, the method executed by a computer system, and comprising:
-
querying a database comprising a plurality of street addresses, each street address associated with image features, to identify an image feature that is present in a street scene image of a street address, the street scene image comprising a plurality of address locations and at least two of the plurality of address locations having known GPS data; performing image analysis of the street scene image to recognize features in the street scene image, wherein the image analysis of the street scene is constrained by the identified image feature from querying the database, and the identified image feature is associated with a street address from the database; determining if the recognized features in the street scene image correspond to the identified image feature associated with the street address from the database; and
in response to determining that the recognized features correspond to the identified image feature associated with the street address from querying the database, updating the street address in the database including the GPS location data. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A method for performing optical character recognition (OCR) of an image, the image associated with GPS location data identifying an actual location of the image, the method executed by a computer and comprising:
-
storing in a database a plurality of images, each image associated with one or more keywords; querying the database to select at least one image; performing optical character recognition of the selected image to recognize text that may be contained in the image, wherein the optical character recognition of the image is constrained by at least one of the keywords associated with the image in the database; determining if the recognized text contained in the image corresponds to the at least one of the keywords associated with the image; and in response to determining that the recognized text contained in the image corresponds to at least one of the keywords associated with the image, updating a street address to be associated with the GPS location data, the street address being associated with the image. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer program product, comprising a computer-readable storage medium containing computer executable instructions for controlling a computer system to perform the operations of:
-
storing in a database of the computer system a plurality of business names, each business name associated with a street address and one or more keywords; estimating a street address for a street scene image using data associated with that street scene image, the street scene image associated with GPS location data identifying an actual location of the street scene image; identifying a target address range based on the estimated street address of the target GPS location; querying the database with the target address range to identify a business name associated with a street address in the target address range and to identify the one or more keywords associated with the identified business name; performing optical character recognition of the street scene image to recognize text contained in the street scene image, wherein the optical character recognition of the street scene is constrained by at least one of the identified keywords resulting from querying the database; determining if the recognized text contained in the street scene image corresponds to at least one of the identified keywords; updating the estimated street address including the GPS location data in response to a determination that the recognized text corresponds to at least one of the identified keywords. - View Dependent Claims (21, 22, 23, 24, 25)
-
-
26. A computer program product, comprising a computer-readable storage medium containing computer executable instructions for controlling a computer system to perform the operations of:
-
querying a database comprising a plurality of street addresses, each street address associated with image features, to identify an image feature that is present in a street scene image of a street address of the street addresses, the street scene image comprising a plurality of address locations and at least two of the plurality of address locations having known GPS data; performing image analysis of the street scene image to recognize features in the street scene image, wherein the image analysis of the street scene is constrained by the identified image feature from querying the database, and the identified image feature is associated with a street address from the database; determining if the recognized features in the street scene image correspond to the identified image feature associated with the street address from the database; and in response to determining that the recognized features correspond to the identified image feature associated with the street address from querying the database, updating the street address in the database including GPS location data associated with GPS location data, the GPS location data identifying an actual location of the street stress. - View Dependent Claims (27, 28)
-
-
29. A computer program product, comprising a computer-readable storage medium containing computer executable instructions for controlling a computer system to perform the operations of:
-
storing in a database a plurality of images, each image associated with one or more keywords and GPS location data identifying an actual location of the image; querying the database to select at least one image; performing optical character recognition of the selected image to recognize text that may be contained in the image, wherein the optical character recognition of the image is constrained by at least one of the keywords associated with the image in the database; determining if the recognized text contained in the image corresponds to the at least one of the keywords associated with the image; and in response to determining that the recognized text contained in the image corresponds to at least one of the keywords associated with the image, updating a street address in the database to be associated with the GPS data, the street address being associated with image. - View Dependent Claims (30)
-
-
31. A computer system for constructing a database storing one or more street scenes, each street scene image associated with GPS location data identifying an actual location of the street scene image, comprising:
-
a database storing a plurality of business names and street addresses, each business name associated with a street address of the street addresses and one or more keywords; a target address range estimator module coupled to the database and adapted to; estimate a street address of the street addresses for the street scene image using data associated with that street scene image; a target GPS calculator module coupled to the target address range estimator module and adapted to identify a target address range based on the estimated street address of the target GPS location; an image registration module coupled to the database and the target address range estimator module and adapted to query the database to identify a business name associated with a street address in the target address range and to identify the one or more keywords associated with the identified business name; an optical character recognition module coupled to the target address range estimator module and the image registration module and adapted to; perform optical character recognition of the street scene image to recognize text contained in the street scene image, wherein the optical character recognition of the street scene is constrained by at least one of the identified keywords resulting from querying the database; and determining if the recognized text contained in the street scene image corresponds to at least one of the identified keywords; and the image registration module adapted to update the estimated street address in the database including the GPS location data in response to a determination that the recognized text corresponds to at least one of the identified keywords. - View Dependent Claims (32)
-
-
33. A computer system for performing image analysis of a street scene image using a database, the street scene image associated with GPS location data identifying an actual location of the street scene image, comprising:
-
an image registration module adapted to query the database comprising a plurality of street addresses, each street address associated with image features, to identify an image feature that is present in the street scene image of the street address, the street scene image comprising a plurality of address locations and at least two of the plurality of address locations having known GPS data; an optical character recognition module coupled to the image registration module and adapted to; perform image analysis of the street scene image to recognize features in the street scene image, wherein the image analysis of the street scene is constrained by the identified image feature from querying the database, and the identified image feature is associated with the street address from the database; and determine if the recognized features in the street scene image correspond to the identified image feature associated with the street address from the database; and the image registration module, in response to determining that the recognized features correspond to the identified image feature associated with the street address from querying the database, adapted to update the street address in the database including the GPS location data, the street address associated with the identified image feature.
-
-
34. A computer system for performing optical character recognition (OCR) of an image using a database, comprising:
-
the database storing a plurality of images, each image associated with one or more keywords, and GPS location data to identifying an actual location of the images; an image registration module coupled to the database and adapted to query the database to select at least one image; and an OCR module coupled to the image registration module and adapted to; perform OCR of the selected image to recognize text that may be contained in the image, wherein the OCR of the image is constrained by at least one of the keywords associated with the image in the database; determine if the recognized text contained in the image corresponds to the at least one of the keywords associated with the image; and in response to determining that the recognized text contained in the image corresponds to at least one of the keywords associated with the image, assigning an actual GPS location to a street address associated with an image of the plurality of images; and
updating the street address in the database to be associated with the GPS location data.
-
-
35. A method for constructing a database storing one or more street scene images, each street scene image associated with GPS location data identifying an actual location of the one or more street scene images, the method executed by a computer system, comprising:
-
storing in the database of the computer system, a plurality business names, each business name associated with a street address and one or more keywords; estimating a street address for a street scene image of the one or more street scene images using the GPS location data associated with the one or more street scene images; identifying a target address range based on the estimated street address; querying the database to identify a business name of the plurality of business names associated with a street address in the target address range, and to identify the one or more keywords associated with the identified business name; performing optical character recognition of the one or more street scene images to recognize text contained in the one or more street scene images, wherein the optical character recognition of the one or more street scene images is constrained by at least one of the identified keywords resulting from querying the database; determining if the recognized text contained in the one or more street scene images corresponds to at least one of the identified keywords; and updating the business name associated with the street address in the database including the GPS location data of the street scene image.
-
-
36. A method for constructing a database storing one or more street scene images, each street scene image associated with GPS location data identifying an actual location of the one or more street scene images, the method executed by a computer system, comprising:
-
storing in the database of the computer system, a plurality business names, each business name associated with a street address and one or more keywords; estimating a street address for a street scene image of the one or more street scene images using the GPS location data associated with the one or more street scene images; identifying a target address range based on the estimated street address; querying the database to identify a business name of the plurality of business names associated with a street address in the target address range, and to identify the one or more keywords associated with the identified business name; performing optical character recognition of the one or more street scene images to recognize text contained in the one or more street scene images, wherein the optical character recognition of the one or more street scene images is constrained by at least one of the identified keywords resulting from querying the database; determining if the recognized text contained in the one or more street scene images corresponds to at least one of the identified keywords; and updating the one or more street scene images in database including the business name associated with the street address in response to determination that the recognized text corresponds to at least on of the identified keywords.
-
Specification