Method and system for searching for information on a network in response to an image query sent by a user from a mobile communications device
First Claim
1. A computer-implemented method for searching for information on a network in response to an image query sent by a user, comprising:
- under control of one or more computer systems configured with executable instructions,receiving an image query sent from a mobile communications device, the image query including at least one image taken using a camera of the mobile communications device;
processing the at least one image to detect any text present in the at least one image and to determine geometry information pertaining to the text, the geometry information including at least one of position and geometry information for the text in the at least one image;
automatically recognizing the text from at least one portion of the at least one image corresponding to the geometry information of the image;
determining matches for the detected text in at least one domain database on the network, the at least one domain database selected based at least in part upon the image query by;
removing words in the text that are not present in a pre-identified dictionary;
performing an N-gram match by counting a number of N-grams in common between the text and each field of every entry in the at least one domain database, the at least one domain database being selected from various domain databases;
normalizing a count of the matching N-grams to return a score for each field of every entry in the at least one domain database;
determining weighted combinations of the scores from multiple fields to compute a final score for every entry in the at least one domain database;
ranking entries in the at least one domain database based on the final scores; and
retrieving relevant information for one or more top-ranked entries in the at least one domain database; and
sending one or more of the identified matches to the mobile device for display to the user.
2 Assignments
0 Petitions
Accused Products
Abstract
Image-based searching for information on a network is provided in response to an image query sent by a user. The image query includes an image captured using a mobile communications device with a camera. The image is processed to detect any text present in the image, and any detected text can be analyzed using a process such as optical character recognition (OCR). The analyzed text is used to search for matches in at least one corresponding domain database, selected from various domain databases present in the network. Thereafter, one or more selected matches and any additional related information can be sent to the user as one or more results for the submitted image query.
116 Citations
15 Claims
-
1. A computer-implemented method for searching for information on a network in response to an image query sent by a user, comprising:
under control of one or more computer systems configured with executable instructions, receiving an image query sent from a mobile communications device, the image query including at least one image taken using a camera of the mobile communications device; processing the at least one image to detect any text present in the at least one image and to determine geometry information pertaining to the text, the geometry information including at least one of position and geometry information for the text in the at least one image; automatically recognizing the text from at least one portion of the at least one image corresponding to the geometry information of the image; determining matches for the detected text in at least one domain database on the network, the at least one domain database selected based at least in part upon the image query by; removing words in the text that are not present in a pre-identified dictionary; performing an N-gram match by counting a number of N-grams in common between the text and each field of every entry in the at least one domain database, the at least one domain database being selected from various domain databases; normalizing a count of the matching N-grams to return a score for each field of every entry in the at least one domain database; determining weighted combinations of the scores from multiple fields to compute a final score for every entry in the at least one domain database; ranking entries in the at least one domain database based on the final scores; and retrieving relevant information for one or more top-ranked entries in the at least one domain database; and sending one or more of the identified matches to the mobile device for display to the user. - View Dependent Claims (2, 3, 4, 5)
-
6. A computer-implemented method for searching for information on a network in response to an image query sent by a user, comprising:
under control of one or more computer systems configured with executable instructions, receiving an image query from a user, the image query including at least one image captured by the user; detecting a presence of text in the at least one image and extracting geometry information corresponding to the text, the geometry information including at least one of a position of the detected text in the at least one image and a size of the detected text; transforming the image to improve a quality of the image; recognizing the detected text in at least one portion of the image using an optical character recognition (OCR) process, the at least one portion determined at least in part using the geometry information; matching the recognized text to a plurality of words in a domain database based at least in part on an N-gram based matching algorithm by; removing words in the text that are not present in a pre-identified dictionary; performing an N-gram match by counting a number of N-grams in common between the text and each field of every entry in at least one domain database, the at least one domain database being selected from various domain databases; normalizing a count of the matching N-grams to return a score for each field of every entry in the at least one domain database; determining weighted combinations of the scores from multiple fields to compute a final score for every entry in the at least one domain database; ranking entries in the at least one domain database based on the final scores; and retrieving relevant information for one or more top-ranked entries in the at least one domain database; and sending one or more of the words matching the recognized text to the user as identified matches.
-
7. A system for searching for information on a network in response to an image query sent by a user, the system comprising:
-
a communication server operable to receive an image query from a user, the image query including at least one image captured by a user and transmitted using a mobile communications device; an image processing module operable to process the at least one image to determine text geometry information of the image corresponding to text detected in the image, the image processing module further operable to recognize the text in at least one portion of the at least one image determined based at least in part upon the text geometry information; and a matching engine operable to determine matches for the recognized text on at least one domain database for a network by; removing words in the text that are not present in a pre-identified dictionary; performing an N-gram match by counting a number of N-grams in common between the text and each field of every entry in at least one domain database, the at least one domain database being selected from various domain databases; normalizing a count of the matching N-grams to return a score for each field of every entry in the at least one domain database; determining weighted combinations of the scores from multiple fields to compute a final score for every entry in the at least one domain database; ranking entries in the at least one domain database based on the final scores; and retrieving relevant information for one or more top-ranked entries in the at least one domain database, wherein the communication server is further operable to send one or more of the determined matches to the user in response to the image query. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A non-transitory computer-readable storage medium having a computer readable program code embodied therein for searching for information on a network in response to an image query sent by a user, the computer readable program code when executed by at least one processor causing the at least one processor to:
-
receive an image query sent from a mobile communications device, the image query including at least one image; process the at least one image to detect any text present in the at least one image and to determine geometry information pertaining to the text, the geometry information including at least one of position and geometry information for the detected text; automatically recognize the text from at least one portion of the at least one image corresponding to the geometry information of the image; determine matches for the detected text in at least one domain database on the network, the at least one domain database selected based at least in part upon the image query by; removing words in the text that are not present in a pre-identified dictionary; performing an N-gram match by counting a number of N-grams in common between the text and each field of every entry in the at least one domain database, the at least one domain database selected from various domain databases; normalizing a count of the matching N-grams to return a score for each field of every entry in the at least one domain database; determining weighted combinations of the scores from multiple fields to compute a final score for every entry in the at least one domain database; ranking entries in the domain database based on the final scores; and retrieving relevant information for one or more top-ranked entries in the at least one domain database; and send one or more of the identified matches to the mobile device for display to the user. - View Dependent Claims (13, 14, 15)
-
Specification