Web mining to build a landmark database and applications thereof
First Claim
Patent Images
1. A computer-implemented method for building a landmark database, comprising:
- receiving web data including a web page from one or more websites via one or more networks;
interpreting the web data using at least one processor to extract landmark data describing a landmark from the web data, the landmark data including a name that identifies the landmark and a portion of a street address where the landmark is located;
applying one or more error checking rules to remove inaccurate landmark data from the landmark data;
sending a search query to an image search service based on the landmark data;
receiving a set of image search results from the image search service;
filtering using one or more image checkers to remove non-photographic images from the set of image search results;
generating a visual model using the filtered image search results;
sending, to a geocoding service, a request with the name of the landmark and the portion of the street address where the landmark is located;
receiving, in response to the request, a location of the landmark from the geocoding service, the location determined based on the name of the landmark and the portion of the street address where the landmark is located; and
recognizing the landmark in an image different from the images in the filtered image search results by correlating the image with the visual model, and the location.
2 Assignments
0 Petitions
Accused Products
Abstract
This invention relates to building a landmark database from web data. In one embodiment, a computer-implemented method builds a landmark database. Web data including a web page is received from one or more websites via one or more networks. The web data is interpreted using at least one processor to determine landmark data describing a landmark. At least a portion of the landmark data identifies a landmark. Finally, a visual model is generated using the landmark data. A computing device is able to recognize the landmark in an image based on the visual model.
-
Citations
20 Claims
-
1. A computer-implemented method for building a landmark database, comprising:
-
receiving web data including a web page from one or more websites via one or more networks; interpreting the web data using at least one processor to extract landmark data describing a landmark from the web data, the landmark data including a name that identifies the landmark and a portion of a street address where the landmark is located; applying one or more error checking rules to remove inaccurate landmark data from the landmark data; sending a search query to an image search service based on the landmark data; receiving a set of image search results from the image search service; filtering using one or more image checkers to remove non-photographic images from the set of image search results; generating a visual model using the filtered image search results; sending, to a geocoding service, a request with the name of the landmark and the portion of the street address where the landmark is located; receiving, in response to the request, a location of the landmark from the geocoding service, the location determined based on the name of the landmark and the portion of the street address where the landmark is located; and recognizing the landmark in an image different from the images in the filtered image search results by correlating the image with the visual model, and the location.
-
-
2. A computer-implemented method for building a landmark database, comprising:
-
receiving web data including a web page comprising a first image of a landmark and text from one or more websites via one or more networks; interpreting the web data using at least one processor to extract landmark data describing the landmark in the first image from the text in the web data, wherein the landmark data comprises landmark name data that identifies the landmark; applying one or more error checking rules to the landmark name data to remove inaccurate landmark name data, wherein the applying comprises checking name length of the landmark name data and capitalization of the landmark name data; geocoding a location of the landmark in the first image by sending a request to a geocoding service, the request based on the landmark name data; sending a search query to an image search service, the search query based on the landmark name data; receiving a set of image search results from the image search service; filtering using one or more image checkers to remove images of landmarks obstructed by one or more faces, non-photographic images, and images of maps from the set of image search results; generating a visual model for the landmark using the filtered image search results; and recognizing the landmark in a target image different from images in the filtered image search results by correlating the target image with the visual model and the location of the landmark. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 19)
-
-
11. A system for building a landmark database, comprising:
-
one or more processors; a web data retriever module implemented on at least one computing device configured to receive web data including a web page comprising a first image of a landmark from one or more websites via one or more networks; a web data interpreter module implemented on the at least one computing device configured to interpret the web data to extract landmark data describing the landmark in the first image from text in the web data, wherein the landmark data comprises landmark name data that identifies a landmark; an error checker module implemented on the at least one computing device configured to apply one or more error checking rules to the landmark name data to remove inaccurate landmark name data, wherein the error checking rules comprise checking name length of landmark name data and checking capitalization of landmark name data; a geocoding module implemented on the at least one computing device configured to geocode a location of the landmark in the first image by sending a request to a geocoding service, the request based on the landmark name data; a visual model generator implemented on the at least one computing device configured to send a search query to an image search service based on the landmark name data, receive a set of image search results from the image search service, filter the received image search results using one or more image checkers to remove images of landmarks obstructed by one or more faces, non-photographic images, and images of maps, and generate a visual model using the filtered image search results; and a landmark recognition engine implemented on the at least one computing device configured to recognize the landmark in a target image different from the images in the filtered image search results by correlating the target image with the visual model and the location of the landmark. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 20)
-
Specification