Classification of ambiguous geographic references
First Claim
Patent Images
1. A method comprising:
- identifying, by a device, a first phrase in a document as being geographically significant,the first phrase being identified as geographically significant based on previous occurrences of the first phrase being determined to be statistically significant to first geographic information;
identifying, by the device, a second phrase in the document as being geographically significant,the second phrase being identified as geographically significant based on previous occurrences of the second phrase being determined to be statistically significant to second geographic information;
determining, by the device, that the first phrase is associated with a first plurality of geographic areas;
determining, by the device, that the second phrase is associated with a second plurality of geographic areas;
determining, by the device, that a geographic area of the first plurality of geographic areas matches a geographic area of the second plurality of geographic areas;
associating, by the device and based on determining that the geographic area of the first plurality of geographic areas matches the geographic area of the second plurality of geographic areas, the document with a particular geographic area, the particular geographic area corresponding to the geographic area of the first plurality of geographic areas and the geographic area of the second plurality of geographic areas;
storing, by the device, information indicating the association of the document with the particular geographic area;
generating, based on located geographic information associated with a phrase in a respective document, a histogram for the phrase; and
storing the generated histogram.
2 Assignments
0 Petitions
Accused Products
Abstract
A location classifier generates location information based on textual strings in input text. The location information defines potential geographical relevance of the input text. In determining the location information, the location classifier may receive at least one geo-relevance profile associated with at least one string in the input text, obtain a combined geo-relevance profile for the document from the at least one geo-relevance profile, and determine geographical relevance of the input text based on the combined geo-relevance profile.
-
Citations
17 Claims
-
1. A method comprising:
-
identifying, by a device, a first phrase in a document as being geographically significant, the first phrase being identified as geographically significant based on previous occurrences of the first phrase being determined to be statistically significant to first geographic information; identifying, by the device, a second phrase in the document as being geographically significant, the second phrase being identified as geographically significant based on previous occurrences of the second phrase being determined to be statistically significant to second geographic information; determining, by the device, that the first phrase is associated with a first plurality of geographic areas; determining, by the device, that the second phrase is associated with a second plurality of geographic areas; determining, by the device, that a geographic area of the first plurality of geographic areas matches a geographic area of the second plurality of geographic areas; associating, by the device and based on determining that the geographic area of the first plurality of geographic areas matches the geographic area of the second plurality of geographic areas, the document with a particular geographic area, the particular geographic area corresponding to the geographic area of the first plurality of geographic areas and the geographic area of the second plurality of geographic areas; storing, by the device, information indicating the association of the document with the particular geographic area; generating, based on located geographic information associated with a phrase in a respective document, a histogram for the phrase; and storing the generated histogram. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A device comprising:
-
a memory to store instructions; and a processor to execute the instructions to; identify a first phrase in a document as being geographically significant, the first phrase being identified as geographically significant based on previous occurrences of the first phrase being determined to be statistically significant to first geographic information; identify a second phrase in the document as being geographically significant, the second phrase being identified as geographically significant based on previous occurrences of the second phrase being determined to be statistically significant to second geographic information; receive information indicating that the first phrase is associated with a first plurality of geographic areas; receive information indicating that the second phrase is associated with a second plurality of geographic areas; determine that a geographic area of the first plurality of geographic areas matches a geographic area of the second plurality of geographic areas; associate, based on determining that the geographic area of the first plurality of geographic areas matches the geographic area of the second plurality of geographic areas, the document with a particular geographic area, the particular geographic area corresponding to the geographic area of the first plurality of geographic areas and the geographic area of the second plurality of geographic areas; store information indicating the association of the document with the particular geographic area, the stored information permitting a determination to be made as to whether the document is relevant to the particular geographic area; generate, based on located geographic information associated with a phrase in a respective document, a histogram for the phrase; and store the generated histogram. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable medium storing instructions, the instructions comprising:
one or more instructions which, when executed by a processor a device, cause the processor to; receive information identifying a first phrase in a document as being geographically significant, the first phrase being identified as geographically significant based on previous occurrences of the first phrase being determined to be statistically significant to first geographic information; receive information identifying a second phrase in the document as being geographically significant, the second phrase being identified as geographically significant based on previous occurrences of the second phrase being determined to be statistically significant to second geographic information; determine that the first phrase is associated with a first plurality of geographic areas; determine that the second phrase is associated with a second plurality of geographic areas; determine that a geographic area of the first plurality of geographic areas matches a geographic area of the second plurality of geographic areas; associate, based on determining that the geographic area of the first plurality of geographic areas matches the geographic area of the second plurality of geographic areas, the document with a particular geographic area, the particular geographic area corresponding to the geographic area of the first plurality of geographic areas and the geographic area of the second plurality of geographic areas; store information indicating the association of the document with the particular geographic area, the stored information permitting a determination to be made that the document is related to the particular geographic area; generate, based on located geographic information associated with a phrase in a respective document, a histogram for the phrase; and store the generated histogram. - View Dependent Claims (14, 15, 16, 17)
Specification