Snippet extraction and ranking
First Claim
1. A computer-readable storage medium having computer executable instructions encoded thereon, the computer executable instructions for execution by a processor to perform location-related mining operations on a travelogue, the operations comprising:
- receiving a query comprising a particular location and a context term which describes the particular location;
identifying at least one snippet, the at least one snippet comprising a segment of user-generated content;
filtering out one or more biased context terms from the at least one snippet, the one or more biased context terms describing a plurality of locations;
extracting the particular location from the at least one snippet;
mining the context term which describes the particular location from the at least one snippet;
calculating a geographic relevance score for the at least one snippet based at least in part on the particular location;
calculating a semantic relevance score for the at least one snippet based at least in part on the context term which describes the particular location;
computing a snippet score based at least in part on the geographic relevance score and the semantic relevance score for the at least one snippet; and
ranking the at least one snippet and at least one other snippet based at least in part on the snippet score of the at least one snippet.
2 Assignments
0 Petitions
Accused Products
Abstract
Described herein is a technology that facilitates efficient automated mining of topic-related aspects of user-generated content based on automated analysis of the user-generated content. Locations are automatically learned based on dividing documents into document segments, and decomposing the segments into local topics and global topics. Techniques are described that facilitate automatically extracting snippets. These techniques include, for example, computer annotating travelogues with learned tags and images, performing topic learning to obtain an interest model, performing location matching based on the interest model, calculating geographic and semantic relevance scores, ranking snippets based on the geographic and semantic relevance scores, and searching snippets with a “location+context term” query.
-
Citations
20 Claims
-
1. A computer-readable storage medium having computer executable instructions encoded thereon, the computer executable instructions for execution by a processor to perform location-related mining operations on a travelogue, the operations comprising:
-
receiving a query comprising a particular location and a context term which describes the particular location; identifying at least one snippet, the at least one snippet comprising a segment of user-generated content; filtering out one or more biased context terms from the at least one snippet, the one or more biased context terms describing a plurality of locations; extracting the particular location from the at least one snippet; mining the context term which describes the particular location from the at least one snippet; calculating a geographic relevance score for the at least one snippet based at least in part on the particular location; calculating a semantic relevance score for the at least one snippet based at least in part on the context term which describes the particular location; computing a snippet score based at least in part on the geographic relevance score and the semantic relevance score for the at least one snippet; and ranking the at least one snippet and at least one other snippet based at least in part on the snippet score of the at least one snippet. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 17, 18, 19, 20)
-
-
9. A computer-implemented method comprising:
-
identifying a travelogue for location-related mining; decomposing the travelogue into a plurality of portions of the travelogue to identify a particular location and a context term, the context term characterizing the particular location without co-occurring with a plurality of locations; filtering out one or more noisy context terms that do not characterize the particular location and co-occur with a plurality of locations from at least one portion of the travelogue; extracting the particular location from the portion of the travelogue as decomposed; mining the context term which characterizes the location from the portion of the travelogue as decomposed; creating a geo-snippet comprising the particular location and the context term which characterizes the particular location; obtaining a geographic relevance score corresponding to the geo-snippet; obtaining a semantic relevance score corresponding to the geo-snippet; and computing, by a processor, a snippet score corresponding to the geo-snippet based at least in part on the geographic relevance score and the semantic relevance score. - View Dependent Claims (10, 11)
-
-
12. A computer-implemented method performed on one or more travelogues comprising:
-
identifying at least one snippet from the one or more travelogues; filtering out one or more noisy context terms from the at least one snippet, the noisy context terms co-occurring with a plurality of locations; extracting a particular location from the at least one snippet; mining a context term which characterizes the particular location from the at least one snippet; calculating a geographic relevance score for the at least one snippet based at least in part on the particular location; calculating a semantic relevance score for the at least one snippet based at least in part on the context term which characterizes the particular location; computing a snippet score based at least in part on the geographic relevance score and the semantic relevance score for the at least one snippet; ranking the at least one snippet based at least in part on the snippet score; and assembling the at least one snippet into a snippet collection. - View Dependent Claims (13, 14, 15, 16)
-
Specification