Segment sensitive query matching
First Claim
Patent Images
1. A method comprising:
- with one or more special purpose computing devices;
processing one or more search query terms submitted to a search engine via a user interface;
processing labeled portions indicative of a plurality of content quality scores for a plurality of segmented portions of a web page,wherein at least one of the plurality of content quality scores is based, at least in part, upon a classification of a corresponding segmented portion of the plurality of segmented portions according to a type of content of the corresponding segmented portion and without regard to subject matter topic of the content of the corresponding segmented portion;
calculating at least one weighted content quality score for the at least one of the plurality of content quality scores based, at least in part, on at least one measure of frequency of at least one term in the corresponding segmented portion matching the one or more search query terms and at least one measure of a length in words of the corresponding segmented portion;
determining whether a query match exists between the web page and the one or more search query terms based, at least in part, on the labeled portions including the at least one weighted content quality score, and the one or more search query terms; and
initiating transmission to the user interface of at least a portion of a result of the determination.
9 Assignments
0 Petitions
Accused Products
Abstract
Exemplary techniques are provided which may be implemented using various methods, apparatuses, and/or articles of manufacture to provide or otherwise support segment sensitive query matching based on segmented portions of web pages and/or providing related information for use in information extraction and/or information retrieval systems. In certain example implementations techniques may be provided for determining whether a query match exists between a document and obtained query terms based, at least in part, on labeled portion information associated with a plurality of segmented portions of a document.
33 Citations
16 Claims
-
1. A method comprising:
with one or more special purpose computing devices; processing one or more search query terms submitted to a search engine via a user interface; processing labeled portions indicative of a plurality of content quality scores for a plurality of segmented portions of a web page, wherein at least one of the plurality of content quality scores is based, at least in part, upon a classification of a corresponding segmented portion of the plurality of segmented portions according to a type of content of the corresponding segmented portion and without regard to subject matter topic of the content of the corresponding segmented portion; calculating at least one weighted content quality score for the at least one of the plurality of content quality scores based, at least in part, on at least one measure of frequency of at least one term in the corresponding segmented portion matching the one or more search query terms and at least one measure of a length in words of the corresponding segmented portion; determining whether a query match exists between the web page and the one or more search query terms based, at least in part, on the labeled portions including the at least one weighted content quality score, and the one or more search query terms; and initiating transmission to the user interface of at least a portion of a result of the determination. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
9. An apparatus comprising:
-
memory; and at least one processing unit implemented, at least in part, by hardware to; process one or more search query terms to be submitted to a search engine via a user interface; process labeled portions to be indicative of a plurality of content quality scores for a plurality of segmented portions of a web page, wherein at least one of the plurality of content quality scores is to be based, at least in part, upon a classification of a corresponding segmented portion of the plurality of segmented portions according to a type of content of the corresponding segmented portion and without regard to subject matter topic of the content of the corresponding segmented portion; calculate at least one weighted content quality score for the at least one of the plurality of content quality scores based, at least in part, on at least one measure of frequency of at least one term in the corresponding segmented portion matching the one or more search query terms and at least one measure of a length in words of the corresponding segmented portion; determine whether a query match exists between the web page and the one or more search query terms based, at least in part, on the labeled portions including the at least one weighted content quality score, and the one or more search query terms; and initiate transmission to the user interface of at least a portion of a result of the determination. - View Dependent Claims (10, 11, 12)
-
-
13. An article comprising a non-transitory computer readable medium having computer implementable instructions stored thereon which are executable by one or more processing units in a computing device to:
-
process one or more search query terms to be submitted to a search engine via a user interface; process labeled portions to be indicative of a plurality of content quality scores for a plurality of segmented portions of a web page, wherein at least one of the plurality of content quality scores is to be based, at least in part, upon a classification of a corresponding segmented portion of the plurality of segmented portions according to a type of content of the corresponding segmented portion and without regard to subject matter topic of the content of the corresponding segmented portion; calculate at least one weighted content quality score for the at least one of the plurality of content quality scores based, at least in part, on at least one measure of frequency of at least one term in the corresponding segmented portion matching the one or more search query terms and at least one measure of a length in words of the corresponding segmented portion; determine whether a query match exists between the web page and the one or more search query terms based, at least in part, on the labeled portions including the at least one weighted content quality score, and the one or more search query terms; and initiate transmission to the user interface of at least a portion of a result of the determination. - View Dependent Claims (14, 15, 16)
-
Specification