Methods and apparatuses for searching content
First Claim
1. A machine implemented method comprising:
- receiving by a search engine, from a content searching or consuming application, an atomic search term, the search engine and the content searching or consuming application being operated on one or more different or same computing devices;
receiving a content page nominally associated with the atomic search term, or access information of the content page, by the search engine;
generating in response, by the search engine, one or more scores for one or more structures of the content page indicative of relative relevance of the content page or one or more portions of the content page to the atomic search term, wherein a structure includes substructures, wherein the generating of a score for a structure is based at least in part on a distance function and a scoring function, wherein the distance function measures distances between sub-structures within the structure being scored to facilitate determining of mutual relevance of occurrence positions, and wherein the scoring function is positionally sensitive, yielding different scores for at least some different occurrence positions of the atomic search term in substructures of the structure being scored, irrespective of substructure category memberships; and
conditionally providing or not providing the content page or one or more portions of the content page, or access information of the content page or one or more portions of the content page, to the content searching or consuming application, by the search engine, based at least in part on the generated one or more scores.
3 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of methods and apparatuses for searching contents, including structured search are described herein. Embodiments of the present invention use tree structures (or more generally, graph structures), layout structures, and/or content category information to capture within search results relevant content that would otherwise be missed, to reduce the incidence of false positives within search results, and to improve the accuracy of rankings within search results. Embodiments of the present invention further use tree structures (or more generally, graph structures), layout structures, and/or content category information to extend search results to include sub-document constituents. Embodiments of the present invention also support the use of distribution properties as criteria for ranking search results. And embodiments of the present invention support search based on structural proximity, search expressions with recursively embedded operators, predicates, and/or quantifiers, and applications to selection of advertisements.
-
Citations
56 Claims
-
1. A machine implemented method comprising:
-
receiving by a search engine, from a content searching or consuming application, an atomic search term, the search engine and the content searching or consuming application being operated on one or more different or same computing devices; receiving a content page nominally associated with the atomic search term, or access information of the content page, by the search engine; generating in response, by the search engine, one or more scores for one or more structures of the content page indicative of relative relevance of the content page or one or more portions of the content page to the atomic search term, wherein a structure includes substructures, wherein the generating of a score for a structure is based at least in part on a distance function and a scoring function, wherein the distance function measures distances between sub-structures within the structure being scored to facilitate determining of mutual relevance of occurrence positions, and wherein the scoring function is positionally sensitive, yielding different scores for at least some different occurrence positions of the atomic search term in substructures of the structure being scored, irrespective of substructure category memberships; and conditionally providing or not providing the content page or one or more portions of the content page, or access information of the content page or one or more portions of the content page, to the content searching or consuming application, by the search engine, based at least in part on the generated one or more scores. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A machine implemented method comprising:
-
receiving by a search engine, from a content searching or consuming application, a search expression having a first and a second proximally associated atomic sub-expression, the search engine and the content searching or consuming application being operated on one or more different or same computing devices; receiving a content page nominally associated with the search expression, or access information of the content page, by the search engine; generating in response, by the search engine, one or more scores for one or more structures of the content page indicative of relative relevance of the content page or one or more portions of the content page to the search expression, wherein a structure includes substructures, wherein the generating of a score for a structure is based at least in part on a distance function and a scoring function, wherein the distance function measures distances between sub-structures within the structure being scored to facilitate determining of mutual relevance of occurrence positions, and wherein the scoring function is positionally sensitive, yielding different scores for at least a first and a second situation where the distance between the occurrence positions of the first and second proximally associated atomic sub-expressions is same in both the first and second situations, when the occurrence positions of the first proximally associated atomic sub-expression have no category membership or same category membership in the first and second situations, and the occurrence positions of the second proximally associated atomic sub-expression have no category membership or same category membership in the first and second situations; and conditionally providing or not providing the content page or one or more portions of the content page, or access information of the content page or one or more portions of the content page, to the content searching or consuming application, by the search engine, based at least in part on the generated one or more scores. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. A machine implemented method comprising:
-
receiving by a search engine, from a content searching or consuming application, a search expression having a plurality of recursively embedded sub-expressions, the search engine and the content searching or consuming application being operated on one or more different or same computing devices; receiving a content page nominally associated with the search expression, or access information of the content page, by the search engine; generating in response, by the search engine, one or more scores indicative of relative relevance of a content page or one or more portions of the content page to the search expression, wherein the generating by the search engine comprising recursively generating one or more scores for one or more structures indicative of relative relevance of the content page or one or more portions of the content page to each of the recursively embedded sub-expressions, wherein a structure includes substructures, wherein at least one of the recursively generating is based at least in part on a distance function, and a scoring function, wherein the distance function measures distances between sub-structures within the structure being scored to facilitate determining of mutual relevance of occurrence positions, and wherein the scoring function is positionally sensitive, yielding different scores for at least some different occurrence positions of a search sub-expression in substructures of the structure being scored, irrespective of substructure category memberships; and conditionally providing or not providing the content page or one or more portions of the content page, or access information of the content page or one or more portions of the content page, to the content searching or consuming application, by the search engine, based at least in part on the generated one or more scores. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49)
-
-
50. A machine implemented method comprising:
-
receiving a content page and one or more advertisements or access information of the one or more advertisements, by an advertising engine operated by one or more computing devices; determining by the advertising engine relative relevance of one or more advertisements to a content page, the determining including scoring an advertisement for relative relevance to the content page by scoring relative relevance of one or more structures of the content page to a composite search expression, wherein a structure includes substructures, and the composite search expression is formed by correspondingly associating a user search expression with a search expression associated with the advertisement being scored, wherein the generating of a score for a structure is based at least in part on a distance function and a scoring function, wherein the distance function measures distances between sub-structures within the structure being scored to facilitate determining of mutual relevance of occurrence positions, and wherein the scoring function is positionally sensitive, yielding different scores for at least some different occurrence positions of the composite search expression in substructures of the structure being scored, irrespective of substructure category memberships; and conditionally selecting or not selecting one or more of the advertisements for rendering with the content page or one or more portions of the content page by a content searching or consuming application, based at least in part on the scoring of the one or more advertisements, the content searching or consuming application being operated on one or more different or same computing devices as the advertising engine. - View Dependent Claims (51, 52, 53, 54, 55, 56)
-
Specification