Presentation of search results based on document structure
First Claim
Patent Images
1. A method performed by one or more server devices, the method comprising:
- identifying, by a processor of the one or more server devices, a document relating to a search term,the document comprising structural elements;
locating, by a processor of the one or more server devices, occurrences of the search term in the document;
grouping, by a processor of the one or more server devices, the occurrences of the search term into at least one cluster,the grouping being based on relative locations of the occurrences of the search term, andthe grouping favoring a cluster that lies within a particular structural element, of the structural elements, over a cluster that lies within multiple structural elements of the structural elements;
identifying, by a processor of the one or more server devices, a structural element, of the structural elements, encompassing the at least one cluster; and
providing, by a processor of the one or more server devices and for presentation, information associated with the identified structural element.
2 Assignments
0 Petitions
Accused Products
Abstract
A system identifies a document relating to a search term, where the document includes a set of structural elements. The system determines a distribution of occurrences of the search term in the document, identifies one of the structural elements based on the distribution of occurrences of the search term in the document, and presents information associated with the identified structural element.
33 Citations
46 Claims
-
1. A method performed by one or more server devices, the method comprising:
-
identifying, by a processor of the one or more server devices, a document relating to a search term, the document comprising structural elements; locating, by a processor of the one or more server devices, occurrences of the search term in the document; grouping, by a processor of the one or more server devices, the occurrences of the search term into at least one cluster, the grouping being based on relative locations of the occurrences of the search term, and the grouping favoring a cluster that lies within a particular structural element, of the structural elements, over a cluster that lies within multiple structural elements of the structural elements; identifying, by a processor of the one or more server devices, a structural element, of the structural elements, encompassing the at least one cluster; and providing, by a processor of the one or more server devices and for presentation, information associated with the identified structural element. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-implemented system comprising:
one or more devices to; obtain a document relating to a search term, the document comprising structural elements; identify occurrences of the search term in the document with regard to the structural elements; group occurrences of the search term into clusters based on a physical proximity of the occurrences of the search term, a first cluster, of the clusters, that lies within a particular structural element, of the structural elements, being favored over a second cluster, of the clusters, that lies within multiple structural elements of the structural elements; identify a structural element, of the structural elements, encompassing each of the clusters; and provide, for presentation, information relating to the identified structural element. - View Dependent Claims (18, 19, 20, 21)
-
22. A device comprising:
-
a memory to store a corpus of documents, at least one document, of the corpus of documents, comprising structural elements; and a processor to; receive a search query that includes a search term; identify a document in the corpus of documents that is relevant to the search term, the at least one document including the document; determine occurrences of the search term in the document based on the structural elements; group the occurrences of the search term into clusters based on a physical proximity of the occurrences of the search term, a first cluster, of the clusters, that lies within a particular structural element, of the structural elements, being favored over a second cluster, of the clusters, that lies within multiple structural elements of the structural elements; identify a structural element, of the structural elements, encompassing each of the clusters; and provide, for presentation, information associated with the identified structural element. - View Dependent Claims (23, 24, 25, 26, 27)
-
-
28. A method performed by one or more server devices, the method comprising:
-
identifying, by a processor of the one or more server devices, a document that is relevant to a search term, the document comprising structural elements; determining, by a processor of the one or more server devices, occurrences of the search term in the document; grouping, by a processor of the one or more server devices, the occurrences of the search term into clusters based on physical proximity of the occurrences of the search term, a first cluster, of the clusters, that lies within a particular structural element, of the structural elements, being favored over a second cluster, of the clusters, that lies within multiple structural elements of the structural elements; determining, by a processor of the one or more server devices, a smallest structure element, of the structural elements, that encompasses each of the clusters; and providing, by a processor of the one or more server devices, information related to the determined structural element. - View Dependent Claims (29, 30)
-
-
31. A computer-implemented system comprising:
one or more devices to; identify a document that is relevant to a search term, the document comprising structural elements; identify occurrences of the search term in the document; group the occurrences of the search term into clusters based on a threshold that specifies how close in physical proximity two occurrences of the search term have to be to be clustered, the threshold being proportional to a number of pages in the document; determine a structural element, of the structural elements, encompassing each of the clusters; and provide, for presentation, information related to the structural element. - View Dependent Claims (32)
-
33. A method performed by one or more server devices, the method comprising:
-
identifying, by a processor of the one or more server devices, a document that is relevant to a search term, the document comprising structural elements, where the structural elements comprise the document, a set of parts of the document, and a set of pages of the document; identifying, by a processor of the one or more server devices, a tree representation of the document, where the pages of the document correspond to leaf nodes, the parts of the document correspond to higher level nodes, and the document corresponds to a root node; assigning, by a processor of the one or more server devices, scores to the leaf nodes based on whether the leaf nodes contain occurrences of the search term; determining, by a processor of the one or more server devices, scores for the higher level nodes based on the scores of associated ones of the leaf nodes; determining, by a processor of the one or more server devices, a score for the root node based on the scores of the higher level nodes; providing, by a processor of the one or more server devices, a threshold, where the threshold is based on at least one of; a number of pages associated with one of the leaf nodes, a number of pages associated with one of the higher level nodes, or a number of pages associated with the root node; selecting, by a processor of the one or more server devices, one of the leaf nodes, one of the higher level nodes, or the root node, as a selected node, based on the scores and the threshold; and providing, by a processor of the one or more server devices, information relating to the selected node. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40)
-
-
41. A computer-implemented system comprising:
one or more devices to; identify a document that is relevant to a search term; identify a tree representation of the document, where pages of the document correspond to leaf nodes, parts of the document correspond to higher level nodes, and the document corresponds to a root node; assign scores to the leaf nodes based on whether the leaf nodes contain occurrences of the search term; determine scores for the higher level nodes based on the scores of associated ones of the leaf nodes; determine a score for the root node based on the scores of the higher level nodes; provide a threshold, where the threshold is threshold is based on at least one of; a number of pages associated with one of the leaf nodes, a number of pages associated with one of the higher level nodes, or a number of pages associated with the root node; select one of the leaf nodes, one of the higher level nodes, or the root node, as a selected node, based on the scores and the threshold; and provide information relating to the selected node. - View Dependent Claims (42, 43)
-
44. A non-transitory computer-readable storage medium containing computer-executable instructions, the instructions comprising:
one or more instruction which, when executed by a processor of a device, cause the processor to; determine that a document is relevant to a search term, the document comprising structural elements; determine occurrences of the search term in the document; group the occurrences of the search term into clusters based on physical proximity of the occurrences of the search term, a first cluster, of the clusters, that lies within a particular structural element, of the structural elements, being favored over a second cluster, of the clusters, that lies within multiple structural elements of the structural elements; determine a structural element, of the structural elements, encompassing each of the clusters; and provide, for presentation, information relating to the determined structural element. - View Dependent Claims (45, 46)
Specification