Inference indexing
First Claim
1. A computer-implemented method of facilitating generation of an inference index using a computing system having processor, memory, and data storage subsystems, the computer-implemented method comprising:
- referencing a canonical entity that is associated with one or more web documents;
identifying a plurality of queries that, when input, result in a selection of at least one web document of the one or more web documents associated with the canonical entity;
generating, via the processor, an entity document for the canonical entity, the entity document including the plurality of identified queries and a representation of the at least one web document, wherein the plurality of identified queries resulted in the selection of the at least one web document corresponding with the canonical entity comprising a unique representation of an entity;
generating an inference index using the canonical entity and the entity document along with other related canonical entities and corresponding entity documents, the inference index corresponding with a knowledge domain of related canonical entities; and
utilizing the inference index in response to a real-time user query provided by a user after generation of the inference index to select a particular canonical entity that is most related to the real-time user query, the particular canonical entity comprising a unique representation of an entity that indicates a person or place and that is selected based on a cumulative score associated with the selected canonical entity being greater than cumulative scores associated with one or more other canonical entities within the inference index, wherein each of the cumulative scores for the canonical entities comprises an aggregate of entity document scores within the corresponding canonical entity, each entity document score calculated based on a frequency of at least a portion of the real-time user query occurring within queries of the corresponding entity document.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and media are provided for facilitating generation of an inference index. In embodiments, a canonical entity is referenced. The canonical entity is associated with web documents. One or more queries that, when input, result in a selection of at least one of the web documents are identified. An entity document is generated for the canonical entity. The entity document includes the identified queries and/or associated text from the content of a document or from an entity title that result in the selection of the at least one of the web documents. The entity document and corresponding canonical entity can be combined with additional related entity documents and canonical entities to generate an inference index.
-
Citations
10 Claims
-
1. A computer-implemented method of facilitating generation of an inference index using a computing system having processor, memory, and data storage subsystems, the computer-implemented method comprising:
-
referencing a canonical entity that is associated with one or more web documents; identifying a plurality of queries that, when input, result in a selection of at least one web document of the one or more web documents associated with the canonical entity; generating, via the processor, an entity document for the canonical entity, the entity document including the plurality of identified queries and a representation of the at least one web document, wherein the plurality of identified queries resulted in the selection of the at least one web document corresponding with the canonical entity comprising a unique representation of an entity; generating an inference index using the canonical entity and the entity document along with other related canonical entities and corresponding entity documents, the inference index corresponding with a knowledge domain of related canonical entities; and utilizing the inference index in response to a real-time user query provided by a user after generation of the inference index to select a particular canonical entity that is most related to the real-time user query, the particular canonical entity comprising a unique representation of an entity that indicates a person or place and that is selected based on a cumulative score associated with the selected canonical entity being greater than cumulative scores associated with one or more other canonical entities within the inference index, wherein each of the cumulative scores for the canonical entities comprises an aggregate of entity document scores within the corresponding canonical entity, each entity document score calculated based on a frequency of at least a portion of the real-time user query occurring within queries of the corresponding entity document. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-readable media, wherein the computer-readable media is one or more computer storage devices having computer-executable instructions embodied thereon that when executed by a computing device, perform a method, the method comprising:
-
referencing a canonical entity that is associated with one or more web documents; identifying a plurality of queries that, when input, result in a selection of at least one web document of the one or more web documents associated with the canonical entity; generating, via the processor, an entity document for the canonical entity, the entity document including the plurality of identified queries and a representation of the at least one web document, wherein the plurality of identified queries resulted in the selection of the at least one web document corresponding with the canonical entity comprising a unique representation of an entity; generating an inference index using the canonical entity and the entity document along with other related canonical entities and corresponding entity documents, the inference index corresponding with a knowledge domain of related canonical entities; and utilizing the inference index in response to a real-time user query provided by a user after generation of the inference index to select a particular canonical entity that is most related to the real-time user query, the particular canonical entity comprising a unique representation of an entity that indicates a person or place and that is selected based on a cumulative score associated with the selected canonical entity being greater than cumulative scores associated with one or more other canonical entities within the inference index, wherein each of the cumulative scores for the canonical entities comprises an aggregate of entity document scores within the corresponding canonical entity, each entity document score calculated based on a frequency of at least a portion of the real-time user query occurring within queries of the corresponding entity document. - View Dependent Claims (7, 8, 9, 10)
-
Specification