Finding Related Entities For Search Queries
First Claim
1. A computer-implemented search system, comprising:
- an indexing component for storing and indexing document entities of a documents, the documents associated with corresponding document identifiers;
a document search component for processing a keyword query and returning document identifiers of documents associated with results of the query; and
a retrieval component for retrieving the document entities based on the document identifiers.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture for finding related entities for web search queries. An extraction component takes a document as input and outputs all the mentions (or occurrences) of named entities such as names of people, organizations, locations, and products in the document, as well as entity metadata. An indexing component takes a document identifier (docID) and the set of mentions of named entities and, stores and indexes the information for retrieval. A document-based search component takes a keyword query and returns the docIDs of the top documents matching with the query. A retrieval component takes a docID as input, accesses the information stored by the indexing component and returns the set of mentions of named entities in the document. This information is then passed to an entity scoring and thresholding component that computes an aggregate score of each entity and selects the entities to return to the user.
111 Citations
20 Claims
-
1. A computer-implemented search system, comprising:
-
an indexing component for storing and indexing document entities of a documents, the documents associated with corresponding document identifiers; a document search component for processing a keyword query and returning document identifiers of documents associated with results of the query; and a retrieval component for retrieving the document entities based on the document identifiers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented method of searching, comprising:
-
extracting occurrences of the entities and entity metadata from web documents; storing and indexing the entities and entity metadata in association with corresponding document identifiers; processing a query and returning document identifiers associated with document results of the query; retrieving a set of the entity occurrences based on the document identifiers; generating scores for the entities; and selecting the entities with the highest scores. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer-implemented system, comprising:
-
computer-implemented means for extracting occurrences of the entities and entity metadata in a web documents; computer-implemented means for storing and indexing the entities and entity metadata in association with corresponding document identifiers; computer-implemented means for processing a query and returning document identifiers associated with document results of the query; computer-implemented means for retrieving a set of the occurrences based on the document identifiers; computer-implemented means for generating scores for the entities; and computer-implemented means for selecting the entities with the highest scores.
-
Specification