INDEXING ROLE HIERARCHIES FOR WORDS IN A SEARCH INDEX
First Claim
1. One or more computer-readable media having computer-executable instructions embodied thereon for performing a method of expanding and indexing a semantic relationships between words within a sentence, the method comprising:
- receiving a first electronic document having content that includes text;
breaking the text into one or more sentences;
determining a first role for a word in the sentence;
determining that the word has more than one potential role;
assigning a second role to the word; and
storing the word with the first role and the second role in an index that associates the word and the sentence with the first electronic document, thereby allowing a search engine to match the first electronic document with a search query that includes similar words having similar roles.
4 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems and computer readable media for finding documents in a data store that match a natural language query submitted by a user are provided. The documents and queries are matched by determining that words within the query have the same relationship to each other as the same words in the document. Documents are semantically analyzed and words in the document are indexed along with the role the word plays in a sentence. The initial semantic role may be generalized using a role hierarchy and stored in the index along with the original role. A similar analysis may be used with the search query to find words used in the same role in both the query and the document.
-
Citations
20 Claims
-
1. One or more computer-readable media having computer-executable instructions embodied thereon for performing a method of expanding and indexing a semantic relationships between words within a sentence, the method comprising:
-
receiving a first electronic document having content that includes text; breaking the text into one or more sentences; determining a first role for a word in the sentence; determining that the word has more than one potential role; assigning a second role to the word; and storing the word with the first role and the second role in an index that associates the word and the sentence with the first electronic document, thereby allowing a search engine to match the first electronic document with a search query that includes similar words having similar roles. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computerized system for matching a search query to text within a one or more documents comprising:
-
a first semantic interpretation component for (1) assigning a primary role to words in the search query; (2) assigning an additional role to at least one word from the search query having the primary role using a role hierarchy; a matching component for determining that at least one document referenced in an index includes a word from the search query, wherein the word in the at least one document and the word in the search query are assigned at least one of the same roles. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. One or more computer-storage media having computer-executable instructions embodied thereon for performing a method of indexing document contents using a role hierarchy, the method comprising:
-
receiving a sentence that is part of a document; assigning a role to words in the sentence; assigning at least one additional role to the words using the role hierarchy, wherein the role hierarchy includes a hierarchy of roles including a root role that is general and at least one level of sub roles that are more specific than the root role, thereby assigning at least one word at least two roles; and storing the words with and one or more roles assigned to individual words in an index that associates the words and the sentence with the document. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification