Indexing role hierarchies for words in a search index
First Claim
1. One or more computer-readable media having computer-executable instructions embodied thereon for performing a method of expanding and indexing relationships between words within a sentence, the method comprising:
- receiving a first electronic document having content that includes text;
breaking the text into one or more sentences;
determining a first grammatical role for a word in the sentence;
determining that the word has more than one potential grammatical role;
assigning a second grammatical role to the word, wherein the second grammatical role is a subservient grammatical role to the first grammatical role, which is a dominant grammatical role, wherein the subservient grammatical role fits within a definition of the dominant grammatical role and has a more specific definition; and
storing the word with the first grammatical role and the second grammatical role in an index that associates the word and the sentence with the first electronic document, thereby allowing a search engine to match the first electronic document with a search query that includes similar words having similar roles.
4 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems and computer readable media for finding documents in a data store that match a natural language query submitted by a user are provided. The documents and queries are matched by determining that words within the query have the same relationship to each other as the same words in the document. Documents are semantically analyzed and words in the document are indexed along with the role the word plays in a sentence. The initial semantic role may be generalized using a role hierarchy and stored in the index along with the original role. A similar analysis may be used with the search query to find words used in the same role in both the query and the document.
-
Citations
20 Claims
-
1. One or more computer-readable media having computer-executable instructions embodied thereon for performing a method of expanding and indexing relationships between words within a sentence, the method comprising:
-
receiving a first electronic document having content that includes text; breaking the text into one or more sentences; determining a first grammatical role for a word in the sentence; determining that the word has more than one potential grammatical role; assigning a second grammatical role to the word, wherein the second grammatical role is a subservient grammatical role to the first grammatical role, which is a dominant grammatical role, wherein the subservient grammatical role fits within a definition of the dominant grammatical role and has a more specific definition; and storing the word with the first grammatical role and the second grammatical role in an index that associates the word and the sentence with the first electronic document, thereby allowing a search engine to match the first electronic document with a search query that includes similar words having similar roles. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for matching a search query to text within a one or more documents comprising:
-
receiving a natural language search query containing a plurality of words; assigning a primary grammatical role to a word in the search query; assigning an additional grammatical role to the word from the search query using a role hierarchy that defines a grammatical relationship between the primary grammatical role and the additional grammatical role, wherein grammatical roles within the role hierarchy get more specific the further grammatical roles are from a root grammatical role, and wherein the additional grammatical role fits within a definition of the primary grammatical role and has a more specific definition, wherein the additional grammatical role is a subservient grammatical role to the primary grammatical role, which is a dominant grammatical role; and determining, at a computing device, that at least one document referenced in an index, which associates the word and a sentence within the at least one document with one or more grammatical roles, includes the word from the search query assigned to one or more of the primary grammatical role and the additional grammatical role. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. One or more computer-storage media having computer-executable instructions embodied thereon for performing a method of indexing document contents using a role hierarchy, the method comprising:
-
receiving a first electronic document having content that includes text; identifying a sentence that is part of the first electronic document; assigning a first grammatical role to a word in the sentence; determining that the word has more than one potential grammatical role; assigning at least a second grammatical role to the word using the role hierarchy, which includes a hierarchy of grammatical roles including a root role that is general and at least one level of sub roles that are more specific than the root role, wherein the second grammatical role is a subservient grammatical role to the first grammatical role, which is a dominant grammatical role, and fits within a definition of the dominant grammatical role and has a more specific definition; and storing the word with the first grammatical role and the second grammatical role in an index that associates the words and the sentence with the electronic document, thereby allowing a search engine to match the first electronic document with a search query that includes similar words having similar roles. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification