Method and system for semantic searching
First Claim
Patent Images
1. A computer-implemented method for performing a semantic search in a natural language text, the method comprising:
- performing, by a processor, a syntactico-semantic analysis of a natural language text to produce a plurality of semantic structures representing a plurality of sentences of the natural language text, wherein the syntactico-semantic analysis of a sentence of the natural language text comprises;
generating a plurality of relationships between lexical units of the sentence,producing a graph of generalized constituents of the sentence,generating a plurality of syntactic trees based on the graph of generalized constituents,selecting an optimal syntactic tree in view of determined non-tree links,producing a semantic structure based on the selected syntactic tree, andstoring the semantic structure in a memory;
producing, using the plurality of semantic structures, an index indicative of positions, within the natural language text, of certain values of at least one of morphological parameters, syntactic parameters, lexical parameters, or semantic parameters, and wherein the index further comprises weight values associated with the parameter values;
searching, using the index, the natural language text for text fragments satisfying a search query;
estimating relevance of search results based on weight values specified by the index; and
ranking search results according to estimated relevance.
5 Assignments
0 Petitions
Accused Products
Abstract
A method and system for facilitating a semantic search based on one or more corpuses of natural language texts are provided. One or more corpuses of natural language texts are received including indexed linguistic parameters and semantic structures of lexical units. The linguistic parameters and semantic structures are generated during a preliminary syntactico-semantic analysis. Searching for text fragments satisfying a query in the one or more corpuses is performed. Relevance of the search results is estimated.
156 Citations
30 Claims
-
1. A computer-implemented method for performing a semantic search in a natural language text, the method comprising:
-
performing, by a processor, a syntactico-semantic analysis of a natural language text to produce a plurality of semantic structures representing a plurality of sentences of the natural language text, wherein the syntactico-semantic analysis of a sentence of the natural language text comprises; generating a plurality of relationships between lexical units of the sentence, producing a graph of generalized constituents of the sentence, generating a plurality of syntactic trees based on the graph of generalized constituents, selecting an optimal syntactic tree in view of determined non-tree links, producing a semantic structure based on the selected syntactic tree, and storing the semantic structure in a memory; producing, using the plurality of semantic structures, an index indicative of positions, within the natural language text, of certain values of at least one of morphological parameters, syntactic parameters, lexical parameters, or semantic parameters, and wherein the index further comprises weight values associated with the parameter values; searching, using the index, the natural language text for text fragments satisfying a search query; estimating relevance of search results based on weight values specified by the index; and ranking search results according to estimated relevance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A system for performing a semantic search in a natural language text, the system comprising one or more processors coupled to a memory, the processors configured to:
-
perform a syntactico-semantic analysis of a natural language text to produce a plurality of semantic structures representing a plurality of sentences of the natural language text, wherein the syntactico-semantic analysis of a sentence of the natural language text comprises; generating a plurality of relationships between lexical units of the sentence, producing a graph of generalized constituents of the sentence, generating a plurality of syntactic trees based on the graph of generalized constituents, selecting an optimal syntactic tree in view of determined non-tree links, producing a semantic structure based on the selected syntactic tree, and storing the semantic structure in the memory; produce, using the plurality of semantic structures, an index indicative of positions, within the natural language text, of certain values of at least one of morphological parameters, syntactic parameters, lexical parameters, or semantic parameters, and wherein the index further comprises weight values associated with the parameter values; search, using the index, the natural language text for text fragments satisfying a search query; estimate relevance of search results based on weight values specified by the index; and rank search results according to estimated relevance. - View Dependent Claims (29)
-
-
30. A non-transitory computer-readable storage medium having machine instructions stored therein, that when executed by a processor, cause the processor to:
-
performing a syntactico-semantic analysis of a natural language text to produce a plurality of semantic structures representing a plurality of sentences of the natural language text, wherein the syntactico-semantic analysis of a sentence of the natural language text comprises; generating a plurality of relationships between lexical units of the sentence, producing a graph of generalized constituents of the sentence, generating a plurality of syntactic trees based on the graph of generalized constituents, selecting an optimal syntactic tree in view of determined non-tree links, producing a semantic structure based on the selected syntactic tree, and storing the semantic structure in the memory; producing, using the plurality of semantic structures, an index indicative of positions, within the natural language text, of certain values of at least one of morphological parameters, syntactic parameters, lexical parameters, or semantic parameters, and wherein the index further comprises weight values associated with the parameter values; searching, using the index, the natural language text for text fragments satisfying a search query; estimating relevance of search results based on weight values specified by the index; and ranking search results according to estimated relevance.
-
Specification