Method and system for semantic searching of natural language texts
First Claim
1. A computer-implemented method for facilitating a semantic search based on one or more previously analyzed and indexed corpuses of natural language texts, the method comprising:
- identifying the one or more previously analyzed and indexed corpuses of natural language texts, each corpus including a plurality of sentences and indexes where indexes include indexes of meanings of linguistic parameters of the sentences and of their lexical units and their relations;
wherein the linguistic parameters comprise parameters determined based on syntactico-semantic analysis of at least one sentence in the corpuses; and
the syntactico-semantic analysis of the at least one sentence comprisesperforming a rough syntactic analysis of the at least one sentence generating a graph of generalized constituents of the at least one sentence;
performing a precise syntactic analysis on the graph of generalized constituents by generating one or more syntactic trees and determining a syntactic structure of the at least one sentence,wherein the performing of the precise syntactic analysis comprises assessing parts of the one or more syntactic trees using prior andstatistical ratings, andgenerating the one or more syntactic trees in an order of decreasing assessment;
semantically analyzing the syntactic structure of the sentence generating a language-independent semantic structure of the at least one sentence; and
generating one or more of the parameters based on the language-independent semantic structure of the at least one sentence;
creating a query for searching sentences satisfying the query;
searching for sentences of the previously analyzed and indexed corpuses satisfying the query; and
displaying the sentences satisfying the query.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system comprising an automated analysis of at least one corpus of natural language text is disclosed. For each sentence of a corpus, the analysis includes performing a syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, building a semantic structure for the sentence, associating each generated syntactic and semantic structure with the sentence, and saving each generated syntactic and semantic structure. For each corpus text that was preliminary analyzed, performing an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each semantic structure associated with sentences in the corpus text. A semantic search as disclosed herein includes at least one automatic preliminary analyzed corpus of sentences comprising searched values of linguistic, syntactic and semantic parameters. Due to deep semantic analysis of one or more corpora, the search may be executed in various languages, in resources of various languages, and in text corpora of various languages, regardless of the language of the query.
143 Citations
28 Claims
-
1. A computer-implemented method for facilitating a semantic search based on one or more previously analyzed and indexed corpuses of natural language texts, the method comprising:
-
identifying the one or more previously analyzed and indexed corpuses of natural language texts, each corpus including a plurality of sentences and indexes where indexes include indexes of meanings of linguistic parameters of the sentences and of their lexical units and their relations; wherein the linguistic parameters comprise parameters determined based on syntactico-semantic analysis of at least one sentence in the corpuses; and
the syntactico-semantic analysis of the at least one sentence comprisesperforming a rough syntactic analysis of the at least one sentence generating a graph of generalized constituents of the at least one sentence; performing a precise syntactic analysis on the graph of generalized constituents by generating one or more syntactic trees and determining a syntactic structure of the at least one sentence, wherein the performing of the precise syntactic analysis comprises assessing parts of the one or more syntactic trees using prior and statistical ratings, and generating the one or more syntactic trees in an order of decreasing assessment; semantically analyzing the syntactic structure of the sentence generating a language-independent semantic structure of the at least one sentence; and generating one or more of the parameters based on the language-independent semantic structure of the at least one sentence; creating a query for searching sentences satisfying the query; searching for sentences of the previously analyzed and indexed corpuses satisfying the query; and displaying the sentences satisfying the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. One or more non-transitory computer-readable media having computer-executable instructions, which when executed perform steps, comprising:
-
identifying one or more previously analyzed and indexed corpuses of natural language texts, each corpus including a plurality of sentences and indexes where indexes include indexes of meanings of linguistic parameters of the sentences and of their lexical units and their relations; wherein the linguistic parameters comprise semantic parameters determined based on syntactico-semantic analysis of at least one sentence in the corpuses; and the syntactico-semantic analysis of the at least one sentence comprises performing a rough syntactic analysis of the at least one sentence generating a graph of generalized constituents of the at least one sentence; performing a precise syntactic analysis on the graph of generalized constituents by generating one or more syntactic trees and determining a syntactic structure of the at least one sentence, wherein the performing of the precise syntactic analysis comprises assessing parts of the one or more syntactic trees using prior and statistical ratings, and generating the one or more syntactic trees in an order of decreasing assessment; semantically analyzing the syntactic structure of the sentence generating a language-independent semantic structure of the at least one sentence; generating one or more of the parameters based on the language-independent semantic structure of the at least one sentence; creating a query for searching sentences satisfying the query; searching for sentences of the previously analyzed and indexed corpuses satisfying the query; and displaying the sentences satisfying the query. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
Specification