Method and system for semantic searching using syntactic and semantic analysis
First Claim
1. A computer-implemented method, comprising:
- preliminarily analyzing at least one corpus of natural language text comprising for each sentence of each natural language text of the corpusperforming syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, byperforming a rough syntactic analysis on each sentence of each natural language text of the corpus to generate a graph of generalized constituents for each sentence,performing a precise syntactic analysis on the graph of generalized constituents to generate one or more syntactic trees to represent a corresponding sentence of a corresponding natural language text of the corpus, andselecting a best syntactic tree from the generated one or more syntactic trees to represent the at least one syntactic structure of the sentence; and
building from the at least one syntactic structure of the sentence a language-independent semantic structure for the sentence thereby associating each generated syntactic structure and language-independent semantic structure with the sentence, wherein the semantic structure comprises semantic classes, semantemes, and deep slots; and
performing additionally for each corpus preliminarily analyzed, an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each language-independent semantic structure associated with sentences of the corpus.
4 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, there is provided a computer-implemented method and system for implementing the method. The method comprises: preliminarily analyzing at least one corpus of natural language text comprising for each sentence of each natural language text of the corpus, performing syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence; building a semantic structure for the sentence; associating each generated syntactic and semantic structure with the sentence; and saving each generated syntactic and semantic structure; for each corpus of natural language text that was preliminarily analyzed, performing an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each semantic structure associated with sentences in the corpus; and searching in at least one preliminarily analyzed corpora for sentences comprising searched values for the linguistic parameters.
139 Citations
19 Claims
-
1. A computer-implemented method, comprising:
-
preliminarily analyzing at least one corpus of natural language text comprising for each sentence of each natural language text of the corpus performing syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, by performing a rough syntactic analysis on each sentence of each natural language text of the corpus to generate a graph of generalized constituents for each sentence, performing a precise syntactic analysis on the graph of generalized constituents to generate one or more syntactic trees to represent a corresponding sentence of a corresponding natural language text of the corpus, and selecting a best syntactic tree from the generated one or more syntactic trees to represent the at least one syntactic structure of the sentence; and building from the at least one syntactic structure of the sentence a language-independent semantic structure for the sentence thereby associating each generated syntactic structure and language-independent semantic structure with the sentence, wherein the semantic structure comprises semantic classes, semantemes, and deep slots; and performing additionally for each corpus preliminarily analyzed, an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each language-independent semantic structure associated with sentences of the corpus. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system, comprising:
-
a processor; and a memory coupled to the processor, the memory storing instructions which when executed by the processor cause system to perform a method, comprising preliminarily analyzing at least one corpus of natural language text comprising for each sentence of each natural language text of the corpus performing syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, by performing a rough syntactic analysis on each sentence of each natural language text of the corpus to generate a graph of generalized constituents for each sentence, performing a precise syntactic analysis on the graph of generalized constituents to generate one or more syntactic trees to represent a corresponding sentence of a corresponding natural language text of the corpus, and selecting a best syntactic tree from the generated one or more syntactic trees to represent the at least one syntactic structure of the sentence; building from the at least one syntactic structure of the sentence a language-independent semantic structure for the sentence thereby associating each generated syntactic structure and language-independent semantic structure with the sentence, wherein the semantic structure comprises semantic classes, semantemes, and deep slots; and performing additionally for each corpus preliminarily analyzed, an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each language-independent semantic structure associated with sentences of the corpus. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable medium having stored thereon a sequence of instructions which when executed by a system, cause the system to perform a method, comprising:
-
preliminarily analyzing at least one corpus of natural language text comprising for each sentence of each natural language text of the corpus performing syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, by performing a rough syntactic analysis on each sentence of each natural language text of the corpus to generate a graph of generalized constituents for each sentence, performing a precise syntactic analysis on the graph of generalized constituents to generate one or more syntactic trees to represent a corresponding sentence of a corresponding natural language text of the corpus, and selecting a best syntactic tree from the generated one or more syntactic trees to represent the at least one syntactic structure of the sentence; building from the at least one syntactic structure of the sentence a language-independent a language-independent semantic structure for the sentence thereby associating each generated syntactic structure and language-independent semantic structure with the sentence, wherein the semantic structure comprises semantic classes, semantemes, and deep slots; and performing additionally for each corpus preliminarily analyzed an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each language-independent semantic structure associated with sentences of the corpus. - View Dependent Claims (18, 19)
-
Specification