METHOD AND SYSTEM FOR SEMANTIC SEARCHING
First Claim
1. A computer-implemented method for facilitating a semantic search, the method comprising:
- identifying a corpora of natural language texts including a plurality of sentences;
performing a syntactic analysis on each sentence using a linguistic description associated with a language of the sentence;
generating, based at least in part on the syntactic analysis, at least one syntactic structure for each sentence;
generating a semantic structure for each sentence of the corpora of natural language texts;
associating each generated syntactic structure and the semantic structure with a respective sentence;
creating an index for each meaning of a linguistic parameter of each of the generated syntactic structures;
creating an index for each meaning of a parameter of each of the semantic structures; and
storing each generated syntactic structure and the semantic structure for each sentence of the corpora of natural language text.
4 Assignments
0 Petitions
Accused Products
Abstract
A method comprising a preliminary automated analysis of at least one corpus of natural language text is disclosed. For each sentence of a corpus, the method includes performing a syntactic analysis using linguistic descriptions to generate at least one syntactic structure for the sentence, building a semantic structure for the sentence, associating each generated syntactic and semantic structure with the sentence, and saving each structure. For each corpus text that was preliminary analyzed, performing an indexing operation to index lexical meanings and values of linguistic parameters of each syntactic structure and each semantic structure associated with sentences in the corpus text. A semantic search includes at least one automatic preliminary analyzed corpus of sentences comprising searched values of linguistic, syntactic and semantic parameters. Due to a deep semantic analysis of a corpus, the search may be executed in various languages, in resources of various languages, and in the text of corpora of various languages regardless of the language of the query.
40 Citations
55 Claims
-
1. A computer-implemented method for facilitating a semantic search, the method comprising:
-
identifying a corpora of natural language texts including a plurality of sentences; performing a syntactic analysis on each sentence using a linguistic description associated with a language of the sentence; generating, based at least in part on the syntactic analysis, at least one syntactic structure for each sentence; generating a semantic structure for each sentence of the corpora of natural language texts; associating each generated syntactic structure and the semantic structure with a respective sentence; creating an index for each meaning of a linguistic parameter of each of the generated syntactic structures; creating an index for each meaning of a parameter of each of the semantic structures; and storing each generated syntactic structure and the semantic structure for each sentence of the corpora of natural language text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for providing a result of a search, the method comprising:
-
for each sentence of a corpus of texts, the corpus of texts including a plurality of sentences, generating at least one syntactic structure for each sentence using at least one linguistic description associated with a source natural language; building a language-independent semantic structure for each said sentence; associating each generated syntactic structure and each semantic structure with a respective sentence; indexing at least one meaning of linguistic parameters associated with each sentence; indexing at least one lexical meaning associated with each lexical unit of each sentence; indexing at least one value associated with linguistic parameters related to a syntactic structure of each sentence; indexing at least one value associated with semantic parameters related to the language-independent semantic structure of each sentence; and performing a syntactic search using the indexing of the linguistic parameters, the indexing of the lexical meaning, the indexing of the value associated with the linguistic parameter related to each syntactic structure, and the indexing of the value associated with a semantic parameter related to each semantic structure, wherein the performing the syntactic search includes storing at least one of the indexing of meaning of linguistic parameters, the indexing of the at least one lexical meaning, the indexing of the at least one value associated with linguistic parameters related to a syntactic structure of each sentence, and the indexing of the at least one value associated with semantic parameters related to the language-independent semantic structure of each sentence. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A system for facilitating a semantic search, the system comprising:
-
a first processor and a computer readable memory; a corpus of natural language texts including a plurality of sentences; a syntactic analyzer in the computer readable memory capable of; performing a syntactic analysis on each sentence using a linguistic description associated with a language of the sentence; generating, based at least in part on the syntactic analysis, at least one syntactic structure for each sentence; generating a semantic structure for each sentence of the corpus of natural language texts; associating each generated syntactic structure and the semantic structure with a respective sentence; and storing each generated syntactic structure and the semantic structure for each sentence of the corpus of natural language text; and an index generation component capable of; creating an index for each meaning of a linguistic parameter of each of the generated syntactic structures; creating an index for each meaning of a parameter of each of the semantic structures; and storing the indexes. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. One or more computer-readable media having computer-executable instructions, which when executed perform steps, comprising:
-
identifying a corpus of natural language texts including a plurality of sentences; performing a syntactic analysis on each sentence using a linguistic description associated with a language of the sentence; generating, based at least in part on the syntactic analysis, at least one syntactic structure for each sentence; generating a semantic structure for each sentence of the corpus of natural language texts; associating each generated syntactic structure and the semantic structure with a respective sentence; creating an index for each meaning of a linguistic parameter of each of the generated syntactic structures; creating an index for each meaning of a parameter of each of the semantic structures; and storing each generated syntactic structure and the semantic structure for each sentence of the corpus of natural language text. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55)
-
Specification