SEARCH TECHNOLOGY USING SYNONIMS AND PARAPHRASING
First Claim
1. A method of organizing a search in electronic text corpora for computer system, with the following actions carried out at least once:
- performing a semantic-syntactic analysis of a search query, comprising building a ranked list of possible lexical meanings for at least one word of the search query;
compiling a list of synonyms for at least one lexical meaning from the ranked list of possible lexical meanings of the at least one word of the search query;
ranking synonyms from the list of synonyms for the at least one lexical meaning;
generating query versions based on the ranked synonyms;
calculating a rating of correspondence of the query versions to the search query;
searching for text fragments in the electronic text corpora satisfying the query based on at least one of the query versions;
ranking the found text fragments based on the ratings of correspondence of the query versions to the search query.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention is a method and a system of organizing information searches in electronic text corpora and displaying the search results in the user interface. The system and the method enable searches not just for words or word combinations, but also for specific lexical meanings of words, where a lexical meaning is a realization of a word'"'"'s semantic meaning in a particular language. The completeness of search results is bases on incorporation synonyms and paraphrases in the search. The method also includes searching for fragments matching the query in electronic text corpora, estimating the results and the displaying the results ranked to the user.
38 Citations
20 Claims
-
1. A method of organizing a search in electronic text corpora for computer system, with the following actions carried out at least once:
-
performing a semantic-syntactic analysis of a search query, comprising building a ranked list of possible lexical meanings for at least one word of the search query; compiling a list of synonyms for at least one lexical meaning from the ranked list of possible lexical meanings of the at least one word of the search query; ranking synonyms from the list of synonyms for the at least one lexical meaning; generating query versions based on the ranked synonyms; calculating a rating of correspondence of the query versions to the search query; searching for text fragments in the electronic text corpora satisfying the query based on at least one of the query versions; ranking the found text fragments based on the ratings of correspondence of the query versions to the search query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for organizing a search in electronic text corpora of natural language texts, the system comprising:
-
one or more data processors; and one or more storage devices storing instructions that, when executed by the one or more data processors, cause the one or more data processors to perform operations comprising; performing a semantic-syntactic analysis of a search query, comprising building a ranked list of possible lexical meanings for at least one word of the search query; compiling a list of synonyms for at least one lexical meaning from the ranked list of possible lexical meanings of the at least one word of the search query; ranking synonyms from the list of synonyms for the at least one lexical meaning; generating query versions based on the ranked synonyms; calculating a rating of correspondence of the query versions to the search query; searching for text fragments in the electronic text corpora satisfying the query based on at least one of the query versions; ranking the found text fragments based on the ratings of correspondence of the query versions to the search query. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification