Information retrieval through identification of prominent notions
First Claim
Patent Images
1. A method for information retrieval, performed by a computer processor (115), the method comprising the steps of:
- a. processing a corpus of sentences, wherein said processing comprises the steps of(i) providing a ranking criteria module (135), containing predefined ranking criteria, and coupled to operate with the computer processor;
(ii) retrieving textual documents from a corpus of text documents (120), wherein said corpus is in operative communication flow with the computer processor;
(iii) extracting prominent sentences from said textual documents, using said ranking criteria module; and
(iv) updating a database of prominent sentences (130), coupled to operate with the computer processor; and
b. retrieving prominent sentences from said database of prominent sentences, prioritized by said predefined ranking criteria and by user search keywords,wherein said extracting of prominent sentences comprises the step of scoring sentences according to at least one extracted rule; and
wherein said extracted rule comprises the steps of;
A. retrieving multiple sentences from said corpus of text documents, wherein for each of said retrieved sentences perform the following steps;
1) identifying the structure of said retrieved sentence;
2) identifying a verb pattern in said retrieved sentence, using said identified structure; and
3) scoring said identified verb pattern, using said ranking criteria module;
B. summing the score of each uniquely identified verb pattern over all the occurrences of said uniquely identified verb pattern in said corpus, to thereby generate aggregated scores for all of said uniquely identified verb patterns; and
C. matching said aggregated scores of said uniquely identified verb patterns with a pre-defined threshold, to thereby generate prominence assigning rules.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method for information retrieval from a corpus of text based on offline prominent sentences extraction, and online prominent sentences retrieval ordered by predefined criteria, and recommending online cross-interest prominent sentences.
-
Citations
17 Claims
-
1. A method for information retrieval, performed by a computer processor (115), the method comprising the steps of:
-
a. processing a corpus of sentences, wherein said processing comprises the steps of (i) providing a ranking criteria module (135), containing predefined ranking criteria, and coupled to operate with the computer processor; (ii) retrieving textual documents from a corpus of text documents (120), wherein said corpus is in operative communication flow with the computer processor; (iii) extracting prominent sentences from said textual documents, using said ranking criteria module; and (iv) updating a database of prominent sentences (130), coupled to operate with the computer processor; and b. retrieving prominent sentences from said database of prominent sentences, prioritized by said predefined ranking criteria and by user search keywords, wherein said extracting of prominent sentences comprises the step of scoring sentences according to at least one extracted rule; and wherein said extracted rule comprises the steps of; A. retrieving multiple sentences from said corpus of text documents, wherein for each of said retrieved sentences perform the following steps; 1) identifying the structure of said retrieved sentence; 2) identifying a verb pattern in said retrieved sentence, using said identified structure; and 3) scoring said identified verb pattern, using said ranking criteria module; B. summing the score of each uniquely identified verb pattern over all the occurrences of said uniquely identified verb pattern in said corpus, to thereby generate aggregated scores for all of said uniquely identified verb patterns; and C. matching said aggregated scores of said uniquely identified verb patterns with a pre-defined threshold, to thereby generate prominence assigning rules. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer software product for searching and retrieving prominent sentences from textual documents, the computer software product embodied in a non-transitory computer-readable medium in which program instructions are stored, which instructions, wherein the program instructions, when read by a computer, perform a method for information retrieval comprising the steps of:
-
a. providing a ranking criteria module, containing predefined ranking criteria, and coupled to operate with the computer processor; b. extracting prominent sentences from a textual document, using said ranking criteria module; and c. retrieving prominent sentences from a textual document, prioritized by said predefined ranking criteria, and by one or more user provided search keywords wherein said extracting of prominent sentences comprises the step of scoring sentences according to at least one extracted rule; and wherein said extracted rule comprises the steps of; A. retrieving multiple sentences from said corpus of text documents, wherein for each of said retrieved sentences perform the following steps; 1) identifying the structure of said retrieved sentence; 2) identifying a verb pattern in said retrieved sentence, using said identified structure; and 3) scoring said identified verb pattern, using said ranking criteria module; B. summing the score of each uniquely identified verb pattern over all the occurrences of said uniquely identified verb pattern in said corpus, to thereby generate aggregated scores for all of said uniquely identified verb patterns; and C. matching said aggregated scores of said uniquely identified verb patterns with a pre-defined threshold, to thereby generate prominence assigning rules. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
Specification