Code, system, and method for retrieving text material from a library of documents
First Claim
1. A computer-assisted method for retrieving one or more selected texts from a library of documents, comprising (a) processing a user-input search query composed of a sentence, sentence fragment or word list containing non-generic words representing the content of the text to be retrieved, (b) accessing a database containing (1) a word-records table composed of (1a) non-generic words contained in said documents and (1b) for each word in the word-records table, a list of identifiers of texts in said documents containing that word, and (2) a document text table containing texts in said documents and associated text identifiers, to identify those texts in the document library having the highest word-match scores with said search query, based on pre-assigned word-match values for the non-generic words in said query, (c) displaying to the user, (i) the non-generic words in said query, and (ii) for each of said non-generic words, (iia) an occurrence value related to the occurrence of that word relative to other words in the query among texts having the highest word-match scores with the search query, and (iib) user choices for adjusting the word-match values of each of the non-generic words in the search query, relative to other words in the query, (d) processing user choices made in response to the information displayed in step (c)(ii), (e) accessing said table of word records to identify texts in the document library having the highest word-match scores based on the user-adjusted word-match values processed in step (d), (f) accessing said document text table to retrieve those texts identified in (e), and (g) displaying to the user one or more of the texts in (e).
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are a computer-readable code, system and method for retrieving one or more selected texts from a library of documents. The system processes a user-input search query representing the content of the text to be retrieved, and accesses a word index for the documents to identify those texts in the database having the highest word-match scores with the search query. The weights of words in the query may be adjusted to optimize the search.
-
Citations
14 Claims
-
1. A computer-assisted method for retrieving one or more selected texts from a library of documents, comprising
(a) processing a user-input search query composed of a sentence, sentence fragment or word list containing non-generic words representing the content of the text to be retrieved, (b) accessing a database containing (1) a word-records table composed of (1a) non-generic words contained in said documents and (1b) for each word in the word-records table, a list of identifiers of texts in said documents containing that word, and (2) a document text table containing texts in said documents and associated text identifiers, to identify those texts in the document library having the highest word-match scores with said search query, based on pre-assigned word-match values for the non-generic words in said query, (c) displaying to the user, (i) the non-generic words in said query, and (ii) for each of said non-generic words, (iia) an occurrence value related to the occurrence of that word relative to other words in the query among texts having the highest word-match scores with the search query, and (iib) user choices for adjusting the word-match values of each of the non-generic words in the search query, relative to other words in the query, (d) processing user choices made in response to the information displayed in step (c)(ii), (e) accessing said table of word records to identify texts in the document library having the highest word-match scores based on the user-adjusted word-match values processed in step (d), (f) accessing said document text table to retrieve those texts identified in (e), and (g) displaying to the user one or more of the texts in (e).
-
5. The method of the section 1, wherein step (c) further includes displaying to the user, texts having the highest word-match scores based on pre-assigned word-match values for the non-generic words in said query.
-
11. A computer-assisted method for retrieving one or more selected texts from a library of documents, comprising
(a) processing a user-input search query composed of a sentence, sentence fragment or word list containing non-generic words representing the content of the text to be retrieved, and a specified document title and length value which specifies a given length of text following said title in a document, (b) accessing a database containing (1) a word records table composed of (1a) non-generic words contained in said documents and (1b) for each word in the table, a list of identifiers of texts in said documents containing that word, and (2) a document text table containing texts in said documents and associated text identifiers, to identify those texts in the database having the highest word-match scores with said search query, based on pre-assigned word-match values for the non-generic words in said query, and which are within the specified length value following the specified title in said documents, and (c) displaying to the user one or more of the texts identified in (b).
-
13. A computer-assisted method for retrieving one or more selected texts from a library of documents, where some of said texts may include titles, comprising
(a) processing a user-input search query composed of a sentence, sentence fragment or word list containing non-generic words representing the content of the text to be retrieved, where said query includes a specified title in a document and a length value which specifies a given length of document text following said title in a document, (b) accessing a database containing (1) a word-records table composed of (1a) non-generic words contained in said documents and (1b) for each word in the word-records table, a list of identifiers of texts in said documents containing that word, and (2) a document text table containing texts in said documents and associated text identifiers, wherein some of the texts in a document are document titles, (c) by said accessing, identifying those texts in the database having the highest word-match scores with said search query which are also within the specified document length following the specified document title, (d) accessing said document text table to retrieve those texts identified in (c), and (e) displaying to the user one or more of the texts in (e).
Specification