Text-processing database
First Claim
1. A computer-accessible database comprising a list of generic words and associated selectivity values, where the selectivity value(s) associated with a word are related to the frequency of occurrence of that word in at least one library of texts in a field, relative to the frequency of occurrence of the same word in one or more libraries of texts in one or more other fields, respectively, and the words in the database are non-generic words in the texts in said libraries of texts.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is a computer-accessible database composed of a list of non-generic words contained in a plurality of digitally encoded texts. Associated with each term is a selectivity value or values that are related to the frequency of occurrence of that word in at least one library of texts in a field, relative to the frequency of occurrence of the same word in one or more libraries of texts in one or more other fields, respectively. Also associated with each term are one or more text identifiers identifying one or more of the digitally processed texts containing that word. Each text identifier may be further associated with sentence and word-number identifiers that identify the sentence and word number(s) of a given database word.
75 Citations
10 Claims
-
1. A computer-accessible database comprising
a list of generic words and associated selectivity values, where the selectivity value(s) associated with a word are related to the frequency of occurrence of that word in at least one library of texts in a field, relative to the frequency of occurrence of the same word in one or more libraries of texts in one or more other fields, respectively, and the words in the database are non-generic words in the texts in said libraries of texts.
Specification