Text-processing code, system and method
First Claim
1. Computer-readable code which is operable, when used to control an electronic computer, to identify descriptive words contained in a digitally encoded input text, by the steps of (i) processing the input text to generate a list of text words, (ii) selecting a text word from (i) as a descriptive word if that word has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a word in a library of texts in a field is related to the frequency of occurrence of that word in said library, relative to the frequency of occurrence of the same word in one or more other libraries of texts in one or more other fields, respectively, and (iii) storing or displaying the words selected in (ii) as descriptive words.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is an automated system, machine-readable code, and method for generating descriptive words and optionally, multi-word groups derived from a digitally encoded, natural-language input text that describes a concept, invention, or event in a selected field. The system includes (a) an electronic digital computer, (b) a database of words and optionally, word-groups derived from a plurality of texts, and (c) computer-readable code for accessing the database. The database provides, or can be used to calculate, a selectivity value for each of the words and optionally, word groups contained in or derived from the input text. Words and optionally, word groups having an above-threshold selectivity value are selected as descriptive terms from the input text.
-
Citations
14 Claims
-
1. Computer-readable code which is operable, when used to control an electronic computer, to identify descriptive words contained in a digitally encoded input text, by the steps of
(i) processing the input text to generate a list of text words, (ii) selecting a text word from (i) as a descriptive word if that word has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a word in a library of texts in a field is related to the frequency of occurrence of that word in said library, relative to the frequency of occurrence of the same word in one or more other libraries of texts in one or more other fields, respectively, and (iii) storing or displaying the words selected in (ii) as descriptive words.
Specification