Text-searching system and method
First Claim
1. Computer-readable code that is operable, when read by an electronic computer, to compare a target concept, invention, or event with each of a plurality of texts, by the steps of:
- (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged words groups characterizing the target concept, invention, or event, selecting that term as a descriptive term if the term has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a term in a library of texts in a field is related to the frequency of occurrence of that term in said library, relative to the frequency of occurrence of the same word in one or more other libraries of texts in one or more other fields, respectively, (b) determining for each of the plurality of texts, a match score related to the number of descriptive terms present in or derived from that text that match those in the target concept, invention, or event, and (c) using the match score to compare the texts with the target concept, invention, or event.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are a computer-readable code, system and method for comparing a target concept, invention, or event with each of a plurality of texts. Each of a plurality of non-generic words and optionally, words groups characterizing the target concept, invention, or event, is selected as a descriptive term if the term has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a term is a measure of the field-specificity of that term. There is then determined, for each of the plurality of texts, a match score related to the number of descriptive terms present in or derived from that text that match those in the target concept, invention, or event. Texts having the highest match scores are selected.
89 Citations
19 Claims
-
1. Computer-readable code that is operable, when read by an electronic computer, to compare a target concept, invention, or event with each of a plurality of texts, by the steps of:
-
(a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged words groups characterizing the target concept, invention, or event, selecting that term as a descriptive term if the term has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a term in a library of texts in a field is related to the frequency of occurrence of that term in said library, relative to the frequency of occurrence of the same word in one or more other libraries of texts in one or more other fields, respectively, (b) determining for each of the plurality of texts, a match score related to the number of descriptive terms present in or derived from that text that match those in the target concept, invention, or event, and (c) using the match score to compare the texts with the target concept, invention, or event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An automated method of comparing a target concept, invention, or event in a given field with each of a plurality of natural-language texts, by the steps of:
-
(a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged words groups characterizing the target concept, invention, or event, selecting that term as a descriptive term if the term has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a term in a library of texts in a field is related to the frequency of occurrence of that term in said library, relative to the frequency of occurrence of the same word in one or more other libraries of texts in one or more other fields, respectively, (b) determining for each of the plurality of texts, a match score related to the number of descriptive terms present in or derived from that text that match those in the target concept, invention, or event, and (c) identifying from among the plurality of texts, one or more texts which have the highest match score or scores.
-
Specification