Code, method, and system for manipulating texts
First Claim
1. A computer-assisted method for combining texts to form novel combinations of texts related to a desired target concept that is represented in the form of a natural-language text or a list of descriptive terms that include words and, optionally, word groups, said method comprising (A) if the target concept is represented in the form of a natural-language text, extracting descriptive word and, optionally, word-group terms from the text, to form a list of descriptive terms, (B) searching a database of target-related texts, to identify a primary group of texts having highest term match scores with a first subset of said terms, (C) searching a database of target-related texts, to identify a secondary group of texts having the highest term match scores with a second subset of said terms, where said first and second subsets are at least partially complementary with respect to the terms in said list, (D) generating pairs of texts containing a text from the primary group of texts and a different text from the secondary group of texts, and (E) selecting for presentation to the user, those pairs of texts that have highest overlap scores as determined from one or more of:
- (E1) overlap between descriptive terms in one text in the pair with descriptive terms in the other text in the pair;
(E2) overlap between descriptive terms present in both texts in the pair and said list of descriptive terms;
(E3) for one or more terms in one of the pairs of texts identified as feature terms, the presence in the other pair of texts of one or more feature-specific terms defined as having a substantially higher rate of occurrence in a feature library composed in texts containing that feature term, (E4) for one or more attributes associated with the target invention, the presence in at least one text in the pair of attribute-specific terms defined as having a substantially higher rate of occurrence in an attribute library composed in texts containing a word-and/or word-group term that is descriptive of that attribute, and (E5) a citation score related to the extent to which one or both texts in the pair are cited by later texts.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are a computer-readable code, system and method for combining texts to form novel combinations of texts related to a desired target concept, where the concept is represented in the form of a natural-language text or a list of descriptive word and/or word-group terms. The system operates to find primary and secondary groups of texts having highest term match scores with a first and second subset of terms in the concept, respectively. It then generates pairs of texts containing a text from each of the primary and secondary groups of database texts, and selects for presentation to the user, those pairs of texts having highest overlap scores as determined from one or more of (i) term overlap, (ii) term coverage, (iii) feature-specific cross-correlation, (iv) attribute-specific correlation, and (v) citation score of one or both texts in the pair.
114 Citations
18 Claims
-
1. A computer-assisted method for combining texts to form novel combinations of texts related to a desired target concept that is represented in the form of a natural-language text or a list of descriptive terms that include words and, optionally, word groups, said method comprising
(A) if the target concept is represented in the form of a natural-language text, extracting descriptive word and, optionally, word-group terms from the text, to form a list of descriptive terms, (B) searching a database of target-related texts, to identify a primary group of texts having highest term match scores with a first subset of said terms, (C) searching a database of target-related texts, to identify a secondary group of texts having the highest term match scores with a second subset of said terms, where said first and second subsets are at least partially complementary with respect to the terms in said list, (D) generating pairs of texts containing a text from the primary group of texts and a different text from the secondary group of texts, and (E) selecting for presentation to the user, those pairs of texts that have highest overlap scores as determined from one or more of: -
(E1) overlap between descriptive terms in one text in the pair with descriptive terms in the other text in the pair;
(E2) overlap between descriptive terms present in both texts in the pair and said list of descriptive terms;
(E3) for one or more terms in one of the pairs of texts identified as feature terms, the presence in the other pair of texts of one or more feature-specific terms defined as having a substantially higher rate of occurrence in a feature library composed in texts containing that feature term, (E4) for one or more attributes associated with the target invention, the presence in at least one text in the pair of attribute-specific terms defined as having a substantially higher rate of occurrence in an attribute library composed in texts containing a word-and/or word-group term that is descriptive of that attribute, and (E5) a citation score related to the extent to which one or both texts in the pair are cited by later texts. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A feature or attribute descriptor dictionary comprising
a list of feature and/or attribute descriptors, and for each descriptor, a list of word and/or word-group terms that are that are descriptor specific for that descriptor, where a term is descriptor specific for a given descriptor if the term has a substantially higher rate of occurrence in a descriptor library composed in texts containing a word-and/or word-group term that is the same as or descriptive of that descriptor than the same term has in a library of texts unrelated to that descriptor.
Specification