Automatic text skimming using lexical chains
First Claim
Patent Images
1. A method for generating characteristic lexical chain, and for synthesizing the update of a lexical chain, the method comprising:
- receiving a string of text;
generating at least one lexical chain from the string of text, wherein the at least one lexical chain comprises at least one noun and at least one adjective;
generating at least one of the following;
a gradability score and a predication score for the at least one adjective;
determining whether the at least one adjective is one of the following;
characteristic and non-characteristic, based on, at least in part, at least one of the following;
the gradability score and the predication score;
updating the at least one lexical chain by removing a non-characteristic adjective from the at least one lexical chain when the non-characteristic adjective appears before at least one of;
a characteristic adjective and a noun; and
providing the updated at least one lexical chain via a computing device capable of at least one of the following;
audibly broadcasting synthesized speech associated with the updated at least one lexical chain, and transmitting over a network data to a user device for at least one of the following;
audible broadcast of the synthesized speech and the visual display of text.
1 Assignment
0 Petitions
Accused Products
Abstract
Automatic text skimming using lexical chains may be provided. First, at least one lexical chain may be created from an electronic document. Next, a list of positions within the electronic document may be created. The positions may include where at least one concept represented by one of the at least one lexical chain is mentioned. In addition, a list of the position where the at least one concept is mentioned may be assembled. A selection of at least one concept may be received from the list.
31 Citations
10 Claims
-
1. A method for generating characteristic lexical chain, and for synthesizing the update of a lexical chain, the method comprising:
-
receiving a string of text; generating at least one lexical chain from the string of text, wherein the at least one lexical chain comprises at least one noun and at least one adjective; generating at least one of the following;
a gradability score and a predication score for the at least one adjective;determining whether the at least one adjective is one of the following;
characteristic and non-characteristic, based on, at least in part, at least one of the following;
the gradability score and the predication score;updating the at least one lexical chain by removing a non-characteristic adjective from the at least one lexical chain when the non-characteristic adjective appears before at least one of;
a characteristic adjective and a noun; andproviding the updated at least one lexical chain via a computing device capable of at least one of the following;
audibly broadcasting synthesized speech associated with the updated at least one lexical chain, and transmitting over a network data to a user device for at least one of the following;
audible broadcast of the synthesized speech and the visual display of text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for generating characteristic multiword expressions and for synthesizing the update of a multiword expression the method comprising:
-
receiving a string of text; generating a multiword expression from the string of text, wherein the multiword expression comprises at least one noun and at least one adjective; determining whether the at least one adjective wherein the determining includes calculating a measure of non-characteristic-ness; generating a gradability score and a predication score for the at least one adjective; determining the at least one adjective'"'"'s initial non-characteristic-ness based on at least a combination of the gradability score and the predication test score; performing a conjunction test based on at least a combination of the gradability score and the predication test score for the at least one adjective; determining the at least one adjective'"'"'s final non-characteristic-ness based on, at least in part, the at least one adjective'"'"'s initial non-characteristic-ness and the results of the conjunction test; updating the multiword expression by removing the at least one adjective determined finally to be non-characteristic when the at least one adjective appears before at least one of;
a characteristic adjective, and a noun; andproviding the updated multiword expression via a computing device capable of at least one of the following;
audibly broadcasting synthesized speech associated with the updated multiword expression, and transmitting over a network data to a user device for at least one of the following;
audible broadcast of the synthesized speech and the visual display of text.
-
Specification