×

Text summarization using part-of-speech

  • US 6,289,304 B1
  • Filed: 03/17/1999
  • Issued: 09/11/2001
  • Est. Priority Date: 03/23/1998
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for automatically summarizing text, comprising:

  • (a) obtaining input text data defining a text that includes two or more tokens;

    (b1) using the input text data to tokenize the text, the tokenized text including one or more tokenized sentences;

    (b2) obtaining part-of-speech (POS) data indicating parts of speech for tokens in the text of each of the tokenized sentences from (b1);

    (c) using the POS data for each tokenized sentence to obtain group data for the sentence indicating one or more groups of consecutive tokens of the text and indicating, within each group, any tokens that meet a POS-based removal criterion; and

    (d) using the group data for each sentence to obtain summarized text data defining a summarized version of the text for the sentence in which tokens in each group that are indicated as meeting the removal criterion are removed so that the number of tokens in the summarized version of the text for the sentence is less than the number of tokens in the text.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×