Linguistically intelligent text compression
First Claim
Patent Images
1. A method of processing a body of text to generate compression options, comprising:
- performing a linguistic analysis on the body of text to obtain a linguistic output indicative of linguistic components of the body of text;
after performing the linguistic analysis, automatically generating a plurality of correct compression options for each of a plurality of different portions of the body of text to compress the body of text based on the linguistic output, each of the correct compression options comprising a different, correct compressed form of an instance of the portion in the body of text; and
selecting one of the plurality of correct compression options for each of the plurality of different portions of the body of text to output a compressed form of the body of text.
1 Assignment
0 Petitions
Accused Products
Abstract
A text processor processes text in a message. The text processor generates a plurality of compressed forms of components of the message. The processor performs a linguistic analysis on the body of text to obtain a linguistic output indicative of linguistic components of the body of text. The processor then generates the plurality of compressed forms that can be used to compress the body of text. The plurality of compressed forms are generated based on the linguistic output. The invention can be implemented as a method of generating the compressed forms and as an apparatus.
49 Citations
20 Claims
-
1. A method of processing a body of text to generate compression options, comprising:
-
performing a linguistic analysis on the body of text to obtain a linguistic output indicative of linguistic components of the body of text;
after performing the linguistic analysis, automatically generating a plurality of correct compression options for each of a plurality of different portions of the body of text to compress the body of text based on the linguistic output, each of the correct compression options comprising a different, correct compressed form of an instance of the portion in the body of text; and
selecting one of the plurality of correct compression options for each of the plurality of different portions of the body of text to output a compressed form of the body of text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer readable data structure formed from a linguistic analysis of a body of text to be compressed indicative of a plurality of correct compressed forms of the body of text, the data structure comprising:
a plurality of different sections, each section corresponding to a textual term in the body of text, each section further comprising a plurality of selectable data fields, selectable to represent in subsequent operations one of a plurality of different, correct compressed forms of the corresponding textual term in the body of text, the different compressed forms representing different levels of compression of the corresponding textual term. - View Dependent Claims (19)
-
20. A message handler receiving a message and generating compression options indicative of different forms of a portion of a body of text in the message, the message handler comprising:
-
a linguistic analyzer linguistically configured to analyze the body of text and provide a linguistic analysis;
a compression form generator configured to automatically generate a plurality of different compressed forms of a plurality of individual textual segments in the body of text based on the linguistic analysis, the plurality of different compressed forms each representing a correct compressed form of a corresponding individual text segment; and
a compressor configured to generate an output indicative of selected ones of the plurality of different compressed forms for the individual textual segments in the body of text.
-
Specification