Method and apparatus for creating a customized summary of text by selection of sub-sections thereof ranked by comparison to target data items

DC CAFC

US 6,334,132 B1
Filed: 06/02/1998
Issued: 12/25/2001
Est. Priority Date: 04/16/1997
Status: Expired due to Term

- Alert
- Pin

Associated Case

Associated Defendants

First Claim

Patent Images

1. Apparatus for summarizing data sets, the apparatus comprising:

an input for receiving a data set to be summarized;

sectioning means for dividing said received data set into plural sections according to pre-determined criteria;

ranking means operable for each said section to compare data within the said section with one or more target data items and for calculating a ranking value for the said section, said ranking value being dependent on the outcome of said comparisons for the said section; and

compiling means for compiling a customized summary of the data set by selecting one or more of said one or more sections according to their respective ranking values.

View all claims

5 Assignments

Timeline View

Assignment View

Litigations

0 Petitions

Reexamination

Accused Products

Abstract

A system for summarizing data sets stores target data items and divides the data set into sections. Each section is compared against the target data items and a ranking value is calculated for each section dependent on the outcome of the comparisons. A summary of the data set is then compiled from sections having a ranking value past a pre-determined threshold value.

Citations

13 Claims

1. Apparatus for summarizing data sets, the apparatus comprising:
- an input for receiving a data set to be summarized;
  
  sectioning means for dividing said received data set into plural sections according to pre-determined criteria;
  
  ranking means operable for each said section to compare data within the said section with one or more target data items and for calculating a ranking value for the said section, said ranking value being dependent on the outcome of said comparisons for the said section; and
  
  compiling means for compiling a customized summary of the data set by selecting one or more of said one or more sections according to their respective ranking values.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 13)
- - 2. Apparatus as in claim 1, including a user input for entering target data items.
  - 3. Apparatus as in claim 1 further including:
4. Apparatus as in claim 3, wherein:
- said calculating means are operable to calculate a first distribution value for each said section, said first distribution value representing a measure of the number of sections of said data set, other than the said section, containing key data items of the said section, said first distribution value, as calculated for the said section, being proportional to the sum of the values of said measure of the number of sections determined for each key data item of the said section.
5. Apparatus as in claim 4 wherein:
- said calculating means are operable to calculate a second distribution value for each said section, said second distribution value representing a measure of the separation between the first occurrence within said data set of each key data item of the said section and the respective last occurrence, said second distribution value, as calculated for the said section, being proportional to the sum of the values of said measure of separation determined for each key data item of the said section.
6. Apparatus as in claim 5, wherein:
- said selecting means are arranged to compile a summary having a pre-defined length by selecting, in order of decreasing rank, as determined by the corresponding ranking value, one or more of said one or more sections, beginning with the highest ranked section, and adding each selected section to the summary until the summary has attained said pre-defined length.
7. Apparatus as in claim 3, wherein:
- said calculating means are operable to calculate a second distribution value for each said section, said second distribution value representing a measure of the separation between the first occurrence within said data set of each key data item of the said section and the respective last occurrence, said second distribution value, as calculated for the said section, being proportional to the sum of the values of said measure of separation determined for each key data item of the said section.
8. Apparatus as in claim 1, wherein:
- said selecting means are arranged to compile a summary having a pre-defined length by selecting, in order of decreasing rank, as determined by the corresponding ranking value, one or more of said one or more sections, beginning with the highest ranked section, and adding each selected section to the summary until the summary has attained said pre-defined length.
13. A method as in claim 8 wherein:
- at step b), said one or more pre-determined measures of distribution include a measure of the separation between the first occurrence within said data set of each key data item of the said section and the respective last occurrence; and
  
  the corresponding distribution value, as calculated for the said section, is proportional to the sum of the values of said measure of separation determined for each key data item of the said section.

9. A method for generating a customised summary of a data set, the method comprising:
- i) receiving, as input, a data set to be summarized;
  
  ii) dividing said data set into sections according to predetermined criteria;
  
  iii) comparing data items in each said section against one or more target data items;
  
  iv) calculating a ranking value for each said section in dependence upon the outcome of the respective said comparisons; and
  
  v) compiling a customized summary of said data set by selecting one or more of said one or more sections according to their respective ranking values.
- View Dependent Claims (10, 11, 12)
- - 10. A method as in claim 9 further comprising:
11. A method as in claim 10, wherein:
- at step b), said one or more pre-determined measures of distribution include a measure of the number of sections of said data set, other than the said section, containing key data items of the said section; and
  
  the corresponding distribution value, as calculated for the said section, is proportional to the sum of the values of said measure of the number of sections determined for each key data item of the said section.
12. A method as in claim 11 wherein:
- at step b), said one or more pre-determined measures of distribution include a measure of the separation between the first occurrence within said data set of each key data item of the said section and the respective last occurrence; and
  
  the corresponding distribution value, as calculated for the said section, is proportional to the sum of the values of said measure of separation determined for each key data item of the said section.

Specification

Resources

Litigation Campaign Assessment

Litigation Data

Current Assignee
Suffolk Technologies, LLC (Vector Capital Corporation)
Original Assignee
British Telecommunications PLC (BT Group PLC)
Inventors
Weeks, Richard
Primary Examiner(s)
Amsbury, Wayne
Assistant Examiner(s)
PARDO, THUY N

Application Number

US09/077,603
Time in Patent Office

1,302 Days
Field of Search

707/3, 707/4, 707/101, 707/102, 707/104, 707/2, 707/6, 345/327
US Class Current

707/723
CPC Class Codes

G06F 16/345   Summarisation for human users

Y10S 707/917   Text

Y10S 707/99932   Access augmentation or opti...

Y10S 707/99942   Manipulating data structure...

Y10S 707/99943   Generating database or data...

Method and apparatus for creating a customized summary of text by selection of sub-sections thereof ranked by comparison to target data items

First Claim

5 Assignments

Litigations

0 Petitions

Reexamination

Accused Products

Abstract

Citations

13 Claims

Specification

Solutions

Use Cases

Quick Links

Method and apparatus for creating a customized summary of text by selection of sub-sections thereof ranked by comparison to target data items

First Claim

5 Assignments

Subscription Required

Subscription Required

Litigations

0 Petitions

Subscription Required

Reexamination

Accused Products

Subscription Required

Abstract

Citations

13 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links