Method and system for minimizing attribute naming errors in set oriented duplicate detection

US 5,799,302 A
Filed: 03/30/1995
Issued: 08/25/1998
Est. Priority Date: 03/30/1995
Status: Expired due to Term

First Claim

Patent Images

1. A method of detecting duplicate entries in an address file, comprising the steps of:

(a) entering an address list to an addressing system, wherein said address list is comprised of one or more address records and said address records are comprised of one or more address fields;

(b) applying a nickname lookup table to said address records, wherein said nickname lookup table comprises one or more nicknames corresponding to a common first name, said one or more nicknames located in one of said address fields; and

further comprising the step of selecting the degree of precision to which a match sequence can be subjected;

(c) performing said match sequence by matching a first record from said address list with a second record and subsequent records, if any, from said address list by comparing said one or more address fields of said first record with said one or more address fields of said second or subsequent records;

(d) repeating said match sequence for each of said subsequent records;

(e) determining a duplicate set, wherein said duplicate set is comprised of all address records with address fields that match as determined by a set of pre-selected criteria;

(f) listing said duplicate set so that each address record follows sequentially;

(g) determining an address record to be retained within said address list; and

(h) retaining said address record within said address list; and

placing said duplicate set on a second list.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The invention is a method for detecting duplicate records on a list or in a file and comprises a number of steps. The steps include entering a list, comprised of one or more records, to a data processing system; then, applying a nickname lookup table to the records to determine a common first name. Once a common name has been determined, the method matches a first record from the list with a second record from the list by comparing the fields of the first record with the fields of at least one other record; the comparison is based on a set of pre-determined criteria. The matching sequence determines a duplicate set, wherein the duplicate set is comprised of at least two records with fields that match. The method then lists matching records sequentially so that the system can create a new record by filling each empty field with a next available corresponding field from a subsequent record within the duplicate set. The newly created record is then retained on the original list; and the duplicate records are placed on a second list. Pre-sorting of the list can occur just prior to the matching sequence as well as just prior to outputting the final list. Additionally, the system operator can be given a number of options to provide flexibility. These options can include: manually correcting a record on the duplicate records list; deleting an address record from the list of duplicates; or, outputting the record.

82 Citations

View as Search Results

16 Claims

1. A method of detecting duplicate entries in an address file, comprising the steps of:
- (a) entering an address list to an addressing system, wherein said address list is comprised of one or more address records and said address records are comprised of one or more address fields;
  
  (b) applying a nickname lookup table to said address records, wherein said nickname lookup table comprises one or more nicknames corresponding to a common first name, said one or more nicknames located in one of said address fields; and
  
  further comprising the step of selecting the degree of precision to which a match sequence can be subjected;
  
  (c) performing said match sequence by matching a first record from said address list with a second record and subsequent records, if any, from said address list by comparing said one or more address fields of said first record with said one or more address fields of said second or subsequent records;
  
  (d) repeating said match sequence for each of said subsequent records;
  
  (e) determining a duplicate set, wherein said duplicate set is comprised of all address records with address fields that match as determined by a set of pre-selected criteria;
  
  (f) listing said duplicate set so that each address record follows sequentially;
  
  (g) determining an address record to be retained within said address list; and
  
  (h) retaining said address record within said address list; and
  
  placing said duplicate set on a second list.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
- - 2. The method of claim 1, wherein a new address record is formed by filling each empty address field within said address record with a next available corresponding field from said subsequent address record within said listing of said duplicate set.
  - 3. The method of claim 2, wherein if said next available corresponding field would produce an incorrect address, then deleting said next available corresponding field and continuing in sequence to a second next available corresponding field from a second subsequent address record;
    - said sequence to continue until said empty address field is filled or until there are no more corresponding fields available.
  - 4. The method of claim 1, wherein said addressing system comprises:
    - (a) a data processing device with a memory operatively connected thereto;
      
      (b) an addressing printer with a media source;
      
      (c) a page printer with a media source;
      
      (d) a display; and
      
      (e) a keyboard.
  - 5. The method of claim 4, wherein said media source is a bin feeder or a cassette.
  - 6. The method of claim 1, further comprising the step of presorting said address list prior to outputting said address list to a language interpreter of a printer.
  - 7. The method of claim 1, further comprising the steps of:
    - (a) outputting said address list to a language interpreter of a printer;
      
      (b) retaining said second list in a memory of said data processing device; and
      
      (c) generating a report in respect of said first list.
  - 8. The method of claim 1, further comprising the step of determining a corresponding bar code in respect of said retained new address record.
  - 9. The method of claim 1, wherein said nickname lookup table comprises a first field comprised of nicknames and a second field comprised of common names in respect of said nicknames.
  - 10. The method of claim 1, wherein said pre-selected set of criteria comprises a choice to be made from among one or more choices derived from said address fields.
  - 11. The method of claim 1, wherein said address fields comprise data identifying an addressee by a plurality of characteristics, said characteristics comprising:
    - (a) name fields;
      
      (b) location fields; and
      
      (c) code fields.
  - 12. The method of claim 4, wherein said data processing device is resident within said addressing printer.
  - 13. The method of claim 4, wherein said data processing device is resident in a host computer exclusive of said addressing printer.
  - 14. The method of claim 1, wherein said duplicate set is displayed to a system operator as a table of duplicate records and said system operator can scroll through said table to view said duplicate records.
  - 15. The method of claim 1, or of claim 14, wherein a system operator is given an option to:
    - (a) manually correct an address record on said second list to;
      
      (i) include a corresponding bar code;
      
      (ii) transfer said corrected address record to said address list; and
      
      , (iii) retain said address records that are not corrected;
      
      (b) delete said address record from said second list;
      
      or(c) output said address record.

16. An addressing system for detecting duplicate entries in an address file, comprising:
- a. means for entering an address list, wherein said address list is comprised of one or more address records and said address records are comprised of one or more address fields;
  
  b. means for applying a nickname lookup table to said address records, wherein said nickname lookup table comprises one or more nicknames corresponding to a common first name, said one or more nicknames located in one of said address fields;
  
  said applying means further comprising means for selecting the degree of precision to which a match sequence can be subjected;
  
  c. means for performing said match sequence for each of said address records by matching a first record from said address list with a second record and subsequent records, if any, from said address list by comparing said one or more address fields of said first record with said one or more address fields of said second or subsequent records, and repeating said match sequence for each of said subsequent records;
  
  d. means for determining a duplicate set, wherein said duplicate set is comprised of all address records with address fields that match as determined by a set of pre-selected criteria;
  
  e. means for listing said duplicate set so that each address record follows sequentially;
  
  f. means for determining an address record to be retained within said address list; and
  
  g. means for retaining said address record within said address list; and
  
  placing said duplicate set on a second list.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Pitney Bowes Incorporated (Böwe Systec AG)
Original Assignee
Pitney Bowes Incorporated (Böwe Systec AG)
Inventors
Szturma, Shawn W., Johnson, Robert J.
Primary Examiner(s)
Lintz, Paul R.

Application Number

US08/413,579
Time in Patent Office

1,244 Days
Field of Search

395/600, 364/401 R, 364/408, 364/478, 364/478.07, 707/1, 707/7, 705/10, 705/45
US Class Current

1/1
CPC Class Codes

G06Q 99/00 Subject matter not provided...

Method and system for minimizing attribute naming errors in set oriented duplicate detection

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

82 Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Method and system for minimizing attribute naming errors in set oriented duplicate detection

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

82 Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links