×

CONFIDENCE LINKS BETWEEN NAME ENTITIES IN DISPARATE DOCUMENTS

  • US 20100076972A1
  • Filed: 12/29/2008
  • Published: 03/25/2010
  • Est. Priority Date: 09/05/2008
  • Status: Active Grant
First Claim
Patent Images

1. A system that detects similarities between name strings in a document set, comprising:

  • a preprocessing module configured to;

    extract a plurality of name strings from the document set;

    a matching module configured to;

    detect possible matching pairs from the plurality of name strings, andassign a plurality of similarity scores to each of the possible matching pairs using a plurality of algorithms; and

    a generation module configured to;

    generate a set of equivalent names by accumulating name strings from the possible matching pairs based on a comparison between the similarity scores and a threshold.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×