×

Automatic disambiguation based on a reference resource

  • US 9,772,992 B2
  • Filed: 01/03/2012
  • Issued: 09/26/2017
  • Est. Priority Date: 02/26/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computing system comprising:

  • a processor; and

    memory storing instructions which, when executed by the processor, configure the computing system to;

    identify a source text having a plurality of words;

    analyze the source text to identify a surface form in the source text, the surface form being an ambiguous orthographic representation of a proper name for an entity;

    based on the identification of the surface form in the source text, access a surface form record representing the surface form, the surface form record identifying at least a first named entity and a second named entity that are different from one another, and are each associated with the surface form and denoted by a proper name,wherein the surface form record comprises a first pointer to a first named entity record that is separate from the surface form record, the first named entity record corresponding to the first named entity and including a first set of context indicators that represents a context of the first named entity, andwherein the surface form record comprises a second pointer to a second named entity cord is separate from the surface form record, the second named entity record corresponding to the second named entity and including a second set of context indicators that represents a context of the second named entity;

    use the first pointer to retrieve the first set of context indicators from the first named entity record;

    generate a first correlation measure based on a number of occurrences in the source text of the first set of context indicators;

    use the second pointer to retrieve the second set of context indicators from the second named entity record;

    generate a second correlation measure based on a number of occurrences in the source text of the second set of context indicators;

    based on a comparison of the first and second correlation measures, select one of the first or second named entities as corresponding to the surface form in the source text; and

    generate a representation of a user interface display that displays the source text and visually associates the surface form and the selected named entity.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×