×

Generating context-based spell corrections of entity names

  • US 8,402,032 B1
  • Filed: 03/24/2011
  • Issued: 03/19/2013
  • Est. Priority Date: 03/25/2010
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • one or more computers including one or more storage devices storing instructions that when executed by the one or more computers cause the one or more computers to perform operations comprising;

    receiving texts from each of a plurality of text sources, wherein each text source provides a text;

    deriving a plurality of name-context pairs from the texts, wherein each name-context pair comprises an entity name included in the text from a text source and a context term included in the text from the text source, wherein each entity name is one or more terms used to refer to a respective entity and each context term is a term that appears in text associated with the entity name;

    calculating a context consistency measure for each distinct name-context pair, wherein the context consistency measure for a particular name-context pair is an estimate of a probability that, if the entity name of the particular name-context pair appears in text, the context term of the particular name-context pair will also appear in the text; and

    storing context-entity name data, wherein the context-entity name data is searchable data that represents one or more of the distinct name-context pairs and the context consistency measure for each of the one or more name-context pair.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×