×

Automatic annotation for training and evaluation of semantic analysis engines

  • US 9,224,103 B1
  • Filed: 03/13/2013
  • Issued: 12/29/2015
  • Est. Priority Date: 03/13/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer system comprising:

  • at least one processor; and

    memory storing instructions that, when executed by the at least one processor, causes the computer system to perform operations comprising;

    receiving documents from a corpus, the corpus comprising;

    an authoritative set of documents from an authoritative source, each document in the authoritative set being associated with an entity, anda second set of documents, the second set being documents that are not in the authoritative set and that are not copies of documents in the authoritative set but that each include at least one link to a document in the authoritative set, the at least one link being associated with anchor text,identifying, for each document in the second set, entity mentions in the document based on the anchor text, each entity mention including the anchor text and an identifier of the linked-to authoritative document,associating the identified entity mentions with respective entity types based on content in the linked-to authoritative document, andtraining an entity tagging engine using the identified entity mentions and the entity types associated with the entity mentions.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×