×

Ambiguous entity disambiguation method

  • US 20080065621A1
  • Filed: 09/13/2006
  • Published: 03/13/2008
  • Est. Priority Date: 09/13/2006
  • Status: Abandoned Application
First Claim
Patent Images

1. An ambiguous entity disambiguation method, wherein an article comprises entities and each entity is a single-word or a multi-word entity, wherein at least one entity has an ambiguous meaning, the method comprising the steps of:

  • providing a disambiguation database which references a digital encyclopedia database, the disambiguation database comprising links to redirect pages of the digital encyclopedia database, links to disambiguation pages of the digital encyclopedia database, and for each redirect page and disambiguation page, the popularity of the page and the type of page;

    extracting entities from the article;

    combining multi-word entities;

    creating entity aliases for combined multi-word entities;

    searching the disambiguation database for pages in the digital encyclopedia database matching each extracted entity and entity alias;

    for each matching page, creating a list of links to other encyclopedia pages;

    scoring each extracted entity and entity alias according to the list of links and disambiguation database;

    adjusting each of the scores; and

    for each entity, selecting the highest scoring entity alias;

    whereby the entity type for each entity is the type of matching page for the highest scoring entity alias in the disambiguation database.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×