×

Method to improve the named entity classification

  • US 10,108,705 B2
  • Filed: 11/09/2011
  • Issued: 10/23/2018
  • Est. Priority Date: 09/29/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method of providing a named entity classification in a computing system, the method comprising:

  • reading, from an LOD (Linking Opening Data) set, an LOD node corresponding to a to-be-classified named entity, wherein the LOD node is associated with a uniform resource identifier and corresponds to a web page, the LOD node further comprising at least a plurality of property entries;

    determining a type attribute of the LOD node corresponding to the to-be-classified named entity as a tagged type of the to-be classified named entity;

    reading a candidate type;

    computing a possibility of the to be-classified named entity belonging to the candidate type, comprising;

    mapping the candidate type to a node of an intermediate ontology, wherein the intermediate ontology includes at least a structured correlation of generic-specific relationships, the generic-specific relationships including at least an identical relationship, a homologous relationship and a conflicting relationship;

    computing an attribute matching score between the candidate type and the tagged type based on a relationship between the mapped node of intermediate ontology and the candidate type, wherein computing the attribute matching score of a mapped intermediate ontology having an identical type is based on a predetermined value, the attribute matching score of a mapped intermediate ontology having a generic-specific relationship is based on the predetermined value and a first count of nodes between two mapped nodes of the intermediate ontology, and the attribute matching score of a mapped intermediate ontology having a homologue relationship is based on the predetermined value and a second count of nodes between two mapped nodes of the of the intermediate ontology to a common source node and a third count of nodes between two mapped nodes of the of the intermediate ontology to the common source node;

    performing statistical processing to attribute matching scores corresponding to the candidate type to obtain a possibility of the to-be classified named entity belonging to the candidate type, wherein performing statistical processing to each attribute matching score corresponding to a same candidate type further comprising;

    converting the attribute matching scores to a node matching score based on the correspondence relationship between the attribute matching scores and the LOD node in order to reduce type attribute entry noise;

    performing statistical processing to each node matching score corresponding to a same candidate type, thereby obtaining a possibility of the to-be-classified named entity belonging to the candidate type; and

    selecting one or more tagged types based on satisfaction of an attribute matching score threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×