×

Method for named-entity recognition and verification

  • US 7,171,350 B2
  • Filed: 08/26/2002
  • Issued: 01/30/2007
  • Est. Priority Date: 05/03/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for named-entity recognition and verification, comprising the steps of:

  • (A) segmenting text data from an article into at least one to-be-tested segments according to a text window;

    (B) parsing the to-be-tested segments to remove ill-formed segments from the to-be-tested segments according to a predefined grammar;

    (C) using a hypothesis test to assess a confidence measure of each to-be-tested segment, wherein the confidence measure is determined from dividing a probability P

    ( o



    L , x L , 1
    , o

    C , y C , 1
    , o

    R , z R , 1
    | H 0
    )
    of assuming that the to-be-teated tested segment has a named-entity by a probability P

    ( o



    L , x L , 1
    , o

    C , y C , 1
    , o

    R , z R , 1
    | H 1
    )
    of assuming that the to-be-tested segment doesn'"'"'t have a named-entity, where O

    C , y C , 1
    is a candidate, O

    L , x L , 1
    is the left context of the candidate, and O

    R , z R , 1
    is the right context of the candidate; and

    (D) determining that the to-be-tested segment has a named-entity if the confidence measure is greater than a predefined threshold, wherein the confidence measure is expressed by a log likelihood ratio, LLR

    ( O

    L , x L , 1
    , O

    C , y C , 1
    , O

    R , z R , 1
    )
    = log

    P

    ( O

    L , x L , 1
    , O

    C , y C , 1
    , O

    R , z R , 1
    | H 0
    )
    P

    ( O

    L , x L , 1
    , O

    C , y C , 1
    , O

    R , z R , 1
    | H 1
    )
    .

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×