×

Structured term recognition

  • US 10,339,214 B2
  • Filed: 11/02/2012
  • Issued: 07/02/2019
  • Est. Priority Date: 11/04/2011
  • Status: Active Grant
First Claim
Patent Images

1. computer implemented method of recognizing types of terms in a specified corpus, comprising:

  • providing a set of known terms t∈

    T, each of the known terms t belonging to a given set of types Γ

    (t)={γ

    1, γ

    2. . . }, wherein each of the terms is comprised of a list of words, t=w1, w2. . . , wn, and the union of all the words w for all the terms t is a word set W;

    forming, by a clustering component of a computer system, a multitude of clusters of words from the words in W;

    using, by a mapping determining component of the computer system, the set of known terms T and the given set of types Γ

    to determine a set of pattern-to-type mappings {p1

    γ

    1, p2

    γ

    2, . . .}, each of the pattern-to-type mappings p→

    γ

    mapping an associated sequence of the clusters of words to one or more of the given set of types {γ

    1, γ

    2, . . . };

    using, by a term recognition component of the computer system, the determined set of pattern-to-type mappings to recognize corpus terms in the specified corpus, each of the corpus terms being comprised of two or more words; and

    for each of the recognized corpus terms in the specified corpus, using, by a word mapping component of the computer system, a specified context in the corpus of at least a plurality of the words of said each corpus term to map said plurality of words to one of the sequences of the clusters of words, and using, by a type recognition component of the computer system, one of the determined pattern-to-type mappings to map said one of the sequences of the clusters of words to one or more of the types y of the given set of types {γ

    1, γ

    2, . . . } to recognize said one or more of the types γ

    of the given set of types for said each recognized corpus term to boost performance of term recognition systems based on dictionary lookup while extending coverage of ontologies.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×