×

SEMI-SUPERVISED PART-OF-SPEECH TAGGING

  • US 20090157384A1
  • Filed: 12/12/2007
  • Published: 06/18/2009
  • Est. Priority Date: 12/12/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving a text comprising a sequence of words;

    selecting a word from the text;

    identifying features of the selected word, the features comprising a suffix of the selected word;

    applying the features of the selected word to a model to identify probabilities for sets of part-of-speech tags, at least one set of part-of-speech tags comprising at least two part-of-speech tags, each part-of-speech tag representing a part-of-speech;

    using the probabilities for sets of part-of-speech tags to weight scores for possible part-of-speech tags for the selected word to form weighted scores;

    using the weighted scores to select a part-of-speech tag for the selected word; and

    storing the selected part-of-speech tag for the selected word.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×