×

LEMMATIZING, STEMMING, AND QUERY EXPANSION METHOD AND SYSTEM

  • US 20100082333A1
  • Filed: 06/01/2009
  • Published: 04/01/2010
  • Est. Priority Date: 05/30/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of stemming text comprising:

  • removing stop words from a document based on at least one stop word entry in an array of stop words;

    flagging as nouns words determined to be attached to definite articles and preceded by a noun array entry in an array of stop words preceding at least one noun;

    adding flagged nouns to a noun dictionary;

    flagging as verbs words determined to be preceded by an verb array entry in an array of stop words preceding at least one verb;

    adding flagged verbs to a verb dictionary;

    searching the document for nouns and verbs based on the flagged nouns and the flagged verbs;

    removing remaining stop words subsequent to searching the document;

    applying light stemming on the flagged nouns;

    applying a root-based stemming on the flagged verbs; and

    storing the stemmed document.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×