×

Method and system for classifying text

  • US 20100094875A1
  • Filed: 08/11/2009
  • Published: 04/15/2010
  • Est. Priority Date: 08/11/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • creating a data structure using a data structure generation engine by identifying a plurality of words and mapping each word to one or more categories and storing the data structure in one or more databases;

    indexing the data structure using an index generation engine;

    identifying an item of content; and

    classifying the item of content using a classification engine based on the data structure, the classifying comprising;

    identifying all one—

    or more—

    word combinations in the item of content;

    for each word of at least a pre-determined number of characters in length in each of the word combinations, identifying each of the categories to which it is mapped; and

    determining a weight for each of the words based on an inverse proportion to the number of categories to which it is mapped.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×