System and method for utilizing distance measures to perform text classification
First Claim
1. A system for performing text classification, comprising:
- text classification categories that each include reference models of reference N-grams;
input text that includes input N-grams upon which said text classification is performed; and
a text classifier that calculates distance measures between said input N-grams and said reference N-grams, said text classifier utilizing said distance measures to identify a matching category for said input text.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for utilizing distance measures to perform text classification includes text classification categories that each have reference models of reference N-grams. Input text that includes input N-grams is accessed for performing the text classification. A text classifier calculates distance measures between the input N-grams and the reference N-grams. The text classifier then utilizes the distance measures to identify a matching category for the input text. In certain embodiments, a verification module performs a verification procedure to determine whether the initially-selected matching category is a valid classification result for the text classification.
88 Citations
41 Claims
-
1. A system for performing text classification, comprising:
-
text classification categories that each include reference models of reference N-grams;
input text that includes input N-grams upon which said text classification is performed; and
a text classifier that calculates distance measures between said input N-grams and said reference N-grams, said text classifier utilizing said distance measures to identify a matching category for said input text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for performing text classification, comprising:
-
providing text classification categories that each include reference models of reference N-grams;
accessing input text that includes input N-grams upon which said text classification is performed;
calculating distance measures between said input N-grams and said reference N-grams; and
utilizing said distance measures to identify a matching category for said input text. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A system for performing text classification, comprising:
-
means for providing text classification categories that each include reference models of reference N-grams;
means for accessing input text that includes input N-grams upon which said text classification is performed;
means for calculating distance measures between said input N-grams and said reference N-grams; and
means for utilizing said distance measures to identify a matching category for said input text.
-
Specification