Sub-model generation to improve classification accuracy
First Claim
1. A method of classifying text input for use with a natural language understanding system, the method comprising:
- via a processor, determining classification information comprising a primary classification and at least one secondary classification for a received text input using a statistical classification model (statistical model);
via the processor, selectively building a statistical classification sub-model (statistical sub-model) according to whether the classification information conforms to an accuracy requirement;
via the processor, selecting the primary classification or the at least one secondary classification for the text input as a final classification according to the statistical sub-model; and
via the processor, outputting the final classification for the text input.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of classifying text input for use with a natural language understanding system can include determining classification information including a primary classification and one or more secondary classifications for a received text input using a statistical classification model (statistical model). A statistical classification sub-model (statistical sub-model) can be selectively built according to a model generation criterion applied to the classification information. The method further can include selecting the primary classification or the secondary classification for the text input as a final classification according to the statistical sub-model and outputting the final classification for the text input.
-
Citations
20 Claims
-
1. A method of classifying text input for use with a natural language understanding system, the method comprising:
-
via a processor, determining classification information comprising a primary classification and at least one secondary classification for a received text input using a statistical classification model (statistical model); via the processor, selectively building a statistical classification sub-model (statistical sub-model) according to whether the classification information conforms to an accuracy requirement; via the processor, selecting the primary classification or the at least one secondary classification for the text input as a final classification according to the statistical sub-model; and via the processor, outputting the final classification for the text input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of improving classification accuracy of text input for use with a natural language understanding system, the method comprising:
-
via a processor, processing a plurality of text inputs using a statistical classification model (statistical model) and a statistical classification sub-model (statistical sub-model), wherein the statistical model comprises a plurality of classes and the statistical sub-model comprises a subset of the plurality of classes; via the processor, determining a usage frequency of the statistical sub-model; via the processor, comparing the usage frequency with a minimum frequency threshold level; via the processor, merging the subset of the plurality of classes into a single, merged class within the statistical model when the usage frequency exceeds the minimum frequency threshold level; and via the processor, outputting an updated statistical model specifying the merged class. - View Dependent Claims (11, 12)
-
-
13. A computer program product comprising:
-
a computer-readable storage comprising computer-usable program code stored thereon that classifies text input for use with a natural language understanding system, the computer program product comprising; computer-usable program code that determines classification information comprising a primary classification and at least one secondary classification for a received text input using a statistical classification model (statistical model); computer-usable program code that selectively builds a statistical classification sub-model (statistical sub-model) according to whether the classification information conforms to an accuracy requirement; computer-usable program code that selects the primary classification or the at least one secondary classification for the text input as a final classification according to the statistical sub-model; and computer-usable program code that outputs the final classification for the text input, wherein the computer-readable storage is not a transitory, propagating signal per se. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification