Method and system for using rule-based knowledge to build a class-based domain specific statistical language model
First Claim
1. A speech processing method comprising:
- manually defining a set of classes for words/phrases, each class being associated with a class-tag;
replacing words of domain-related rules with corresponding class-tags to generate class-based domain-related rules;
generating a class-based rule net using the class-based domain-related rules;
obtaining a class-based statistical representation from the class-based rule net; and
combining the class-based statistical representation with a class-based statistical representation from a statistical language model to generate a final language model.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for providing a class-based statistical language model representation from rule-based knowledge is disclosed. The class-based language model is generated from a statistical representation of a class-based rule net. A class-based rule net is generated using the domain-related rules with words replaced with their corresponding class-tags that are manually defined. The class-based statistical representation from the class-based rule net is combined with a class-based statistical representation from a statistical language model to generate a language model. The language model is enhanced by smoothing/adapting with general-purpose and/or domain-related corpus for use as the final language model. A two-pass search algorithm is applied for speech decoding.
-
Citations
21 Claims
-
1. A speech processing method comprising:
-
manually defining a set of classes for words/phrases, each class being associated with a class-tag; replacing words of domain-related rules with corresponding class-tags to generate class-based domain-related rules; generating a class-based rule net using the class-based domain-related rules; obtaining a class-based statistical representation from the class-based rule net; and combining the class-based statistical representation with a class-based statistical representation from a statistical language model to generate a final language model. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a memory to store a statistical language model representation of a class-tag based rule net; and a processor to enable a user to manually define a set of classes for words/phrases, each class being associated with a class-tag, to replace words of domain-related rules with corresponding class-tags to generate class-based domain-related rules, to generate a class-based rule net using the class-based domain-related rules, to obtain a class-based statistical representation from the class-based rule net, and to combine the class-based statistical representation with a class-based statistical representation from a statistical language model to generate a final language model. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A machine readable storage medium having stored therein a plurality of machine executable instructions, wherein when executed, the instructions perform operations comprising:
-
manually defining a set of classes for words/phrases, each class being associated with a class-tag; replacing words of domain-related rules with corresponding class-tags to generate class-based domain-related rules; generating a class-based rule net using the class-based domain-related rules; obtaining a class-based statistical representation from the class-based rule net; and combining the class-based statistical representation with a class-based statistical representation from a statistical language model to generate a final language model. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification