Search engine for phrase recognition based on prefix/body/suffix architecture
First Claim
1. A method implemented in a digital processing system of constructing a language model in a speech recognition system, comprising:
- receiving speech signals into a processor;
storing a plurality of phrases into a plurality of words in a prefix word, body word, and suffix word structure, wherein each of the phrases has a body word and optionally a prefix word and optionally a suffix word;
grouping the words into a plurality of prefix word classes, a plurality of body word classes, and a plurality of suffix word classes in accordance with a set of predetermined linguistic rules, wherein each of the respective prefix, body, and suffix word classes includes a number of prefix words of a first category, a number of body words of a second category, and a number of suffix words of a third category, respectively;
storing data elements representing interconnections among the prefix, body, and suffix word classes together according to the predetermined linguistic rules, wherein the language model generates signals representative of the received speech signals during a phrase-based search.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of constructing a language model for a phrase-based search in a speech recognition system and an apparatus for constructing and/or searching through the language model. The method includes the step of separating a plurality of phrases into a plurality of words in a prefix word, body word, and suffix word structure. Each of the phrases has a body word and optionally a prefix word and a suffix word. The words are grouped into a plurality of prefix word classes, a plurality of body word classes, and a plurality of suffix word classes in accordance with a set of predetermined linguistic rules. Each of the respective prefix, body, and suffix word classes includes a number of prefix words of same category, a number of body words of same category, and a number of suffix words of same category, respectively. The prefix, body, and suffix word classes are then interconnected together according to the predetermined linguistic rules. A method of organizing a phrase search based on the above-described prefix/body/suffix language model is also described. The words in each of the prefix, body, and suffix classes are organized into a lexical tree structure. A phrase start lexical tree structure is then created for the words of all the prefix classes and the body classes having a word which can start one of the plurality of phrases while still maintaining connections of these prefix and body classes within the language model.
-
Citations
32 Claims
-
1. A method implemented in a digital processing system of constructing a language model in a speech recognition system, comprising:
-
receiving speech signals into a processor; storing a plurality of phrases into a plurality of words in a prefix word, body word, and suffix word structure, wherein each of the phrases has a body word and optionally a prefix word and optionally a suffix word; grouping the words into a plurality of prefix word classes, a plurality of body word classes, and a plurality of suffix word classes in accordance with a set of predetermined linguistic rules, wherein each of the respective prefix, body, and suffix word classes includes a number of prefix words of a first category, a number of body words of a second category, and a number of suffix words of a third category, respectively; storing data elements representing interconnections among the prefix, body, and suffix word classes together according to the predetermined linguistic rules, wherein the language model generates signals representative of the received speech signals during a phrase-based search. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of organizing a phrase search in a speech recognition system having a language model comprising a plurality of prefix classes each having a number of prefix words of a first category, a plurality of body classes each having a number of body words of a second category, and a plurality of suffix classes each having a number of suffix words of a third category, wherein the classes are formed and interconnected together in accordance with a set of predetermined linguistic rules, wherein the method comprises the steps of:
-
receiving speech signals into a processor; organizing a plurality of phrases into a prefix, body and suffix structure, wherein each of the phrases has a body word and optionally a prefix word and a suffix word; organizing the words in each of the prefix, body, and suffix classes into a lexical tree structure; creating a phrase start lexical tree structure for the words of all the prefix classes and the body classes having a word from which can start one of the plurality of phrases while still maintaining connections of these prefix and body classes within the language model, wherein the language model generates signals representative of the received speech signals. - View Dependent Claims (8, 9, 10)
-
-
11. An apparatus for constructing a search engine for a phrase-based search in a speech recognition system, comprising:
-
means for receiving speech signals into a processor; means for providing a language model for a plurality of phrases that are arranged in a prefix, body, and suffix structure, wherein each of the phrases has a body word and optionally a prefix word and a suffix word, wherein the language model includes a plurality of prefix classes each having a number of prefix words, a plurality of body classes each having a number of body words, and a plurality of suffix classes each having a number of suffix words, wherein the classes are formed and interconnected together according to a set of predetermined linguistic rules; means for organizing the words in each of the prefix, body, and suffix classes into a lexical tree structure; means for creating a phrase start lexical tree structure for the words of all the prefix classes and the body classes having a word which can start one of the phrases while maintaining interconnections of these prefix and body classes within the language model, wherein signals representative of the received speech signals are generated in response to the phrase-based search. - View Dependent Claims (12, 13, 14)
-
-
15. A method of recognizing speech in a speech recognition system having a processor and a memory, said method comprising:
-
storing at least a portion of a language model in said memory, said language model for a plurality of phrases that are arranged in a prefix, body, and suffix structure in accordance with a set of predetermined linguistic rules, wherein each of the phrases has a body word and optionally a prefix word and optionally a suffix word, and wherein the language model includes a plurality of prefix classes each having a number of prefix words of a first category, a plurality of body classes each having a number of body words of a second category, and a plurality of suffix classes each having a number of suffix words of a third category, and wherein said classes are interconnected with interconnections in accordance with said set of predetermined linguistic rules; receiving a speech utterance which includes at least one particular phrase; performing an acoustic matching process on a representation of said speech utterance by using said language model. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A speech recognition apparatus for recognizing speech, said apparatus comprising:
-
a transducer which receives a speech utterance which includes at least one phrase; a processor coupled to said transducer to receive a digitized representation of said speech utterance; a memory coupled to said processor, said memory storing at least a portion of a language model for a plurality of phrases that are arranged in a prefix, body, and suffix structure in accordance with a set of predetermined linguistic rules, wherein each of the phrases has a body word and optionally a prefix word and optionally a suffix word, and wherein the language model includes a plurality of prefix classes each having a number of prefix words of a first category, a plurality of body classes each having a number of body words of a second category, and a plurality of suffix classes each having a number of suffix words of a third category, and wherein said classes are interconnected with interconnections in accordance with said set of predetermined linguistic rules, and wherein said processor performs an acoustic matching process on said digitized representation of said speech utterance by using said language model. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. A computer readable medium containing executable instructions which, when executed in a processing system, causes the processing system to perform a method for recognizing speech comprising:
-
receiving speech signals into a processor; processing the received speech signals using a phrase-based language model wherein; a plurality of phrases are stored into a plurality of words in a prefix word, body word, and suffix word structure, wherein each of the phrases has a body word and optionally a prefix word and optionally a suffix word; the plurality of words are grouped into a plurality of prefix word classes, a plurality of body word classes, and a plurality of suffix word classes in accordance with a set of predetermined linguistic rules, wherein each of the respective prefix, body, and suffix word classes includes a number of prefix words of a first category, a number of body words of a second category, and a number of suffix words of a third category, respectively; data elements representing interconnections among the prefix, body, and suffix word classes are stored together according to the predetermined linguistic rules; and providing output signals representative of the received speech signals. - View Dependent Claims (28, 29, 30, 31, 32)
-
Specification