Multi-phoneme streamer and knowledge representation speech recognition system and method
First Claim
1. A method of processing speech, comprising:
- generating a list of candidate words for at least one set of phonemes, each candidate word having a pronunciation boundary, from a phoneme analysis of a received speech input;
permuting at least one member of the list of candidate words for the at least one set of phonemes to generate a plurality of potential syntactic structures which are valid in accordance with a set of syntactic rules, while respecting pronunciation boundaries of the candidate words;
generating a plurality of valid syntactic sequences of words from the permuted candidate words and potential syntactic structures;
processing a speech input to identify a plurality of syntactic sequences of words, the syntactic sequences of words comprising the candidate words, the candidate words and the syntactic sequences of words each having at least one associated part of speech;
deriving one or more conceptual representations lion at least one of the syntactic sequences of words; and
formulating one or more responses to the speech input based on at least one conceptual representation.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and method related to a new approach to speech recognition that reacts to concepts conveyed through speech. In its fullest implementation, the system and method shifts the balance of power in speech recognition from straight sound recognition and statistical models to a more powerful and complete approach determining and addressing conveyed concepts. This is done by using a probabilistically unbiased multi-phoneme recognition process, followed by a phoneme stream analysis process that builds the list of candidate words derived from recognized phonemes, followed by a permutation analysis process that produces sequences of candidate words with high potential of being syntactically valid, and finally, by processing targeted syntactic sequences in a conceptual analysis process to generate the utterance'"'"'s conceptual representation that can be used to produce an adequate response. The invention can be employed for a myriad of applications, such as improving accuracy or automatically generating punctuation for transcription and dictation, word or concept spotting in audio streams, concept spotting in electronic text, customer support, call routing and other command/response scenarios.
75 Citations
290 Claims
-
1. A method of processing speech, comprising:
-
generating a list of candidate words for at least one set of phonemes, each candidate word having a pronunciation boundary, from a phoneme analysis of a received speech input; permuting at least one member of the list of candidate words for the at least one set of phonemes to generate a plurality of potential syntactic structures which are valid in accordance with a set of syntactic rules, while respecting pronunciation boundaries of the candidate words; generating a plurality of valid syntactic sequences of words from the permuted candidate words and potential syntactic structures; processing a speech input to identify a plurality of syntactic sequences of words, the syntactic sequences of words comprising the candidate words, the candidate words and the syntactic sequences of words each having at least one associated part of speech; deriving one or more conceptual representations lion at least one of the syntactic sequences of words; and formulating one or more responses to the speech input based on at least one conceptual representation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 195, 196)
-
-
158. A system for processing speech, comprising:
-
a phoneme analyzer, receiving a speech input, generating a list of candidate words for at least one set of phonemes, each candidate word having a pronunciation boundary, the candidate words being permuted to generate a plurality of potential syntactic structures which are valid in accordance with a set of syntactic rules, the candidate words and plurality of potential syntactic structures each having an associated part of speech; means for identifying a plurality of syntactic sequences of words from the potential syntactic structures and candidate words; means for deriving one or more conceptual representations from at least one of the syntactic sequences of words; and means for formulating one or more responses to the speech input based on one or more of the conceptual representations. - View Dependent Claims (159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279)
-
-
280. A system for processing speech, comprising:
-
an input for receiving a speech input; a processor, receiving a set of phonemes derived from a speech input, generating a set of candidate words having respective pronunciation boundaries from the set of phonemes, permuting the candidate words to produce a plurality of syntactically valid potential syntactic structures, the candidate words and plurality of potential syntactic structures each having an associated part of speech identifying a plurality of syntactic sequences of words from the potential syntactic structures and candidate words, deriving at least one conceptual representation from at least one of the syntactic sequences of words, and formulating at least one response to the speech input based on one or more of the conceptual representations; and an output, for communicating a signal responsive to the at least one response. - View Dependent Claims (281)
-
-
282. A method for processing speech, comprising:
-
receiving a speech input; deriving a set of phonemes from the speech input; generating a set of candidate words having respective pronunciation boundaries from the set of phonemes; permuting the candidate words to produce a plurality of syntactically valid potential syntactic structures, the candidate words and plurality of potential syntactic structures each having an associated part of speech; identifying a plurality of syntactic sequences of words from the potential syntactic structures and candidate words; deriving at least one conceptual representation from at least one of the syntactic sequences of words; formulating at least one response to the speech input based on one or more of the conceptual representations; and communicating a signal responsive to the at least one response. - View Dependent Claims (283)
-
-
284. A method for processing speech, comprising;
-
receiving an input comprising speech; identifying a list of candidate words constructed from a sequence of phonemes, wherein at least some of the candidate words are alternative candidate words corresponding to the same portion or an overlapping portion of the input, each respective candidate word having a pronunciation boundary and a respective part of speech; permuting the candidate words to create a plurality of potential syntactic structures, wherein at least some of the plurality of potential syntactic structures is selected as corresponding to the input and having a respective, part or parts of speech; syntactically validating the potential syntactic structures to render syntactically valid sequences of words; generating a plurality of valid syntactic sequences of words from the permuted candidate words and potential syntactic structures; deriving conceptual representations of syntactically valid sequences of words; and formulating at least one response to the input based on the conceptual representations. - View Dependent Claims (285, 286, 287, 288, 289, 290)
-
Specification