System and method for user-specific speech recognition
First Claim
1. A method for user-specific speech recognition, comprising:
- receiving, at an input device coupled to a computer, a natural language utterance in a current dialog;
comparing, at a speech recognition engine that executes on the computer, voice characteristics associated with the utterance to unique speech characteristics contained in one or more user profiles to determine an identity associated with a user that spoke the utterance;
seeding the speech recognition engine with data in one or more dictionary and phrase tables, wherein the data seeding the speech recognition engine includes prior probabilities or fuzzy possibilities that are dynamically updated based on the determined user identity and a history associated with the current dialog;
determining, at the speech recognition engine, that the utterance contains one or more words that were unrecognized or incorrectly recognized in response to a recognition associated with the utterance having a confidence level that does not meet or exceed a predetermined value;
requesting, by the speech recognition engine, the user to spell the one or more unrecognized or incorrectly recognized words using a phonetic alphabet, wherein the user provides the phonetic alphabet spelling in one or more subsequent natural language utterances; and
looking up, at the speech recognition engine, one or more words in the one or more dictionary and phrase tables that match the phonetic alphabet spelling to learn a pronunciation associated with the one or more unrecognized or incorrectly recognized words.
5 Assignments
0 Petitions
Accused Products
Abstract
The systems and methods described herein may recognize natural language utterances that include queries and/or commands and execute the queries and/or commands based on user-specific profiles. The systems and methods described herein may include a complete speech-based information query, retrieval, presentation and command environment that makes significant use of context, prior information, domain knowledge, and the user-specific profiles to achieve a natural environment for one or more users making queries or commands in multiple domains. Through this integrated approach, a complete speech-based natural language query and response environment can be created and tailored to specific users. For example, the systems and methods described herein may create, store, and use extensive personal profile information for different users, thereby improving the reliability of determining the context and presenting the results that the specific users may expect for a particular question or command.
-
Citations
22 Claims
-
1. A method for user-specific speech recognition, comprising:
-
receiving, at an input device coupled to a computer, a natural language utterance in a current dialog; comparing, at a speech recognition engine that executes on the computer, voice characteristics associated with the utterance to unique speech characteristics contained in one or more user profiles to determine an identity associated with a user that spoke the utterance; seeding the speech recognition engine with data in one or more dictionary and phrase tables, wherein the data seeding the speech recognition engine includes prior probabilities or fuzzy possibilities that are dynamically updated based on the determined user identity and a history associated with the current dialog; determining, at the speech recognition engine, that the utterance contains one or more words that were unrecognized or incorrectly recognized in response to a recognition associated with the utterance having a confidence level that does not meet or exceed a predetermined value; requesting, by the speech recognition engine, the user to spell the one or more unrecognized or incorrectly recognized words using a phonetic alphabet, wherein the user provides the phonetic alphabet spelling in one or more subsequent natural language utterances; and looking up, at the speech recognition engine, one or more words in the one or more dictionary and phrase tables that match the phonetic alphabet spelling to learn a pronunciation associated with the one or more unrecognized or incorrectly recognized words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for user-specific speech recognition, wherein the system comprises a computer device having a speech recognition engine configured to:
-
compare voice characteristics associated with a natural language utterance received in a current dialog to unique speech characteristics contained in one or more user profiles to determine an identity associated with a user that spoke the utterance; receive seeding data contained in one or more dictionary and phrase tables, wherein the seeding data includes prior probabilities or fuzzy possibilities that are dynamically updated based on the determined user identity and a history associated with the current dialog; determine that the utterance contains one or more words that were unrecognized or incorrectly recognized in response to a recognition associated with the utterance having a confidence level that does not meet or exceed a predetermined value; request the user to spell the one or more unrecognized or incorrectly recognized words using a phonetic alphabet, wherein the user provides the phonetic alphabet spelling in one or more subsequent natural language utterances; and look up one or more words in the one or more dictionary and phrase tables that match the phonetic alphabet spelling to learn a pronunciation associated with the one or more unrecognized or incorrectly recognized words. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification