Speech recognition using selectable recognition modes
First Claim
1. A computerized method of performing speech recognition comprising:
- using speech recognition programming for;
providing a user interface which allows a user to select between generating a first and a second user input;
responding to the generation of the first user input by performing large vocabulary speech recognition on one or more utterances in a prior-language-context-dependent mode, which recognizes at least the first word of an utterance depending in part on a language model context created by a previously recognized word from the previous utterance, if any; and
responding to the generation of the second user input by performing large vocabulary speech recognition on one or more utterances in a prior-language-context-independent mode, which recognizes at least the first word of an utterance substantially independently of any language model context created by a previously recognized word from the previous utterance, if any;
wherein;
as words are recognized by said speech recognition programming in both of said recognition modes such words are output to programming external to said speech recognition programming for use by said external programming; and
the response by said speech recognition programming to said first and second inputs by switching recognition modes is independent of the state of said external programming.
8 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized words to external program as they are recognized, and allows a user to select between large vocabulary recognition of an utterance with and without language context from the prior utterance independently of state of the external program; allowing a user to select between continuous and discrete speech recognition that use substantially the same vocabulary; allowing a user to select between continuous and discrete large-vocabulary speech recognition modes; allowing a user to select between at least two different alphabetic entry speech recognition modes; and allowing a user to select from among four or more of the following recognitions modes when creating text: a large-vocabulary mode, an alphabetic entry mode, a number entry mode, and a punctuation entry mode.
-
Citations
24 Claims
-
1. A computerized method of performing speech recognition comprising:
-
using speech recognition programming for;
providing a user interface which allows a user to select between generating a first and a second user input;
responding to the generation of the first user input by performing large vocabulary speech recognition on one or more utterances in a prior-language-context-dependent mode, which recognizes at least the first word of an utterance depending in part on a language model context created by a previously recognized word from the previous utterance, if any; and
responding to the generation of the second user input by performing large vocabulary speech recognition on one or more utterances in a prior-language-context-independent mode, which recognizes at least the first word of an utterance substantially independently of any language model context created by a previously recognized word from the previous utterance, if any;
wherein;
as words are recognized by said speech recognition programming in both of said recognition modes such words are output to programming external to said speech recognition programming for use by said external programming; and
the response by said speech recognition programming to said first and second inputs by switching recognition modes is independent of the state of said external programming. - View Dependent Claims (2, 3, 4)
-
-
5. A computerized method of performing speech recognition comprising:
-
providing a user interface which allows a user to select between generating a first and a second user input;
responding to the generation of the first user input by selecting a continuous speech recognition mode which performs continuous speech recognition on speech sounds using a given vocabulary;
responding to the generation of the second user input by selecting a discrete recognition mode which performs discrete recognition on speech sounds using substantially the same given vocabulary; and
responding to speech sounds by performing recognition upon them using the currently selected speech recognition mode;
wherein the user can switch between the use of continuous and discrete recognition by selecting one of said user inputs. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computerized method of performing speech recognition comprising:
-
providing a user interface which allows a user to select between generating a first and a second user input;
responding to the generation of the first user input by selecting a continuous speech recognition mode which performs large vocabulary continuous speech recognition on speech sounds;
responding to the generation of the second user input by selecting a discrete recognition mode which performs large vocabulary discrete recognition on speech sounds; and
responding to speech sounds by performing large vocabulary recognition upon them using the currently selected speech recognition mode;
wherein the user can switch between the use of continuous and discrete large vocabulary recognition by selecting one of said user inputs.
-
-
15. A computerized method of performing speech recognition comprising:
-
providing a user interface which allows a user to select between generating a first and a second user input;
responding to the generation of the first user input by switching to a first recognition mode that recognizes one or more utterances as one or more words in a first alphabetic entry vocabulary; and
responding to the generation of the second user input by switching to a second recognition mode that recognizes one or more utterances as one or more words in a second, different, alphabetic entry vocabulary;
wherein the first and second alphabetic entry vocabularies associated different letter-identifying words with individual letters of the alphabet. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computerized method of performing speech recognition comprising:
-
providing a user interface which allows a user to select between generating a first, a second, a third, or a fourth user input;
responding to the generation of the first user input by switching to performing speech recognition using a first, general purpose large vocabulary; and
responding to the generation of the second user input by switching to performing speech recognition using a second, alphabetic entry vocabulary;
responding to the generation of the third user input by switching to performing speech recognition using a third, numerical entry vocabulary;
responding to the generation of the fourth user input by switching to performing speech recognition using a fourth, punctuation entry, vocabulary; and
sequentially receiving output in the form of words produced by speech recognizing using different user selected ones of said four vocabularies and placing that output into a common text. - View Dependent Claims (21, 22, 23, 24)
-
Specification