Dictation with incremental recognition of speech
First Claim
1. A method, performed by a computing system, for providing a dictating service, comprising:
- receiving a speech signal in response to vocalization, by a user, of an incremental portion of a complete utterance, the speech signal being from a microphone;
interpreting the incremental portion based on the speech signal, to provide recognized speech, prior to the user finishing the complete utterance; and
providing rendered text associated with the recognized speech on an output presentation displayed on a display screen prior to the user finishing the complete utterance, wherein providing the rendered text on the output presentation further comprises modifying a rate at which the rendered text is presented on the output presentation, the rate being modified based on a level of uncertainty associated with each part of the rendered text.
2 Assignments
0 Petitions
Accused Products
Abstract
A dictation module is described herein which receives and interprets a complete utterance of the user in incremental fashion, that is, one incremental portion at a time. The dictation module also provides rendered text in incremental fashion. The rendered text corresponds to the dictation module'"'"'s interpretation of each incremental portion. The dictation module also allows the user to modify any part of the rendered text, as it becomes available. In one case, for instance, the dictation module provides a marking menu which includes multiple options by which a user can modify a selected part of the rendered text. The dictation module also uses the rendered text (as modified or unmodified by the user using the marking menu) to adjust one or more models used by the dictation model to interpret the user'"'"'s utterance.
31 Citations
20 Claims
-
1. A method, performed by a computing system, for providing a dictating service, comprising:
-
receiving a speech signal in response to vocalization, by a user, of an incremental portion of a complete utterance, the speech signal being from a microphone; interpreting the incremental portion based on the speech signal, to provide recognized speech, prior to the user finishing the complete utterance; and providing rendered text associated with the recognized speech on an output presentation displayed on a display screen prior to the user finishing the complete utterance, wherein providing the rendered text on the output presentation further comprises modifying a rate at which the rendered text is presented on the output presentation, the rate being modified based on a level of uncertainty associated with each part of the rendered text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computing system, comprising:
-
at least one processing device; and memory that comprises computer readable instructions that, when executed by the at least one processing device, cause the at least one processing device to perform acts including; extracting features from a speech signal, the speech signal being received in response to vocalization, by a user, of an incremental portion of a complete utterance, the speech signal being from a microphone; interpreting the incremental portion based on the features extracted from the speech signal, prior to the user finishing the complete utterance, to provide recognized speech, the speech signal being acoustically interpreted using an acoustic model and linguistically interpreted using a language model; and providing rendered text associated with the recognized speech on an output presentation displayed on a display screen prior to the user finishing the complete utterance, wherein providing the rendered text on the output presentation further comprises modifying a rate at which the rendered text is presented on the output presentation, the rate being modified based on a level of uncertainty associated with each part of the rendered text. - View Dependent Claims (18)
-
-
19. A computer readable storage device for storing computer readable instructions, the computer readable instructions providing a dictation module when executed by one or more processing devices, the computer readable instructions comprising:
-
logic configured to present rendered text associated with a vocalization, by a user, of an incremental portion of a complete utterance on a display screen prior to the user finishing the complete utterance, said logic configured to present the rendered text further being configured to modify a rate at which the rendered text is presented, the rate being modified based on a level of uncertainty associated with the rendered text; and logic configured to present a marking menu on the display screen to the user that provides a plurality of options, the plurality of options giving the user an opportunity to modify any part of the rendered text in different respective ways. - View Dependent Claims (20)
-
Specification