FREE TEXT VOICE TRAINING
First Claim
Patent Images
1. A method for acoustically training a speech recognition engine of a speech recognition software application, the method comprising:
- receiving audio data representing a user'"'"'s voice speaking at least one phrase, the at least one phrase being unknown to the speech recognition engine in both spoken audio and text forms;
the speech recognition engine, using a process performed by a processor, translating the at least one phrase into text form for display to the user; and
receiving a reviewed version of the text form and the speech recognition software application, using a process performed by a processor, converting the reviewed version of the text form into a context free grammar based on text indicated as validated text.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method provide acoustic training of a voice or speech recognition engine and/or voice or speech recognition software application. Instead of requiring a user to read from a prepared or predetermined script, the system and method described herein enable acoustic training using any free text spoken phrases provided by the user directly, or by a previously recorded speech, presentation, or the like, performed by the user.
-
Citations
20 Claims
-
1. A method for acoustically training a speech recognition engine of a speech recognition software application, the method comprising:
-
receiving audio data representing a user'"'"'s voice speaking at least one phrase, the at least one phrase being unknown to the speech recognition engine in both spoken audio and text forms; the speech recognition engine, using a process performed by a processor, translating the at least one phrase into text form for display to the user; and receiving a reviewed version of the text form and the speech recognition software application, using a process performed by a processor, converting the reviewed version of the text form into a context free grammar based on text indicated as validated text. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable storage medium, which is not a signal, with an executable program stored thereon, wherein the executable program instructs a processor to perform a method, the method comprising:
-
receive audio data at a the speech recognition engine, the audio data representing a user'"'"'s voice speaking at least one phrase, the at least one phrase being unknown to the speech recognition engine in both spoken audio and text forms; translate the at least one phrase into text form for display to the user; and receive a reviewed version of the text form and convert the reviewed version of the text form into a context free grammar based on text indicated as validated text. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A speech recognition system that can be acoustically trained with free text audio, the system comprising:
-
a speech recognition software application operating on a computing device having a processor, the speech recognition software application comprising; a speech recognition engine; a comparison module configured to receive an indication of validated text and associate the validated text with at least one word from the free text audio; and a plurality of voice models; wherein upon receipt of a plurality of instances in which validated text is associated with the at least one word from the free text audio, the speech recognition software application selects a subset of voice models of the plurality of voice models in such a way that the subset of voice models shares a plurality of characteristics with the free text audio associated with the validated text. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification