Two-way speech recognition and dialect system
First Claim
1. A system for transcribing spoken words to text, the system comprising:
- an audio transducer that receives the spoken words and generates speech signals corresponding thereto;
a user interface through which a user can receive and send signals;
at least one data structure containing word data correlating text representations of words to speech signals wherein the at least one data structure includes dialect parameter data that can be used to recognize selected dialects corresponding to selected users;
a processor that receives the speech signals from the audio transducer wherein the processor compares the speech signals to the word data in the at least one data structure to produce text representations corresponding to the spoken words and wherein the processor initially sends signals to the user via the user interface asking the user questions indicative of the user'"'"'s dialect to thereby induce the user to provide answer signals to the processor that are indicative of dialect parameters and wherein the processor compares the dialect parameters from the user to the dialect parameter data in the at least one data structure to determine a dialect corresponding to the user prior to comparing the speech signals to the word data in the at least one data structure to produce text representations corresponding to the spoken words.
4 Assignments
0 Petitions
Accused Products
Abstract
A speech-to-text conversion system. The two-way speech recognition and dialect system comprises a computer system, an attached microphone assembly, and speech-to-text conversion software. The two-way speech recognition and dialect system includes a database of dialectal characteristics and queries a user to determine their likely dialect. The system uses this determination to reduce the time for the system to reliably transcribe a user'"'"'s speech into text and to anticipate dialectal word usage. In another embodiment of the invention, the two-way speech recognition and dialect system is capable of transcribing the speech of multiple speakers while distinguishing between the different speakers and identifying the text belonging to each speaker.
-
Citations
30 Claims
-
1. A system for transcribing spoken words to text, the system comprising:
-
an audio transducer that receives the spoken words and generates speech signals corresponding thereto;
a user interface through which a user can receive and send signals;
at least one data structure containing word data correlating text representations of words to speech signals wherein the at least one data structure includes dialect parameter data that can be used to recognize selected dialects corresponding to selected users;
a processor that receives the speech signals from the audio transducer wherein the processor compares the speech signals to the word data in the at least one data structure to produce text representations corresponding to the spoken words and wherein the processor initially sends signals to the user via the user interface asking the user questions indicative of the user'"'"'s dialect to thereby induce the user to provide answer signals to the processor that are indicative of dialect parameters and wherein the processor compares the dialect parameters from the user to the dialect parameter data in the at least one data structure to determine a dialect corresponding to the user prior to comparing the speech signals to the word data in the at least one data structure to produce text representations corresponding to the spoken words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
- 14. A machine loadable software program adapted to control the operation of a processor such that the processor translates audio word signals indicative of at least one word spoken by a speaker into corresponding text signals in response to receiving the audio word signals from an audio transducer wherein the software program further induces the processor to obtain dialect parameters from the speaker prior to receiving audio word signals from the speaker by inducing the processor to ask questions of the speaker indicative of the speaker'"'"'s dialect such that the speaker provides the dialect parameters via a user input device associated with the processor and wherein the software program induces the processor to determine the speaker'"'"'s dialect from the dialect parameters prior to translating the audio word signals into corresponding text signals so as to improve the efficiency of the processor in correlating text words to received audio word signals.
-
23. A method of translating spoken words into corresponding text words, the method comprising:
-
obtaining dialect parameter data from a speaker by asking the speaker questions indicative of dialect and then evaluating the answers;
determining a dialect of the speaker from the dialect parameter data;
receiving spoken words from the speaker after having determined the dialect of the speaker; and
using the dialect of the speaker to facilitate correlation of the spoken word to corresponding text words. - View Dependent Claims (24, 25, 26, 27, 28)
-
-
29. A method of translating spoken words into corresponding text words, the method comprising:
-
obtaining dialect parameter data from a plurality of speakers by asking the plurality of speakers questions indicative of dialect and evaluating the answers;
determining the dialects of the speakers from the dialect parameter data;
receiving spoken words from the speakers after having determined the dialects of the speakers;
using the dialects of the speakers to correlate the spoken words to corresponding text words; and
using the dialects of the speakers to identify the text words corresponding to each speaker. - View Dependent Claims (30)
-
Specification