Systems and methods for speech recognition and separate dialect identification
First Claim
1. A system for transcribing spoken words to text, the system comprising:
- at least one data structure containing word data correlating words to speech signals, wherein the at least one data structure includes dialect parameter data that is different than the word data and that can be used to recognize user dialects; and
a processor configured to receive speech signals and to produce text representations corresponding to spoken words, wherein the processor initially sends signals to a user asking the user at least one question indicative of the user'"'"'s dialect to thereby induce the user to provide non-verbal answer signals to the processor that are indicative of dialect parameters, wherein the dialect parameters comprise at least one of an age of the user, a gender of the user, an educational level of the user, a native language of the user, a geographic origin of the user, and a current geographic residence of the user,and wherein the processor compares the dialect parameters from the user to the dialect parameter data in the at least one data structure to determine at least one dialect that the user is likely to have prior to comparing the speech signals to the word data.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech-to-text conversion system. The two-way speech recognition and dialect system comprises a computer system, an attached microphone assembly, and speech-to-text conversion software. The two-way speech recognition and dialect system includes a database of dialectal characteristics and queries a user to determine their likely dialect. The system uses this determination to reduce the time for the system to reliably transcribe a user'"'"'s speech into text and to anticipate dialectal word usage. In another embodiment of the invention, the two-way speech recognition and dialect system is capable of transcribing the speech of multiple speakers while distinguishing between the different speakers and identifying the text belonging to each speaker.
-
Citations
12 Claims
-
1. A system for transcribing spoken words to text, the system comprising:
-
at least one data structure containing word data correlating words to speech signals, wherein the at least one data structure includes dialect parameter data that is different than the word data and that can be used to recognize user dialects; and a processor configured to receive speech signals and to produce text representations corresponding to spoken words, wherein the processor initially sends signals to a user asking the user at least one question indicative of the user'"'"'s dialect to thereby induce the user to provide non-verbal answer signals to the processor that are indicative of dialect parameters, wherein the dialect parameters comprise at least one of an age of the user, a gender of the user, an educational level of the user, a native language of the user, a geographic origin of the user, and a current geographic residence of the user, and wherein the processor compares the dialect parameters from the user to the dialect parameter data in the at least one data structure to determine at least one dialect that the user is likely to have prior to comparing the speech signals to the word data. - View Dependent Claims (2)
-
-
3. A speech recognition system comprising:
-
a means for storing dialect data; a means for processing speech signals into text representations, wherein the processing means receives at least one dialect parameter indicative of at least one of a user'"'"'s age, gender, educational level, native language, geographic origin, and current geographic residence, wherein the processing means compares the at least one dialect parameter to the dialect data to determine at least one dialect that the user is likely to have, and wherein the processing means further compares speech signals to word data to produce text representations corresponding to spoken words of the user, said comparison of the speech signals following said determination of the at least one dialect; and a means for obtaining input from the user, wherein the processing means receives the at least one dialect parameter via the obtaining means, and wherein the means for obtaining input comprises a keyboard.
-
-
4. A speech recognition system comprising:
-
a means for storing dialect data; a means for processing speech signals into text representations, wherein the processing means receives at least one dialect parameter indicative of at least one of a user'"'"'s age, gender, educational level, native language, geographic origin, and current geographic residence, wherein the processing means compares the at least one dialect parameter to the dialect data to determine at least one dialect that the user is likely to have, and wherein the processing means further compares speech signals to word data to produce text representations corresponding to spoken words of the user, said comparison of the speech signals following said determination of the at least one dialect; and an interface means for sending signals to the user, wherein the processing means sends a first signal to the user via the interface means to thereby induce the user to provide a non-verbal answer signal that is indicative of the at least one dialect parameter.
-
-
5. A method of processing speech for a speech recognition system, the method comprising:
-
determining if one of a plurality of dialects has been assigned to a user; if one of the plurality of dialects has not been assigned to the user; prior to receiving speech signals from the user, receiving at least one dialect parameter from the user; comparing the at least one dialect parameter to dialect parameter data to determine at least one of the plurality of dialects that the user is likely to have; receiving speech signals corresponding to spoken words of the user; and using the at least one of the plurality of dialects that the user is likely to have to produce text representations corresponding to the spoken words of the user. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
Specification