Method and device for speech recognition
First Claim
1. A method for recognizing spoken language comprising the steps of:
- identifying a number of phonemes from a segment of input speech;
interpreting the phonemes as possible word combinations to establish a model of the speech with word and sentence accents according to a standardized pattern;
determining the fundamental tone curve of the input speech;
determining the maximum and minimum values of the fundamental tone curve of the input speech and their respective positions;
determining the maximum and minimum values of the fundamental tone curve of the speech model;
comparing the fundamental tone curve of the input speech and the fundamental tone curve of the speech model to identify a time difference between the maximum and minimum values of the fundamental tone curve of the incoming speech in relation to the maximum and minimum values of the fundamental tone curve of the speech model;
adjusting the intonation pattern of the speech model utilizing the identified time difference to modify the speech model to conform with the dialectal characteristics of the input speech.
5 Assignments
0 Petitions
Accused Products
Abstract
A method and device for recognizing dialectal variations in a language. From an incoming speech is on one hand a speech recognition procedure being performed, and on the other hand the fundamental tone curve being extracted. Out of the speech recognition is created an allophone string which together with the fundamental tone curve is used for the detecting of the maximun and minimum values of the fundamental tone. The recognized speech is compared with a lexicon with orthography and transcription for the finding of suitable word candidates. The found word candidates are further analyzed regarding syntax. This in mentioned way found syntactical and lexical information is used for creating a model of the speech. The fundamental tone outline of the model and the fundamental tone of the speech are compared, at which the maximun and minimum values of the fundamental tones are appointed and a difference between the model and the speech are obtained. The difference is after that influencing the model which is brought to correspond to the given speech. The in mentioned way modelled model is then used for the speech recognition, at which an increased possibility to understand the different dialects of a language in an artificial way is achieved.
-
Citations
20 Claims
-
1. A method for recognizing spoken language comprising the steps of:
-
identifying a number of phonemes from a segment of input speech; interpreting the phonemes as possible word combinations to establish a model of the speech with word and sentence accents according to a standardized pattern; determining the fundamental tone curve of the input speech; determining the maximum and minimum values of the fundamental tone curve of the input speech and their respective positions; determining the maximum and minimum values of the fundamental tone curve of the speech model; comparing the fundamental tone curve of the input speech and the fundamental tone curve of the speech model to identify a time difference between the maximum and minimum values of the fundamental tone curve of the incoming speech in relation to the maximum and minimum values of the fundamental tone curve of the speech model;
adjusting the intonation pattern of the speech model utilizing the identified time difference to modify the speech model to conform with the dialectal characteristics of the input speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A device for recognizing spoken language comprising:
-
speech recognition means for identifying a number of phonemes from a segment of input speech; interpretation means for interpreting the phonemes as possible word combinations to establish a model of speech having word and sentence accents according to a standardized pattern; extraction means for extracting a fundamental tone curve of the input speech; first analyzing means for determining the maximum and minimum values of the fundamental tone curve and their respective positions; second analyzing means for determining the maximum and minimum values of the fundamental tone curve of the speech model and their respective positions; comparison means for comparing the input speech with the speech model to identify a time difference between the occurrence of the maximum and minimum values of the fundamental tone curve of the incoming speech in relation to the maximum and minimum values of the fundamental tone curve of the speech model; correction means for adjusting the intonation pattern of the speech model utilizing the identified time difference, to modify the speech model to conform with the dialectal characteristics of the input speech. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification