Speech recognition method on sentences in all languages
First Claim
1. A speech recognition method on sentences in all languages comprising:
- (1) a sentence can be a syllable, a word, a name or a sentence, and M=1000 different voices are prepared;
(2) a pre-processor to delete noise and all time intervals without real signal sampled points, before and after a voice (sentence), between two syllables and two words;
(3) a method to normalize the whole waveform of real signal sampled points of a voice (sentence), using E equal elastic frames (windows) without filter and without overlap over each other, and to transform the whole waveform of real signal sampled points into an equal-sized E×
P matrix of the linear predict coding cepstra (LPCC);
(4) M=1000 different voices are transformed into 1000 different E×
P matrices of linear predict coding cepstra (LPCC) to represent 1000 different databases;
(5) a user pronounces a known sentence, delete noise and all time intervals without real language signal points, before and after the known sentence, between two syllables and two words, and E=12 equal elastic frames normalize the whole waveform of real language signal points into an E×
P matrix of LPCC;
(6) use the distance or weighted distance between the E×
P matrix of LPCC of the known sentence and 1000 different E×
P matrices of LPCC of 1000 different voices representing 1000 different databases to find its closest database, the E×
P matrix of the known sentence is put into its closest database, and similarly, the E×
P matrices of LPCC of all known sentences are put into their closest databases individually;
(7) to classify an unknown sentence, after deletion of noise and time intervals without language signal points, before and after the unknown sentence, between two syllables and two words, the unknown sentence with real language sampled points is transformed into an E×
P matrix of LPCC, the invention uses the distance or weighted distance between the E×
P matrix of LPCC of the unknown sentence and 1000 different E×
P matrices of LPCC of 1000 different voices representing 1000 different databases to find its F closest databases and again uses the distance or weighted distance between the E×
P matrix of LPCC of the unknown sentence and the E×
P matrices of LPCC of the similar known sentences in its F closest databases to find a known sentence to be the unknown sentence; and
(8) if an unknown sentence is not identified, the unknown sentence is pronounced again, its E×
P matrix of LPCC is put into the new closest database, and then it will be identified correctly.
0 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition method on all sentences in all languages is provided. A sentence can be a word, name or sentence. All sentences are represented by E×P=12×12 matrices of linear predict coding cepstra (LPCC) 1000 different voices are transformed into 1000 matrices of LPCC to represent 1000 databases. E×P matrices of known sentences after deletion of time intervals between two words are put into their closest databases. To classify an unknown sentence, use the distance to find its F closest databases and then from known sentences in its F databases, find a known sentence to be the unknown one. The invention needs no samples and can find a sentence in one second using Visual Basic. Any person without training can immediately and freely communicate with computer in any language. It can recognize up to 7200 English words, 500 sentences of any language and 500 Chinese words.
38 Citations
3 Claims
-
1. A speech recognition method on sentences in all languages comprising:
-
(1) a sentence can be a syllable, a word, a name or a sentence, and M=1000 different voices are prepared; (2) a pre-processor to delete noise and all time intervals without real signal sampled points, before and after a voice (sentence), between two syllables and two words; (3) a method to normalize the whole waveform of real signal sampled points of a voice (sentence), using E equal elastic frames (windows) without filter and without overlap over each other, and to transform the whole waveform of real signal sampled points into an equal-sized E×
P matrix of the linear predict coding cepstra (LPCC);(4) M=1000 different voices are transformed into 1000 different E×
P matrices of linear predict coding cepstra (LPCC) to represent 1000 different databases;(5) a user pronounces a known sentence, delete noise and all time intervals without real language signal points, before and after the known sentence, between two syllables and two words, and E=12 equal elastic frames normalize the whole waveform of real language signal points into an E×
P matrix of LPCC;(6) use the distance or weighted distance between the E×
P matrix of LPCC of the known sentence and 1000 different E×
P matrices of LPCC of 1000 different voices representing 1000 different databases to find its closest database, the E×
P matrix of the known sentence is put into its closest database, and similarly, the E×
P matrices of LPCC of all known sentences are put into their closest databases individually;(7) to classify an unknown sentence, after deletion of noise and time intervals without language signal points, before and after the unknown sentence, between two syllables and two words, the unknown sentence with real language sampled points is transformed into an E×
P matrix of LPCC, the invention uses the distance or weighted distance between the E×
P matrix of LPCC of the unknown sentence and 1000 different E×
P matrices of LPCC of 1000 different voices representing 1000 different databases to find its F closest databases and again uses the distance or weighted distance between the E×
P matrix of LPCC of the unknown sentence and the E×
P matrices of LPCC of the similar known sentences in its F closest databases to find a known sentence to be the unknown sentence; and(8) if an unknown sentence is not identified, the unknown sentence is pronounced again, its E×
P matrix of LPCC is put into the new closest database, and then it will be identified correctly. - View Dependent Claims (2, 3)
-
Specification