×

Speech recognition method on sentences in all languages

  • US 20120116764A1
  • Filed: 11/09/2010
  • Published: 05/10/2012
  • Est. Priority Date: 11/09/2010
  • Status: Abandoned Application
First Claim
Patent Images

1. A speech recognition method on sentences in all languages comprising:

  • (1) a sentence can be a syllable, a word, a name or a sentence, and M=1000 different voices are prepared;

    (2) a pre-processor to delete noise and all time intervals without real signal sampled points, before and after a voice (sentence), between two syllables and two words;

    (3) a method to normalize the whole waveform of real signal sampled points of a voice (sentence), using E equal elastic frames (windows) without filter and without overlap over each other, and to transform the whole waveform of real signal sampled points into an equal-sized E×

    P matrix of the linear predict coding cepstra (LPCC);

    (4) M=1000 different voices are transformed into 1000 different E×

    P matrices of linear predict coding cepstra (LPCC) to represent 1000 different databases;

    (5) a user pronounces a known sentence, delete noise and all time intervals without real language signal points, before and after the known sentence, between two syllables and two words, and E=12 equal elastic frames normalize the whole waveform of real language signal points into an E×

    P matrix of LPCC;

    (6) use the distance or weighted distance between the E×

    P matrix of LPCC of the known sentence and 1000 different E×

    P matrices of LPCC of 1000 different voices representing 1000 different databases to find its closest database, the E×

    P matrix of the known sentence is put into its closest database, and similarly, the E×

    P matrices of LPCC of all known sentences are put into their closest databases individually;

    (7) to classify an unknown sentence, after deletion of noise and time intervals without language signal points, before and after the unknown sentence, between two syllables and two words, the unknown sentence with real language sampled points is transformed into an E×

    P matrix of LPCC, the invention uses the distance or weighted distance between the E×

    P matrix of LPCC of the unknown sentence and 1000 different E×

    P matrices of LPCC of 1000 different voices representing 1000 different databases to find its F closest databases and again uses the distance or weighted distance between the E×

    P matrix of LPCC of the unknown sentence and the E×

    P matrices of LPCC of the similar known sentences in its F closest databases to find a known sentence to be the unknown sentence; and

    (8) if an unknown sentence is not identified, the unknown sentence is pronounced again, its E×

    P matrix of LPCC is put into the new closest database, and then it will be identified correctly.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×