×

Speech recognition for recognizing speaker-independent, continuous speech

  • US 7,089,184 B2
  • Filed: 03/22/2001
  • Issued: 08/08/2006
  • Est. Priority Date: 03/22/2001
  • Status: Active Grant
First Claim
Patent Images

1. A speech recognition device, comprising:

  • an I/O device for accepting a voice stream;

    a frequency domain converter communicating with said I/O device, said frequency domain converter converting said voice stream from a time domain to a frequency domain and generating a plurality of frequency domain outputs;

    a frequency domain output storage communicating with said frequency domain converter, said frequency domain output storage comprising at least two frequency spectrum frame storages for storing at least a current frequency spectrum frame and a previous frequency spectrum frame, with a frequency spectrum frame storage of said at least two frequency spectrum frame storages comprising a plurality of frequency bins storing said plurality of frequency domain outputs;

    a processor communicating with said plurality of frequency bins;

    a memory communicating with said processor;

    a frequency spectrum difference storage in said memory, with said frequency spectrum difference storage storing one or more frequency spectrum differences calculated as a difference between said current frequency spectrum frame and said previous frequency spectrum frame;

    at least one feature storage in said memory for storing at least one feature extracted from said voice stream;

    at least one transneme table in said memory, with said at least one transneme table including a plurality of transneme table entries and with a transneme table entry of said plurality of transneme table entries mapping a predetermined frequency spectrum difference to at least one predetermined transneme of a predetermined verbal language;

    at least one mappings storage in said memory, with said at least one mappings storage storing one or more found transnemes;

    at least one transneme-to-vocabulary database in said memory, with said at least one transneme-to-vocabulary database mapping a set of one or more found transnemes to at least one speech unit of said predetermined verbal language; and

    at least one voice stream representation storage in said memory, with said at least one voice stream representation storage storing a voice stream representation created from said one or more found transnemes;

    wherein said speech recognition device calculates a frequency spectrum difference between a current frequency spectrum frame and a previous frequency spectrum frame, maps said frequency spectrum difference to a transneme table, and converts said frequency spectrum difference to a transneme if said frequency spectrum difference is greater than a predetermined difference threshold, and creates a digital voice stream representation of said voice stream from one or more transnemes thus produced.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×