Arrangement for speech recognition
First Claim
1. A speech recognizer comprising:
- a random access memory;
a downloader for loading decision trees from a set of decision trees into said random access memory;
a vocabulary comprising one or more words of a language;
a divider for dividing at least one word of said vocabulary into subwords;
a transcription generator adapted to process at least one subword, wherein the downloader is adapted to download a subset of the set of decision trees at a time into said random access memory, and the transcription generator is further adapted to generate at least one phoneme transcription for said subword using said subset of the decision trees; and
a combiner for combining generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech recognizer comprises a random access memory, a downloader for loading decision trees from a set of decision trees into said random access memory, a vocabulary comprising one or more words of a language, a divider for dividing at least one word of the vocabulary into subwords, and a transcription generator adapted to process at least one subword. The downloader is adapted to download a subset of the set of decision trees at a time into said random access memory. The transcription generator is further adapted to generate at least one phoneme transcription for the subword using the subset of decision trees. The speech recognizer also comprises a combiner for combining the generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words. The invention also relates to a device, a system, a module, a method, a computer program product and a data structure.
22 Citations
28 Claims
-
1. A speech recognizer comprising:
-
a random access memory;
a downloader for loading decision trees from a set of decision trees into said random access memory;
a vocabulary comprising one or more words of a language;
a divider for dividing at least one word of said vocabulary into subwords;
a transcription generator adapted to process at least one subword, wherein the downloader is adapted to download a subset of the set of decision trees at a time into said random access memory, and the transcription generator is further adapted to generate at least one phoneme transcription for said subword using said subset of the decision trees; and
a combiner for combining generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A device comprising:
-
a random access memory;
a downloader for loading decision trees from a set of decision trees into said random access memory;
a vocabulary comprising one or more words of a language;
a divider for dividing at least one word of said vocabulary into subwords;
a transcription generator adapted to process at least one subword, wherein the downloader is adapted to download a subset of the set of decision trees at a time into said random access memory, and the transcription generator is further adapted to generate at least one phoneme transcription for said subword using said subset of the decision trees; and
a combiner for combining the generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A wireless communication device comprising:
-
a random access memory;
a downloader for loading decision trees from a set of decision trees into said random access memory;
a vocabulary comprising one or more words of a language;
a divider for dividing at least one word of said vocabulary into subwords;
a transcription generator adapted to process at least one subword, wherein the downloader is adapted to download a subset of the set of decision trees at a time into said random access memory, and the transcription generator is further adapted to generate at least one phoneme transcription for said subword using said subset of the decision trees; and
a combiner for combining the generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words.
-
-
21. A system comprising
a server comprising a mass memory for storing a set of decision trees, and a transmitter for transmitting information from the server; a device comprising a receiver for receiving information from the server;
a random access memory;
a downloader for loading decision trees from the set of decision trees from said server into said random access memory;
a vocabulary comprising one or more words of a language;
a divider for dividing at least one word of said vocabulary into subwords;
a transcription generator adapted to process at least one subword, wherein the downloader is adapted to download a subset of the set of decision trees at a time into said random access memory, and the transcription generator is further adapted to generate at least one phoneme transcription for said subword using said subset of the decision trees; and
a combiner for combining the generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words.
-
22. A module comprising:
-
a downloader for loading decision trees from a set of decision trees into a random access memory;
a divider for dividing at least one word of said vocabulary into subwords;
a transcription generator adapted to process at least one subword of a vocabulary, said vocabulary comprising one or more words of a language, wherein the downloader is adapted to download a subset of the set of decision trees at a time into said random access memory, and the transcription generator is further adapted to generate at least one phoneme transcription for said subword using said subset of the decision trees; and
a combiner for combining the generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words.
-
-
23. A method for generating the phoneme transcriptions of words of a vocabulary of a language comprising:
-
loading decision trees into a random access memory;
processing at least one subword of a vocabulary, wherein the processing comprising downloading a subset of the set of decision trees at a time into said random access memory, and generating at least one phoneme transcription for said subword using said subset of the decision trees; and
combining the generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words.
-
-
24. A computer program product for generating the phoneme transcriptions of words of a vocabulary of a language when executed on a processor, the computer program product comprising machine executable steps stored in an addressable memory, the machine executable steps for:
-
loading decision trees into a random access memory;
processing the words of the vocabulary subword-by-subword, wherein the processing comprising downloading a subset of the set of decision trees at a time into said random access memory, and generating at least one phoneme transciption for said subword using said subset of the decision trees; and
combining the generated phoneme transcriptions of the subwords to obtain phoneme transcriptions of said one or more words.
-
-
25. A data structure including words of at least one vocabulary of at least one language for processing subwords of the words of the vocabulary, the data structure comprising:
-
subword and phoneme definitions;
decision trees for single subwords arranged for random access of the decision trees;
the data of the decision trees comprising information for obtaining phoneme transcriptions from subwords. - View Dependent Claims (26)
-
-
27. A method for producing a data structure including words of at least one vocabulary of at least one language for processing subwords of the words of the vocabulary, the method comprising
obtaining subword and phoneme definitions; -
forming decision trees for single subwords on the basis of the phoneme definitions; and
arranging said decision trees for single subwords for random access.
-
-
28. A computer program product for producing a data structure including words of at least one vocabulary of at least one language for processing subwords of the words of the vocabulary when executed on a processor, the computer program product, the computer program product comprising machine executable steps stored in an addressable memory, the machine executable steps for:
-
obtaining subword and phoneme definitions;
forming decision trees for single subwords on the basis of the phoneme definitions; and
arranging said decision trees for single subwords for random access.
-
Specification