Speech processing apparatus and method
First Claim
1. Apparatus for generating and testing speech models, said apparatus comprising:
- a data collection unit operable to collect and store utterance data indicative of the pronunciation of one or more words by one or more speakers;
a speech model generation unit operable to generate speech models of words, utterances of which have been collected by said data collection unit; and
a testing unit operable to test the accuracy of the matching of utterances collected by said data collection unit to speech models generated by said speech model generation unit and to generate a visual display of the results of said testing by said testing unit.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer system is provided including a control module 20 and data collection module 22 which generate user interfaces enabling a user to identify a vocabulary and a number of speakers from whom utterances are to be obtained. The data collection module 22 then co-ordinates the collection of utterance data for the words in the vocabulary from these speakers and stores the data in a speaker database 24. When a satisfactory set of utterances have been collected the utterances are passed to a modal generation module 25 which generates a speech model using the utterances. The speech model is stored by the model generation module 25 in a model database 26. The generated model stored within the model database 26 can then be tested using a testing module 27 and other utterances stored within the speaker database 24. If the performance of the model is unsatisfactory further or different utterances can be used to generate new models for storage within the model database 26. When a speech model is determined to be satisfactory the control module 20 can invoke the output module 28 to output a copy of the model.
-
Citations
25 Claims
-
1. Apparatus for generating and testing speech models, said apparatus comprising:
-
a data collection unit operable to collect and store utterance data indicative of the pronunciation of one or more words by one or more speakers;
a speech model generation unit operable to generate speech models of words, utterances of which have been collected by said data collection unit; and
a testing unit operable to test the accuracy of the matching of utterances collected by said data collection unit to speech models generated by said speech model generation unit and to generate a visual display of the results of said testing by said testing unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method of collecting utterance data comprising the steps of:
-
displaying a first user interface to enable user input of speaker identifiers and storing said speaker identifiers in a speaker database;
displaying a second user interface to enable user input of word identifiers and storing said word identifiers in a vocabulary database;
displaying a series of prompts to prompt the utterance of words corresponding to word identifiers stored in said vocabulary database by speakers identified by speaker identifiers stored in said speaker database; and
synchronising the collection of utterance data indicative of the pronunciation of words with said series of prompts.
-
-
24. Apparatus for collecting utterance data indicative of the pronunciation of one or more words by one or more speakers, the apparatus comprising:
-
a data collection unit operable to collect and store utterance data indicative of the pronunciation of one or more words by one or more speakers;
a vocabulary database operable to store word identifiers indicative of one or more words;
a speaker database operable to store speaker identifiers indicative of speaker from whom utterance data is to be collected; and
a co-ordination unit, said co-ordination unit being operable;
to generate a first user interface to enable user input of speaker identifiers for storage in said speaker database;
to generate a second user interface to enable user input of word identifiers for storage in said vocabulary database; and
to generate a third user interface operable to generate a series of prompts to prompt the utterance of words corresponding to word identifiers stored in said vocabulary database by speakers identified by speaker identifiers stored said speaker database and to synchronise said series of prompts with the collection of utterance data indicative of pronunciation of words.
-
-
25. A method of generating speech models comprising the steps of:
-
providing a computer system operable to collect utterance data, to generate speech models utilising said collected utterance data and to test the accuracy of matching utterances to said generated speech models;
collecting data indicative of the pronunciation of one or more words by one or more speakers utilising said apparatus;
generating speech models utilizing said collected utterances;
determining whether said accuracy of said generated models is satisfactory by testing said models utilizing said apparatus; and
outputting speech models determined to be satisfactory in said determination step.
-
Specification