Method and apparatus for automatically generating a speech recognition vocabulary from a white pages listing
First Claim
1. A method for generating a speech recognition dictionary for use in a speech recognition system, the method comprising the steps of:
- providing a machine readable medium containing a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words;
processing said machine readable medium by a computing device for generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each orthography being a representation of a spoken utterance, each orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier;
transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognition system;
storing said data structure on a computer readable medium.
13 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a method and apparatus for automatically generating a speech recognition vocabulary for a speech recognition system from a listing that contains a number of entries, each entry containing a multi-word identification data that distinguishes that entry from other entries in the list. The method comprises the steps of creating for each entry in the listing a plurality of orthographies in the speech recognition vocabulary that are formed by combining selected words from the entry. The words combination is effected by applying a heuristics model that mimics the way users formulate requests to the automated directory assistance system. The method is particularly useful for generating speech recognition vocabularies for automated directory assistance systems.
109 Citations
57 Claims
-
1. A method for generating a speech recognition dictionary for use in a speech recognition system, the method comprising the steps of:
-
providing a machine readable medium containing a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words; processing said machine readable medium by a computing device for generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each orthography being a representation of a spoken utterance, each orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier; transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognition system; storing said data structure on a computer readable medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 42, 43)
-
-
18. An apparatus for generating a speech recognition vocabulary for use in a speech recognition system, said apparatus comprising:
-
first memory means for holding a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words; a processor in operative relationship with said first memory means; a program element providing means for; a) generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each said orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier; b) transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognitions system. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A machine readable medium containing a program element for instructing a computer to generate a speech recognition vocabulary for use in a speech recognition system, said computer including:
-
first memory means for holding a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words; a processor in operative relationship with said first memory means; a program element providing means for; a) generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each said orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier; b) transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognition system. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
44. A speech recognition system having a memory which contains a speech recognition vocabulary representing a plurality of orthographies, said speech recognition vocabulary generated by:
-
providing a computer readable medium containing a listing of a plurality of entity identifiers wherein each entity identifier comprises at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words; generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each said orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier; storing said orthography set on a computer readable medium in a format such that the orthographies of said orthography set are potentially recognizable by a speech recognition system on a basis of a spoken utterance by a user. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57)
-
Specification