Method and apparatus for automatically generating a speech recognition vocabulary from a white pages listing

US 5,839,107 A
Filed: 11/29/1996
Issued: 11/17/1998
Est. Priority Date: 11/29/1996
Status: Expired due to Term

First Claim

Patent Images

1. A method for generating a speech recognition dictionary for use in a speech recognition system, the method comprising the steps of:

providing a machine readable medium containing a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words;

processing said machine readable medium by a computing device for generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each orthography being a representation of a spoken utterance, each orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier;

transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognition system;

storing said data structure on a computer readable medium.

View all claims

13 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The invention relates to a method and apparatus for automatically generating a speech recognition vocabulary for a speech recognition system from a listing that contains a number of entries, each entry containing a multi-word identification data that distinguishes that entry from other entries in the list. The method comprises the steps of creating for each entry in the listing a plurality of orthographies in the speech recognition vocabulary that are formed by combining selected words from the entry. The words combination is effected by applying a heuristics model that mimics the way users formulate requests to the automated directory assistance system. The method is particularly useful for generating speech recognition vocabularies for automated directory assistance systems.

109 Citations

View as Search Results

57 Claims

1. A method for generating a speech recognition dictionary for use in a speech recognition system, the method comprising the steps of:
- providing a machine readable medium containing a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words;
  
  processing said machine readable medium by a computing device for generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each orthography being a representation of a spoken utterance, each orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier;
  
  transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognition system;
  
  storing said data structure on a computer readable medium.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 42, 43)
- - 2. A method as defined in claim 1, wherein said data element is a representation of a sound made when uttering the orthography associated with the data element.
  - 3. A method as defined in claim 2, wherein the step of transcribing said orthography set includes the step of generating phonemic transcriptions for each orthography in said set.
  - 4. A method as defined in claim 3, wherein the step of transcribing said orthography set further includes the step of converting the phonemic transcriptions into acoustic transcriptions.
  - 5. A method as defined in claim 2, wherein each orthography in said orthography set shares a common word with another orthography in said orthography set.
  - 6. A method as defined in claim 5, wherein each orthography in said orthography set includes a word that is common with every other orthography in said orthography set.
  - 7. A method as defined in claim 2, wherein said listing is a database including a plurality of records, each record including a plurality of information fields, data stored in the information fields for a certain record constituting an entity identifier.
  - 8. A method as defined in claim 7, wherein one of said entity identifiers includes data indicative of a name, said method comprising the step of generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name.
  - 9. A method as defined in claim 7, wherein one of said entity identifiers includes data indicative of a civic address, said method comprising the step of generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of civic address.
  - 10. A method as defined in claim 7, wherein one of said entity identifiers includes data indicative of a title, said method comprising the step of generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a title.
  - 11. A method as defined in claim 7, wherein one of said entity identifiers includes data indicative of a name and data indicative of a civic address, said method comprising the step of generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a civic address.
  - 12. A method as defined in claim 7, wherein one of said entity identifiers includes data indicative of a name and data indicative of a title, said method comprising the step of generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a title.
  - 13. A method as defined in claim 7, wherein one of said information fields includes data indicative of a name, said method comprising prior to generating any orthography set effecting the steps of:
    - scanning said one information field to locate predetermined data;
      
      upon identification of said predetermined data in association with a given record negating said predetermined data from said given record.
  - 14. A method as defined in claim 13, wherein said predetermined data includes combination of words selected from a group consisting of "toll free", "day or night" and "24 hour".
  - 15. A method as defined in claim 7, wherein one of said information fields includes data indicative of a name, said method comprising prior to generating any orthography set effecting the steps of:
    - scanning said one information field to locate predetermined data;
      
      upon identification of said predetermined data in association with a given record replacing said predetermined data with new data.
  - 16. A method as defined in claim 15, comprising the step of consulting a table that establishes a correspondence between said predetermined data and said new data to identify the new data for replacing said predetermined table.
  - 17. A method as defined in claim 15, wherein said predetermined data is an abbreviation of a word.
  - 42. A machine readable medium containing a speech recognition vocabulary generated by the method defined in claim 1.
  - 43. A machine readable medium containing a speech recognition vocabulary generated by the method defined in claim 2.

18. An apparatus for generating a speech recognition vocabulary for use in a speech recognition system, said apparatus comprising:
- first memory means for holding a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words;
  
  a processor in operative relationship with said first memory means;
  
  a program element providing means for;
  
  a) generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each said orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier;
  
  b) transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognitions system.
- View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
- - 19. An apparatus as defined in claim 18, wherein each data element is a representation of a sound made when uttering the orthography associated with the data element.
  - 20. An apparatus as defined in claim 19, wherein said program element means for generating a phonemic transcription for the orthographies in said set.
  - 21. An apparatus as defined in claim 20, wherein said program element provides means for converting each phonemic transcription into an acoustic transcription.
  - 22. An apparatus as defined in claim 20, wherein each orthography in said orthography set shares a common word with another orthography in said orthography set.
  - 23. An apparatus as defined in claim 22, wherein each orthography in said orthography set includes a word that is common with every other orthography in said orthography set.
  - 24. An apparatus as defined in claim 23, wherein said listing is a database including a plurality of records, each record including a plurality of information fields, data stored in the information fields for a certain record constituting an entity identifier.
  - 25. An apparatus as defined in claim 24, wherein one of said entity identifiers includes data indicative of a name, said program element including means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name.
  - 26. An apparatus as defined in claim 24, wherein one of said entity identifiers includes data indicative of a civic address, said program element including means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of civic address.
  - 27. An apparatus as defined in claim 24, wherein one of said entity identifiers includes data indicative of a title, said program element including means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a title.
  - 28. An apparatus as defined in claim 24, wherein one of said entity identifiers includes data indicative of a name and data indicative of a civic address, said program element including means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a civic address.
  - 29. An apparatus as defined in claim 24, wherein one of said entity identifiers includes data indicative of a name and data indicative of a title, said program element including means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a title.

30. A machine readable medium containing a program element for instructing a computer to generate a speech recognition vocabulary for use in a speech recognition system, said computer including:
- first memory means for holding a listing of a plurality of entity identifiers, each entity identifier including at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words;
  
  a processor in operative relationship with said first memory means;
  
  a program element providing means for;
  
  a) generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each said orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier;
  
  b) transcribing said orthography set in the form of data elements forming a data structure capable of being processed by a speech recognition system characterized by an input for receiving a signal derived from a spoken utterance, said speech recognition system being capable of processing the signal and the data structure to select a data element corresponding to an orthography likely to match the spoken utterance and performing a determined action on the basis of the data element likely to match the spoken utterance selected by the speech recognition system.
- View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
- - 31. A machine readable medium as defined in claim 30, wherein each data element is a representation of the sound made when uttering the orthography associated with the data element.
  - 32. A machine readable medium as defined in claim 31, wherein said program element provides means for generating a phonemic transcription for the orthographies in said set.
  - 33. An apparatus as defined in claim 32, wherein said program element provides means for converting the phonemic transcription into an acoustic transcription.
  - 34. A machine readable medium as defined in claim 31, wherein each orthography in said orthography set shares a common word with another orthography in said orthography set.
  - 35. A machine readable medium as defined in claim 34, wherein each orthography in said orthography set includes a word that is common with every other orthography in said orthography set.
  - 36. A machine readable medium as defined in claim 33, wherein said listing is a database including a plurality of records, each record including a plurality of information fields, data stored in the information fields for a certain record constituting an entity identifier.
  - 37. A machine readable medium as defined in claim 36, wherein one of said entity identifiers includes data indicative of a name, said program element including means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name.
  - 38. A machine readable medium as defined in claim 36, wherein one of said entity identifiers includes data indicative of a civic address, said program element including means for generating orthography set including at least one orthography that includes a word common with a word included in said data indicative of civic address.
  - 39. A machine readable medium as defined in claim 36, wherein one of said entity identifiers includes data indicative of a title, said program element including means for generating orthography set including at least one orthography that includes a word common with a word included in said data indicative of a title.
  - 40. A machine readable medium as defined in claim 36, wherein one of said entity identifiers includes data indicative of a name and data indicative of a civic address, said program element including means for generating orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a civic address.
  - 41. A machine readable medium as defined in claim 36, wherein one of said entity identifiers includes data indicative of a name and data indicative of a title, said program element including means for generating orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a title.

44. A speech recognition system having a memory which contains a speech recognition vocabulary representing a plurality of orthographies, said speech recognition vocabulary generated by:
- providing a computer readable medium containing a listing of a plurality of entity identifiers wherein each entity identifier comprises at least one word that symbolizes a particular meaning, said plurality of entity identifiers being distinguishable from one another based on either one of individual words and combinations of individual words, at least some of said entity identifiers including at least two separate words;
  
  generating for at least some of said entity identifiers an orthography set including a plurality of orthographies, each said orthography in a given orthography set being a composition of different words and at least one of said different words being selected from a respective entity identifier;
  
  storing said orthography set on a computer readable medium in a format such that the orthographies of said orthography set are potentially recognizable by a speech recognition system on a basis of a spoken utterance by a user.
- View Dependent Claims (45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57)
- - 45. A speech recognition system as defined in claim 44, wherein each orthography in said orthography set shares a common word with another orthography in said orthography set.
  - 46. A speech recognition system as defined in claim 45, wherein each orthography in said set includes a word that is common with every other orthography in said set.
  - 47. A speech recognition system as defined in claim 44, wherein said listing is a database including a plurality of records, each record including a plurality of information fields, data stored in a information fields for a certain record constituting an entity identifier.
  - 48. A speech recognition system as defined in claim 47, wherein one of said entity identifiers includes data indicative of a name, said system comprising a means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name.
  - 49. A speech recognition system as defined in claim 47, wherein one of said entity identifiers includes data indicative of a civic address, said system comprises a means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of civic address.
  - 50. A speech recognition system as defined in claim 47, wherein one of said entity identifiers includes data indicative of a title, said system comprising a means for generating an orthography set including at least one orthography that includes a word common with a word including in said data indicative of a title.
  - 51. A speech recognition system as defined in claim 47, wherein one of said entity identifiers includes data indicative of a name and data indicative of a civic address, said system comprising a means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a civic address.
  - 52. A speech recognition system as defined in claim 47, wherein one of said entity identifiers includes data indicative of a name and data indicative of a title, said system comprising a means for generating an orthography set including at least one orthography that includes a word common with a word included in said data indicative of a name and a word common with a word included in said data indicative of a title.
  - 53. A speech recognition system as defined in claim 47, wherein one of said information fields includes data indicative of a name, said system comprising a means for effecting the steps of:
    - scanning said one information field to locate predetermined data;
      
      upon identification of said predetermined data in association with a given record negating said predetermined data from said given record.
  - 54. A speech recognition system as defined in claim 53, wherein said predetermined data includes combination of words selected from the group consisting of "toll free", "day or night" and "24 hour".
  - 55. A speech recognition system as defined in claim 47, wherein one of said information fields includes data indicative of a name, said system comprising a means for effecting the steps of:
    - scanning said one information field to locate predetermined data;
      
      upon identification of said predetermined data in association with a given record replacing said predetermined data with new data.
  - 56. A speech recognition system as defined in claim 55, comprising a means for consulting a table that establishes a correspondence between said predetermined data and said new data to identify the new data for replacing said predetermined table.
  - 57. A speech recognition system as defined in claim 55, wherein said predetermined data is an abbreviation of a word.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Avaya Incorporated
Original Assignee
Northern Telecom Limited (Nortel Networks Corporation)
Inventors
Sabourin, Michael, Gupta, Vishwa
Primary Examiner(s)
Hudspeth, David R.
Assistant Examiner(s)
Zintel, Harold

Application Number

US08/757,610
Time in Patent Office

718 Days
Field of Search

707/104, 707/102, 707/5, 707/3, 704/270, 704/243, 704/231
US Class Current

704/270
CPC Class Codes

G10L 2015/228   of application context

H04M 2201/40   using speech recognition sp...

H04M 3/4931   Directory assistance systems

Y10S 707/99943   Generating database or data...

Method and apparatus for automatically generating a speech recognition vocabulary from a white pages listing

First Claim

13 Assignments

0 Petitions

Accused Products

Abstract

109 Citations

57 Claims

Specification

Use Cases

Quick Links

Others

Method and apparatus for automatically generating a speech recognition vocabulary from a white pages listing

First Claim

13 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

109 Citations

57 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others