Methods and system for creating voice files using a VoiceXML application

US 7,577,568 B2
Filed: 06/10/2003
Issued: 08/18/2009
Est. Priority Date: 06/10/2003
Status: Expired due to Fees

First Claim

Patent Images

1. A method of automating the preparation of a voice application, comprising:

writing a voice software application for providing an audio announcement;

applying markup language elements to the application;

annotating the application with markup language audio tags;

associating via a processor segments of a text string to be spoken in the voice application with a plurality of audio tags, the plurality of audio tags include matching voice properties for audio files corresponding to each segment of the text string, and the voice properties define an attribute of the audio files that including a gender of a speaker and an age group of the speaker;

parsing the application to locate the plurality of audio tags;

passing the segments of the text string associated with the plurality of audio tags to a database of audio files;

if an audio file having content and voice properties matching all segments of the text string is located in the database of audio files, replacing the segments of the text string and the associated audio tag with a file name of the located audio file automatically; and

if an audio file having content and voice properties matching a portion of the segments of the text string is located in the database of audio files, passing the located audio file to an audio file developer for review and replacing the segments of the text string and the associated audio tag with a file name of the located audio file upon file developer confirmation.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Methods and systems for automating the assembly or creation of audio files for providing to listeners or for use in voice interactive services are provided. A voice application script is prepared and text associated with a desired audio file statement is inserted in the voice application in place of an audio file name. A recording manager software program passes the voice application script to an Extensible Markup Language (XML) parser that locates audio file tags in the voice application script. The XML parser extracts voice properties, if any, for each found audio tag, such as age and gender properties. The XML parser extracts the text string, and the recording manager software module passes the text string and associated properties in a database query to an audio file recording library database for locating an audio file matching the text string and properties. If a matching audio file or combination of audio files is located, a file name for the located file or files may be populated into the voice application script so that upon execution of the voice application script, the located audio file will be called by the script for presentation to a user or for use in a voice interactive services system.

Citations

27 Claims

1. A method of automating the preparation of a voice application, comprising:
- writing a voice software application for providing an audio announcement;
  
  applying markup language elements to the application;
  
  annotating the application with markup language audio tags;
  
  associating via a processor segments of a text string to be spoken in the voice application with a plurality of audio tags, the plurality of audio tags include matching voice properties for audio files corresponding to each segment of the text string, and the voice properties define an attribute of the audio files that including a gender of a speaker and an age group of the speaker;
  
  parsing the application to locate the plurality of audio tags;
  
  passing the segments of the text string associated with the plurality of audio tags to a database of audio files;
  
  if an audio file having content and voice properties matching all segments of the text string is located in the database of audio files, replacing the segments of the text string and the associated audio tag with a file name of the located audio file automatically; and
  
  if an audio file having content and voice properties matching a portion of the segments of the text string is located in the database of audio files, passing the located audio file to an audio file developer for review and replacing the segments of the text string and the associated audio tag with a file name of the located audio file upon file developer confirmation.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
- - 2. A method of claim 1 wherein annotating the application with markup language audio tags include annotating the application with voice Extensible Markup Language (VoiceXML) audio tags.
  - 3. The method of claim 1 wherein associating the segments of the text string with the plurality of audio tags includes inserting each segment of the text string within a corresponding audio tag.
  - 4. The method of claim 1 further comprising:
    - replacing the segments of the text string with corresponding audio files only if the corresponding audio file is located for each segment of the text string in the database of audio files.
  - 5. The method of claim 4, wherein passing the segments of the text string to a database of audio files includes passing the segments of the text string and the corresponding voice properties to the database of audio files.
  - 6. The method of claim 5, wherein replacing the segments of the text string with a file name of the located audio file includes replacing segments of the text string with a file name of the located audio file if the audio file has content matching the particular segment of the text string and matching the corresponding voice property.
  - 7. The method of claim 6, further comprising prior to parsing the application to locate the plurality of audio tags, passing the application to an application parser for locating the plurality of audio tags and for locating the text string.
  - 8. The method of claim 7, wherein the application parser includes a voice XML parsing application.
  - 9. The method of claim 8, wherein parsing the application to locate the plurality of audio tags includes extracting the segments of the text string associated with a portion of the plurality of audio tags from the application.
  - 10. The method of claim 9 further comprising extracting a voice property associated with the plurality of audio tags from the application.
  - 11. The method of claim 10, wherein if no voice property is associated with an audio tag, selecting a default voice property.
  - 12. The method of claim 7, wherein passing the application to an application parser includes passing the application to a recording manager application for locating the desired audio file content.
  - 13. The method of claim 5, wherein passing the segments of the text string and the voice properties to the database of audio files includes passing the segments of the text string and the voice properties to the database of audio files via a database query.
  - 14. The method of claim 13 whereby the database of audio files includes a recording library having a plurality of pre-recorded audio files.
  - 15. The method of claim 5, further comprising after passing the segments of the text string and the voice properties to the database of audio files, searching the database of audio files for one or more audio files having content matching the voice properties and matching all or part of the segments of the text string.
  - 16. The method of claim 1, wherein if one of the one or more audio files is acceptable, replacing the segment of the text string with a file name of the acceptable one of the one or more audio files.
  - 17. The method of claim 1, wherein if a combination of the one or more audio files is acceptable, replacing the text string with a file name representing the combination of the one or more audio files.
  - 18. The method of claim 16, further comprising prior to replacing the text string with a file name of the acceptable one of the one or more audio files, passing the one or more audio files to an audio file developer for review.
  - 19. The method of claim 1, wherein writing a voice software application includes writing the voice software application using a text editor.
  - 20. The method of claim 1, wherein writing a voice software application includes writing the voice software application using a graphical user interface-based software application editor.

21. A system for automating the preparation of a voice application, comprising:
- a processor;
  
  an XML parser operative to control the processor to;
  
  parse a voice software application to locate a plurality of audio tags;
  
  pass a plurality of text strings to be spoken in the voice application associated with the corresponding plurality of audio tags to a recording manager application;
  
  the recording manager application operative to control the processor to;
  
  pass the plurality of text strings associated with the corresponding plurality of audio tags to a database of audio files, wherein each audio tag includes at least one voice property;
  
  determine if an audio file having content matching one of the plurality of text strings is located in the database of audio files, wherein the voice property of a matching text string defines an attribute of the corresponding audio file that includes at least one of a gender of a speaker and an age group of the speaker; and
  
  replace the text string with a file name of the corresponding audio file located in the database of audio files automatically, if the content and associated voice properties of the located audio file match the text string and its voice property; and
  
  replace the text string with a file name of a corresponding audio file located in the database of audio files after passing the located file to a developer for review and obtaining developer confirmation, if the content and associated voice properties of the located audio file partially match the text string and its voice property.
- View Dependent Claims (22, 23, 24)
- - 22. The system of claim 21, wherein the recording manager application is further operative to control the processor to determine whether one or more audio files are located in the database of audio files that partially match one of the plurality of text strings, if an audio file having content matching one of the plurality of text strings, if an audio file having content matching one of the plurality of text strings is not located in the database of audio files.
  - 23. The system of claim 22, wherein the recording manager application is further operative to control the processor to replace the text string with a file name representing a combination of one or more corresponding audio files, if a combination of the one or more audio files is located in the database of audio files having content matching the text string.
  - 24. The system of claim 21, further comprising a text editor operative to control the processor to write a voice software application.

25. A method of automating the preparation of a voice application, comprising:
- annotating a voice Extensible Markup Language (XML) application with one or more audio tags, wherein each audio tag includes at least one voice property;
  
  associating a text string to be spoken in the voice application with a corresponding audio tag;
  
  parsing via a processor the application to locate the corresponding audio tag;
  
  passing the text string and the corresponding audio tag to a database of audio files; and
  
  if an audio file having content matching the text string and at least one voice property included in the associated audio tag is located in the database of audio files, replacing the text string with a file name of the located audio file automatically; and
  
  if an audio file having content matching the text string and the at least one voice property is not located in the database of audio files, determining whether a plurality of audio files are located in the database of audio files that match the at least one voice property and that partially match the text string, and replacing the text string with a file name representing the combination of the plurality of audio files after passing the file name representing the combination of the audio files to a developer for review and receiving developer confirmation.
- View Dependent Claims (26, 27)
- - 26. The method of claim 25, wherein replacing the text string with a file name of the located audio file includes replacing the text string with a file name of the located audio file that has content matching the text string and a default voice property.
  - 27. The method of claim 26, further comprising after passing the text string and the at least one voice property to the database of audio files, searching the database of audio files for one or more audio files having content matching the at least one voice property and matching at least a portion of the text string.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
AT&T Intellectual Property I LP (AT&T, Inc.)
Original Assignee
AT&T Intellectual Property I LP (AT&T, Inc.)
Inventors
Chintrakulchai, Pichet, Busayapongchai, Senis
Primary Examiner(s)
Edouard; Patrick N
Assistant Examiner(s)
GODBOLD, DOUGLAS

Application Number

US10/458,532
Publication Number

US 20040254792A1
Time in Patent Office

2,261 Days
Field of Search

704/258, 704/260, 704/270, 704/278, 715/513, 715205-209, 715/231, 715/134, 715/246, 715/239, 715/255, 715/704, 715/216, 715/727
US Class Current

704/260
CPC Class Codes

H04M 3/4938 comprising a voice browser ...

Methods and system for creating voice files using a VoiceXML application

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

27 Claims

Specification

Solutions

Use Cases

Quick Links

Methods and system for creating voice files using a VoiceXML application

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

27 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links