Methods and system for creating voice files using a VoiceXML application
First Claim
1. A method of automating the preparation of a voice application, comprising:
- writing a voice software application for providing an audio announcement;
applying markup language elements to the application;
annotating the application with markup language audio tags;
associating via a processor segments of a text string to be spoken in the voice application with a plurality of audio tags, the plurality of audio tags include matching voice properties for audio files corresponding to each segment of the text string, and the voice properties define an attribute of the audio files that including a gender of a speaker and an age group of the speaker;
parsing the application to locate the plurality of audio tags;
passing the segments of the text string associated with the plurality of audio tags to a database of audio files;
if an audio file having content and voice properties matching all segments of the text string is located in the database of audio files, replacing the segments of the text string and the associated audio tag with a file name of the located audio file automatically; and
if an audio file having content and voice properties matching a portion of the segments of the text string is located in the database of audio files, passing the located audio file to an audio file developer for review and replacing the segments of the text string and the associated audio tag with a file name of the located audio file upon file developer confirmation.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for automating the assembly or creation of audio files for providing to listeners or for use in voice interactive services are provided. A voice application script is prepared and text associated with a desired audio file statement is inserted in the voice application in place of an audio file name. A recording manager software program passes the voice application script to an Extensible Markup Language (XML) parser that locates audio file tags in the voice application script. The XML parser extracts voice properties, if any, for each found audio tag, such as age and gender properties. The XML parser extracts the text string, and the recording manager software module passes the text string and associated properties in a database query to an audio file recording library database for locating an audio file matching the text string and properties. If a matching audio file or combination of audio files is located, a file name for the located file or files may be populated into the voice application script so that upon execution of the voice application script, the located audio file will be called by the script for presentation to a user or for use in a voice interactive services system.
-
Citations
27 Claims
-
1. A method of automating the preparation of a voice application, comprising:
-
writing a voice software application for providing an audio announcement; applying markup language elements to the application; annotating the application with markup language audio tags; associating via a processor segments of a text string to be spoken in the voice application with a plurality of audio tags, the plurality of audio tags include matching voice properties for audio files corresponding to each segment of the text string, and the voice properties define an attribute of the audio files that including a gender of a speaker and an age group of the speaker; parsing the application to locate the plurality of audio tags; passing the segments of the text string associated with the plurality of audio tags to a database of audio files; if an audio file having content and voice properties matching all segments of the text string is located in the database of audio files, replacing the segments of the text string and the associated audio tag with a file name of the located audio file automatically; and if an audio file having content and voice properties matching a portion of the segments of the text string is located in the database of audio files, passing the located audio file to an audio file developer for review and replacing the segments of the text string and the associated audio tag with a file name of the located audio file upon file developer confirmation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A system for automating the preparation of a voice application, comprising:
-
a processor; an XML parser operative to control the processor to; parse a voice software application to locate a plurality of audio tags; pass a plurality of text strings to be spoken in the voice application associated with the corresponding plurality of audio tags to a recording manager application; the recording manager application operative to control the processor to; pass the plurality of text strings associated with the corresponding plurality of audio tags to a database of audio files, wherein each audio tag includes at least one voice property; determine if an audio file having content matching one of the plurality of text strings is located in the database of audio files, wherein the voice property of a matching text string defines an attribute of the corresponding audio file that includes at least one of a gender of a speaker and an age group of the speaker; and replace the text string with a file name of the corresponding audio file located in the database of audio files automatically, if the content and associated voice properties of the located audio file match the text string and its voice property; and replace the text string with a file name of a corresponding audio file located in the database of audio files after passing the located file to a developer for review and obtaining developer confirmation, if the content and associated voice properties of the located audio file partially match the text string and its voice property. - View Dependent Claims (22, 23, 24)
-
-
25. A method of automating the preparation of a voice application, comprising:
-
annotating a voice Extensible Markup Language (XML) application with one or more audio tags, wherein each audio tag includes at least one voice property; associating a text string to be spoken in the voice application with a corresponding audio tag; parsing via a processor the application to locate the corresponding audio tag; passing the text string and the corresponding audio tag to a database of audio files; and if an audio file having content matching the text string and at least one voice property included in the associated audio tag is located in the database of audio files, replacing the text string with a file name of the located audio file automatically; and if an audio file having content matching the text string and the at least one voice property is not located in the database of audio files, determining whether a plurality of audio files are located in the database of audio files that match the at least one voice property and that partially match the text string, and replacing the text string with a file name representing the combination of the plurality of audio files after passing the file name representing the combination of the audio files to a developer for review and receiving developer confirmation. - View Dependent Claims (26, 27)
-
Specification