Providing services for an information processing system using an audio interface
First Claim
1. A computer implemented method for providing services for a processing system using an audio interface, said method comprising:
- rendering a first name recording at an audio interface;
selecting a verb based on subject matter contained within a remainder of said phrase;
rendering a recording of said verb at the audio interface;
rendering a second name recording at the audio interface, wherein said second name recording commences with a predetermined word and wherein said verb recording is recorded such that its termination contains proper co-articulation for said predetermined word; and
rendering said remainder of said phrase at the audio interface, wherein said rendering said remainder includes;
rendering a first value associated with said first name; and
rendering a second value associated with said second name, and wherein said verb is selected based on a difference between said first and second values.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for providing efficient menu services for an information processing system that uses a telephone or other form of audio user interface. In one embodiment, the menu services provide effective support for novice users by providing a full listing of available keywords and rotating house advertisements which inform novice users of potential features and information. For experienced users, cues are rendered so that at any time the user can say a desired keyword to invoke the corresponding application. The menu is flat to facilitate its usage. Full keyword listings are rendered after the user is given a brief cue to say a keyword. Service messages rotate words and word prosody. When listening to receive information from the user, after the user has been cued, soft background music or other audible signals are rendered to inform the user that a response may now be spoken to the service. Other embodiments determine default cities, on which to report information, based on characteristics of the caller or based on cities that were previously selected by the caller. Other embodiments provide speech concatenation processes that have co-articulation and real-time subject-matter-based word selection which generate human sounding speech. Other embodiments reduce the occurrences of falsely triggered barge-ins during content delivery by only allowing interruption for certain special words. Other embodiments offer special services and modes for calls having voice recognition trouble. The special services are entered after predetermined criterion have been met by the call. Other embodiments provide special mechanisms for automatically recovering the address of a caller.
427 Citations
20 Claims
-
1. A computer implemented method for providing services for a processing system using an audio interface, said method comprising:
-
rendering a first name recording at an audio interface; selecting a verb based on subject matter contained within a remainder of said phrase; rendering a recording of said verb at the audio interface; rendering a second name recording at the audio interface, wherein said second name recording commences with a predetermined word and wherein said verb recording is recorded such that its termination contains proper co-articulation for said predetermined word; and rendering said remainder of said phrase at the audio interface, wherein said rendering said remainder includes; rendering a first value associated with said first name; and rendering a second value associated with said second name, and wherein said verb is selected based on a difference between said first and second values. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
providing a first speech segment, at an audio interface, a first value being associated with the first speech segment; providing a second speech segment at the audio interface, a second value being associated with the second speech segment; selecting a stored verb based on the first and second values; concatenating the first speech segment, the selected verb, and the second speech segment to form a phrase using co-articulation between the selected verb and the second speech segment; providing the concatenated phrase at the audio interface. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system comprising:
-
means for comparing a first value associated with a first speech segment to a second value associated with a second speech segment; means for selecting a recorded verb based on the comparison of the first and second values; means for co-articulating the recorded verb and a recorded voice segment, the recorded voice segment including a leading phoneme and the recorded verb having been formed from uttering the verb and the leading phoneme together; means for concatenating the first speech segment, the second speech segment, and the co-articulated verb and voice segment to form a phrase; and means for rendering the phrase at an audio interface.
-
Specification