Establishing a multimodal personality for a multimodal application
First Claim
Patent Images
1. A method of establishing a multimodal personality for an application that provides vocal and visual output, the method comprising:
- with at least one processor;
selecting, by the application that provides vocal and visual output, matching vocal and visual demeanors; and
incorporating, by the application, the matching vocal and visual demeanors as a multimodal personality into the application by rendering a voice prompt and/or response generated by the application in the vocal demeanor and rendering a visual element generated by the application in the matching visual demeanor, the voice prompt or response being rendered in a voice having an age, gender and/or accent based on the selected vocal demeanor,wherein;
selecting matching vocal and visual demeanors further comprises selecting a vocal demeanor in dependence upon a history of a user'"'"'s navigation among web sites, the vocal demeanor being selected to match a visual demeanor determined based on at least one property of web pages previously visited, the at least one property comprising one or more of;
text font;
count of words on the web page;
proportion of white space;
ratio of graphics to screen area;
orratio of text space to graphic space.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods, apparatus, and computer program products are described for establishing a multimodal personality for a multimodal application that include selecting, by the multimodal application, matching vocal and visual demeanors and incorporating, by the multimodal application, the matching vocal and visual demeanors as a multimodal personality into the multimodal application.
149 Citations
20 Claims
-
1. A method of establishing a multimodal personality for an application that provides vocal and visual output, the method comprising:
- with at least one processor;
selecting, by the application that provides vocal and visual output, matching vocal and visual demeanors; and incorporating, by the application, the matching vocal and visual demeanors as a multimodal personality into the application by rendering a voice prompt and/or response generated by the application in the vocal demeanor and rendering a visual element generated by the application in the matching visual demeanor, the voice prompt or response being rendered in a voice having an age, gender and/or accent based on the selected vocal demeanor, wherein; selecting matching vocal and visual demeanors further comprises selecting a vocal demeanor in dependence upon a history of a user'"'"'s navigation among web sites, the vocal demeanor being selected to match a visual demeanor determined based on at least one property of web pages previously visited, the at least one property comprising one or more of; text font; count of words on the web page; proportion of white space; ratio of graphics to screen area;
orratio of text space to graphic space. - View Dependent Claims (2, 3, 4, 5, 6)
- with at least one processor;
-
7. Apparatus for establishing a multimodal personality for an application that provides vocal and visual output, the apparatus comprising a computer processor and a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions capable of:
-
selecting, by the application that provides vocal and visual output, matching vocal and visual demeanors, the selected vocal demeanor and the selected visual demeanor having at least one matching characteristic, the at least one matching characteristic comprising at least one of an age, gender, location, time or application domain; and incorporating, by the application, the matching vocal and visual demeanors as a multimodal personality into the application, wherein; selecting the vocal demeanor comprises selecting at least one grammar for use in recognizing vocal inputs; and selecting matching vocal and visual demeanors further comprises selecting the vocal and visual demeanors in dependence upon visual aspects or vocal aspects of a history of a user'"'"'s navigation among multimodal web sites, the vocal aspects comprising one or more of; number of grammars per page; number of dialogs per page; dialog intensity;
ornumber of speech inputs per page; and the visual aspects comprising one or more of; text font; counts of words on web pages of the multimodal web sites; proportion of white space; ratio of graphics to screen area;
orratio of text space to graphic space. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer storage medium encoded with a computer program product for establishing a multimodal personality for an application that provides vocal and visual output, the computer program product comprising computer program instructions for, when executed by at least one processor:
-
selecting, by the application that provides vocal and visual output, matching vocal and visual demeanors; and incorporating, by the application, the matching vocal and visual demeanors as a multimodal personality into the application by; selecting one or more demeanors from a store comprising a plurality of demeanors, the selected one or more demeanors defining the matching vocal and visual demeanors; linking one or more styles to one or more markup elements of a markup document output by the application based on the selected demeanor; and rendering, using a multimodal browser, the markup document using the one or more linked styles, the rendering comprising rendering at least one visual aspect and one speech aspect, wherein; selecting matching vocal and visual demeanors further comprises selecting a visual demeanor in dependence upon vocal aspects of a history of a user'"'"'s navigation among multimodal web sites, the vocal aspects comprising one or more of; number of grammars per page; number of dialogs per page; dialog intensity;
ornumber of speech inputs per page. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification