Methods and systems for speech-enabling a human-to-machine interface
First Claim
1. A method for speech-enabling a human-to-machine interface, the method comprising:
- by a processor,loading content of the human-to-machine interface;
adding logic configured to enable speech interaction with the content to the interface;
presenting the content to a user of the interface; and
activating speech interaction with the content via the logic for the user;
the logic including;
uniquely identifying a plurality of input fields and corresponding input field identifications (IDs) associated with the content;
mapping the input field IDs to grammar slot names to produce a speech-to-field mapping, the grammar slot names associated with speech-to-text synthesis of speech expected to be received by the logic and outputting representations of the speech having correspondence with the grammar slot names; and
enabling a flow of representations of speech to the input fields via the speech-to-field mapping.
3 Assignments
0 Petitions
Accused Products
Abstract
Generally, human-to-machine interfaces are configured to accept speech input from a user. However, such interfaces, e.g., web browsers, must be configured to enable acceptance of speech input from the user. Some interfaces, such as mobile browsers, have less configuration adaptability and are not able to be configured to accept speech input from a user. Embodiments of the present invention speech-enable human-to-machine interfaces by loading content of the human-to-machine interface and adding logic configured to enable speech interaction with the content to the interface. The embodiment then activates speech interaction with the content via the logic for the user. Thus, embodiments of the present invention enable speech interaction with interfaces that are not configured to be adapted to allow speech interaction and are able to enable the speech interaction in a seamless manner.
-
Citations
19 Claims
-
1. A method for speech-enabling a human-to-machine interface, the method comprising:
-
by a processor, loading content of the human-to-machine interface; adding logic configured to enable speech interaction with the content to the interface; presenting the content to a user of the interface; and activating speech interaction with the content via the logic for the user; the logic including; uniquely identifying a plurality of input fields and corresponding input field identifications (IDs) associated with the content; mapping the input field IDs to grammar slot names to produce a speech-to-field mapping, the grammar slot names associated with speech-to-text synthesis of speech expected to be received by the logic and outputting representations of the speech having correspondence with the grammar slot names; and enabling a flow of representations of speech to the input fields via the speech-to-field mapping. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 12, 13, 14)
-
-
10. A system for speech-enabling a human-to-machine interface, the system comprising:
-
one or more modules implemented in hardware or as instructions executing on a processor, the one or more modules including; a loading module configured to load content of the human-to-machine interface; an applying module configured to apply logic to the content, the logic configured to enable speech interaction with the content; a presenting module configured to present the content to a user of the interface; and a speech interaction module configured to activate speech interaction with the content via the logic for the user; the logic including; an identifying module configured to uniquely identify a plurality of input fields and corresponding input field identifications (IDs) associated with the content; a mapping module configured to map the input field IDs to grammar slot names to produce a speech-to-field mapping, the grammar slot names associated with speech-to-text synthesis of speech expected to be received by the logic and output representations of the speech having correspondence with the grammar slot names; and a transcribing module configured to enable a flow of representations of speech to the input fields via the speech-to-field mapping. - View Dependent Claims (11, 15, 16, 17, 18)
-
-
19. A non-transitory computer readable medium having computer readable program codes embodied therein for speech-enabling a human-to-machine interface, the computer readable program codes including instructions that, when executed by a processor, cause the processor to:
-
load content of the human-to-machine interface; apply logic to the content, the logic configured to enable speech interaction with the content; present the content to a user of the interface; and activate speech interaction with the content via the logic for the user; the logic further configured to cause the processor to; uniquely identify a plurality of input fields and corresponding input field identifications (IDs) associated with the content; map the input field IDs to grammar slot names to produce a speech-to-field mapping, the grammar slot names associated with speech-to-text synthesis of speech expected to be received by the logic and outputting representations of the speech having correspondence with the grammar slot names; and enable a flow of representations of speech to the input fields via the speech-to-field mapping.
-
Specification