Enabling Natural Language Understanding In An X+V Page Of A Multimodal Application
First Claim
1. A method of enabling natural language understanding using an X+V page of a multimodal application, the method implemented with a statistical language model (‘
- SLM’
) grammar of the multimodal application in an automatic speech recognition (‘
ASR’
) engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, the method comprising;
receiving, in the ASR engine from the multimodal application, a voice utterance;
generating, by the ASR engine according to the SLM grammar, at least one recognition result for the voice utterance;
determining, by an action classifier for the VoiceXML interpreter, an action identifier in dependence upon the recognition result, the action identifier specifying an action to be performed by the multimodal application; and
interpreting, by the VoiceXML interpreter, the multimodal application in dependence upon the action identifier.
3 Assignments
0 Petitions
Accused Products
Abstract
Enabling natural language understanding using an X+V page of a multimodal application implemented with a statistical language model (‘SLM’) grammar of the multimodal application in an automatic speech recognition (‘ASR’) engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, including: receiving, in the ASR engine from the multimodal application, a voice utterance; generating, by the ASR engine according to the SLM grammar, at least one recognition result for the voice utterance; determining, by an action classifier for the VoiceXML interpreter, an action identifier in dependence upon the recognition result, the action identifier specifying an action to be performed by the multimodal application; and interpreting, by the VoiceXML interpreter, the multimodal application in dependence upon the action identifier.
-
Citations
20 Claims
-
1. A method of enabling natural language understanding using an X+V page of a multimodal application, the method implemented with a statistical language model (‘
- SLM’
) grammar of the multimodal application in an automatic speech recognition (‘
ASR’
) engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, the method comprising;receiving, in the ASR engine from the multimodal application, a voice utterance; generating, by the ASR engine according to the SLM grammar, at least one recognition result for the voice utterance; determining, by an action classifier for the VoiceXML interpreter, an action identifier in dependence upon the recognition result, the action identifier specifying an action to be performed by the multimodal application; and interpreting, by the VoiceXML interpreter, the multimodal application in dependence upon the action identifier. - View Dependent Claims (2, 3, 4, 5, 6)
- SLM’
-
7. Apparatus for enabling natural language understanding using an X+V page of a multimodal application, the apparatus implemented with a statistical language model (‘
- SLM’
) grammar of the multimodal application in an automatic speech recognition (‘
ASR’
) engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, the apparatus comprising a computer processor and a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions capable of;receiving, in the ASR engine from the multimodal application, a voice utterance; generating, by the ASR engine according to the SLM grammar, at least one recognition result for the voice utterance; determining, by an action classifier for the VoiceXML interpreter, an action identifier in dependence upon the recognition result, the action identifier specifying an action to be performed by the multimodal application; and interpreting, by the VoiceXML interpreter, the multimodal application in dependence upon the action identifier. - View Dependent Claims (8, 9, 10, 11, 12)
- SLM’
-
13. A computer program product for enabling natural language understanding using an X+V page of a multimodal application, the computer program product implemented with a statistical language model (‘
- SLM’
) grammar of the multimodal application in an automatic speech recognition (‘
ASR’
) engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, the computer program product disposed upon a computer-readable, signal-bearing medium, the computer program product comprising computer program instructions capable of;receiving, in the ASR engine from the multimodal application, a voice utterance; generating, by the ASR engine according to the SLM grammar, at least one recognition result for the voice utterance; determining, by an action classifier for the VoiceXML interpreter, an action identifier in dependence upon the recognition result, the action identifier specifying an action to be performed by the multimodal application; and interpreting, by the VoiceXML interpreter, the multimodal application in dependence upon the action identifier. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
- SLM’
Specification