SELECTIVE SPEECH RECOGNITION FOR CHAT AND DIGITAL PERSONAL ASSISTANT SYSTEMS
First Claim
1. A method for speech recognition in a chat information system (CIS), the method comprising:
- receiving, by a processor operatively coupled to a memory, an audio input;
recognizing, by a first speech recognizer of a plurality of speech recognizers, a first part of the audio input to generate a first recognized input;
identifying, by the processor, at least one trigger in the first recognized input;
based on the identification, selecting, by the processor, a second speech recognizer of the plurality of speech recognizers; and
recognizing, by the second speech recognizer, a second part of the audio input to generate a second recognized input.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are computer-implemented methods and systems for dynamic selection of speech recognition systems for the use in Chat Information Systems (CIS) based on multiple criteria and context of human-machine interaction. Specifically, once a first user audio input is received, it is analyzed so as to locate specific triggers, determine the context of the interaction or predict the subsequent user audio inputs. Based on at least one of these criteria, one of a free-diction recognizer, pattern-based recognizer, address book based recognizer or dynamically created recognizer is selected for recognizing the subsequent user audio input. The methods described herein increase the accuracy of automatic recognition of user voice commands, thereby enhancing overall user experience of using CIS, chat agents and similar digital personal assistant systems.
45 Citations
32 Claims
-
1. A method for speech recognition in a chat information system (CIS), the method comprising:
-
receiving, by a processor operatively coupled to a memory, an audio input; recognizing, by a first speech recognizer of a plurality of speech recognizers, a first part of the audio input to generate a first recognized input; identifying, by the processor, at least one trigger in the first recognized input; based on the identification, selecting, by the processor, a second speech recognizer of the plurality of speech recognizers; and recognizing, by the second speech recognizer, a second part of the audio input to generate a second recognized input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method for speech recognition in a CIS, the method comprising:
-
receiving, by a processor operatively coupled with a memory, a first audio input; recognizing, by a first speech recognizer of a plurality of speech recognizers, at least a part of the first audio input to generate a first recognized input; receiving, by the processor, a second audio input; identifying, by the processor, at least one trigger in the first recognized input; based on the identification, selecting, by the processor, a second speech recognizer of the plurality of speech recognizers; and recognizing, by the second speech recognizer, at least a part of the second audio input to generate a second recognized input. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A method for speech recognition in a CIS, the method comprising:
-
receiving, by a processor operatively coupled with a memory, a first audio input; recognizing, by a first speech recognizer of a plurality of speech recognizers, at least a part of the first audio input to generate a first recognized input; providing, by the processor, a response to the first recognized input utilizing the CIS; determining, by the processor, a type of the response; receiving, by the processor, a second audio input; based on the determination, selecting, by the processor, a second speech recognizer of the plurality of speech recognizers; and recognizing, by the second speech recognizer, at least a part of the second audio input to generate a second recognized input. - View Dependent Claims (26, 27, 28, 29, 30, 31)
-
-
32. A system for speech recognition, the system comprising:
-
a communication module configure to receive one or more audio inputs; two or more speech recognizers configured to generate recognized inputs; and a decision making logic configured to identify at least one trigger in one of the recognized inputs and, based on the at least one trigger, select one of the two or more speech recognizers for performing speech recognition of at least a part of the one or more audio inputs; wherein the at least one trigger includes a type of the one or more audio inputs or prediction regarding a type of the one or more audio inputs.
-
Specification