Multi-level speech recognition
First Claim
1. A method comprising:
- obtaining, by a first electronic device, context data comprising at least one of location, time and activity, wherein the context data is associated with the first electronic device;
transmitting, by the first electronic device, the context data to a second electronic device;
receiving, by the first electronic device, a first speech recognition model, the first speech recognition model being a subset of a second speech recognition model present at the second electronic device, wherein the first speech recognition model is selected based on the context data;
determining, by the first electronic device, whether an utterance can be recognized by a speech recognition process, wherein the speech recognition process is performed by the first electronic device and uses the first speech recognition model;
in response to determining that the utterance cannot be recognized by the speech recognition process, sending, by the first electronic device, at least a portion of the utterance to the second electronic device; and
in response to determining that the utterance can be recognized by the speech recognition process, causing, by the first electronic device, performance of an action associated with the utterance.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and device for recognizing an utterance. The method includes transmitting context data associated with a first device to a second device. A first speech recognition model is received from the second device. The first speech recognition model is a subset of a second speech recognition model present at the second device. The first speech recognition model is based on the context data. It is determined whether the utterance can be recognized at the first device based on the first speech recognition model. If the utterance cannot be recognized at the first device, then at least a portion of the utterance is sent to the second device. If the utterance can be recognized at the first device, then an action associated with the recognized utterance is performed.
-
Citations
31 Claims
-
1. A method comprising:
-
obtaining, by a first electronic device, context data comprising at least one of location, time and activity, wherein the context data is associated with the first electronic device; transmitting, by the first electronic device, the context data to a second electronic device; receiving, by the first electronic device, a first speech recognition model, the first speech recognition model being a subset of a second speech recognition model present at the second electronic device, wherein the first speech recognition model is selected based on the context data; determining, by the first electronic device, whether an utterance can be recognized by a speech recognition process, wherein the speech recognition process is performed by the first electronic device and uses the first speech recognition model; in response to determining that the utterance cannot be recognized by the speech recognition process, sending, by the first electronic device, at least a portion of the utterance to the second electronic device; and in response to determining that the utterance can be recognized by the speech recognition process, causing, by the first electronic device, performance of an action associated with the utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A first electronic device comprising:
-
a processor device configured to obtain context data comprising at least one of location, time and activity, wherein the context data is associated with the first electronic device; a transmitter configured to transmit the context data to a second electronic device; a microphone configured to capture an utterance; and a speech processor configured to;
receive a first speech recognition model, wherein the first speech recognition model is a subset of a second speech recognition model present at the second electronic device, and the first speech recognition model is selected based on the context data, determine whether the utterance can be recognized by a speech recognition process, wherein the speech recognition process is performed by the speech processor and uses the first speech recognition model, send at least a portion of the utterance to the second electronic device in response to a determination that the utterance cannot be recognized by the speech recognition process, and upon a determination that the utterance can be recognized by the speech recognition process, causing by the first electronic device, performance of an action associated with the utterance. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A non-transitory processor-readable medium that includes a program that, when executed by a processor, performs a method comprising:
-
obtaining, by a first electronic device, context data comprising at least one of location, time and activity, wherein the context data is associated with the first electronic device; transmitting, by the first electronic device, the context data to a second electronic device; receiving, by the first electronic device, a first speech recognition model, wherein the first speech recognition model is a subset of a second speech recognition model present at the second electronic device, wherein the first speech recognition model is selected based on the context data; determining, by the first electronic device, whether an utterance can be recognized by a speech recognition process, wherein the speech recognition process is performed by the first electronic device and uses the first speech recognition model; in response to determining that the utterance cannot be recognized by the speech recognition process, sending, by the first electronic device, at least a portion of the utterance to the second electronic device; and in response to determining that the utterance can be recognized by the speech recognition process, causing the first electronic device to perform an action associated with the utterance. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
-
31. A method comprising:
-
obtaining, by a first electronic device, context data comprising at least one of location, time and activity, wherein the context data is associated with the first electronic device; transmitting, by the first electronic device, the context data to a second electronic device; receiving, by the first electronic device, a first speech recognition model, wherein the first speech recognition model is different than a second speech recognition model present at the second electronic device, wherein the first speech recognition model is selected based on the context data; determining, by the first electronic device, whether an utterance can be recognized by a speech recognition process, wherein the speech recognition process is performed by the first electronic device and uses the first speech recognition model; in response to determining that the utterance cannot be recognized by the speech recognition process, sending, by the first electronic device, at least a portion of the utterance to the second electronic device; and in response to determining that the utterance can be recognized by the speech recognition process, causing the first electronic device to perform an action associated with the utterance.
-
Specification