System and method of performing automatic speech recognition using local private data
First Claim
1. A method comprising:
- receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device;
receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data;
identifying a location of the device;
determining a privacy level of the private user data according to the location of the device;
recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result;
replacing the placeholder with the recognition result in the text to yield an updated message; and
presenting the updated message on the device.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of providing hybrid speech recognition between a local embedded speech recognition system and a remote speech recognition system relates to receiving speech from a user at a device communicating with a remote speech recognition system. The system recognizes a first part of speech by performing a first recognition of the first part of the speech with the embedded speech recognition system that accesses private user data, wherein the private user data is not available to the remote speech recognition system. The system recognizes the second part of the speech by performing a second recognition of the second part of the speech with the remote speech recognition system. The final recognition result is a combination of these two recognition processes. The private data can be such local information as a user location, a playlist, frequently dialed numbers or texted people, user contact list information, and so forth.
-
Citations
8 Claims
-
1. A method comprising:
-
receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device; receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data; identifying a location of the device; determining a privacy level of the private user data according to the location of the device; recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; replacing the placeholder with the recognition result in the text to yield an updated message; and presenting the updated message on the device. - View Dependent Claims (2, 3)
-
-
4. A computer-readable storage device storing instructions which, when executed by a processor, cause the processor to perform operations comprising:
-
receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device; receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data; identifying a location of the device; determining a privacy level of the private user data according to the location of the device; recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; replacing the placeholder with the recognition result in the text to yield an updated message; and presenting the updated message on the device. - View Dependent Claims (5, 6)
-
-
7. A system comprising:
-
a processor; and computer-readable storage medium storing instructions which, when executed by the processor, cause the processor to perform operations comprising; receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device; receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data; identifying a location of the device; determining a privacy level of the private user data according to the location of the device; recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; replacing the placeholder with the recognition result in the text to yield an updated message; and presenting the updated message on the device. - View Dependent Claims (8)
-
Specification