System and method of performing automatic speech recognition using local private data
First Claim
1. A method comprising:
- receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device;
receiving one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data to yield a received component;
determining a privacy level of the private user data;
recognizing the audio using the received component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; and
replacing the placeholder with the recognition result in the text to yield an updated message.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of providing hybrid speech recognition between a local embedded speech recognition system and a remote speech recognition system relates to receiving speech from a user at a device communicating with a remote speech recognition system. The system recognizes a first part of speech by performing a first recognition of the first part of the speech with the embedded speech recognition system that accesses private user data, wherein the private user data is not available to the remote speech recognition system. The system recognizes the second part of the speech by performing a second recognition of the second part of the speech with the remote speech recognition system. The final recognition result is a combination of these two recognition processes. The private data can be such local information as a user location, a playlist, frequently dialed numbers or texted people, user contact list information, and so forth.
48 Citations
18 Claims
-
1. A method comprising:
-
receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device; receiving one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data to yield a received component; determining a privacy level of the private user data; recognizing the audio using the received component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; and replacing the placeholder with the recognition result in the text to yield an updated message. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-readable storage device storing instructions which, when executed by a processor, cause the processor to perform operations comprising:
-
receiving text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on a computing device; receiving one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data to yield a received component; determining a privacy level of the private user data; recognizing the audio using the received component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; and replacing the placeholder with the recognition result in the text to yield an updated message. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system comprising:
-
a processor; and a computer-readable storage medium storing instructions which, when executed by the processor, cause the processor to perform operations comprising; receiving text as part of a message, a placeholder within the text and audio, wherein the system comprises an embedded speech recognition system that accesses private user data on the system; receiving one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data to yield a received component; determining a privacy level of the private user data; recognizing the audio using the received component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result; and replacing the placeholder with the recognition result in the text to yield an updated message. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification