System and method of performing automatic speech recognition using local private data

US 9,666,188 B2
Filed: 10/29/2013
Issued: 05/30/2017
Est. Priority Date: 10/29/2013
Status: Active Grant

First Claim

Patent Images

1. A method comprising:

receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device;

receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data;

identifying a location of the device;

determining a privacy level of the private user data according to the location of the device;

recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result;

replacing the placeholder with the recognition result in the text to yield an updated message; and

presenting the updated message on the device.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of providing hybrid speech recognition between a local embedded speech recognition system and a remote speech recognition system relates to receiving speech from a user at a device communicating with a remote speech recognition system. The system recognizes a first part of speech by performing a first recognition of the first part of the speech with the embedded speech recognition system that accesses private user data, wherein the private user data is not available to the remote speech recognition system. The system recognizes the second part of the speech by performing a second recognition of the second part of the speech with the remote speech recognition system. The final recognition result is a combination of these two recognition processes. The private data can be such local information as a user location, a playlist, frequently dialed numbers or texted people, user contact list information, and so forth.

Citations

8 Claims

1. A method comprising:
- receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device;
  
  receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data;
  
  identifying a location of the device;
  
  determining a privacy level of the private user data according to the location of the device;
  
  recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result;
  
  replacing the placeholder with the recognition result in the text to yield an updated message; and
  
  presenting the updated message on the device.
- View Dependent Claims (2, 3)
- - 2. The method of claim 1, wherein individual names from a user contact list are from the private user data.
  - 3. The method of claim 1, wherein the private user data comprises one of data in a user contact list, frequently dialed phone numbers, frequently used texted names, data associated with a user location, data associated with a playlist, user history, and multiple hypothesis associated with private information.

4. A computer-readable storage device storing instructions which, when executed by a processor, cause the processor to perform operations comprising:
- receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device;
  
  receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data;
  
  identifying a location of the device;
  
  determining a privacy level of the private user data according to the location of the device;
  
  recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result;
  
  replacing the placeholder with the recognition result in the text to yield an updated message; and
  
  presenting the updated message on the device.
- View Dependent Claims (5, 6)
- - 5. The computer-readable storage device of claim 4, wherein individual names from a user contact list are from the private user data.
  - 6. The computer-readable storage device of claim 4, wherein the private user data comprises one of data in a user contact list, frequently dialed phone numbers, frequently used texted names, data associated with a user location, data associated with a playlist, user history, and multiple hypothesis associated with private information.

7. A system comprising:
- a processor; and
  
  computer-readable storage medium storing instructions which, when executed by the processor, cause the processor to perform operations comprising;
  
  receiving, on a device, text as part of a message, a placeholder within the text and audio, wherein the device comprises an embedded speech recognition system that accesses private user data on the device and wherein the private user data is not available to a remote speech recognition system in communication with the device;
  
  receiving a component comprising one of a garbage model, phonemic language model, a language model according to a standard list, and a language model built on the private user data;
  
  identifying a location of the device;
  
  determining a privacy level of the private user data according to the location of the device;
  
  recognizing the audio using the component, the embedded speech recognition system and by accessing the private user data according to the privacy level to yield a recognition result;
  
  replacing the placeholder with the recognition result in the text to yield an updated message; and
  
  presenting the updated message on the device.
- View Dependent Claims (8)
- - 8. The system of claim 7, wherein the private user data comprises one of data in a user contact list, frequently dialed phone numbers, frequently used texted names, data associated with a user location, user history, data associated with a playlist, and multiple hypothesis associated with private information.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Inventors
Johnston, Michael J., Thomson, David, Rangarajan Sridhar, Vivek Kumar
Primary Examiner(s)
Armstrong, Angela A

Application Number

US14/066,079
Publication Number

US 20150120288A1
Time in Patent Office

1,309 Days
Field of Search

704231, 704270, 704275
US Class Current
CPC Class Codes

G10L 15/22   Procedures used during a sp...

G10L 15/30   Distributed recognition, e....

G10L 2015/228   of application context

System and method of performing automatic speech recognition using local private data

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

System and method of performing automatic speech recognition using local private data

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links