Forming speech recognition over a network and using speech recognition results based on determining that a network connection exists

US 9,380,155 B1
Filed: 02/13/2014
Issued: 06/28/2016
Est. Priority Date: 11/30/2000
Status: Active Grant

First Claim

Patent Images

1. A computer-implemented method comprising:

receiving, at a server and from a mobile device, a request including a speech data representation of an utterance or feature data extracted from the speech data representation of the utterance;

obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance;

identifying, by the server, a keyword based on the transcription of the utterance; and

initiating a communication between the mobile device and another device based on the identified keyword.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems, methods and apparatus for generating, distributing, and using speech recognition models. A shared speech processing facility is used to support speech recognition for a wide variety of devices with limited capabilities including business computer systems, personal data assistants, etc., which are coupled to the speech processing facility via a communications channel, e.g., the Internet. Devices with audio capture capability record and transmit to the speech processing facility, via the Internet, digitized speech and receive speech processing services, e.g., speech recognition model generation and/or speech recognition services, in response. The Internet is used to return speech recognition models and/or information identifying recognized words or phrases. The speech processing facility can be used to provide speech recognition capabilities to devices without such capabilities and/or to augment a device'"'"'s speech processing capability. Voice dialing, telephone control and/or other services are provided by the speech processing facility in response to speech recognition results.

227 Citations

20 Claims

1. A computer-implemented method comprising:
- receiving, at a server and from a mobile device, a request including a speech data representation of an utterance or feature data extracted from the speech data representation of the utterance;
  
  obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance;
  
  identifying, by the server, a keyword based on the transcription of the utterance; and
  
  initiating a communication between the mobile device and another device based on the identified keyword.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1, comprising:
    - updating the speech recognition model using the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance.
  - 3. The method of claim 1, wherein the keyword comprises a name of a contact.
  - 4. The method of claim 1, wherein initiating a communication between the mobile device and another device based on the identified keyword, comprises:
    - initiating a voice dialing operation based on the identified keyword.
  - 5. The method of claim 1, wherein initiating a communication between the mobile device and another device based on the identified keyword, comprises:
    - providing voice dialing information to the mobile device based on the identified keyword.
  - 6. The method of claim 1, wherein identifying, by the server, a keyword based on the transcription of the utterance comprises:
    - determining that the keyword matches a word in the transcription of the utterance.
  - 7. The method of claim 1, wherein obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance, comprises:
    - performing automated speech recognition of the utterance using the speech recognition model to generate the transcription of the utterance.

8. A system comprising:
- one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising;
  
  receiving, at a server and from a mobile device, a request including a speech data representation of an utterance or feature data extracted from the speech data representation of the utterance;
  
  obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance;
  
  identifying, by the server, a keyword based on the transcription of the utterance; and
  
  initiating a communication between the mobile device and another device based on the identified keyword.
- View Dependent Claims (9, 10, 11, 12, 13, 14)
- - 9. The system of claim 8, the operations comprising:
    - updating the speech recognition model using the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance.
  - 10. The system of claim 8, wherein the keyword comprises a name of a contact.
  - 11. The system of claim 8, wherein initiating a communication between the mobile device and another device based on the identified keyword, comprises:
    - initiating a voice dialing operation based on the identified keyword.
  - 12. The system of claim 8, wherein initiating a communication between the mobile device and another device based on the identified keyword, comprises:
    - providing voice dialing information to the mobile device based on the identified keyword.
  - 13. The system of claim 8, wherein identifying, by the server, a keyword based on the transcription of the utterance comprises:
    - determining that the keyword matches a word in the transcription of the utterance.
  - 14. The system of claim 8, wherein obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance, comprises:
    - performing automated speech recognition of the utterance using the speech recognition model to generate the transcription of the utterance.

15. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
- receiving, at a server and from a mobile device, a request including a speech data representation of an utterance or feature data extracted from the speech data representation of the utterance;
  
  obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance;
  
  identifying, by the server, a keyword based on the transcription of the utterance; and
  
  initiating a communication between the mobile device and another device based on the identified keyword.
- View Dependent Claims (16, 17, 18, 19, 20)
- - 16. The medium of claim 15, wherein obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance, comprises:
    - performing automated speech recognition of the utterance using the speech recognition model to generate the transcription of the utterance.
  - 17. The medium of claim 15, wherein the keyword comprises a name of a contact.
  - 18. The medium of claim 15, wherein initiating a communication between the mobile device and another device based on the identified keyword, comprises:
    - initiating a voice dialing operation based on the identified keyword.
  - 19. The medium of claim 15, wherein initiating a communication between the mobile device and another device based on the identified keyword, comprises:
    - providing voice dialing information to the mobile device based on the identified keyword.
  - 20. The medium of claim 15, wherein identifying, by the server, a keyword based on the transcription of the utterance comprises:
    - determining that the keyword matches a word in the transcription of the utterance.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google LLC (Alphabet Inc.)
Original Assignee
Google Inc. (Alphabet Inc.)
Inventors
Reding, Craig L., Levas, Suzi
Primary Examiner(s)
McFadden, Susan

Application Number

US14/179,725
Time in Patent Office

866 Days
Field of Search

379/88.17, 704/270.1
US Class Current

1/1
CPC Class Codes

G10L 13/08   Text analysis or generation...

G10L 15/02   Feature extraction for spee...

G10L 15/063   Training

G10L 15/08   Speech classification or se...

G10L 15/22   Procedures used during a sp...

G10L 15/26   Speech to text systems G10L...

G10L 15/30   Distributed recognition, e....

G10L 17/04   Training, enrolment or mode...

G10L 2015/221   Announcement of recognition...

G10L 2015/223   Execution procedure of a sp...

H04M 2201/40   using speech recognition sp...

H04M 2207/18   wireless networks

H04M 3/42204   Arrangements at the exchang...

Forming speech recognition over a network and using speech recognition results based on determining that a network connection exists

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

227 Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Forming speech recognition over a network and using speech recognition results based on determining that a network connection exists

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

227 Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links