Environmentally aware speech recognition
First Claim
1. A method, comprising:
- receiving, via a device, one or more spoken utterances;
based on the one or more spoken utterances, identifying a language of the one or more spoken utterances;
determining an acoustic model for a particular language based on the identified language, wherein the acoustic model for the particular language is configured for use in speech recognition;
determining a location of the device;
determining one or more environmental conditions regarding an environment of the location of the device;
determining from among a plurality of data sets at least one adaptation data set based on the one or more environmental conditions and the location of the device, wherein the at least one adaptation data set includes information that enables recognition of speech preferences associated with the location; and
using the at least one adaptation data set, adapting the acoustic model for the particular language to obtain another acoustic model that is adapted to the one or more environmental conditions and the location of the device.
2 Assignments
0 Petitions
Accused Products
Abstract
Examples of methods and systems for implementing environmentally aware speech recognition are described. In some examples, a method may be performed by a computing device within a system to adapt an acoustic model for a particular language to one or more environmental conditions. A device may receive one or more spoken utterances and based on the utterances, a system containing the device may determine an acoustic model for the particular language. The system may adapt the acoustic model using one or more data sets depending on the environmental conditions at the location of the device or may obtain another acoustic model that is adapted to the environmental conditions. In some examples, the system may also adapt the acoustic model using one or more data sets based on the voice characteristics of the speaker of the one or more spoken utterances.
-
Citations
20 Claims
-
1. A method, comprising:
-
receiving, via a device, one or more spoken utterances; based on the one or more spoken utterances, identifying a language of the one or more spoken utterances; determining an acoustic model for a particular language based on the identified language, wherein the acoustic model for the particular language is configured for use in speech recognition; determining a location of the device; determining one or more environmental conditions regarding an environment of the location of the device; determining from among a plurality of data sets at least one adaptation data set based on the one or more environmental conditions and the location of the device, wherein the at least one adaptation data set includes information that enables recognition of speech preferences associated with the location; and using the at least one adaptation data set, adapting the acoustic model for the particular language to obtain another acoustic model that is adapted to the one or more environmental conditions and the location of the device. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer readable medium having stored therein instructions, that when executed by a computing system, cause the computing system to perform functions comprising:
-
receiving, via a device, one or more spoken utterances; based on the one or more spoken utterances, identifying a language of the one or more spoken utterances; determining an acoustic model for a particular language based on the identified language, wherein the acoustic model for the particular language is configured for use in speech recognition; determining a location of the device; determining one or more environmental conditions regarding an environment of the location of the device; determining from among a plurality of data sets at least one adaptation data set based on the one or more environmental conditions and the location of the device, wherein the at least one adaptation data set includes information that enables recognition of speech preferences associated with the location; and using the at least one adaptation data set, adapting the acoustic model for the particular language to obtain another acoustic model that is adapted to the one or more environmental conditions and the location of the device. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system, comprising:
-
at least one processor; and data storage comprising program instructions executable by the at least one processor to cause the system to perform functions comprising; receiving, via a device, one or more spoken utterances; based on the one or more spoken utterances, identifying a language of the one or more spoken utterances; determining an acoustic model for a particular language based on the identified language, wherein the acoustic model for the particular language is configured for use in speech recognition; determining a location of the device; determining one or more environmental conditions regarding an environment of the location of the device; determining from among a plurality of data sets at least one adaptation data set based on the one or more environmental conditions and the location of the device, wherein the at least one adaptation data set includes information that enables recognition of speech preferences associated with the location; and using the at least one adaptation data set, adapting the acoustic model for the particular language to obtain another acoustic model that is adapted to the one or more environmental conditions and the location of the device. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification