Disambiguation of a spoken query term
First Claim
1. A system comprising:
- one or more computers; and
a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising;
receiving an audio signal that corresponds to a spoken query term;
performing speech recognition on the audio signal to select two or more textual, candidate transcriptions that match the spoken query term, and to establish a speech recognition confidence value for each candidate transcription;
obtaining a search history for a user who spoke the spoken query term, wherein the search history references one or more past search queries that have been submitted by the user;
generating one or more n-grams from each candidate transcription, wherein each n-gram is a subsequence of n phonemes, syllables, letters, characters, words or terms from a respective candidate transcription;
determining, for each n-gram, a frequency with which the n-gram occurs in the past search queries, and a weighting value that is based on the respective frequency;
generating, for each of the candidate transcriptions, a combined value based on combining the speech recognition confidence value for the candidate transcription with the weighting value for one or more of the n-grams that are generated from the candidate transcription;
selecting an intended query term from among the candidate transcriptions based on the combined values; and
causing a search engine to perform a search query that includes the intended query term.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing spoken query terms. In one aspect, a method includes performing speech recognition on an audio signal to select two or more textual, candidate transcriptions that match a spoken query term, and to establish a speech recognition confidence value for each candidate transcription, obtaining a search history for a user who spoke the spoken query term, where the search history references one or more past search queries that have been submitted by the user, generating one or more n-grams from each candidate transcription, where each n-gram is a subsequence of n phonemes, syllables, letters, characters, words or terms from a respective candidate transcription, and determining, for each n-gram, a frequency with which the n-gram occurs in the past search queries, and a weighting value that is based on the respective frequency.
145 Citations
20 Claims
-
1. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; receiving an audio signal that corresponds to a spoken query term; performing speech recognition on the audio signal to select two or more textual, candidate transcriptions that match the spoken query term, and to establish a speech recognition confidence value for each candidate transcription; obtaining a search history for a user who spoke the spoken query term, wherein the search history references one or more past search queries that have been submitted by the user; generating one or more n-grams from each candidate transcription, wherein each n-gram is a subsequence of n phonemes, syllables, letters, characters, words or terms from a respective candidate transcription; determining, for each n-gram, a frequency with which the n-gram occurs in the past search queries, and a weighting value that is based on the respective frequency; generating, for each of the candidate transcriptions, a combined value based on combining the speech recognition confidence value for the candidate transcription with the weighting value for one or more of the n-grams that are generated from the candidate transcription; selecting an intended query term from among the candidate transcriptions based on the combined values; and causing a search engine to perform a search query that includes the intended query term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
receiving an audio signal that corresponds to a spoken query term; performing speech recognition on the audio signal to select two or more textual, candidate transcriptions that match the spoken query term, and to establish a speech recognition confidence value for each candidate transcription; obtaining a search history for a user who spoke the spoken query term, wherein the search history references one or more past search queries that have been submitted by the user; generating one or more n-grams from each candidate transcription, wherein each n-gram is a subsequence of n phonemes, syllables, letters, characters, words or terms from a respective candidate transcription; determining, for each n-gram, a frequency with which the n-gram occurs in the past search queries, and a weighting value that is based on the respective frequency; generating, for each of the candidate transcriptions, a combined value based on combining the speech recognition confidence value for the candidate transcription with the weighting value for one or more of the n-grams that are generated from the candidate transcription; selecting an intended query term from among the candidate transcriptions based on the combined values; and causing a search engine to perform a search query that includes the intended query term. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A method comprising:
-
receiving an audio signal that corresponds to a spoken query term; performing speech recognition on the audio signal to select two or more textual, candidate transcriptions that match the spoken query term, and to establish a speech recognition confidence value for each candidate transcription; obtaining a search history for a user who spoke the spoken query term, wherein the search history references one or more past search queries that have been submitted by the user; generating one or more n-grams from each candidate transcription, wherein each n-gram is a subsequence of n phonemes, syllables, letters, characters, words or terms from a respective candidate transcription; determining, for each n-gram, a frequency with which the n-gram occurs in the past search queries, and a weighting value that is based on the respective frequency; generating, by one or more computers, and for each of the candidate transcriptions, a combined value based on combining the speech recognition confidence value for the candidate transcription with the weighting value for one or more of the n-grams that are generated from the candidate transcription; selecting an intended query term from among the candidate transcriptions based on the combined values; and causing a search engine to perform a search query that includes the intended query term.
-
Specification