ACCURACY IMPROVEMENT OF SPOKEN QUERIES TRANSCRIPTION USING CO-OCCURRENCE INFORMATION
First Claim
1. A computer-implemented method for executing a voice search, the computer-implemented method comprising:
- receiving a spoken query;
identifying, via an automated speech recognition process, multiple transcription hypotheses based on the spoken query, each respective transcription hypothesis having a speech recognition score;
evaluating a plurality of the transcription hypotheses using a co-occurrence identification process, the co-occurrence identification process;
identifying a frequency that proposed query terms, from each respective transcription hypothesis, co-occur based on a corpus of documents;
assigning a co-occurrence score to each respective transcription hypothesis; and
selecting a best transcription hypothesis based on at least non-sequential co-occurrences of the proposed query terms within the corpus documents; and
generating a text query corresponding to the best transcription hypothesis.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques disclosed herein include systems and methods for voice-enabled searching. Techniques include a co-occurrence based approach to improve accuracy of the 1-best hypothesis for non-phrase voice queries, as well as for phrased voice queries. A co-occurrence model is used in addition to a statistical natural language model and acoustic model to recognize spoken queries, such as spoken queries for searching a search engine. Given an utterance and an associated list of automated speech recognition n-best hypotheses, the system rescores the different hypotheses using co-occurrence information. For each hypothesis, the system estimates a frequency of co-occurrence within web documents. Combined scores from a speech recognizer and a co-occurrence engine can be combined to select a best hypothesis with a lower word error rate.
52 Citations
15 Claims
-
1. A computer-implemented method for executing a voice search, the computer-implemented method comprising:
-
receiving a spoken query; identifying, via an automated speech recognition process, multiple transcription hypotheses based on the spoken query, each respective transcription hypothesis having a speech recognition score; evaluating a plurality of the transcription hypotheses using a co-occurrence identification process, the co-occurrence identification process; identifying a frequency that proposed query terms, from each respective transcription hypothesis, co-occur based on a corpus of documents; assigning a co-occurrence score to each respective transcription hypothesis; and selecting a best transcription hypothesis based on at least non-sequential co-occurrences of the proposed query terms within the corpus documents; and generating a text query corresponding to the best transcription hypothesis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer system for executing a voice search, the computer system comprising:
-
a processor; and a memory coupled to the processor, the memory storing instructions that, when executed by the processor, cause the system to perform the operations of; receiving a spoken query; identifying, via an automated speech recognition process, multiple transcription hypotheses based on the spoken query, each respective transcription hypothesis having a speech recognition score; evaluating a plurality of the transcription hypotheses using a co-occurrence identification process, the co-occurrence identification process; identifying a frequency that proposed query terms, from each respective transcription hypothesis, co-occur based on a corpus of documents; assigning a co-occurrence score to each respective transcription hypothesis; and selecting a best transcription hypothesis based on at least non-sequential co-occurrences of the proposed query terms within the corpus documents; and generating a text query corresponding to the best transcription hypothesis.
-
Specification