×

Accuracy improvement of spoken queries transcription using co-occurrence information

  • US 9,330,661 B2
  • Filed: 01/16/2014
  • Issued: 05/03/2016
  • Est. Priority Date: 07/31/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving a spoken query;

    identifying, via an automated speech recognition process, a plurality of transcription hypotheses based on the spoken query, each respective transcription hypothesis having a speech recognition score;

    evaluating the plurality of transcription hypotheses using a co-occurrence identification process, the co-occurrence identification process comprising;

    identifying a frequency that proposed query terms, from each respective transcription hypothesis, co-occur based on a corpus of documents;

    assigning a co-occurrence score to each respective transcription hypothesis;

    evaluating a weighting of the speech recognition score of each respective transcription hypothesis and a weighting of the co-occurrence score of each respective transcription hypothesis;

    increasing a weighting of a co-occurrence score of a given transcription hypothesis of the plurality of transcription hypotheses to be greater than a weighting of a speech recognition score of the given transcription hypothesis of the plurality of transcription hypotheses when proposed query terms from the given transcription hypothesis are more than a threshold phrase length; and

    selecting a best transcription hypothesis based on at least the weighting of the speech recognition score of each respective transcription hypothesis and the weighting of the co-occurrence score of each respective transcription hypothesis;

    generating a text query corresponding to the best transcription hypothesis; and

    receiving, from an information retrieval system, search results based on the text query corresponding to the best transcription hypothesis.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×