Method of producing alternate utterance hypotheses using auxiliary information on close competitors
First Claim
1. A method of constructing a choice list of alternate versions of a recognized transcript from a speech recognition system, said method comprising:
- during speech recognition, generating a list of close call records, wherein each record includes histories for each of two competing partial hypotheses;
initializing the close call list from at least one output of the speech recognition system;
selecting one of the close call records from the list of close call records;
selecting a transcript from the choice list;
determining whether one of the two histories for the selected record matches a partial subhistory of the transcript from the choice list;
if one of the two histories for the selected close call record matches a partial subhistory of the transcript, substituting the other of the two histories for the partial subhistory of the transcript to generate an alternative version of the transcript; and
, adding the alternative version of the transcript to the choice list.
10 Assignments
0 Petitions
Accused Products
Abstract
A method of constructing a list of alternate transcripts from a recognized transcript includes generating a list of close call records, matching partial sub-histories from the recognized transcript with one of the history pairs stored in each of the records, and substituting the other of the history pairs for the partial sub-history of the recognized transcript. A close call record is generated each time a pair of partial hypotheses attempt to seed a common word. Each close call record includes history information and scoring information associated with a particular pair of partial hypotheses seeding a common word. Alternate transcripts are constructed by substituting close call histories for partial histories of the recognized transcripts, and also by substituting close call histories for partial histories of other alternate transcript.
-
Citations
33 Claims
-
1. A method of constructing a choice list of alternate versions of a recognized transcript from a speech recognition system, said method comprising:
-
during speech recognition, generating a list of close call records, wherein each record includes histories for each of two competing partial hypotheses;
initializing the close call list from at least one output of the speech recognition system;
selecting one of the close call records from the list of close call records;
selecting a transcript from the choice list;
determining whether one of the two histories for the selected record matches a partial subhistory of the transcript from the choice list;
if one of the two histories for the selected close call record matches a partial subhistory of the transcript, substituting the other of the two histories for the partial subhistory of the transcript to generate an alternative version of the transcript; and
,adding the alternative version of the transcript to the choice list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of constructing a list of alternate versions of a recognized transcript, said method comprising:
-
during speech recognition, generating a list of close call records, wherein each record includes histories for each of two competing partial hypotheses that seed a common word;
adding the recognized transcript to a choice list;
selecting a record from the close call list;
for each entry on the choice list, (a) determining whether one of the two histories for the selected record matches a partial subhistory of that entry on the choice list;
(b) if one of the two histories for the selected record matches a partial subhisory of that entry, substituting the other of the two histories for the partial subhistory of that entry to generate an alternative version of the transcript; and
,(c) adding the alternative version of the transcript to the choice list. - View Dependent Claims (14)
-
-
15. A method of constructing a list of alternate transcripts from a recognized transcript, comprising:
-
performing speech recognition on a spoken transcripts to generate a best scoring hypothesis, wherein performing speech recognition involves at each of a plurality of different times throughout the transcript generating two partial hypotheses each seeding a common word, said two partial hypotheses including a primary hypothesis having a first score and corresponding to a primary partial history and a competing hypothesis having a second score and corresponding to a competing partial history;
at each of the plurality of different times, storing a close call record, wherein said close call record includes the primary partial history, the competing partial history, and a measure of how close the two competing hypotheses are; and
after performing speech recognition, using the stored close call records to generate a choice list of alternative versions of the best scoring hypothesis.
-
-
16. A method of constructing a list of alternate transcript from a recognized utterance, comprising
generating a list of close call records, wherein each record includes history information and scoring information associated with a particular pair of partial hypotheses seeding a common word; -
generating one or more alternate transcripts from the list of close call records by evaluating each record in the list for a match between a partial sub-history of the recognized utterance and one of the histories stored in the record, and upon finding such a match, substituting the other of the histories stored in the record for the partial sub-history in the recognized transcript; and
,storing the one or more alternate transcripts in a choice list. - View Dependent Claims (17, 18, 19)
-
-
20. A method of constructing a list of alternate transcripts from a recognized transcript, comprising:
-
providing two or more partial hypotheses of an acoustic transcript;
for each pair of partial hypotheses characterized by a first partial hypothesis having an associated first score and a second partial hypothesis having an associated second score being less than the first score, both ending at a common time and both seeding a common continuation word, evaluating the first partial hypothesis and the second partial hypothesis at each acoustic frame following the seeding of the common continuation word, and storing in a close call list a record of the first and second partial hypotheses, the record corresponding to the acoustic frame resulting in a smallest score difference between a current best overall scoring hypothesis and the second score, wherein the record includes at least (i) a history of the first partial hypothesis, (ii) a history of the second partial hypothesis, (iii) a score difference being a difference between the first score and the second score, and (iv) a global score difference being a difference between the current best overall scoring hypothesis and the second score; and
,generating one or more alternate hypotheses by combining information from at least one record in the close call list with the recognized transcript.
-
-
21. A method of constructing a list of alternate utterance hypotheses from a complete utterance hypothesis, comprising:
-
providing two or more partial hypotheses of an acoustic utterance;
for each pair of partial hypotheses characterized by a first partial hypothesis having an associated first score and a second partial hypothesis having an associated second score being less than the first score, both ending at a common time and both seeding a common continuation word, evaluating the first partial hypothesis and the second partial hypothesis at each acoustic frame following the seeding of the common continuation word, and storing in a close call list a record of the first and second partial hypotheses, the record corresponding to the acoustic frame resulting in a smallest score difference between a current best overall scoring hypothesis and the second score, wherein the record includes at least (i) a history of the first partial hypothesis, (ii) a history of the second partial hypothesis, (iii) a score difference being a difference between the first score and the second score, and (iv) a global score difference being a difference between the current best overall scoring hypothesis and the second score;
for each acoustic frame, updating the two or more partial hypotheses until the acoustic utterance ends, and selecting a best scoring complete hypothesis;
evaluating the records in the close call list for potential alternate utterance hypotheses, beginning with a record in the close call list having a smallest score difference and subsequently with each record in the close call list in an order of ascending score difference, by;
(i) comparing a set of first words from the first hypothesis and a set of first words from one or more complete hypotheses from a choice list;
(ii) if a set of first words from a history of the first partial hypothesis matches a set of first words from one or more complete hypotheses from the choice list, substituting the history of the second partial hypothesis for the history of the first partial hypothesis within the one or more complete hypotheses from the choice list so as to generate one or more alternate utterance hypotheses, and placing the alternate hypotheses in the choice list; and
,(iii) continuing evaluating the records in the close call list until filling the choice list. - View Dependent Claims (22, 23, 24, 25)
-
-
26. A method of creating an alternate utterance hypothesis from a complete utterance hypothesis, comprising:
-
for a first partial hypothesis having an associated first score and a second partial hypothesis having an associated second score being less than the first score, both ending at a common time and both seeding a common continuation word, storing information characterizing the first partial hypothesis and the second partial hypothesis at each frame following the seeding of the common continuation word, the information including at least a history of the first partial hypothesis and a history of the second partial hypothesis;
comparing a set of first words from the first hypothesis and a set of first words from the complete utterance hypothesis; and
,if a set of first words from the history of the first partial hypothesis matches a set of first words from the complete utterance hypothesis, substituting the history of the second partial hypothesis for the history of the first partial hypothesis within the complete utterance hypothesis. - View Dependent Claims (27, 28, 29)
-
-
30. A computer readable medium including stored instructions adapted for execution on a processor, comprising:
-
instructions for generating a list of close call records, wherein each record includes history information and scoring information associated with a particular pair of partial hypotheses seeding a common word;
instructions for generating one or more alternate transcripts from the list of close call records by evaluating each record in the list for a match between a partial sub-history of the recognized transcript and one of the histories stored in the record, and upon finding such a match, substituting the other of the histories stored in the record for the partial sub-history in the recognized transcript; and
,instructions for storing the one or more alternate trancripts in a choice list. - View Dependent Claims (31, 32)
-
-
33. A computer readable medium including stored instructions adapted for execution on a processor, comprising:
-
instructions for generating, during speech recognition, a close call list of close call records, wherein each record includes histories for each of two competing partial hypotheses;
instructions for initializing the close call list from at least one output of the speech recognition system;
instructions for selecting one of the close call records from the close call list;
instructions for selecting a transcript from the choice list;
instructions for determining whether one of the two histories for the selected record matches a partial subhistory of the transcript from the choice list;
instructions for substituting the other of the two histories for the partial subhistory of the transcript to generate an alternative version of the transcript, if one of the two histories for the selected close call record matches a partial subhistory of the transcript; and
, instructions for adding the alternative version of the transcript to the choice list.
-
Specification