SPEECH RECOGNITION SYSTEM, SPEECH RECOGNITION METHOD, AND SPEECH RECOGNITION PROGRAM
First Claim
1. A speech recognition system comprising:
- a hypothesis search unit which searches for an optimal solution of inputted speech data by generating a hypothesis which is a bundle of words which are searched for as recognition result candidates;
a self-repair decision unit which calculates a self-repair likelihood of a word or a word sequence included in the hypothesis which is being searched for by the hypothesis search unit, and decides whether or not self-repair of the word or the word sequence is performed; and
a transparent word hypothesis generation unit which, when the self-repair decision unit decides that the self-repair is performed, generates a transparent word hypothesis which is a hypothesis which regards as a transparent word a word or a word sequence included in a disfluency interval or a repair interval of a self-repair interval including the word or the word sequence,wherein the hypothesis search unit searches for an optimal solution by including as search target hypotheses the transparent word hypothesis generated by the transparent word hypothesis generation unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition system has: hypothesis search means which searches for an optimal solution of inputted speech data by generating a hypothesis which is a bundle of words which are searched for as recognition result candidates; self-repair decision means which calculates a self-repair likelihood of a word or a word sequence included in the hypothesis which is being searched for by the hypothesis search means, and decides whether or not self-repair of the word or the word sequence is performed; and transparent word hypothesis generation means which, when it is decided that the self-repair is performed, generates a transparent word hypothesis which is a hypothesis which regards as a transparent word a word or a word sequence included in a disfluency interval or a repair interval of a self-repair interval including the word or the word sequence.
-
Citations
12 Claims
-
1. A speech recognition system comprising:
-
a hypothesis search unit which searches for an optimal solution of inputted speech data by generating a hypothesis which is a bundle of words which are searched for as recognition result candidates; a self-repair decision unit which calculates a self-repair likelihood of a word or a word sequence included in the hypothesis which is being searched for by the hypothesis search unit, and decides whether or not self-repair of the word or the word sequence is performed; and a transparent word hypothesis generation unit which, when the self-repair decision unit decides that the self-repair is performed, generates a transparent word hypothesis which is a hypothesis which regards as a transparent word a word or a word sequence included in a disfluency interval or a repair interval of a self-repair interval including the word or the word sequence, wherein the hypothesis search unit searches for an optimal solution by including as search target hypotheses the transparent word hypothesis generated by the transparent word hypothesis generation unit. - View Dependent Claims (2, 3, 4, 5, 6, 11, 12)
-
-
7. A speech recognition method comprising in process in which a hypothesis search unit searches for an optimal solution of inputted speech data by generating a hypothesis which is a bundle of words which are searched for as recognition result candidates:
-
calculating a self-repair likelihood of a word or a word sequence included in a hypothesis which is being searched for and deciding whether or not self-repair of the word or the word sequence is performed; and when it is decided that the self-repair is performed, generating a transparent word hypothesis which is a hypothesis which regards as a transparent word a word or a word sequence included in a disfluency interval or a repair interval of a self-repair interval including the word or the word sequence, wherein the hypothesis search unit searches for an optimal solution by including as search target hypotheses the generated transparent word hypothesis. - View Dependent Claims (8)
-
-
9. A non-transitory computer readable information recording medium storing a speech recognition program, when executed by a processor, performs a method for,
in process of hypothesis search processing of searching for an optimal solution of inputted speech data by generating a hypothesis which is a bundle of words which are searched for as recognition result candidates: -
calculating a self-repair likelihood of a word or a word sequence included in a hypothesis which is being searched for and deciding whether or not self-repair of the word or the word sequence is performed; and generating a transparent word hypothesis which is a hypothesis which regards as a transparent word a word or a word sequence included in a disfluency interval or a repair interval of a self-repair interval including the word or the word sequence when it is decided that the self-repair is performed, searching for an optimal solution by including as search target hypotheses the generated transparent word hypothesis. - View Dependent Claims (10)
-
Specification