Speech recognition system and method for speech recognition
First Claim
1. A speech recognition system comprising:
- an identifier for adding an identifying code to utterance data corresponding to signals generated by utterances of each of a plurality of users, the identifying code being available for identifying each of the users,a calculator for rating the utterance data by a value for each of the identifying code, the value being determined on the basis of comparison of characteristics of the utterance data with characteristics of word information selected from a plurality of sets of word information stored;
storage for storing N pieces of vocabulary information corresponding to N sets of the utterance data, the utterance data having a same identifying code, the N sets of utterance data having the value within top N, N being an integer equal to one or more;
a selector for selecting posterior N pieces of word information posterior in time to prior N pieces of word information, the identifying codes of the utterance data relative to the posterior and prior N pieces of word information being spoken by the users that are different from each other;
a relational calculator for calculating a degree of relationship between the prior and posterior N pieces of word information, the degree of relationship being capable of rating a fact of the utterance relative to the posterior N pieces of word information being performed later than the utterance relative to the prior N pieces of word information;
a first determiner for determining the posterior N pieces of word information corresponding to an utterance performed later than the utterance relative to the prior N pieces of word information; and
a second determiner for determining the posterior N pieces of word relative to an utterance as a response to the utterance relating to the prior N pieces of word information on the basis of a predetermined condition.
1 Assignment
0 Petitions
Accused Products
Abstract
A recognition result extraction unit and an agreement determination unit are provided. The recognition result extraction unit extracts, from a recognition result storage unit, N best solutions A and B obtained by an utterance B. The utterance B follows an utterance A corresponding to the N best solutions A and made by a speaker b who is different from a speaker of the utterance A. In a case where a repeat utterance determination unit determines that the N best solutions B are N best solutions obtained by a repeat utterance B according to the utterance A corresponding to the N best solutions A, when the best solution A and B are different each other, the agreement determination unit determines that some or all of the N best solutions A can be replaced with some or all of the N best solutions B.
-
Citations
17 Claims
-
1. A speech recognition system comprising:
-
an identifier for adding an identifying code to utterance data corresponding to signals generated by utterances of each of a plurality of users, the identifying code being available for identifying each of the users, a calculator for rating the utterance data by a value for each of the identifying code, the value being determined on the basis of comparison of characteristics of the utterance data with characteristics of word information selected from a plurality of sets of word information stored; storage for storing N pieces of vocabulary information corresponding to N sets of the utterance data, the utterance data having a same identifying code, the N sets of utterance data having the value within top N, N being an integer equal to one or more; a selector for selecting posterior N pieces of word information posterior in time to prior N pieces of word information, the identifying codes of the utterance data relative to the posterior and prior N pieces of word information being spoken by the users that are different from each other; a relational calculator for calculating a degree of relationship between the prior and posterior N pieces of word information, the degree of relationship being capable of rating a fact of the utterance relative to the posterior N pieces of word information being performed later than the utterance relative to the prior N pieces of word information; a first determiner for determining the posterior N pieces of word information corresponding to an utterance performed later than the utterance relative to the prior N pieces of word information; and a second determiner for determining the posterior N pieces of word relative to an utterance as a response to the utterance relating to the prior N pieces of word information on the basis of a predetermined condition. - View Dependent Claims (2, 3)
-
-
4. A speech recognition system comprising:
-
an input identification means for identifying each of a plurality of users of received signals of utterance; recognition result storage for storing top N recognition vocabularies having high recognition scores starting from the best solution as N best solutions, N being an integer equal to one or more, the recognition scores being calculated by comparing data corresponding to the utterance with a plurality of recognition vocabularies, a recognition word having the highest recognition score being the best solution; a recognition result extraction means for extracting N best solutions extracted as following N best solutions from the recognition result storage, the following N best solutions following chronologically the utterance corresponding to a preceding N best solutions, the following N best solutions having been made by one of the users different from the user of the utterance corresponding to the preceding N best solutions; a degree of association calculation means for calculating a degree of association representing a likelihood that the following N best solutions are N best solutions obtained by a response utterance in response to the utterance corresponding to the preceding N best solutions; a response utterance determination means for determining that the following N best solutions are N best solutions obtained by a response utterance in response to the utterance corresponding to the preceding N best solutions in the case of the degree of association being equal to or more than a threshold value; a repeat utterance determination means for determining whether the following N best solutions are N best solutions obtained by a repeat utterance in response to the utterance corresponding to the preceding N best solution, in the case that the following N best solutions are N best solutions obtained by a response utterance in response to the utterance corresponding to the preceding N best solutions; and an agreement determination means for;
determining whether a preceding best solution and a following best solution agree with each other in the case of the following N best solutions being best solutions obtained by a repeat utterance in response to the utterance corresponding to the preceding N best solutions, the preceding best solution being a best solution of the preceding N best solutions, the following best solution being a best solution of the following N best solutions is the following best solution; and
determining that some or all of the preceding N best solutions can be replaced with some or all of the following N best solutions in the case that the preceding best solution and the following best solution do not agree with each other. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A speech recognition method comprising:
-
adding an identifying code to utterance data corresponding to signals generated by utterances of each of a plurality of users, the identifying code being available for identifying each of the users; rating the utterance data by a value for each of the identifying codes, the value being determined on the basis of comparison of a characteristics of the utterance data with characteristics of word information selected from a plurality of sets of word information stored; storing N pieces of word information corresponding to N sets of the utterance data, the utterance data having a same identifying code, the N sets of utterance data having the value within top N, N being an integer equal to one or more; selecting posterior N pieces of word information posterior in time to prior N pieces of word information, the identifying codes of the utterance data relative to the posterior and prior N pieces of word information being spoken by the users that are different from each other; calculating a degree of relationship between the prior and posterior N pieces of word information, the degree of relationship being capable of rating a fact of the utterance relative to the posterior N pieces of word information being performed later than the utterance relative to the prior N pieces of word information; determining the posterior N pieces of word information corresponding to an utterance performed later than the utterance relative to the prior N pieces of word information; and determining the posterior N pieces of word relative to an utterance as a response to the utterance relating to the prior N pieces of word information on the basis of a predetermined condition. - View Dependent Claims (16, 17)
-
Specification