Interactive robot, speech recognition method and computer program product
First Claim
Patent Images
1. An interactive robot capable of speech recognition, comprising:
- a sound-source-direction estimating unit that estimates a direction of a sound source for target voices which are required to undergo speech recognition;
a moving unit that moves the interactive robot in the sound-source direction;
a target-voice acquiring unit that acquires the target voices at a position after moving;
a target-voice holding unit that holds voice patterns of the target voices, the target voices including misrecognition-notification voices signifying that speech recognition by the speech recognizing unit is erroneous;
a speech recognizing unit that performs speech recognition of the target voices by pattern matching of the voice patterns of the target voices, which are held in the target-voice holding unit, with the target voices acquired by the target-voice acquiring unit;
a recognition-accuracy evaluating unit that calculates, as an accuracy of recognition results, an agreement accuracy between the acquired target voices and the voice patterns of the target voices held in the target-voice holding unit;
wherein the moving unit moves the interactive robot itself in the direction of the sound source when the recognition accuracy for results of speech recognition of the target voices is smaller than a predetermined recognition-accuracy threshold and when the misrecognition-notification voices held in the target-voice holding unit are recognized.
4 Assignments
0 Petitions
Accused Products
Abstract
An interactive robot capable of speech recognition includes a sound-source-direction estimating unit that estimates a direction of a sound source for target voices which are required to undergo speech recognition; a moving unit that moves the interactive robot in the sound-source direction; a target-voice acquiring unit that acquires the target voices at a position after moving; and a speech recognizing unit that performs speech recognition of the target voices.
48 Citations
15 Claims
-
1. An interactive robot capable of speech recognition, comprising:
-
a sound-source-direction estimating unit that estimates a direction of a sound source for target voices which are required to undergo speech recognition; a moving unit that moves the interactive robot in the sound-source direction; a target-voice acquiring unit that acquires the target voices at a position after moving; a target-voice holding unit that holds voice patterns of the target voices, the target voices including misrecognition-notification voices signifying that speech recognition by the speech recognizing unit is erroneous; a speech recognizing unit that performs speech recognition of the target voices by pattern matching of the voice patterns of the target voices, which are held in the target-voice holding unit, with the target voices acquired by the target-voice acquiring unit; a recognition-accuracy evaluating unit that calculates, as an accuracy of recognition results, an agreement accuracy between the acquired target voices and the voice patterns of the target voices held in the target-voice holding unit; wherein the moving unit moves the interactive robot itself in the direction of the sound source when the recognition accuracy for results of speech recognition of the target voices is smaller than a predetermined recognition-accuracy threshold and when the misrecognition-notification voices held in the target-voice holding unit are recognized. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer-implemented method for an interactive robot capable of speech recognition, the method comprising:
-
estimating a direction of the sound source of target voices which are required to undergo speech recognition; moving the interactive robot in the direction of the sound source; acquiring the target voices when the interactive robot is located at a position after moving; performing speech recognition of the target voices by pattern matching of voice patterns of the target voices, which are held in a target-voice holding unit, with the acquired target voices, where the target voices held in the target-voice holding unit include misrecognition-notification voices signifying that speech recognition is erroneous; calculating, as an accuracy of recognition results, an agreement accuracy between the acquired target voices and the voice patterns of the target voices held in the target-voice holding unit; and moving the interactive robot itself in the direction of the sound source when the recognition accuracy for results of speech recognition of the target voices is smaller than a predetermined recognition-accuracy threshold and when the misrecognition-notification voices held in the target-voice holding unit are recognized.
-
-
15. A computer program product having a computer readable medium including programmed instructions for performing speech recognition processing on an interactive robot capable of speech recognition, wherein the instructions, when executed by a computer, cause the computer to perform:
-
estimating a direction of the sound source of target voices which are required to undergo speech recognition; moving the interactive robot in the direction of the sound source; acquiring the target voices when the interactive robot is located at a position after moving; performing speech recognition of the target voices by pattern matching of voice patterns of the target voices, which are held in a target-voice holding unit, with the acquired target voices, where the target voices held in the target-voice holding unit include misrecognition-notification voices signifying that speech recognition is erroneous; calculating, as an accuracy of recognition result, an agreement accuracy between the acquired target voices and the voice patterns of the target voices held in the target-voice holding unit; and moving the interactive robot itself in the direction of the sound source when the recognition accuracy for results of speech recognition of the target voices is smaller than a predetermined recognition-accuracy threshold and when the misrecognition-notification voices held in the target-voice holding unit are recognized.
-
Specification