Voice recognition method and voice recognition apparatus
First Claim
Patent Images
1. A voice recognition method comprising:
- detecting a vocal section including a vocal sound in a voice, based on a feature value of an audio signal representing the voice;
identifying a word expressed by the vocal sound in the vocal section, by matching the feature value of the audio signal of the vocal section and an acoustic model of each of a plurality of words; and
selecting, with a processor, the word expressed by the vocal sound in a word section based on a comparison result between a signal characteristic of the word section and a signal characteristic of the vocal section, whereinthe selecting includes selecting the word expressed by the vocal sound in the word section having a signal characteristic not less than a given lower limit threshold value and not greater than a given upper limit threshold value with respect to the signal characteristic of the vocal section, andthe signal characteristic of the word section and the signal characteristic of the vocal section are one of Signal-to-Noise Ratio (SNR) or Average Power.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice recognition method includes: detecting a vocal section including a vocal sound in a voice, based on a feature value of an audio signal representing the voice; identifying a word expressed by the vocal sound in the vocal section, by matching the feature value of the audio signal of the vocal section and an acoustic model of each of a plurality of words; and selecting, with a processor, the word expressed by the vocal sound in a word section based on a comparison result between a signal characteristic of the word section and a signal characteristic of the vocal section.
-
Citations
17 Claims
-
1. A voice recognition method comprising:
-
detecting a vocal section including a vocal sound in a voice, based on a feature value of an audio signal representing the voice; identifying a word expressed by the vocal sound in the vocal section, by matching the feature value of the audio signal of the vocal section and an acoustic model of each of a plurality of words; and selecting, with a processor, the word expressed by the vocal sound in a word section based on a comparison result between a signal characteristic of the word section and a signal characteristic of the vocal section, wherein the selecting includes selecting the word expressed by the vocal sound in the word section having a signal characteristic not less than a given lower limit threshold value and not greater than a given upper limit threshold value with respect to the signal characteristic of the vocal section, and the signal characteristic of the word section and the signal characteristic of the vocal section are one of Signal-to-Noise Ratio (SNR) or Average Power. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A voice recognition apparatus comprising:
-
a processor, coupled to a memory, configured to; detect a vocal section including a vocal sound in a voice, based on a feature value of an audio signal representing the voice, identify a word expressed by the vocal sound in the vocal section, by matching the feature value of the audio signal of the vocal section and an acoustic model of each of a plurality of words, and select the word expressed by the vocal sound in a word section based on a comparison result between a signal characteristic of the word section and a signal characteristic of the vocal section, wherein the processor is configured to select the word expressed by the vocal sound in the word section having a signal characteristic not less than a given lower limit threshold value and not greater than a given upper limit threshold value with respect to the signal characteristic of the vocal section, and the signal characteristic of the word section and the signal characteristic of the vocal section are one of Signal-to-Noise Ratio (SNR) or Average Power. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable recording medium having stored therein a program for causing a computer to execute a voice recognition process comprising:
-
detecting a vocal section including a vocal sound in a voice, based on feature value of an audio signal representing the voice; identifying a word expressed by the vocal sound in the vocal section, by matching the feature value of the audio signal of the vocal section and an acoustic model of each of a plurality of words; and selecting the word expressed by the vocal sound in a word section based on a comparison result between a signal characteristic of the word section and a signal characteristic of the vocal section, wherein the selecting includes selecting the word expressed by the vocal sound in the word section having a signal characteristic not less than a given lower limit threshold value and not greater than a given upper limit threshold value with respect to the signal characteristic of the vocal section, and the signal characteristic of the word section and the signal characteristic of the vocal section are one of Signal-to-Noise Ratio (SNR) or Average Power.
-
Specification