SPEECH RECOGNITION DEVICE AND METHOD, AND SEMICONDUCTOR INTEGRATED CIRCUIT DEVICE
First Claim
1. A semiconductor integrated circuit device that is used in a speech recognition device that issues a question or a message to a user based on speech reproduction data and performs speech recognition processing on speech of the user, comprising:
- a scenario setting unit that receives a command designating scenario flow information representing a relationship between a plurality of the speech reproduction data and a plurality of conversion lists, and, in accordance with the scenario flow information, selects prescribed speech reproduction data from among the plurality of speech reproduction data which are stored in a speech reproduction data storage, and selects a prescribed conversion list from among the plurality of conversion lists which are stored in a conversion list storage;
a standard pattern extraction unit that extracts a standard pattern corresponding to at least part of individual words or sentences included in the prescribed conversion list, from a speech recognition database containing standard patterns representing a distribution state of frequency components of a plurality of phonemes that are used in a prescribed language;
a speech signal synthesizer that synthesizes an output speech signal based on the prescribed speech reproduction data;
a signal processor that extracts the frequency component of an input speech signal by performing a Fourier-transform on the speech signal, and generates a feature pattern representing the distribution state of the frequency component of the speech signal; and
a match detector that compares the feature pattern generated from at least part of the speech signal with the standard pattern extracted from the speech recognition database, and outputs a speech recognition result.
1 Assignment
0 Petitions
Accused Products
Abstract
A semiconductor integrated circuit device for speech recognition includes a scenario setting unit that receives a command designating scenario flow information and selects prescribed speech reproduction data in a speech reproduction data storage and a prescribed conversion list, in accordance with the scenario flow information, a standard pattern extraction unit that extracts a standard pattern corresponding to at least part of individual words or sentences included in the prescribed conversion list from a speech recognition database, a speech signal synthesizer that synthesizes an output speech signal, a signal processor that generates a feature pattern representing the distribution state of the frequency component of an input speech signal, and a match detector that compares the feature pattern with the standard pattern and outputs a speech recognition result.
21 Citations
11 Claims
-
1. A semiconductor integrated circuit device that is used in a speech recognition device that issues a question or a message to a user based on speech reproduction data and performs speech recognition processing on speech of the user, comprising:
-
a scenario setting unit that receives a command designating scenario flow information representing a relationship between a plurality of the speech reproduction data and a plurality of conversion lists, and, in accordance with the scenario flow information, selects prescribed speech reproduction data from among the plurality of speech reproduction data which are stored in a speech reproduction data storage, and selects a prescribed conversion list from among the plurality of conversion lists which are stored in a conversion list storage; a standard pattern extraction unit that extracts a standard pattern corresponding to at least part of individual words or sentences included in the prescribed conversion list, from a speech recognition database containing standard patterns representing a distribution state of frequency components of a plurality of phonemes that are used in a prescribed language; a speech signal synthesizer that synthesizes an output speech signal based on the prescribed speech reproduction data; a signal processor that extracts the frequency component of an input speech signal by performing a Fourier-transform on the speech signal, and generates a feature pattern representing the distribution state of the frequency component of the speech signal; and a match detector that compares the feature pattern generated from at least part of the speech signal with the standard pattern extracted from the speech recognition database, and outputs a speech recognition result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speech recognition method that is used in a speech recognition device that issues a question or a message to a user based on speech reproduction data and performs speech recognition processing on speech of the user, comprising:
-
(a) receiving a command designating scenario flow information representing a relationship between a plurality of the speech reproduction data and a plurality of conversion lists; (b) in accordance with the scenario flow information, selecting prescribed speech reproduction data from among the plurality of speech reproduction data which are stored in a speech reproduction data storage, and selecting a prescribed conversion list from among the plurality of conversion lists which are stored in a conversion list storage; (c) extracting a standard pattern corresponding to at least part of individual words or sentences included in the prescribed conversion list, from a speech recognition database containing standard patterns representing a distribution state of frequency components of a plurality of phonemes that are used in a prescribed language; (d) synthesizing an output speech signal based on the prescribed speech reproduction data; (e) extracting the frequency component of an input speech signal by performing a Fourier-transform on the speech signal, and generating a feature pattern representing the distribution state of the frequency component of the speech signal; and (f) comparing the feature pattern generated from at least part of the speech signal with the standard pattern extracted from the speech recognition database, and outputting a speech recognition result.
-
Specification