Automatic Speech Recognition System
First Claim
1. An automatic speech recognition system, which recognizes speeches in acoustic signals detected by a plurality of microphones as character information, the system comprising:
- a sound source localization module which localizes a sound direction corresponding to a specified speaker based on the acoustic signals detected by the plurality of microphones;
a feature extractor which extracts features of speech signals contained in one or more pieces of information detected by the plurality of microphones;
an acoustic model memory which stores direction-dependent acoustic models that are adjusted to a plurality of directions at intervals;
an acoustic model composition module which composes an acoustic model adjusted to the sound direction, which is localized by the sound source localization module, based on the direction-dependent acoustic models in the acoustic model memory, the acoustic model composition module storing the acoustic model in the acoustic model memory; and
a speech recognition module which recognizes the features extracted by the feature extractor as character information using the acoustic model composed by the acoustic model composition module.
1 Assignment
0 Petitions
Accused Products
Abstract
An automatic speech recognition system includes: a sound source localization module for localizing a sound direction of a speaker based on the acoustic signals detected by the plurality of microphones; a sound source separation module for separating a speech signal of the speaker from the acoustic signals according to the sound direction; an acoustic model memory which stores direction-dependent acoustic models that are adjusted to a plurality of directions at intervals; an acoustic model composition module which composes an acoustic model adjusted to the sound direction, which is localized by the sound source localization module, based on the direction-dependent acoustic models, the acoustic model composition module storing the acoustic model in the acoustic model memory; and a speech recognition module which recognizes the features extracted by a feature extractor as character information using the acoustic model composed by the acoustic model composition module.
479 Citations
12 Claims
-
1. An automatic speech recognition system, which recognizes speeches in acoustic signals detected by a plurality of microphones as character information, the system comprising:
-
a sound source localization module which localizes a sound direction corresponding to a specified speaker based on the acoustic signals detected by the plurality of microphones; a feature extractor which extracts features of speech signals contained in one or more pieces of information detected by the plurality of microphones; an acoustic model memory which stores direction-dependent acoustic models that are adjusted to a plurality of directions at intervals; an acoustic model composition module which composes an acoustic model adjusted to the sound direction, which is localized by the sound source localization module, based on the direction-dependent acoustic models in the acoustic model memory, the acoustic model composition module storing the acoustic model in the acoustic model memory; and a speech recognition module which recognizes the features extracted by the feature extractor as character information using the acoustic model composed by the acoustic model composition module. - View Dependent Claims (3, 4, 6, 7)
-
-
2. An automatic speech recognition system, which recognizes speeches of a specified speaker in acoustic signals detected by a plurality of microphones as character information, the system comprising:
-
a sound source localization module which localizes a sound direction corresponding to the specified speaker based on the acoustic signals detected by the plurality of microphones; a sound source separation module which separates speech signals of the specified speaker from the acoustic signals based on the sound direction localized by the sound source localization module a feature extractor which extracts features of the speech signals separated by the sound source separation module; an acoustic model memory which stores direction-dependent acoustic models that are adjusted to a plurality of directions at intervals; an acoustic model composition module which composes an acoustic model adjusted to the sound direction, which is localized by the sound source localization module, based on the direction-dependent acoustic models in the acoustic model memory, the acoustic model composition module storing the acoustic model in the acoustic model memory; and a speech recognition module which recognizes the features extracted by the feature extractor as character information using the acoustic model composed by the acoustic model composition module. - View Dependent Claims (5, 9, 10, 11, 12)
-
-
8. An automatic speech recognition system, which recognizes speeches of a specified speaker in acoustic signals detected by a plurality of microphones as character information, the system comprising:
-
a sound source localization module which localizes a sound direction corresponding to the specified speaker based on the acoustic signals detected by the plurality of microphones; a stream tracking module which stores the sound direction localized by the sound source localization module so as to estimate a direction in which the specified speaker is moving, the stream tracking module estimating a current position of the speaker according to the estimated direction; a sound source separation module which separates speech signals of the specified speaker from the acoustic signals based on a sound direction, which is determined by the current position of the speaker estimated by the stream tracking module; a feature extractor which extracts features of the speech signals separated by the sound source separation module; an acoustic model memory which stores direction-dependent acoustic models that are adjusted to a plurality of directions at intervals; an acoustic model composition module which composes an acoustic model adjusted to the sound direction, which is localized by the sound source localization module, based on the direction-dependent acoustic models in the acoustic model memory, the acoustic model composition module storing the acoustic model in the acoustic model memory; and a speech recognition module which recognizes the features extracted by the feature extractor as character information using the acoustic model composed by the acoustic model composition module.
-
Specification