VOICE RECOGNITION APPARATUS, VOICE RECOGNITION METHOD AND PROGRAM
First Claim
1. A voice recognition apparatus, comprising:
- a tracking unit for detecting a sound source direction and a voice segment to execute a sound source extraction process; and
a voice recognition unit for inputting a sound source extraction result from the tracking unit to execute a voice recognition process,the tracking unit creating a segment being created management unit that creates and manages a voice segment per unit of sound source,each segment being created management unit createdsequentially detecting a sound source direction to execute a voice segment creation process that sequentially updates a voice segment estimated by connecting a detection result to a time direction,creating an extraction filter for a sound source extraction after a predetermined time is elapsed from a voice segment beginning, andsequentially applying the extraction filter created to an input voice signal to sequentially create a partial sound source extraction result of a voice segment,the tracking unitsequentially outputting the partial sound source extraction result created by the segment being created management unit to the voice recognition unit,the voice recognition unitsequentially executing the voice recognition process to the partial sound source extraction result inputted from the tracking unit to output a voice recognition result.
1 Assignment
0 Petitions
Accused Products
Abstract
There is provided an apparatus and a method for rapidly extracting a target sound from a sound signal where a variety of sounds are mixed generated from a plurality of the sound sources. There is a voice recognition unit including a tracking unit for detecting a sound source direction and a voice segment to execute a sound source extraction process, and a voice recognition unit for inputting a sound source extraction result to execute a voice recognition process. In the tracking unit, a segment being created management unit that creates and manages a voice segment per unit of sound source sequentially detects a sound source direction, sequentially updates a voice segment estimated by connecting a detection result to a time direction, creates an extraction filter for a sound source extraction after a predetermined time is elapsed, and sequentially creates a sound source extraction result by sequentially applying the extraction filter to an input voice signal. The voice recognition unit sequentially executes the voice recognition process to a partial sound source extraction result to output a voice recognition result.
-
Citations
20 Claims
-
1. A voice recognition apparatus, comprising:
-
a tracking unit for detecting a sound source direction and a voice segment to execute a sound source extraction process; and a voice recognition unit for inputting a sound source extraction result from the tracking unit to execute a voice recognition process, the tracking unit creating a segment being created management unit that creates and manages a voice segment per unit of sound source, each segment being created management unit created sequentially detecting a sound source direction to execute a voice segment creation process that sequentially updates a voice segment estimated by connecting a detection result to a time direction, creating an extraction filter for a sound source extraction after a predetermined time is elapsed from a voice segment beginning, and sequentially applying the extraction filter created to an input voice signal to sequentially create a partial sound source extraction result of a voice segment, the tracking unit sequentially outputting the partial sound source extraction result created by the segment being created management unit to the voice recognition unit, the voice recognition unit sequentially executing the voice recognition process to the partial sound source extraction result inputted from the tracking unit to output a voice recognition result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A voice recognition method executed by a voice recognition apparatus, the voice recognition apparatus, comprising:
-
a tracking unit for detecting a sound source direction and a voice segment to execute a sound source extraction process; and a voice recognition unit for inputting a sound source extraction result from the tracking unit to execute a voice recognition process, the tracking unit creating a segment being created management unit that creates and manages a voice segment per unit of sound source, each segment being created management unit created sequentially detecting a sound source direction to execute a voice segment creation process that sequentially updates a voice segment by connecting a detection result to a time direction, creating an extraction filter for a sound source extraction after a predetermined time is elapsed from a voice segment beginning, and sequentially applying the extraction filter created to an input voice signal to sequentially create a partial sound source extraction result of a voice segment, the tracking unit sequentially outputting the partial sound source extraction result created by the segment being created management unit to the voice recognition unit, the voice recognition unit sequentially executing the voice recognition process to the partial sound source extraction result inputted from the tracking unit to output a voice recognition result.
-
-
20. A program for executing a voice recognition method executed by a voice recognition apparatus, the voice recognition apparatus, comprising:
-
a tracking unit for detecting a sound source direction and a voice segment to execute a sound source extraction process; and a voice recognition unit for inputting a sound source extraction result from the tracking unit to execute a voice recognition process, the program allows the tracking unit to create a segment being created management unit that creates and manages a voice segment per unit of sound source, each segment being created management unit created to sequentially detect a sound source direction to execute a voice segment creation process that sequentially updates a voice segment by connecting a detection result to a time direction, to create an extraction filter for a sound source extraction after a predetermined time is elapsed from a voice segment beginning, and to sequentially apply the extraction filter created to an input voice signal to sequentially create a partial sound source extraction result of a voice segment, the tracking unit to sequentially output the partial sound source extraction result created by the segment being created management unit to the voice recognition unit, the voice recognition unit to sequentially execute the voice recognition process to the partial sound source extraction result inputted from the tracking unit to output a voice recognition result.
-
Specification