Word spotting using both filler and phone recognition
First Claim
1. A computer implemented method for finding a keyword in acoustic data, the method comprising a filler recognition phase and a keyword recognition phase that occurs subsequent to initiation of said filler recognition phase, comprising the steps of:
- processing the acoustic data during the filler recognition phase to identify phones and to generate (i) temporal delimiters within said acoustic data and (ii) likelihood scores for the phones;
processing the acoustic data during the keyword recognition phase to identify instances of a specified keyword comprising a sequence of phones, said processing employing said temporal delimiters to restrict search space for instances of said specified keyword and further employing said likelihood scores generated in the filler recognition phase as an aid in the keyword recognition phase.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to a word-spotting system and a method for finding a keyword in ascoustic data. The method includes a filler recognition phase and a keyword recognition phase wherein: during the filler recognition phase the acoustic data is processed to identify phones and to generate temporal delimiters and likelihood scores for the phones; during the keyword recognition phase, the acoustic data is processed to identify instances of a specified keyword including a sequence of phones; wherein the temporal delimiters and likelihood scores generated in the filler recognition phase are used in the keyword recognition phase.
40 Citations
10 Claims
-
1. A computer implemented method for finding a keyword in acoustic data, the method comprising a filler recognition phase and a keyword recognition phase that occurs subsequent to initiation of said filler recognition phase, comprising the steps of:
-
processing the acoustic data during the filler recognition phase to identify phones and to generate (i) temporal delimiters within said acoustic data and (ii) likelihood scores for the phones; processing the acoustic data during the keyword recognition phase to identify instances of a specified keyword comprising a sequence of phones, said processing employing said temporal delimiters to restrict search space for instances of said specified keyword and further employing said likelihood scores generated in the filler recognition phase as an aid in the keyword recognition phase. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system for finding a keyword in acoustic data, the system employing a filler recognition phase and a keyword recognition phase that occurs subsequent to said initiation of said filler recognition phase and comprising:
-
a) means for processing the acoustic data during the filler recognition phase to identify phones and to generate a temporal delimiters for said acoustic data and likelihood scores for the phones; and b) means for processing the acoustic data during the keyword recognition phase, using the temporal delimiters to restrict search space for instances of said keyword and likelihood scores identitfied by means a), to identify instances of said keyword comprising a sequence of phones. - View Dependent Claims (7, 8, 9, 10)
-
Specification