Speech recognition system using spectrogram analysis
First Claim
1. A data processing method for recognizing a sound record of a human utterance, comprising:
- dividing the sound record into a sequence of one or more segments; and
comparing a plurality of dictionary entries with the sound record, each dictionary entry being incrementally compared with a continuous stretch of segments of the sound record.
1 Assignment
0 Petitions
Accused Products
Abstract
Computer comparison of one or more dictionary entries with a sound record of a human utterance to determine whether and where each dictionary entry is contained within the sound record. The record is segmented, and for each vocalized segment a spectrogram is obtained, and for other segments symbolic and numeric data are obtained. The spectrogram of a vocalized segment is then processed to decrease noise and to eliminate variations in pronunciation. Each entry in the dictionary is then compared with every sequence of segments of substantially the same length in the sound record. The comparison takes into account the formant profiles within each vocalized segment and symbolic and numeric data for other segments are obtained in the record and in the dictionary entries.
-
Citations
50 Claims
-
1. A data processing method for recognizing a sound record of a human utterance, comprising:
-
dividing the sound record into a sequence of one or more segments; and
comparing a plurality of dictionary entries with the sound record, each dictionary entry being incrementally compared with a continuous stretch of segments of the sound record. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A data processing method for recognizing a sound record of a human utterance, comprising:
-
dividing the sound record into a sequence of at least one segment;
processing a plurality of stored dictionary entries against the sound record, comprising for each of a plurality of dictionary entries;
determining an expected number of segments associated with the dictionary entry; and
comparing the dictionary entry against the sound record by incrementally testing each continuous stretch of the sound record having a length substantially equal to the expected number of segments. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29)
-
-
25. A data processing system for recognizing a sound record of a human utterance, comprising:
-
a segmentation engine for dividing the sound record into a sequence of one or more segments; and
a comparison engine for comparing a plurality of dictionary entries with the sound record, each dictionary entry being incrementally compared with a continuous stretch of segments of the sound record.
-
-
30. A data processing system for recognizing a sound record of a human utterance, comprising:
-
a segmentation engine for dividing the sound record into a sequence of at least one segment;
an algorithm for processing a plurality of stored dictionary entries against the sound record, comprising for each of a plurality of dictionary entries;
determining an expected number of segments associated with the dictionary entry; and
comparing the dictionary entry against the sound record by incrementally testing each continuous stretch of the sound record having a length substantially equal to the expected number of segments. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48)
-
-
49. A computer program product comprising:
-
a computer-usable medium; and
a data processing method stored on the medium for recognizing a sound record of a human utterance, comprising computer instructions for;
dividing the sound record into a sequence of one or more segments; and
comparing a plurality of dictionary entries with the sound record, each dictionary entry being incrementally compared with a continuous stretch of segments of the sound record.
-
-
50. A computer program product, comprising:
-
a computer-usable medium; and
a data processing method stored on the medium for recognizing a sound record of a human utterance, comprising computer instructions for;
dividing the sound record into a sequence of at least one segment;
processing a plurality of stored dictionary entries against the sound record, comprising for each of a plurality of dictionary entries;
determining an expected number of segments associated with the dictionary entry; and
comparing the dictionary entry against the sound record by incrementally testing each continuous stretch of the sound record having a length substantially equal to the expected number of segments.
-
Specification