Method and system for automated speech recognition that rearranges an information data base to disrupt formation of recognition artifacts
First Claim
1. A method for automated speech recognition, wherein audio input information is matched to data base information stored in a data base by at least one matching algorithm, and wherein before the input information is matched to the data base information, the data base information is arranged in the data base in a data base information structure, the method comprising:
- inputting audio input information from an audio input device;
performing a structural analysis of the content of the data base by comparing structural parameters of the data base content to predefined requirements and using the results of the structural analysis to decide whether a rearrangement procedure of the data base information from a data base information structure to a matching information structure is required;
if a rearrangement procedure of the data base information is required;
selecting one of multiple rearrangement procedures based on the result of the structural analysis, the selected rearrangement procedure rearranging the data base information from the data base information structure into a matching information structure which differs from the data base information structure, wherein the selected one of the multiple rearrangement procedures in the step of rearranging performs an algorithm that addresses the relationship between entries of the data base information, which are elements in a word list, by quantifying a degree of similarity between the entries by measuring a relevant phonetic distance between the entries and rearranging the entries in a way that is a function of the degree of similarity;
redistributing entries corresponding to words whose phonetic distance is below a phonetic distance threshold into subdirectories so that the entries whose phonetic distance is below the phonetic distance threshold are separated in different subdirectories in order to disrupt the forming of recognition artifacts due to their similarity; and
applying a speech recognition program that includes the at least one matching algorithm to the rearranged data base information and matching the audio input information to the rearranged data base information to recognize the speech content of the audio input information from the audio input device.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and a method perform information recognition. The method arranges data base information in a data base information structure. The method matches input information to the data base information using at least one matching algorithm and using a matching information structure. In accordance with the system and the method, the matching information structure differs from the data base information structure.
25 Citations
15 Claims
-
1. A method for automated speech recognition, wherein audio input information is matched to data base information stored in a data base by at least one matching algorithm, and wherein before the input information is matched to the data base information, the data base information is arranged in the data base in a data base information structure, the method comprising:
-
inputting audio input information from an audio input device; performing a structural analysis of the content of the data base by comparing structural parameters of the data base content to predefined requirements and using the results of the structural analysis to decide whether a rearrangement procedure of the data base information from a data base information structure to a matching information structure is required; if a rearrangement procedure of the data base information is required; selecting one of multiple rearrangement procedures based on the result of the structural analysis, the selected rearrangement procedure rearranging the data base information from the data base information structure into a matching information structure which differs from the data base information structure, wherein the selected one of the multiple rearrangement procedures in the step of rearranging performs an algorithm that addresses the relationship between entries of the data base information, which are elements in a word list, by quantifying a degree of similarity between the entries by measuring a relevant phonetic distance between the entries and rearranging the entries in a way that is a function of the degree of similarity; redistributing entries corresponding to words whose phonetic distance is below a phonetic distance threshold into subdirectories so that the entries whose phonetic distance is below the phonetic distance threshold are separated in different subdirectories in order to disrupt the forming of recognition artifacts due to their similarity; and applying a speech recognition program that includes the at least one matching algorithm to the rearranged data base information and matching the audio input information to the rearranged data base information to recognize the speech content of the audio input information from the audio input device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 13)
-
-
11. A system for automated speech recognition with a data base containing data base information being stored in the data base in a data base information structure, the system comprising:
-
an audio input device that provides audio input information; at least one matching means containing at least one matching algorithm as computer program to match the audio input information to data base information; a rearranging means to rearrange the data base information into a matching information structure to be matched with the audio input information by the at least one matching means, the rearranging means performing an algorithm that addresses the relationship between entries of the data base information, which are elements in a word list, by quantifying a degree of similarity between the entries by measuring a relevant phonetic distance between the entries and rearranging the entries in a way that is a function of the degree of similarity;
wherein;entries corresponding to words sounding too similar, the rearranging means redistribute the entries either within the data base or within or between subdirectories in order to disrupt the forming of recognition artifacts due to their similarity, by grouping words having a measured relevant phonetic distance below a selected threshold level into different subsets; and a plurality of speech recognition programs each with a matching algorithm applied to the data base information by matching the audio input information to each subset of the rearranged data base information, wherein the audio input information is matched to each subset by different speech recognition programs, and wherein each information subset match results in a candidate set of match candidates. - View Dependent Claims (12)
-
-
14. A method for automated speech recognition, wherein audio input information is matched to data base information stored in a data base by at least one matching algorithm, the data base information comprising a plurality of entries stored in the data base, wherein before the input information is matched to the data base information, the data base information is arranged in the data base in a data base information structure;
- the method comprising;
inputting audio input information from an audio input device; rearranging the data base information into a plurality of information subsets, rearranging the data base information from the data base information structure into a matching information structure in the subsets which differs from the data base information structure by redistributing data base information corresponding to words whose phonetic distance is below a phonetic distance threshold into subdirectories so that the entries whose phonetic distance is below the phonetic distance threshold are separated in different subdirectories in order to disrupt the forming of recognition artifacts due to their similarity, and applying a plurality of speech recognition programs each with a matching algorithm to the data base information by matching the audio input information to each subset of the rearranged data base information, wherein the audio input information is matched to each subset by different speech recognition programs, and wherein each information subset match results in a candidate set of match candidates.
- the method comprising;
-
15. A method for automated speech recognition, wherein audio input information is matched to data base information stored in a data base by at least one matching algorithm, and wherein before the input information is matched to the data base information, the data base information is arranged in the data base in a data base information structure, the method comprising:
-
inputting audio input information from an audio input device; performing a structural analysis of the content of the data base by comparing structural parameters of the data base content to predefined requirements; deciding, based on the result of the structural analysis, whether a rearrangement procedure of the data base information from a data base information structure to a matching information structure is required, and if a rearrangement procedure of the data base information is required; selecting one of multiple rearrangement procedures based on the result of the structural analysis procedure; and rearranging the data base information from the data base information structure into a matching information structure which differs from the data base information structure by redistributing data base information corresponding to words whose phonetic distance is below a phonetic distance threshold into subdirectories so that the entries whose phonetic distance is below the phonetic distance threshold are separated in different subdirectories in order to disrupt the forming of recognition artifacts due to their similarity; and applying a plurality of speech recognition programs each with a matching algorithm to the data base information by matching the audio input information to each subset of the rearranged data base information, wherein the audio input information is matched to each subset by different speech recognition programs, and wherein each information subset match results in a candidate set of match candidates.
-
Specification