AUTOMATIC SPEECH RECOGNITION BASED UPON INFORMATION RETRIEVAL METHODS
First Claim
Patent Images
1. In a computing environment, a system comprising:
- a recognition mechanism that processes audio input into acoustic units;
a feature extraction mechanism that processes the acoustic units into features derived from the acoustic units; and
an information retrieval-based scoring mechanism that inputs the features and determines one or more words or acoustic scores associated with words based upon the features.
2 Assignments
0 Petitions
Accused Products
Abstract
Described is a technology in which information retrieval (IR) techniques are used in a speech recognition (ASR) system. Acoustic units (e.g., phones, syllables, multi-phone units, words and/or phrases) are decoded, and features found from those acoustic units. The features are then used with IR techniques (e.g., TF-IDF based retrieval) to obtain a target output (a word or words). Also described is the use of IR techniques to provide a full large vocabulary continuous speech (LVCSR) recognizer
90 Citations
20 Claims
-
1. In a computing environment, a system comprising:
-
a recognition mechanism that processes audio input into acoustic units; a feature extraction mechanism that processes the acoustic units into features derived from the acoustic units; and an information retrieval-based scoring mechanism that inputs the features and determines one or more words or acoustic scores associated with words based upon the features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
- 14. In a computing environment, a method performed on at least one processor, comprising, processing audio input into acoustic units, extracting features corresponding to the acoustic units, and using information retrieval-based scoring to determine acoustic scores for words based upon the features.
-
18. One or more computer-readable media having computer-executable instructions, which when executed perform steps, comprising:
-
receiving speech; extracting units based upon the speech and hypothesized word boundaries; determining candidate words that are associated with the units; computing an information-retrieval based acoustic score for each candidate word and associating that acoustic score with that candidate word; and sorting the candidate words by acoustic score. - View Dependent Claims (19, 20)
-
Specification