Recognition of Speech With Different Accents
First Claim
Patent Images
1. A method for recognizing speech, comprising:
- loading a digital representation of a first human utterance;
processing the digital first utterance with a first accent category model;
processing the digital first utterance with a second accent category model;
selecting a category of accents based on results from the processing the first accent category model and the processing the second accent category model;
selecting a plurality of accent models belonging to the selected category of accents;
loading a digital representation of a second human utterance;
processing the digital second utterance with each of the selected plurality of accent models; and
fusing the results of the processing the digital second utterance to produce a recognition output.
5 Assignments
0 Petitions
Accused Products
Abstract
Computer-based speech recognition can be improved by recognizing words with an accurate accent model. In order to provide a large number of possible accents, while providing real-time speech recognition, a language tree data structure of possible accents is provided in one embodiment such that a computerized speech recognition system can benefit from choosing among accent categories when searching for an appropriate accent model for speech recognition.
44 Citations
20 Claims
-
1. A method for recognizing speech, comprising:
-
loading a digital representation of a first human utterance; processing the digital first utterance with a first accent category model; processing the digital first utterance with a second accent category model; selecting a category of accents based on results from the processing the first accent category model and the processing the second accent category model; selecting a plurality of accent models belonging to the selected category of accents; loading a digital representation of a second human utterance; processing the digital second utterance with each of the selected plurality of accent models; and fusing the results of the processing the digital second utterance to produce a recognition output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus for speech processing, comprising:
-
a first comparison module configured to determine a selected accent category based on whether a first accent category model or a second accent category model is a better match for a first human sound to be captured from an audio transducer; and a second comparison module configured to determine which accent model of a plurality of accent models is a best match for a second human sound to be captured from the audio transducer, wherein the plurality of accent models is associated with the selected accent category. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A non-transitory computer readable storage medium, comprising:
-
instructions for a processor to process a first accent category model and a second accent category model; conditional instructions to process a first plurality of accent models based on a result of the first accent category model; wherein accents represented in the first plurality of accent models are within a category represented by the first accent category model. - View Dependent Claims (17, 18, 19, 20)
-
Specification