CONTENT SELECTION USING SPEECH RECOGNITION
First Claim
1. A method used with a wireless communication device for selecting a content file from a set of content files using speech recognition, the method comprising:
- establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files;
receiving at least one audible utterance from a user;
identifying a set of phonemes associated with the received audible utterance;
generating a phoneme lattice based on the identified set of phonemes;
generating a phoneme lattice statistical model based on the phoneme lattice;
assigning a score to each tagged text item in a subset of the set of tagged text items based on the phoneme lattice statistical model; and
presenting one or more of the tagged text items having a score that is above a threshold.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are a method and wireless device for selecting a content file using speech recognition. The method includes establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files. At least one audible utterance (226) is received (804) from a user. A phoneme lattice (302) is generated (808) based on the audible utterance (226). A phoneme lattice statistical model is generated (810) based on the phoneme lattice (302). A score is assigned (1008) to the tagged text items based on probabilistic estimates in the phoneme lattice statistical model. A list of high scoring tagged text items is presented (1014) so that a selection of a content file may be made. A word lattice (402) and a word lattice statistical model are also used in some embodiments
66 Citations
20 Claims
-
1. A method used with a wireless communication device for selecting a content file from a set of content files using speech recognition, the method comprising:
-
establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files; receiving at least one audible utterance from a user; identifying a set of phonemes associated with the received audible utterance; generating a phoneme lattice based on the identified set of phonemes; generating a phoneme lattice statistical model based on the phoneme lattice; assigning a score to each tagged text item in a subset of the set of tagged text items based on the phoneme lattice statistical model; and presenting one or more of the tagged text items having a score that is above a threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method used with a wireless communication device for selecting a content file from a set of content files, the method comprising:
-
establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files; generating a set of indexing N-grams from the set of tagged text items; receiving at least one audible utterance from a user; generating a phoneme lattice based on the received at least one audible utterance; generating a phoneme lattice statistical model based on the phoneme lattice; assigning a score to each indexing N-gram in the set of indexing N-grams based on the phoneme lattice statistical model; determining a subset of the set of indexing N-grams, wherein the indexing N-grams in the subset have an assigned score greater than a first threshold;
generating a word lattice based on the subset of indexing N-grams;generating a word lattice statistical model based on the word lattice; assigning a score to each tagged text item in a subset of the set of tagged text items, wherein the subset comprises tagged test items that are associated with the subset of indexing N-grams, and wherein the score assigned to each tagged text item is based on the word lattice statistical model; and presenting one or more of the tagged text items having scores above a second threshold. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A wireless communication device comprising:
-
a memory; a processor communicatively coupled to the memory; and a speech responsive search engine communicatively coupled to the memory and the processor, the speech responsive search engine for; establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files; receiving at least one audible utterance from a user; identifying a set of phonemes associated with the received audible utterance; generating a phoneme lattice based on the identified set of phonemes; creating a phoneme lattice statistical model based on the phoneme lattice; assigning a score to each tagged text item in a subset of the set of tagged text items based on the phoneme lattice statistical model; and presenting one or more of the tagged text items having a score that is above a threshold. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification