Systems and methods for performing ASR in the presence of heterographs
First Claim
Patent Images
1. A method for performing automatic speech recognition (ASR) when a heterographic word is present, the method comprising:
- receiving verbal input from a user that comprises a plurality of utterances;
matching a first of the plurality of utterances to a first word;
determining a word that describes the context for the first word;
determining that a second utterance in the plurality of utterances matches a plurality of words that are in a same heterograph set;
combining a second word chosen from the plurality of words with the word that describes the context for the first word to generate a first combined set of words;
storing a first value representing a distance between words in the first combined set of words;
combining a third word chosen from the plurality of words with the word that describes the context for the first word to generate a second combined set of words;
storing a second value representing a distance between words in the second combined set of words;
in response to determining that the second value is smaller than the first value, performing a media guidance application function on an available media asset based on the second combined set of words.
9 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for performing ASR in the presence of heterographs are provided. Verbal input is received from the user that includes a plurality of utterances. A first of the plurality of utterances is matched to a first word. It is determined that a second utterance in the plurality of utterances matches a plurality of words that is in a same heterograph set. It is identified which one of the plurality of words is associated with a context of the first word. A function is performed based on the first word and the identified one of the plurality of words.
-
Citations
18 Claims
-
1. A method for performing automatic speech recognition (ASR) when a heterographic word is present, the method comprising:
-
receiving verbal input from a user that comprises a plurality of utterances; matching a first of the plurality of utterances to a first word; determining a word that describes the context for the first word; determining that a second utterance in the plurality of utterances matches a plurality of words that are in a same heterograph set; combining a second word chosen from the plurality of words with the word that describes the context for the first word to generate a first combined set of words; storing a first value representing a distance between words in the first combined set of words; combining a third word chosen from the plurality of words with the word that describes the context for the first word to generate a second combined set of words; storing a second value representing a distance between words in the second combined set of words; in response to determining that the second value is smaller than the first value, performing a media guidance application function on an available media asset based on the second combined set of words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for performing automatic speech recognition (ASR) when a heterographic word is present, the system comprising:
-
control circuitry configured to; receive verbal input from a user that comprises a plurality of utterances; match a first of the plurality of utterances to a first word; determine a word that describes the context for the first word; determine that a second utterance in the plurality of utterances matches a plurality of words that are in a same heterograph set; combine a second word chosen from the plurality of words with the word that describes the context for the first word to generate a first combined set of words; store a first value representing a distance between words in the first combined set of words; combine a third word chosen from the plurality of words with the word that describes the context for the first word to generate a second combined set of words; store a second value representing a distance between words in the second combined set of words; and in response to determining that the second value is smaller than the first value, perform a media guidance application function on an available media asset based on the second combined set of words. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification