Word matching with context sensitive character to sound correlating
First Claim
Patent Images
1. A method, comprising:
- automatically generating one or more context sensitive character to sound correlation rules;
providing the one or more rules to a query processing logic;
converting a word into a first set of sounds using the one or more rules; and
storing the word and first set of sounds in a data store searchable by the query processing logic.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, methods, media, and other embodiments associated with word matching with context sensitive character to sound correlating are described. One exemplary method embodiment includes automatically generating context sensitive character to sound correlation rules, making the rules available to a query processing logic, converting words into sets of sounds using the rules, and storing a data entry linking the word and set of sounds in a data store searchable by the query processing logic.
-
Citations
25 Claims
-
1. A method, comprising:
-
automatically generating one or more context sensitive character to sound correlation rules;
providing the one or more rules to a query processing logic;
converting a word into a first set of sounds using the one or more rules; and
storing the word and first set of sounds in a data store searchable by the query processing logic. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer-readable medium storing processor executable instructions operable to perform a method, the method comprising:
-
automatically generating one or more recall biased context sensitive character to sound correlation rules using one or more culturally aware pronunciation dictionaries during machine learning training, the culturally aware pronunciation dictionaries including words having characters described in a phonetically characterized training set of characters, where automatically generating the one or more rules includes controlling a text-to-phoneme conversion logic to build grapheme-to-phoneme rules in the form of decision trees and includes providing as input to the text-to-phoneme conversion logic one or more pronunciation dictionaries, where the text-to-phoneme conversion logic relies on alignment where letters are matched with phonemes and a mapping is made between ordered lists of letters and phonemes;
creating a character specific training table for a character in the training set of characters, the character specific training table including one or more words in which the character is found, one or more grams for the character, and one or more sounds associated with the character, the character specific training table including one or more entries containing related words, grams, and sounds;
producing one or more feature vectors for a letter based, at least in part, on alignment, the feature vectors being configured to provide a context for the letter, where the context includes a relationship to one or more of, a previous letter, and a following letter;
providing the one or more rules to a query processing logic;
converting a word into a first set of sounds using the one or more rules;
storing the word and first set of sounds in a data store searchable by the query processing logic;
accepting a query term to match on pronunciation;
converting the query term into a second set of sounds using the one or more rules;
controlling the query processing logic to input a string of grams associated with the query term;
accessing the data store;
controlling the query processing logic to select one or more words from the data store based, at least in part, on matching the second set of sounds to one or more first set of sounds;
controlling the query processing logic to provide one or more confidences related to the one or more words; and
computing an overall confidence for a match for a word selected from the data store from confidences related to the letters in the word.
-
-
19. A system, comprising:
-
one or more data stores configured to store one or more text to sound pronunciation data entries, one or more text training words, one or more text to sound conversion rules, and one or more text and sound representation data entries; and
a machine learning logic configured to automatically generate one or more text to sound conversion rules from the text to sound pronunciation data entries and the text training words, to store the text to sound conversion rules, to automatically generate one or more text and sound representation data entries, and to store the one or more text and sound representation data entries. - View Dependent Claims (20, 21, 22, 23)
-
-
24. A system, comprising:
-
means for computing a control data for selectively controlling a text to sound conversion logic;
means for computing a set of sounds from a word; and
means for matching a first set of sounds to a second set of sounds, the first set of sounds being computed from a first word and the second set of sounds being computed from a second word.
-
-
25. A set of application programming interfaces embodied on a computer-readable medium for execution by a computer component in conjunction with word matching with context sensitive character to sound correlating, comprising:
-
a first interface for communicating a text to sound pronunciation data; and
a second interface for communicating a text to sound conversion rule that is based, at least in part, on the text to sound pronunciation data.
-
Specification