Updating phonetic dictionaries
First Claim
Patent Images
1. A computer-implemented method comprising:
- identifying, from among a set of query terms, a particular query term that (i) does not occur in a lexicon of terms, and (ii) has no designated, canonical phonetic representation in a pronunciation phonetic dictionary, wherein a canonical phonetic representation comprises a sequence of phonemes;
generating a phonetic representation estimate for the particular query term that (i) does not occur in the lexicon of terms, and (ii) has no designated, canonical phonetic representation in the pronunciation phonetic dictionary;
transmitting data identifying at least a portion of a term that does occur in the lexicon of terms and the particular query term to a spelling correction server;
receiving, from the spelling correction server, data that specifies a spelling correction confidence score, wherein the spelling correction confidence score reflects a probability that the term that does occur in the lexicon of terms is a correct spelling of the particular query term;
determining that the spelling correction confidence score satisfies a predetermined threshold; and
in response to determining that the spelling correction confidence score satisfies a predetermined threshold, designating, by one or more computing devices, the phonetic representation estimate for the particular query term as a canonical phonetic representation, in the phonetic dictionary, of the term that does occur in the lexicon of terms.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for updating phonetic dictionaries. In one aspect, a method includes accessing a phonetic dictionary that identifies terms and one or more phonetic representations associated with each term, determining that a particular term that is identified in the phonetic dictionary is a spelling correction for another term that is identified in the phonetic dictionary, and storing, in the phonetic dictionary, one or more of the phonetic representations associated with the other term, with the particular term that is a spelling correction for the other term.
58 Citations
14 Claims
-
1. A computer-implemented method comprising:
-
identifying, from among a set of query terms, a particular query term that (i) does not occur in a lexicon of terms, and (ii) has no designated, canonical phonetic representation in a pronunciation phonetic dictionary, wherein a canonical phonetic representation comprises a sequence of phonemes; generating a phonetic representation estimate for the particular query term that (i) does not occur in the lexicon of terms, and (ii) has no designated, canonical phonetic representation in the pronunciation phonetic dictionary; transmitting data identifying at least a portion of a term that does occur in the lexicon of terms and the particular query term to a spelling correction server; receiving, from the spelling correction server, data that specifies a spelling correction confidence score, wherein the spelling correction confidence score reflects a probability that the term that does occur in the lexicon of terms is a correct spelling of the particular query term; determining that the spelling correction confidence score satisfies a predetermined threshold; and in response to determining that the spelling correction confidence score satisfies a predetermined threshold, designating, by one or more computing devices, the phonetic representation estimate for the particular query term as a canonical phonetic representation, in the phonetic dictionary, of the term that does occur in the lexicon of terms. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; identifying, from among a set of query terms, a particular query term that (i) does not occur in a lexicon of terms, and (ii) has no designated, canonical phonetic representation in a pronunciation phonetic dictionary, wherein a canonical phonetic representation comprises a sequence of phonemes; generating a phonetic representation estimate for the particular query term that (i) does not occur in the lexicon of terms, and (ii) has no designated, canonical phonetic representation in the pronunciation phonetic dictionary; transmitting data identifying at least a portion of a term that does occur in the lexicon of terms and the particular query term to a spelling correction server; receiving, from the spelling correction server, data that specifies a spelling correction confidence score, wherein the spelling correction confidence score reflects a probability that the term that does occur in the lexicon of terms is a correct spelling of the particular query term; determining that the spelling correction confidence score satisfies a predetermined threshold; and in response to determining that the spelling correction confidence score satisfies a predetermined threshold, designating, by one or more computing devices, the phonetic representation estimate for the particular query term as a canonical phonetic representation, in the phonetic dictionary, of the term that does occur in the lexicon of terms. - View Dependent Claims (10, 11)
-
-
12. A non-transitory computer-readable storage device storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
identifying, from among a set of query terms, a particular query term that (i) does not occur in a lexicon of terms, and (ii) has no designated, canonical phonetic representation in a pronunciation phonetic dictionary, wherein a canonical phonetic representation comprises a sequence of phonemes; generating a phonetic representation estimate for the particular query term that (i) does not occur in the lexicon of terms, and (ii) has no designated, canonical phonetic representation in the pronunciation phonetic dictionary; transmitting data identifying at least a portion of a term that does occur in the lexicon of terms and the particular query term to a spelling correction server; receiving, from the spelling correction server, data that specifies a spelling correction confidence score, wherein the spelling correction confidence score reflects a probability that the term that does occur in the lexicon of terms is a correct spelling of the particular query term; determining that the spelling correction confidence score satisfies a predetermined threshold; and in response to determining that the spelling correction confidence score satisfies a predetermined threshold, designating, by one or more computing devices, the phonetic representation estimate for the particular query term as a canonical phonetic representation, in the phonetic dictionary, of the term that does occur in the lexicon of terms. - View Dependent Claims (13, 14)
-
Specification