Understanding spoken location information based on intersections
First Claim
Patent Images
1. A system for recognizing location information input by a user, the location information including an intersection of streets, the system comprising:
- a grammar having a street grammar component that maps various different pronunciations of street names to semantic tags;
a recognition system configured to receive the location information input, access the grammar, and generate a street name recognition hypothesis with associated semantic tags;
an intersection data store storing a plurality of valid street intersections, each street intersection having street names of intersecting streets;
a tokenization component configured to receive the street name recognition hypothesis and the associated semantic tags and to generate, for each of the associated semantic tags, a tokenized query having phonemes that are labeled with a digits position;
a search component configured to search the intersection data store based on the tokenized queries and output at least one recognized intersection result identifying a street intersection recognized from the location information input by the user; and
wherein the phonemes are labeled with a “
100”
for a 100'"'"'s place, with a “
10”
for a 10'"'"'s place, with a “
1”
for a 1'"'"'s place, and with a “
0”
for an ordinal suffix.
2 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, the present system recognizes a user'"'"'s speech input using an automatically generated probabilistic context free grammar for street names that maps all pronunciation variations of a street name to a single canonical representation during recognition. A tokenizer expands the representation using position-dependent phonetic tokens and an intersection classifier classifies an intersection, despite the presence of recognition errors and incomplete street names.
-
Citations
18 Claims
-
1. A system for recognizing location information input by a user, the location information including an intersection of streets, the system comprising:
-
a grammar having a street grammar component that maps various different pronunciations of street names to semantic tags; a recognition system configured to receive the location information input, access the grammar, and generate a street name recognition hypothesis with associated semantic tags; an intersection data store storing a plurality of valid street intersections, each street intersection having street names of intersecting streets; a tokenization component configured to receive the street name recognition hypothesis and the associated semantic tags and to generate, for each of the associated semantic tags, a tokenized query having phonemes that are labeled with a digits position; a search component configured to search the intersection data store based on the tokenized queries and output at least one recognized intersection result identifying a street intersection recognized from the location information input by the user; and wherein the phonemes are labeled with a “
100”
for a 100'"'"'s place, with a “
10”
for a 10'"'"'s place, with a “
1”
for a 1'"'"'s place, and with a “
0”
for an ordinal suffix. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method of generating a street names grammar for use in an intersection recognition system, the method comprising:
-
receiving an intersection with street names; parsing each street name into a sequence of entities representing individual components of the street name; utilizing a processor of a computing device to construct a graph based on the entities, the graph having a plurality of paths representing a plurality of possible pronunciations of the street name, each path through the graph representing a different pronunciation; generating an utterance representation for each of the plurality of possible pronunciations; mapping each utterance representation corresponding to a given pronunciation to a common semantic tag; assigning a prior probability score to each pronunciation; outputting the plurality of possible pronunciations, utterance representations, semantic tags and prior probabilities as the street names grammar for use in a street names understanding system; and generating additional utterance representations for each path through the graph, the additional utterance representations including alternative pronunciations and prefix substitutions for the given path.
-
Specification