Spatially indexed grammar and methods of use
First Claim
1. A non-transitory computer readable medium having stored thereon one or more sequences of instructions for causing one or more processors to perform the steps for speech recognition using tiles, each tile defines an area on the surface of the earth and the defined area is a more or less symmetrical shape defined by a coordinate system and can have a spatially proximate relationship to other tiles, a grammar is associated with each tile and is derived from feature names of first level identifiers in the area defined by the tile and, each first level identifier corresponds to a point on the surface of the earth, the steps comprising:
- identifying a candidate area location;
choosing an initial tile based on the candidate area location;
comparing an utterance of a first level identifier against a first grammar associated with the initial tile to determine if the utterance corresponds to a feature from which the first grammar was derived, wherein the grammar comprises an audio file associated with each of the feature names of the first level identifiers associated with the tile, and wherein the utterance corresponds to a feature from which the first grammar was derived if the utterance matches an audio file associated with the first grammar;
if the utterance does not correspond to a feature in the first grammar,determining a plurality of second tiles; and
comparing the utterance of the first level identifier against the plurality of grammars associated with the plurality of second tiles to determine if the utterance corresponds to a feature from which the plurality of grammars was derived; and
determining a point coordinate associated with the feature which matches the utterance.
16 Assignments
0 Petitions
Accused Products
Abstract
Improved systems and methods are described which simplify the individual'"'"'s interaction with speech recognition software, expand the database of spoken point names that can be recognized, and increase the quality and therefore likelihood of success of speech recognition applications. The present systems and methods apply to various uses, such as providing driving directions, finding the nearest location based service, and finding the nearest “Where Am I?” type of location based services.
-
Citations
39 Claims
-
1. A non-transitory computer readable medium having stored thereon one or more sequences of instructions for causing one or more processors to perform the steps for speech recognition using tiles, each tile defines an area on the surface of the earth and the defined area is a more or less symmetrical shape defined by a coordinate system and can have a spatially proximate relationship to other tiles, a grammar is associated with each tile and is derived from feature names of first level identifiers in the area defined by the tile and, each first level identifier corresponds to a point on the surface of the earth, the steps comprising:
-
identifying a candidate area location; choosing an initial tile based on the candidate area location; comparing an utterance of a first level identifier against a first grammar associated with the initial tile to determine if the utterance corresponds to a feature from which the first grammar was derived, wherein the grammar comprises an audio file associated with each of the feature names of the first level identifiers associated with the tile, and wherein the utterance corresponds to a feature from which the first grammar was derived if the utterance matches an audio file associated with the first grammar; if the utterance does not correspond to a feature in the first grammar, determining a plurality of second tiles; and comparing the utterance of the first level identifier against the plurality of grammars associated with the plurality of second tiles to determine if the utterance corresponds to a feature from which the plurality of grammars was derived; and determining a point coordinate associated with the feature which matches the utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computerized system for generating a spatially indexed grammar associated with place names, the computerized system comprising:
-
a processor; a memory coupled to the processor; a tile-code generation module which partitions a geographical area into a plurality of finite non-overlapping regions and assigns an index to each of the finite regions, the index configured to be determined using a coordinate system by interleaving a first digit of a horizontal coordinate associated with one of the finite regions with a first digit of a vertical coordinate associated with one of the finite regions, interleaving a second digit of the horizontal coordinate with a second digit of the vertical coordinate, interleaving a third digit of the horizontal coordinate with a third digit of the vertical coordinate, and adding a fourth digit of the horizontal coordinate; a feature generation module which classifies a number of points on each of the place names within each of the finite regions, each of the points comprising names of first level identifiers in an area defined by the finite regions; a sorting and de-duping module which sorts the place names associated with the place names in each of the finite regions and removes one or more duplicate points within each of the finite regions; and a tile file generation module which generates a list of features and associated speech recognition grammars for at least a first and a second of the finite regions, after the features are sorted and the duplicate features are removed from the first and the second of the finite regions. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A method for matching a spoken location description with data associated with a feature that exists geographically within the coordinate location on the surface of the earth, the method comprising:
-
determining a coordinate associated with a specific location; using one or more processors to choose an initial tile and a first grammar associated with the initial tile, the initial tile being substantially centered around the coordinate and defining an area on the surface of the earth and the defined area is a more or less symmetrical shape defined by a coordinate system and can have a spatially proximate relationship to other tiles, the grammar being associated with the initial tile and being derived from feature names of first level identifiers in the area defined by the initial tile and each first level identifier corresponding to a point on the surface of the earth; obtaining a spoken language description of a point within the specific location; using one or more processors to compare the point with the data associated with the feature, wherein the first grammar comprises an audio file associated with each of the feature names of the first level identifiers associated with the tile, and wherein the utterance corresponds to a feature from which the first grammar was derived if the spoken language description matches an audio file associated with the first grammar; querying a geocoding module with text associated with the feature, the text associated with the feature comprises a text representation of the audio file that matched the spoken language description; and providing information associated with the feature. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A spatially indexed grammar apparatus comprising:
-
a place grammar to output a candidate area location associated with a wireless device; a point coordinate database to output a point coordinate associated with the candidate area location; an initial tile from a set of tiles, each tile representing a finite non-overlapping region, the initial tile having an associated speech recognition grammar, the initial tile having a relationship to the point coordinate; a plurality of additional tiles associated with the grammar, the plurality of additional tiles having a spatially proximate relationship to the initial tile; a voice response system configured to match an utterance to a feature in the speech recognition grammar; and a geocoding module to receive a text-based representation of a feature from the grammar, the feature matching a spoken location description of a first level identifier from a user. - View Dependent Claims (30, 31, 32, 33, 34, 35)
-
-
36. A method for speech recognition using tiles, each tile defines an area on the surface of the earth and the defined area is a more or less symmetrical shape defined by a coordinate system and can have a spatially proximate relationship to other tiles, the method comprising:
-
one or more processors using a place grammar to output a candidate area location associated with a wireless device; one or more processors using a point coordinate database to output a point coordinate associated with the candidate area location; choosing an initial tile from a set of tiles, the initial tile having an associated first speech recognition grammar, the initial tile having a relationship to the point coordinate; choosing a plurality of additional tiles, the plurality of additional tiles having a spatially proximate relationship to the initial tile, and an associated second speech recognition grammar; and providing a text-based representation of a feature from the first or the second grammar to a geocoding module, the feature matching a spoken location description of a first level identifier from a user, wherein the spoken location description matches an audio file associated with a feature name from the first or the second grammar. - View Dependent Claims (37)
-
-
38. A method for speech recognition using tiles, each tile defines an area on the surface of the earth and the defined area is a more or less symmetrical shape defined by a coordinate system and can have a spatially proximate relationship to other tiles, a grammar is associated with each tile and is derived from feature names of first level identifiers in the area defined by the tile and, each first level identifier corresponds to a point on the surface of the earth, the method comprising:
-
prompting a user for a general location; capturing a first utterance from the user; matching the first utterance against a place grammar to determine if the first utterance corresponds to a first feature from which the place grammar was derived, wherein the place grammar comprises an audio file associated with features that fall within the area defined by the tile; prompting a user for a specific location; capturing a second utterance from the user; selecting a portion of the place grammar such that the portion of the place grammar contains feature names of first level identifiers that exist geographically within the geographic area associated with the first feature; matching the second utterance against the portion of the place grammar to determine if the second utterance corresponds to a second feature from which the portion of the place grammar was derived; determining a coordinate location associated with the second feature; using the coordinate location as a basis for an index into a spatially indexed grammar; selecting an initial tile using the index; prompting the user for a point; capturing a third utterance from the user; matching the third utterance against a grammar associated with the initial tile to determine if the third utterance corresponds to a third feature from which the grammar associated with the initial tile was derived; if the third utterance does not correspond to the third feature determining a plurality of second tiles and comparing the third utterance against a plurality of grammars associated with the plurality of second tiles to determine if the third utterance corresponds to a fourth feature from which the plurality of grammars was derived; sending a text-based representation of the point to a geocoding module; and playing back information from the geocoding module associated with the text-based representation of the point.
-
-
39. A method for speech recognition using tiles, each tile defines an area on the surface of the earth and the defined area is a more or less symmetrical shape defined by a coordinate system and can have a spatially proximate relationship to other tiles, a grammar is associated with each tile and is derived from feature names of first level identifiers in the area defined by the tile and, each first level identifier corresponds to a point on the surface of the earth, the method comprising:
-
using a general and a specific location to obtain an initial tile; sending a grammar associated with the initial tile to speech recognition software of an interactive voice response system, wherein the place grammar comprises an audio file associated with features that fall within the area defined by the tile; matching a spoken utterance of a first level identifier with a feature in the grammar; determining if the spoken utterance matches the feature; if the spoken utterance does not correspond to the feature determining a plurality of second tiles and comparing the spoken utterance against a plurality of grammars associated with the plurality of second tiles to determine if the spoken utterance corresponds to a feature from which the plurality of grammars was derived; sending a text representation of the feature to a geocoding module to determine coordinates for a point matching the spoken utterance; and providing the coordinates for the point to a location based service.
-
Specification