Systems, methods, and computer program products for location salience modeling for multimodal search

US 8,700,655 B2
Filed: 11/08/2010
Issued: 04/15/2014
Est. Priority Date: 11/08/2010
Status: Active Grant

First Claim

Patent Images

1. A method, for performing a multimodal search, comprising:

receiving, by a multimodal search system having a processor, from a remote mobile device, a query information package comprising;

(I) a search query text string comprising a search topic component and a location component, the location component comprising a type of information selected from a first group consisting of;

(a) precise location information;

(b) ambiguous location information; and

(c) no location information;

(II) map state information comprising boundary information of a map displayed on the remote mobile device when the search query text string was issued and zoom-level information indicating a zoom level of the map when the search query text string was issued;

(III) touch tracing input information when the search query text string was issued;

(IV) prior query location information, if available;

(V) history of recent map movements information; and

(VI) history of recent utterances information;

determining which type of information, of the first group, the location component includes;

if it is determined that the location component comprises one or both of (b) ambiguous location information and (c) no location information;

determining, by the multimodal search system, based at least partially upon a decision tree model trained prior to the multimodal search system receiving the query information package, a search location in which the multimodal search system should search for the search topic component, including determining the search location to be one of a second group consisting of;

a first specific location determined using (II) the map state information when the search query text string was issued;

a second specific location determined using (III) the touch tracing input information when the search query text string was issued;

a third specific location determined using (IV) the prior query location information;

a fourth specific location determined using (V) the history of recent map movements history information; and

a fifth specific location determined using (VI) the history of recent utterances information;

determining, by the multimodal search system, a set of search results based upon the search location and the search topic component; and

sending the set of search results to the remote mobile device;

training the decision tree model, comprising;

receiving, by the multimodal search system, a plurality of training query information packages from a training remote mobile device, each training query information package comprising;

a training search query text string comprising a training search topic component and a training location component, the training location component comprising one of training ambiguous location information and no training location information;

training map state information;

training touch tracing input information;

training prior query location information;

training map manipulation information; and

training geographic location information;

instructing the training remote mobile device from which at least one of the plurality of training query information packages is received to provide a disambiguation interface configured to request an intended location of the training location component via a plurality of selectable options, the selectable options comprising a current location, a currently displayed location, a last spoken location, and a last touched location;

receiving, by the multimodal search system, the intended location from the training remote mobile device from which at least one of the plurality of training query information packages is received as one of the current location, the currently displayed location, the last spoken location, and the last touched location;

storing the intended locations in combination with the training map state information, the training touch tracing input information, the training prior query location information, the training map manipulation information, and the training geographic location information to create a decision tree instance for each of the intended locations;

training, using the decision tree instances, the decision tree model; and

performing, until a threshold value of decision tree instances is at least reached, operations to create the decision tree instances.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Computational models of dialog context have often focused on unimodal spoken dialog or text, using the language itself as the primary locus of contextual information. But as spoken unimodal interaction is replaced by situated multimodal interaction on mobile platforms supporting a combination of spoken dialog with graphical interaction, touch-screen input, geolocation, and other non-linguistic contextual factors, a need arises for more sophisticated models of context that capture the influence of these factors on semantic interpretation and dialog flow. The systems, methods, and computer program products disclosed herein address this need. A method for multimodal search includes, in part, determining an intended location of search query based upon information received from a remote mobile device that issued the search query.

Citations

11 Claims

1. A method, for performing a multimodal search, comprising:
- receiving, by a multimodal search system having a processor, from a remote mobile device, a query information package comprising;
  
  (I) a search query text string comprising a search topic component and a location component, the location component comprising a type of information selected from a first group consisting of;
  
  (a) precise location information;
  
  (b) ambiguous location information; and
  
  (c) no location information;
  
  (II) map state information comprising boundary information of a map displayed on the remote mobile device when the search query text string was issued and zoom-level information indicating a zoom level of the map when the search query text string was issued;
  
  (III) touch tracing input information when the search query text string was issued;
  
  (IV) prior query location information, if available;
  
  (V) history of recent map movements information; and
  
  (VI) history of recent utterances information;
  
  determining which type of information, of the first group, the location component includes;
  
  if it is determined that the location component comprises one or both of (b) ambiguous location information and (c) no location information;
  
  determining, by the multimodal search system, based at least partially upon a decision tree model trained prior to the multimodal search system receiving the query information package, a search location in which the multimodal search system should search for the search topic component, including determining the search location to be one of a second group consisting of;
  
  a first specific location determined using (II) the map state information when the search query text string was issued;
  
  a second specific location determined using (III) the touch tracing input information when the search query text string was issued;
  
  a third specific location determined using (IV) the prior query location information;
  
  a fourth specific location determined using (V) the history of recent map movements history information; and
  
  a fifth specific location determined using (VI) the history of recent utterances information;
  
  determining, by the multimodal search system, a set of search results based upon the search location and the search topic component; and
  
  sending the set of search results to the remote mobile device;
  
  training the decision tree model, comprising;
  
  receiving, by the multimodal search system, a plurality of training query information packages from a training remote mobile device, each training query information package comprising;
  
  a training search query text string comprising a training search topic component and a training location component, the training location component comprising one of training ambiguous location information and no training location information;
  
  training map state information;
  
  training touch tracing input information;
  
  training prior query location information;
  
  training map manipulation information; and
  
  training geographic location information;
  
  instructing the training remote mobile device from which at least one of the plurality of training query information packages is received to provide a disambiguation interface configured to request an intended location of the training location component via a plurality of selectable options, the selectable options comprising a current location, a currently displayed location, a last spoken location, and a last touched location;
  
  receiving, by the multimodal search system, the intended location from the training remote mobile device from which at least one of the plurality of training query information packages is received as one of the current location, the currently displayed location, the last spoken location, and the last touched location;
  
  storing the intended locations in combination with the training map state information, the training touch tracing input information, the training prior query location information, the training map manipulation information, and the training geographic location information to create a decision tree instance for each of the intended locations;
  
  training, using the decision tree instances, the decision tree model; and
  
  performing, until a threshold value of decision tree instances is at least reached, operations to create the decision tree instances.
- View Dependent Claims (2, 3)
- - 2. The method of claim 1, wherein:
    - the set of search results is a first set of search results; and
      
      the method further comprises performing, in response to determining that the location component comprises (a) precise location information, operations comprising;
      
      determining, by the multimodal search system, a second set of search results based upon the precise location information and the search topic component; and
      
      sending the second set of search results to the remote mobile device.
  - 3. The method of claim 1, wherein:
    - the query information package further comprises (VII) information related to a map manipulation that occurred before the search query text string was issued and a time since the map manipulation occurred and (VIII) a geographic location of the remote mobile device when the search query text string was issued; and
      
      determining, by the multimodal search system, the search location in which the multimodal search system should search for the search topic component is further based upon at least one of (VII) and (VIII).

4. A non-transitory computer-readable storage medium comprising computer-executable instructions that, when executed by a processor, cause the processor to perform operations comprising:
- receiving, from a remote mobile device, a query information package comprising;
  
  (I) a search query text string comprising a search topic component and a location component, the location component comprising a type of information selected from a first group consisting of;
  
  (a) precise location information;
  
  (b) ambiguous location information; and
  
  (c) no location information;
  
  (II) map state information comprising boundary information of a map displayed on the remote mobile device when the search query text string was issued and zoom-level information indicating a zoom level of the map when the search query text string was issued;
  
  (III) touch tracing input information when the search query text string was issued;
  
  (IV) prior query location information, if available;
  
  (V) history of recent map movements information; and
  
  (VI) history of recent utterances information;
  
  determining which type of information, of the first group, the location component includes;
  
  if the processor determines that the location component comprises one or both of (b) ambiguous location information and (c) no location information;
  
  determining, based at least partially upon a decision tree model trained prior to receiving the query information package, a search location in which to search for the search topic component, including determining the search location to be one of a second group consisting of;
  
  a first specific location determined using (II) the map state information when the search query text string was issued;
  
  a second specific location determined using (III) the touch tracing input information when the search query text string was issued;
  
  a third specific location determined using (IV) the prior query location information;
  
  a fourth specific location determined using (V) the history of recent map movements history information; and
  
  a fifth specific location determined using (VI) the history of recent utterances information;
  
  determining a set of search results based upon the search location and the search topic component; and
  
  sending the set of search results to the remote mobile device;
  
  training the decision tree model, comprising;
  
  receiving a plurality of training query information packages from a training remote mobile device, each training query information package comprising;
  
  a training search query text string comprising a training search topic component and a training location component, the training location component comprising one of training ambiguous location information and no training location information;
  
  training map state information;
  
  training touch tracing input information;
  
  training prior query location information;
  
  training map manipulation information; and
  
  training geographic location information;
  
  instructing the training remote mobile device from which at least one of the plurality of training query information packages is received to provide a disambiguation interface configured to request an intended location of the training location component via a plurality of selectable options, the selectable options comprising a current location, a currently displayed location, a last spoken location, and a last touched location;
  
  receiving the intended location from the training remote mobile device from which at least one of the plurality of training query information packages is received as one of the current location, the currently displayed location, the last spoken location, and the last touched location;
  
  storing the intended locations in combination with the training map state information, the training touch tracing input information, the training prior query location information, the training map manipulation information, and the training geographic location information to create a decision tree instance for each of the intended locations;
  
  training, using the decision tree instances, the decision tree model; and
  
  performing, until a threshold value of decision tree instances is at least reached, operations to create the decision tree instances.
- View Dependent Claims (5, 6)
- - 5. The non-transitory computer-readable storage medium of claim 4, wherein:
    - the set of search results is a first set of search results; and
      
      the operations further comprise, in response to determining that the location component comprises (a) the precise location information;
      
      determining a second set of search results based upon the precise location information and the search topic component; and
      
      sending the second set of search results to the remote mobile device.
  - 6. The non-transitory computer-readable storage medium of claim 4, wherein:
    - the query information package further comprises (VII) information related to a map manipulation that occurred before the search query text string was issued and a time since the map manipulation occurred and (VIII) a geographic location of the remote mobile device when the search query text string was issued; and
      
      the instructions that, when executed by the processor, cause the processor to determine the search location in which to search for the search topic component, cause the processor to determine the search location based upon at least one of (VII) and (VIII).

7. A multimodal search system comprising:
- a processor; and
  
  a memory, communicative with the processor, comprising computer-executable instructions that, when executed by the processor, cause the processor to perform operations comprising;
  
  receiving, from a remote mobile device, a query information package comprising;
  
  (I) a search query text string comprising a search topic component and a location component, the location component comprising a type of information selected from a first group consisting of;
  
  (a) precise location information;
  
  (b) ambiguous location information; and
  
  (c) no location information;
  
  (II) map state information comprising boundary information of a map displayed on the remote mobile device when the search query text string was issued and zoom-level information indicating a zoom level of the map when the search query text string was issued;
  
  (III) touch tracing input information when the search query text string was issued;
  
  (IV) prior query location information, if available;
  
  (V) history of recent map movements information; and
  
  (VI) history of recent utterances information;
  
  determining which type of information, of the first group, the location component includes;
  
  performing, if the processor determines that the location component comprises one or both of (b) ambiguous location information and (c) no location information, additional operations comprising;
  
  determining, based at least partially upon a decision tree model trained prior to receiving the query information package, a search location in which the multimodal search system should search for the search topic component, including determining the search location to be one of a second group consisting of;
  
  a first specific location determined using (II) the map state information when the search query text string was issued;
  
  a second specific location determined using (III) the touch tracing input information when the search query text string was issued;
  
  a third specific location determined using (IV) the prior query location information;
  
  a fourth specific location determined using (V) the history of recent map movements history information; and
  
  a fifth specific location determined using (VI) the history of recent utterances information;
  
  determining a set of search results based upon the search location and the search topic component; and
  
  sending, via a communications interface, the set of search results to the remote mobile device;
  
  training the decision tree model, comprising;
  
  receiving a plurality of training query information packages from a training remote mobile device, each training query information package comprising;
  
  a training search query text string comprising a training search topic component and a training location component, the training location component comprising one of training ambiguous location information and no training location information;
  
  training map state information;
  
  training touch tracing input information;
  
  training prior query location information;
  
  training map manipulation information; and
  
  training geographic location information;
  
  instructing the training remote mobile device from which at least one of the plurality training query information packages is received to provide a disambiguation interface configured to request an intended location of the training location component via a plurality of selectable options, the selectable options comprising a current location, a currently displayed location, a last spoken location, and a last touched location;
  
  receiving the intended location from the training remote mobile device from which at least one of the plurality of training query information packages is received as one of the current location, the currently displayed location, the last spoken location, and the last touched location;
  
  storing the intended locations in combination with the training map state information, the training touch tracing input information, the training prior query location information, the training map manipulation information, and the training geographic location information to create a decision tree instance for each of the intended locations;
  
  training, using the decision tree instances, the decision tree model; and
  
  performing, until a threshold value of decision tree instances is at least reached, operations to create the decision tree instances.
- View Dependent Claims (8, 9, 10, 11)
- - 8. The multimodal search system of claim 7, wherein:
    - the set of search results is a first set of search results; and
      
      the operations further comprise performing, in response to determining that the location component comprises (a) the precise location information additional operations comprising;
      
      determining a second set of search results based upon the precise location information and the search topic component; and
      
      sending the second set of search results to the remote mobile device.
  - 9. The multimodal search system of claim 8, wherein:
    - the search query text string is received as speech audio or a combination of text and speech audio; and
      
      the system further comprises a speech recognition module configured to receive the speech audio and convert the speech audio to text.
  - 10. The multimodal search system of claim 8, further comprising a listings index comprising a plurality of listings from which the first set of search results and the second set of search results are determined.
  - 11. The multimodal search system of claim 7, wherein:
    - the query information package further comprises (VII) information related to a map manipulation that occurred before the search query text string was issued and a time since the map manipulation occurred and (VIII) a geographic location of the remote mobile device when the search query text string was issued; and
      
      the instructions that, when executed by the processor, cause the processor to determine the search location in which the multimodal search system should search for the search topic component, cause the processor to determine the search location based upon at least one of (VII) and (VIII).

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
AT&T Intellectual Property I LP (AT&T, Inc.)
Inventors
Johnston, Michael, Ehlen, Patrick
Primary Examiner(s)
Uddin, Md. I

Application Number

US12/941,312
Publication Number

US 20120117112A1
Time in Patent Office

1,254 Days
Field of Search

707/706, 707/771, 707/918, 707/713, 707/718, 707/728, 707/731, 707/758, 707/769, 707/999.003
US Class Current

707/769
CPC Class Codes

G01C 21/3608 using speech input, e.g. us...

G06F 16/9537 Spatial or temporal depende...

Systems, methods, and computer program products for location salience modeling for multimodal search

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

11 Claims

Specification

Solutions

Use Cases

Quick Links

Systems, methods, and computer program products for location salience modeling for multimodal search

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

11 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links