SYSTEM, DEVICE AND METHOD FOR PROCESSING INTERLACED MULTIMODAL USER INPUT

US 20150019227A1
Filed: 05/15/2013
Published: 01/15/2015
Est. Priority Date: 05/16/2012
Status: Active Grant

First Claim

Patent Images

1-35. -35. (canceled)

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A device, method and system are provided for interpreting and executing operations based on multimodal input received at a computing device. The multimodal input can include one or more verbal and non-verbal inputs, such as a combination of speech and gesture inputs received substantially concurrently via suitable user interface means provided on the computing device. One or more target objects is identified from the non-verbal input, and text is recognized from the verbal input. An interaction object is generated using the recognized text and identified target objects, and thus comprises a natural language expression with embedded target objects. The interaction object is then processed to identify one or more operations to be executed.

Citations

57 Claims

1-35. -35. (canceled)

36. A method implemented at a computing device, the method comprising:
- receiving verbal input using a verbal input interface of the computing device;
  
  receiving, concurrently with at least part of the verbal input, at least one secondary input using a non-verbal input interface of the computing device;
  
  identifying one or more target objects from the at least one secondary input;
  
  recognizing text from the received verbal input;
  
  generating an interaction object, the interaction object comprising a natural language expression having references to the one or more identified target objects embedded within the recognized text, the generating of the interaction object comprising identifying at least one attribute associated with each of the one or more identified target objects or at least one operation associated with each of the one or more identified target objects;
  
  processing the interaction object to identify at least one operation to be executed on at least one of the one or more identified target objects; and
  
  executing the operation on the at least one of the one or more identified target objects.
- View Dependent Claims (37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
- - 37. The method of claim 36, wherein the one or more target objects are identified prior to completion of recognition of the text from the received verbal input.
  - 38. The method of claim 36, wherein each of the one or more identified target objects is associated with a metaobject defining the associated at least one attribute or at least one operation.
  - 39. The method of claim 36, wherein the processing the interaction object comprises correlating at least a part of the recognized text with at least one identified attribute of at least one of the one or more identified target objects.
  - 40. The method of claim 36, further comprising:
    - displaying a text or graphical representation of the interaction object for user confirmation prior to processing the interaction object;
      
      receiving an indication of an error in the text recognized from the received verbal input; and
      
      providing a selection of one or more options to correct the indicated error, the one or more options being determined from at least one attribute associated with the one or more identified target objects.
  - 41. The method of claim 36, further comprising sending the interaction object to a further computing device for processing.
  - 42. The method of claim 36, wherein the at least one secondary input comprises a touch-based input.
  - 43. The method of claim 36, wherein the non-verbal input interface is selected from the group consisting of:
    - a kinetic input interface;
      
      an inertial input interface;
      
      a perceptual input interface;
      
      a touch input interface; and
      
      a sensor input interface.
  - 44. The method of claim 36, wherein the verbal input comprises speech input.
  - 45. The method of claim 36, wherein the verbal input comprises text input.
  - 46. The method of claim 36, wherein the interaction object comprises a plurality of operations to be executed on the at least one of the one or more identified target objects, the method further comprising:
    - executing a first one of the plurality of operations on the at least one of the one or more identified target objects while buffering remaining ones of the plurality of operations; and
      
      sequentially executing the remaining ones of the plurality of operations after execution of the first one of the plurality of operations.

47. A computing device, comprising:
- at least one verbal input interface;
  
  at least one non-verbal input interface;
  
  at least one processor in communication with the at least one verbal input interface and the at least one non-verbal input interface, the at least one processor being configured to;
  
  receive verbal input using the verbal input interface;
  
  receive, concurrently with at least part of the verbal input, at least one secondary input using the at least one non-verbal input interface;
  
  identify one or more target objects from the at least one secondary input;
  
  recognize text from the received verbal input;
  
  generate an interaction object, the interaction object comprising a natural language expression having references to the one or more identified target objects embedded within the recognized text, the generation of the interaction object comprising identification of at least one attribute associated with each of the one or more identified target objects or at least one operation associated with each of the one or more identified target objects;
  
  process the interaction object to identify at least one operation to be executed on at least one of the one or more identified target objects; and
  
  execute the operation on the at least one of the one or more identified target objects.
- View Dependent Claims (48, 49, 50, 51, 52, 53, 54, 55, 56, 57)
- - 48. The computing device of claim 47, wherein the one or more target objects are identified prior to completion of recognition of the text from the received verbal input.
  - 49. The computing device of claim 47, wherein each of the one or more identified target objects is associated with a metaobject defining the associated at least one attribute or at least one operation.
  - 50. The computing device of claim 47, wherein the at least one processor is configured to process the interaction object by correlating at least a part of the recognized text with at least one identified attribute of at least one of the one or more identified target objects.
  - 51. The computing device of claim 47, wherein the at least one processor is further configured to:
    - display a text or graphical representation of the interaction object for user confirmation on a display of the computing device, prior to processing the interaction object;
      
      receive an indication of an error in the text recognized from the received speech input; and
      
      provide a selection of one or more options to correct the indicated error, the one or more options being determined from at least one attribute associated with the one or more identified target objects.
  - 52. The computing device of claim 47, wherein the at least one processor is further configured to send the interaction object to a further computing device for processing.
  - 53. The computing device of claim 47, wherein the at least one secondary input comprises a touch-based input.
  - 54. The computing device of claim 47, wherein the non-verbal input interface is selected from the group consisting of:
    - a kinetic input interface;
      
      an inertial input interface;
      
      a perceptual input interface;
      
      a touch input interface; and
      
      a sensor input interface.
  - 55. The computing device of claim 47, wherein the verbal input comprises speech input.
  - 56. The computing device of claim 47, wherein the verbal input comprises text input.
  - 57. The computing device of claim 47, wherein the interaction object comprises a plurality of operations to be executed on the at least one of the one or more identified target objects, and the at least one processor is further configured to:
    - execute a first one of the plurality of operations on the at least one of the one or more identified target objects while buffering remaining ones of the plurality of operations; and
      
      sequentially execute the remaining ones of the plurality of operations after execution of the first one of the plurality of operations.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Xtreme Interactions Inc.
Original Assignee
Xtreme Interactions Inc.
Inventors
Anandarajah, Joe

Granted Patent

US 9,601,113 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/257
CPC Class Codes

G06F 2203/0381   Multimodal input, i.e. inte...

G06F 2203/04806   Zoom, i.e. interaction tech...

G06F 3/038   Control and interface arran...

G06F 3/0481   based on specific propertie...

G06F 3/04845   for image manipulation, e.g...

G06F 3/04883   for inputting data by handw...

G06F 3/167   Audio in a user interface, ...

G10L 15/18   using natural language mode...

G10L 15/19   Grammatical context, e.g. d...

G10L 15/22   Procedures used during a sp...

G10L 17/22   Interactive procedures; Man...

SYSTEM, DEVICE AND METHOD FOR PROCESSING INTERLACED MULTIMODAL USER INPUT

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

57 Claims

Specification

Solutions

Use Cases

Quick Links

SYSTEM, DEVICE AND METHOD FOR PROCESSING INTERLACED MULTIMODAL USER INPUT

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

57 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links