System and method for initiating multi-modal speech recognition using a long-touch gesture

US 10,497,371 B2
Filed: 04/29/2019
Issued: 12/03/2019
Est. Priority Date: 10/31/2014
Status: Expired due to Fees

First Claim

Patent Images

1. A method comprising:

receiving a multi-modal input comprising speech and a single touch on a display, the single touch being at a single point;

identifying, based at least in part on a pronoun in the speech and based on the single touch, a first object;

identifying, based at least in part on the pronoun in the speech and based on the single touch, a second object; and

performing an action based on the speech and an association of the first object and the second object as identified by the single touch.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system, method and computer-readable storage devices are disclosed for multi-modal interactions with a system via a long-touch gesture on a touch-sensitive display. A system operating per this disclosure can receive a multi-modal input comprising speech and a touch on a display, wherein the speech comprises a pronoun. When the touch on the display has a duration longer than a threshold duration, the system can identify an object within a threshold distance of the touch, associate the object with the pronoun in the speech, to yield an association, and perform an action based on the speech and the association.

Citations

20 Claims

1. A method comprising:
- receiving a multi-modal input comprising speech and a single touch on a display, the single touch being at a single point;
  
  identifying, based at least in part on a pronoun in the speech and based on the single touch, a first object;
  
  identifying, based at least in part on the pronoun in the speech and based on the single touch, a second object; and
  
  performing an action based on the speech and an association of the first object and the second object as identified by the single touch.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The method of claim 1, wherein identifying the pronoun in the speech is further based on the single touch being of a duration longer than a threshold duration.
  - 3. The method of claim 1, further comprising:
    - identifying, based at least in part on the pronoun in the speech, a first set of coordinates having a first meaning for the first object; and
      
      identifying, based at least in part on the pronoun in the speech, a second set of coordinates having a second meaning for the second object.
  - 4. The method of claim 1, wherein the pronoun comprises one of I, you, he, she, her, him, they, them, their, my, me, it, we, who, us, what, which, whose, whom, himself, herself, itself, myself, someone, anybody, anyone, ours, this, some, none, whichever, those, that, these, neither, nothing, one, each, everyone, everybody, everything, all, some, and most.
  - 5. The method of claim 1, wherein the pronoun is implied in the speech.
  - 6. The method of claim 2, wherein the threshold duration is based on a context for the single touch on the display.
  - 7. The method of claim 2, wherein the threshold duration is based on a recognition certainty of a command recognized in the speech.
  - 8. The method of claim 1, wherein the speech of the multi-modal input is received simultaneously with initiation of the single touch on the display.
  - 9. The method of claim 1, wherein the speech of the multi-modal input is received after a duration of the single touch on the display is determined to meet a press and hold threshold.

10. A system comprising:
- a processor; and
  
  a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising;
  
  receiving a multi-modal input comprising speech and a single touch on a display, the single touch being at a single point;
  
  identifying, based at least in part on a pronoun in the speech and based on the single touch, a first object;
  
  identifying, based at least in part on the pronoun in the speech and based on the single touch, a second object; and
  
  performing an action based on the speech and an association of the first object and the second object as identified by the single touch.
- View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
- - 11. The system of claim 10, wherein identifying the pronoun in the speech is further based on the single touch being of a duration longer than a threshold duration.
  - 12. The system of claim 10, wherein the computer-readable storage medium stores additional instructions stored which, when executed by the processor, cause the processor to perform operations further comprising:
    - identifying, based at least in part on the pronoun in the speech, a first set of coordinates having a first meaning for the first object; and
      
      identifying, based at least in part on the pronoun in the speech, a second set of coordinates having a second meaning for the second object.
  - 13. The system of claim 10, wherein the pronoun comprises one of I, you, he, she, her, him, they, them, their, my, me, it, we, who, us, what, which, whose, whom, himself, herself, itself, myself, someone, anybody, anyone, ours, this, some, none, whichever, those, that, these, neither, nothing, one, each, everyone, everybody, everything, all, some, and most.
  - 14. The system of claim 10, wherein the pronoun is implied in the speech.
  - 15. The system of claim 11, wherein the threshold duration is based on a context for the single touch on the display.
  - 16. The system of claim 11, wherein the threshold duration is based on a recognition certainty of a command recognized in the speech.
  - 17. The system of claim 10, wherein the speech of the multi-modal input is received simultaneously with initiation of the single touch on the display.
  - 18. The system of claim 10, wherein the speech of the multi-modal input is received after a duration of the single touch on the display is determined to meet a press and hold threshold.

19. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
- receiving a multi-modal input comprising speech and a single touch on a display, the single touch being at a single point;
  
  identifying, based at least in part on a pronoun in the speech and based on the single touch, a first object;
  
  identifying, based at least in part on the pronoun in the speech and based on the single touch, a second object; and
  
  performing an action based on the speech and an association of the first object and the second object as identified by the single touch.
- View Dependent Claims (20)
- - 20. The computer-readable storage device of claim 19, wherein identifying the pronoun in the speech is further based on the single touch being of a duration longer than a threshold duration.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
AT&T Intellectual Property I LP (AT&T, Inc.)
Original Assignee
AT&T Intellectual Property I LP (AT&T, Inc.)
Inventors
Vasilieff, Brant J., Ehlen, Patrick, Johnston, Michael J.
Primary Examiner(s)
Roberts, Shaun

Application Number

US16/397,374
Publication Number

US 20190251969A1
Time in Patent Office

218 Days
Field of Search

704270, 704275
US Class Current
CPC Class Codes

G06F 3/04842   Selection of displayed obje...

G06F 3/0488   using a touch-screen or dig...

G06F 3/167   Audio in a user interface, ...

G10L 15/22   Procedures used during a sp...

G10L 2015/223   Execution procedure of a sp...

G10L 2015/228   of application context

System and method for initiating multi-modal speech recognition using a long-touch gesture

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for initiating multi-modal speech recognition using a long-touch gesture

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links