Method and system for determining user input based on gesture

US 10,228,242 B2
Filed: 05/05/2015
Issued: 03/12/2019
Est. Priority Date: 07/12/2013
Status: Active Grant

First Claim

Patent Images

1. A method for determining a user input, comprising:

capturing, at one or more image capturing sensors, an image of a field of view of a user, the image comprising a gesture created by the user;

determining a sequence for a plurality of gesture analysis processes based in part or in whole upon computational resource utilization of the plurality of gesture analysis processes;

analyzing, at least by a microprocessor, the image to determine a set of candidates and to identify a set of points associated with the gesture;

removing at least one candidate from gesture recognition with at least a first gesture analysis process of the plurality of gesture analysis processes to reduce the set of candidates to a remaining set of one or more remaining candidates while skipping one or more remaining gesture analysis processes of the plurality of gesture analysis processes for the at least one candidate;

generating respective scoring values for the one or more remaining candidates based in part or in whole on matching results of the one or more remaining candidates with predetermined gestures in a database; and

determining a user input based at least in part on a recognized gesture that is recognized by at least a second gesture analysis process.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A waveguide apparatus includes a planar waveguide and at least one optical diffraction element (DOE) that provides a plurality of optical paths between an exterior and interior of the planar waveguide. A phase profile of the DOE may combine a linear diffraction grating with a circular lens, to shape a wave front and produce beams with desired focus. Waveguide apparati may be assembled to create multiple focal planes. The DOE may have a low diffraction efficiency, and planar waveguides may be transparent when viewed normally, allowing passage of light from an ambient environment (e.g., real world) useful in AR systems. Light may be returned for temporally sequentially passes through the planar waveguide. The DOE(s) may be fixed or may have dynamically adjustable characteristics. An optical coupler system may couple images to the waveguide apparatus from a projector, for instance a biaxially scanning cantilevered optical fiber tip.

285 Citations

20 Claims

1. A method for determining a user input, comprising:
- capturing, at one or more image capturing sensors, an image of a field of view of a user, the image comprising a gesture created by the user;
  
  determining a sequence for a plurality of gesture analysis processes based in part or in whole upon computational resource utilization of the plurality of gesture analysis processes;
  
  analyzing, at least by a microprocessor, the image to determine a set of candidates and to identify a set of points associated with the gesture;
  
  removing at least one candidate from gesture recognition with at least a first gesture analysis process of the plurality of gesture analysis processes to reduce the set of candidates to a remaining set of one or more remaining candidates while skipping one or more remaining gesture analysis processes of the plurality of gesture analysis processes for the at least one candidate;
  
  generating respective scoring values for the one or more remaining candidates based in part or in whole on matching results of the one or more remaining candidates with predetermined gestures in a database; and
  
  determining a user input based at least in part on a recognized gesture that is recognized by at least a second gesture analysis process.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1, further comprising:
    - accessing a networked memory to access the database of predetermined gestures;
      
      recognizing the gesture when a scoring value exceeds a threshold value; and
      
      comparing the set of points to at least one predetermined set of points associated with a database of predetermined gestures.
  - 3. The method of claim 1, wherein the gesture comprises at least one of inter-finger interactions, pointing, tapping, and rubbing.
  - 4. The method of claim 1, further comprising:
    - determining an action based on the user input; and
      
      performing the action at the computing system comprising the microprocessor.

5. A system for determining a user input, comprising:
- one or more image capturing sensors configured to capture an image of a field of view of a user, the image comprising a gesture created by the user;
  
  the at least one microprocessor further configured to determine a sequence for a plurality of gesture analysis processes based in part or in whole upon computational resource utilization of the plurality of gesture analysis processes;
  
  at least one microprocessor further configured to analyze the image with at least one of the plurality of gesture analysis processes according to the sequence to determine a set of candidates and to identify a set of points associated with the gesture;
  
  the at least one microprocessor further configured to remove at least one candidate from gesture recognition to reduce the set of candidates to a remaining set of one or more remaining candidates while skipping one or more remaining gesture analysis processes of the plurality of gesture analysis processes for the at least one candidate;
  
  the at least one microprocessor further configured to generate respective scoring values for the one or more remaining candidates based in part or in whole on matching results of the one or more remaining candidates with predetermined gestures in a database; and
  
  the at least one microprocessor further configured to determine a user input based at least in part on a recognized gesture of the one or more remaining gestures.

6. A computer program product comprising a non-transitory computer-usable storage medium storing thereupon executable code which, when executed by at least one microprocessor, causes the at least one microprocessor to perform a set of acts for determining a user input, the set of acts comprising:
- capturing, at one or more image capturing sensors, an image of a field of view of a user, the image comprising a gesture created by the user;
  
  determining a sequence for a plurality of gesture analysis processes based in part or in whole upon computational resource utilization of the plurality of gesture analysis processes;
  
  analyzing, at least by a microprocessor, the image with at least one of the plurality of gesture analysis processes according to the sequence to determine a set of candidates and to identify a set of points associated with the gesture;
  
  removing at least one candidate from gesture recognition to reduce the set of candidates to a remaining set of one or more remaining candidates while skipping one or more remaining gesture analysis processes of the plurality of gesture analysis processes for the at least one candidate;
  
  generating respective scoring values for the one or more remaining candidates based in part or in whole on matching results of the one or more remaining candidates with predetermined gestures in a database; and
  
  determining a user input based at least in part on a recognized gesture of the one or more remaining gestures.

7. A method of identifying a gesture, comprising:
- capturing, at one or more image capturing sensors, a plurality of images of respective fields of view of a user;
  
  determining a predetermined processing order for a plurality of gesture analysis processes based in part or in whole upon computational resource utilization of the plurality of gesture analysis processes;
  
  analyzing, with at least one microprocessor, the plurality of images with at least one of the plurality of gesture analysis processes according to the sequence at least by performing a rejection cascade processing on a set of candidates to remove at least one candidate from a set of candidates for the plurality of images to generate a reduced set of one or more remaining candidates while skipping one or more gesture analysis processes based in part or in whole upon the predetermined processing order, the rejection cascade processing comprising;
  
  a relatively less computational intensive stage using relatively less expensive computations and configured to remove one or more candidates to transform the set of candidates into a reduced set of candidates; and
  
  a later, more computational intensive stage using relatively more expensive computations and configured to analyze the reduced set of candidates to determine one or more gestures from the plurality of images; and
  
  identifying at least one gesture by performing at least a second gesture analysis process of the plurality of gesture analysis processes on the plurality of images.

8. A method of identifying a gesture, comprising:
- capturing, at one or more image capturing sensors, a plurality of images of respective fields of view of a user;
  
  generating a plurality of gesture candidates from the plurality of images at least by performing a depth segmentation analysis based in part or in whole upon depth data provided by the one or more one or more image capturing sensors;
  
  determining a sequence for a plurality of gesture analysis processes based in part or in whole upon computational resource utilization of the plurality of gesture analysis processes;
  
  generating analysis data values corresponding to each of the plurality of gesture candidates;
  
  sorting the plurality of gesture candidates based on the analysis data values;
  
  eliminating, with at least a first gesture analysis process, one or more gesture candidates with analysis data values less than a threshold to generate a reduced set of gesture candidates while skipping one or more remaining gesture analysis processes of the plurality of gesture analysis processes; and
  
  identifying at least one gesture candidate from the reduced set of gesture candidates as the gesture for interaction with at least a second gesture analysis process executing on a computing system.

9. A method for classifying a gesture, comprising:
- capturing, at one or more image capturing sensors, an image of a field of view of a user;
  
  determining a sequence for a plurality of gesture analysis processes based in part or in whole upon computational resource utilization of the plurality of gesture analysis processes;
  
  reducing a set of gesture candidates into a reduced set of gesture candidates at least by removing one or more gesture candidates with at least a first gesture analysis process of the plurality of gesture analysis processes while skipping one or more remaining gesture analysis processes of the plurality of gesture analysis processes for the image;
  
  performing, at least by a microprocessor operatively coupled to the one or more image capturing sensors, depth segmentation on the image at least by performing a line search with a series of lines on data in the image to generate a depth map;
  
  analyzing the depth map using a classifier mechanism to identify a part of a hand corresponding to a point in the depth map;
  
  skeletonizing the depth map into a skeletonized depth map based at least in part on an identification of the part of the hand;
  
  classifying the image as a gesture in the reduced set of gesture candidates with at least a second gesture analysis process of the plurality of gesture analysis processes based in part or in whole on the skeletonized depth map.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
- - 10. The method of claim 9, wherein the depth segmentation comprises the line search with one or more diagonal lines employed in a portion of the image.
  - 11. The method of claim 9, further comprising performing a cascade analysis on the depth map to classify the image as the gesture.
  - 12. The method of claim 9, further comprising performing depth augmentation on the depth map.
  - 13. The method of claim 9, further comprising performing surface normalization on the depth map.
  - 14. The method of claim 9, further comprising performing orientation normalization on the depth map.
  - 15. The method of claim 9, further comprising performing background subtraction on the depth map.
  - 16. The method of claim 9, further comprising performing depth comparison on the depth map.
  - 17. The method of claim 9, further comprising classifying the image as the gesture based on the depth map, which has been skeletonized, and prior information.
  - 18. The method of claim 9, wherein the classifier mechanism comprises a decision forest or a decision tree.
  - 19. The method of claim 9, wherein the line search is performed with a plurality of flat lines in a first portion in the image and a plurality of diagonal lines in a second portion in the image.
  - 20. The method of claim 9, further comprising:
    - checking an amount of light reflected off a part of the user in the image;
      
      performing confidence enhancement for the depth map, which has been skeletonized, based at least in part or in whole upon a clear map of the part of the user and upon the amount of light; and
      
      filtering out one or more identified objects as the part of the user at least by flood filling data from cascade processing.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Magic Leap, Inc.
Original Assignee
Magic Leap, Inc.
Inventors
Abovitz, Rony, Schowengerdt, Brian T., Watson, Mathew D.
Primary Examiner(s)
Wang, Jin Cheng

Application Number

US14/704,858
Publication Number

US 20150234477A1
Time in Patent Office

1,407 Days
Field of Search

345633, 382103, 382190, 715852, 707609, 707722
US Class Current
CPC Class Codes

A61B 2034/101   Computer-aided simulation o...

A61B 2562/0204   Acoustic sensors

A61B 2562/0219   Inertial sensors, e.g. acce...

A61B 2562/0247   Pressure sensors

A61B 3/0008   provided with illuminating ...

A61B 3/022   for testing contrast sensit...

A61B 3/024   for determining the visual ...

A61B 3/028   for testing visual acuity; ...

A61B 3/063   for testing light sensitivi...

A61B 3/066   for testing colour vision

A61B 3/08   for testing binocular or st...

A61B 3/085   for testing strabismus

A61B 3/10   Objective types, i.e. instr...

A61B 3/1015   for wavefront analysis

A61B 3/102   for optical coherence tomog...

A61B 3/1035   for measuring astigmatism t...

A61B 3/113   for determining or recordin...

A61B 3/12   for looking at the eye fund...

A61B 3/1216   for diagnostics of the iris

A61B 3/13   Ophthalmic microscopes

A61B 3/14 : Arrangements specially adap...

A61B 3/165 : Non-contacting tonometers

A61B 34/10 : Computer-aided planning, si...

A61B 5/0059 : using light, e.g. diagnosis...

A61B 5/0066 : Optical coherence imaging

A61B 5/0077 : Devices for viewing the sur...

A61B 5/01 : Measuring temperature of bo...

A61B 5/14532 : for measuring glucose, e.g....

A61B 5/1455 : using optical sensors, e.g....

A61B 5/14555 : specially adapted for the e...

A61B 5/369 : Electroencephalography [EEG...

A61B 5/398 : Electrooculography [EOG], e...

A61B 5/6803 : Head-worn items, e.g. helme...

A63F 13/00 : Video games, i.e. games usi...

A63F 13/213 : comprising photodetecting m...

A63F 13/428 : involving motion or positio...

G01B 11/303 : using photoelectric detecti...

G02B 2027/0105 : Holograms with particular s...

G02B 2027/0127 : comprising devices increasi...

G02B 2027/0138 : comprising image capture sy...

G02B 2027/014 : comprising information/imag...

G02B 2027/0178 : Eyeglass type eyeglass deta...

G02B 2027/0185 : Displaying image at variabl...

G02B 2027/0187 : slaved to motion of at leas...

G02B 27/0101 : characterised by optical fe...

G02B 27/017 : Head mounted

G02B 27/0172 : characterised by optical fe...

G02B 27/0179 : Display position adjusting ...

G02B 27/42 : Diffraction optics , i.e. s...

G02B 27/4205 : having a diffractive optica...

G02B 27/4227 : in image scanning systems

G02B 6/10 : of the optical waveguide ty...

G02B 6/34 : utilising prism or grating ...

G06F 16/7837 : using objects detected or r...

G06F 18/22 : Matching criteria, e.g. pro...

G06F 3/005 : Input arrangements through ...

G06F 3/011 : Arrangements for interactio...

G06F 3/013 : Eye tracking input arrangem...

G06F 3/017 : Gesture based interaction, ...

G06F 3/0304 : Detection arrangements usin...

G06F 3/04815 : Interaction with a metaphor...

G06F 3/04842 : Selection of displayed obje...

G06F 3/04845 : for image manipulation, e.g...

G06F 3/0485 : Scrolling or panning

G06F 3/0487 : using specific features pro...

G06F 3/04883 : for inputting data by handw...

G06Q 30/0643 : Graphical representation of...

G06T 19/006 : Mixed reality object pose d...

G06T 2200/04 : involving 3D image data

G06T 2200/24 : involving graphical user in...

G06T 2207/10004 : Still image; Photographic i...

G06T 2207/10024 : Color image

G06T 2207/10148 : Varying focus

G06T 2207/10152 : Varying illumination

G06T 2207/30041 : Eye; Retina; Ophthalmic

G06T 2207/30196 : Human being; Person

G06T 2210/41 : Medical

G06T 2219/024 : Multi-user, collaborative e...

G06T 7/60 : Analysis of geometric attri...

G06V 10/20 : Image preprocessing

G06V 10/40 : Extraction of image or vide...

G06V 10/758 : Involving statistics of pix...

G06V 20/10 : Terrestrial scenes scenes u...

G06V 20/20 : in augmented reality scenes

G06V 20/40 : in video content extracting...

G06V 20/653 : by matching three-dimension...

G06V 40/113 : Recognition of static hand ...

G06V 40/28 : Recognition of hand or arm ...

G16H 40/00 : ICT specially adapted for t...

G16H 40/20 : for the management or admin...

G16H 40/67 : for remote operation

H04B 10/25891 : Transmission components H04...

View All

Method and system for determining user input based on gesture

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

285 Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Method and system for determining user input based on gesture

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

285 Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links