Voice-output reading system with gesture-based navigation
First Claim
1. A method for electronically reading text under interactive control by a user, the method comprising:
- obtaining a first digital image of at least a portion of the text to be read;
performing symbology recognition on the first digital image;
capturing a temporal sequence of digital images of user-generated spatial configurations of at least one pointer;
determining a command signal from the temporal sequence of digital images;
choosing a subset of the recognized symbology to read at a spatial location on the text to be read on the basis of the determined command signal;
feeding back to the user the spatial location of the chosen subset to be read and the spatial location of at least one pointer relative to the recognized text, wherein the spatial location of the chosen subset and the pointer aids the user in selecting the text to be read; and
converting the chosen subset of recognized symbology into a humanly understandable version.
1 Assignment
0 Petitions
Accused Products
Abstract
An optical-input print reading device with voice output for people with impaired or no vision in which the user provides input to the system from hand gestures. Images of the text to be read, on which the user performs finger- and hand-based gestural commands, are input to a computer, which decodes the text images into their symbolic meanings through optical character recognition, and further tracks the location and movement of the hand and fingers in order to interpret the gestural movements into their command meaning. In order to allow the user to select text and align printed material, feedback is provided to the user through audible and tactile means. Through a speech synthesizer, the text is spoken audibly. For users with residual vision, visual feedback of magnified and image enhanced text is provided. Multiple cameras of the same or different field of view can improve performance. In addition, alternative device configurations allow portable operation, including the use of cameras located on worn platforms, such as eyeglasses, or on a fingertip system. The use of gestural commands is natural, allowing for rapid training and ease of use. The device also has application as an aid in learning to read, and for data input and image capture for home and business uses.
614 Citations
33 Claims
-
1. A method for electronically reading text under interactive control by a user, the method comprising:
-
obtaining a first digital image of at least a portion of the text to be read; performing symbology recognition on the first digital image; capturing a temporal sequence of digital images of user-generated spatial configurations of at least one pointer; determining a command signal from the temporal sequence of digital images; choosing a subset of the recognized symbology to read at a spatial location on the text to be read on the basis of the determined command signal; feeding back to the user the spatial location of the chosen subset to be read and the spatial location of at least one pointer relative to the recognized text, wherein the spatial location of the chosen subset and the pointer aids the user in selecting the text to be read; and converting the chosen subset of recognized symbology into a humanly understandable version. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. An electronic reading apparatus for converting text to spoken words for a user, comprising:
-
a pointer that is operated by the user to indicate commands, wherein commands are encoded in the location and movement of the pointer relative to the text; a digital imaging device that converts text to a digital imaging signal and which additionally captures a temporal sequence of digital images of the pointer, wherein the temporal sequence of images contains information about the location and movement of the pointer relative to the text; a character recognizer receptive of the digital imaging signal, the recognizer generating a recognized character signal comprising the symbolic identity of the recognized text and the location of the recognized text relative to the digital imaging signal; a pointer tracker that determines the pointer location and movement, the tracker generating a pointer location and movement signal relative to the text; a command interpreter receptive of the pointer location and movement signal and the recognized character signal, the interpreter generating a command signal indicating the text to be converted to spoken words; a feedback device receptive of the command signal, and providing feedback to the user about the location of the text to be converted to spoken words and the location of the pointer relative to recognized text; a controller receptive of the command signal and the recognized character signal, the controller generating an output signal representative of at least portions of the text recognized; and a transducer receptive of the output signal for converting the output signal to speech. - View Dependent Claims (32)
-
-
33. A method for electronically reading aloud text under interactive control by a user with a computer-based system, the method comprising:
-
obtaining a first image of at least a portion of the text and converting the image to a first signal representative thereof; capturing a temporal sequence of images of at least one pointer under the control of the user and converting the temporal sequence of images to a second signal representative thereof; supplying the first and second signals to the computer-based system; performing symbology recognition on the first signal; determining the position and movement of the pointer relative to the text from the first and second signals; choosing a subset of the recognized symbology to read aloud on the basis of the determined position and movement of the pointer relative to the text and generating a command signal; converting the chosen subset of recognized symbology into speech, additionally including providing feedback to the user of the particular subset of recognized symbology chosen and of the location of the pointer relative to the recognized symbology.
-
Specification