VOICE-ENABLED DOCUMENTS FOR FACILITATING OPERATIONAL PROCEDURES
First Claim
1. A method of producing voice-enabled documents using a processor-based mobile computing system, including at least one processor and at least one non-transitory processor-readable medium communicatively coupled to the at least one processor, the method comprising:
- acquiring a digital image of a document;
parsing digital image data associated with the digital image into segments;
decoding text-containing segments of the image data to extract a number of text data objects;
accepting input interactively from a user;
identifying at least one of the extracted text data objects as user-selectable field;
displaying the image data on a display screen and visually emphasizing the user-selectable fields; and
for each user-selectable field;
transforming the text data object of the respective user-selectable field to an audio playback file, by the at least one processor;
storing the audio playback file to the at least one non-transitory processor-readable medium;
storing at least one voice command name for the respective user-selectable field to the at least one non-transitory processor-readable medium;
logically associating the at least one voice command name for the respective user-selectable field as a trigger with the audio playback file for the respective user-selectable field, by the at least one processor.
0 Assignments
0 Petitions
Accused Products
Abstract
A voice-enabled document system facilitates execution of service delivery operations by eliminating the need for manual or visual interaction during information retrieval by an operator. Access to voice-enabled documents can facilitate operations for mobile vendors, on-site or field-service repairs, medical service providers, food service providers, and the like. Service providers can access the voice-enabled documents by using a client device to retrieve the document, display it on a screen, and, via voice commands initiate playback of selected audio files containing information derived from text data objects selected from the document. Data structures that are components of a voice-enabled document include audio playback files and a logical association that links the audio playback files to user-selectable fields, and to a set of voice commands.
536 Citations
20 Claims
-
1. A method of producing voice-enabled documents using a processor-based mobile computing system, including at least one processor and at least one non-transitory processor-readable medium communicatively coupled to the at least one processor, the method comprising:
-
acquiring a digital image of a document; parsing digital image data associated with the digital image into segments; decoding text-containing segments of the image data to extract a number of text data objects; accepting input interactively from a user; identifying at least one of the extracted text data objects as user-selectable field; displaying the image data on a display screen and visually emphasizing the user-selectable fields; and for each user-selectable field; transforming the text data object of the respective user-selectable field to an audio playback file, by the at least one processor; storing the audio playback file to the at least one non-transitory processor-readable medium; storing at least one voice command name for the respective user-selectable field to the at least one non-transitory processor-readable medium; logically associating the at least one voice command name for the respective user-selectable field as a trigger with the audio playback file for the respective user-selectable field, by the at least one processor. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of accessing information in a voice-enabled document, using a processor-based system, including at least one processor and at least one non-transitory processor-readable medium communicatively coupled to the at least one processor, the method comprising:
-
causing an image of at least a part of a digital image of the voice-enabled document to appear on a display screen, the voice-enabled document including a number of user-selectable fields; receiving a voice command input by the at least one processor, the voice command input is indicative of a selection of one of the user-selectable fields; and initiating a playback of an audio playback file logically associated with the selected user-selectable field, by the at least one processor. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A system for producing voice-enabled documents, the system comprising:
-
a non-transitory processor-readable medium comprising data structures associated with voice-enabled electronic documents, wherein the data structures include; image data representing the voice-enabled electronic document for display on an electronic display screen; at least one voice command name associated with each of a plurality of embedded document fields; and a logical association between each voice command name and an audio data file, such that voice recognition of a voice command name triggers an audible presentation of the logically associated audio data file; a digital camera that captures an image of a document and stores associated image data in the non-transitory processor-readable medium; at least one processor programmed to extract text data objects from the image, and to produce corresponding audio data files for storage in the non-transitory processor-readable medium; a display that presents the text data objects as user-selectable fields; a microphone, the display being responsive to voice commands received via the microphone; at least one audio speaker that receives input from the audio player; and a logical association generator that logically assigns one or more voice command names to each user-selectable field, and further associates the voice command names with corresponding audio data files. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification