Voice annotation of scanned images for portable scanning applications
First Claim
1. A scanning device comprising:
- an image pickup component wherein said image pickup component reads a plurality of sample points from a portion of a surface of a document and generates a value for each of said plurality of sample points;
a controller connected to said image pickup component wherein said controller receives and assembles said values into an image array;
a memory connected to said controller wherein said controller stores said image array as an image data file in said memory;
a voice pickup for capturing a user'"'"'s speech and converting said user'"'"'s speech to an incoming analog signal; and
an analog-to-digital converter connected to said voice pickup and connected to said controller, wherein said incoming analog signal from said voice pickup is converted to a digital signal and passed to said controller, wherein said controller stores said digital signal in said memory as a voice annotation file, and connects said voice annotation file and said image data file.
3 Assignments
0 Petitions
Accused Products
Abstract
A scanning device that scans a document and allows a user to voice annotate the scanned document by speaking into a voice pickup located in the device. The data from scanning is saved as an image data file in device memory. The device digitizes the speech input, compresses the speech, and saves the speech as a voice clip file in device memory. The device establishes a connection between the voice clip file and the image data file. When the user uploads the image data file to a host computer, the voice clip file is automatically transferred. When the user selects an image data file through a user interface, the voice clip is automatically played back or the user is notified of the voice clip. The play back process involves decompressing the speech and then reformatting the speech into a format which the host computer sound card can recognize.
-
Citations
19 Claims
-
1. A scanning device comprising:
-
an image pickup component wherein said image pickup component reads a plurality of sample points from a portion of a surface of a document and generates a value for each of said plurality of sample points;
a controller connected to said image pickup component wherein said controller receives and assembles said values into an image array;
a memory connected to said controller wherein said controller stores said image array as an image data file in said memory;
a voice pickup for capturing a user'"'"'s speech and converting said user'"'"'s speech to an incoming analog signal; and
an analog-to-digital converter connected to said voice pickup and connected to said controller, wherein said incoming analog signal from said voice pickup is converted to a digital signal and passed to said controller, wherein said controller stores said digital signal in said memory as a voice annotation file, and connects said voice annotation file and said image data file. - View Dependent Claims (2, 3, 4, 5, 6, 7)
a digital-to-analog converter connected to said controller, wherein said digital signal stored as said (voice annotation file is passed from said controller to said digital-to-analog converter and is converted to an outgoing analog signal; and
a speaker connected to said digital-to-analog converter, wherein said outgoing analog signal from said digital-to-analog converter is sent to said speaker for audio output.
-
-
3. A scanning device according to claim 1 further comprising:
a display connected to said controller, wherein said image array stored as said image data file is passed from said controller to said display wherein a visual representation of said portion of said surface of said document is displayed.
-
4. A scanning device according to claim 1 further comprising:
a host connection connected to said controller, wherein said image data file and said voice annotation file are uploaded through said host connection to a communications interface of a host computer.
-
5. A scanning device according to claim 4 further comprising:
-
a system bus connected to said communications interface;
a host memory connected to said system bus;
a storage device connected to said system bus;
a speaker/sound card connected to said system bus;
a key board connected to said system bus;
a mouse connected to said system bus;
a graphics display connected to said system bus; and
a processing element connected to said system bus wherein said processing element communicates with said communications interface, said host memory, said storage device, said speaker/sound card, and said display over said system bus.
-
-
6. A scanning device according to claim 5 further comprising:
file transfer software located in said host memory of said host computer wherein said file transfer software receives said uploaded image data file and said uploaded voice annotation file from said communications interface and saves said uploaded image data file and said uploaded voice annotation file to said storage device.
-
7. A scanning device according to claim 6 further comprising:
a user interface program in said host memory of said host computer wherein a user selects said uploaded image data file through said keyboard or through said mouse, and further wherein file conversion software located in said host memory converts said uploaded voice annotation file to an audio file format recognized by sound card driver software located in said host memory, wherein said sound card driver software converts said uploaded voice annotation file in said audio file format to an analog signal for audio output through said speaker/sound card, and said uploaded image data file is displayed on said graphics display as a visual representation of said portion of said surface of said document.
-
8. A method of scanning images comprising the steps of:
-
(a) scanning a portion of a surface of a document with an image pickup component of a scanning device wherein said image pickup component reads a plurality of sample points from said portion of said surface of said document;
(b) generating a grey scale value for each of said plurality of sample points;
(c) assembling each of said grey scale values into an image array;
(d) storing said image array as an image data file in a memory;
(e) capturing a user'"'"'s speech and converting said captured user'"'"'s speech to an incoming analog signal;
(f) converting said incoming analog signal to a digital signal;
(g) storing said digital signal into said memory as a voice annotation file; and
(h) connecting said image data file and said voice annotation file. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
(c) converting each of said grey scale values into a binary value and assembling each of said binary values into an image array.
-
-
10. A method of scanning images according to claim 8 wherein step (g) further comprises compressing said digital signal prior to storing as said voice annotation file.
-
11. A method of scanning images according to claim 8 wherein step (g) further comprises storing said digital signal as said voice annotation file in an industry standard audio file format.
-
12. A method of scanning images according to claim 8 wherein step (h) further comprises connecting said image data file and said voice annotation file by giving said image data file and said voice annotation file a same file name and giving said image data file a first file extension name and giving said voice annotation file a second file extension name.
-
13. A method of scanning images according to claim 8 further comprising the following step:
(i) repeating steps (a) through (h) for a portion of a surface of a next document.
-
14. A method of scanning images according to claim 8 further comprising the following steps:
-
(i) converting said digital signal stored as said voice annotation file to an outgoing analog signal; and
(j) sending said outgoing analog signal to a speaker for audio output.
-
-
15. A method of scanning images according to claim 8 further comprising the following step:
(i) displaying said image array stored as said image data file on a display wherein a visual representation of said portion of said surface of said document is displayed.
-
16. A method of scanning images according to claim 8 further comprising the following step:
-
(i) uploading said image data file and said voice annotation file stored in said memory through a host connection connected to said memory to a communication interface of a host computer, wherein said host connection communicates with said communication interface; and
(j) storing said uploaded image data file and said uploaded voice annotation file in a storage device connected to said host computer.
-
-
17. A method of scanning images according to claim 16 further comprising the following steps:
-
(k) receiving input into a user interface program to select said uploaded image data file from said storage device;
(l) decompressing said uploaded voice annotation file from said storage device;
(m) converting said decompressed uploaded voice annotation file to an audio file format;
(n) processing said converted decompressed uploaded voice annotation file in said audio file format in sound card drive software through a speaker/sound card for audio output; and
(o) repeating steps (k) through (n) for a next uploaded image data file.
-
-
18. A method of scanning images according to claim 17 wherein step (k) further comprises the following steps (k1) and (k2):
-
(k1) displaying on a graphics display in said host computer an indication that said uploaded image data file has a connected said uploaded voice annotation file; and
(k2) receiving input to select said uploaded voice annotation file for audio play back.
-
-
19. A method of scanning images according to claim 17 wherein step (n) further comprises the following steps (n1) and (n2):
-
(n1) selecting said uploaded image data file for display; and
(n2) displaying said uploaded image data file on said graphics display in said host computer.
-
Specification