Voice annotation of scanned images for portable scanning applications

US 6,226,422 B1
Filed: 02/19/1998
Issued: 05/01/2001
Est. Priority Date: 02/19/1998
Status: Expired due to Term

First Claim

Patent Images

1. A scanning device comprising:

an image pickup component wherein said image pickup component reads a plurality of sample points from a portion of a surface of a document and generates a value for each of said plurality of sample points;

a controller connected to said image pickup component wherein said controller receives and assembles said values into an image array;

a memory connected to said controller wherein said controller stores said image array as an image data file in said memory;

a voice pickup for capturing a user'"'"'s speech and converting said user'"'"'s speech to an incoming analog signal; and

an analog-to-digital converter connected to said voice pickup and connected to said controller, wherein said incoming analog signal from said voice pickup is converted to a digital signal and passed to said controller, wherein said controller stores said digital signal in said memory as a voice annotation file, and connects said voice annotation file and said image data file.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A scanning device that scans a document and allows a user to voice annotate the scanned document by speaking into a voice pickup located in the device. The data from scanning is saved as an image data file in device memory. The device digitizes the speech input, compresses the speech, and saves the speech as a voice clip file in device memory. The device establishes a connection between the voice clip file and the image data file. When the user uploads the image data file to a host computer, the voice clip file is automatically transferred. When the user selects an image data file through a user interface, the voice clip is automatically played back or the user is notified of the voice clip. The play back process involves decompressing the speech and then reformatting the speech into a format which the host computer sound card can recognize.

Citations

19 Claims

1. A scanning device comprising:
- an image pickup component wherein said image pickup component reads a plurality of sample points from a portion of a surface of a document and generates a value for each of said plurality of sample points;
  
  a controller connected to said image pickup component wherein said controller receives and assembles said values into an image array;
  
  a memory connected to said controller wherein said controller stores said image array as an image data file in said memory;
  
  a voice pickup for capturing a user'"'"'s speech and converting said user'"'"'s speech to an incoming analog signal; and
  
  an analog-to-digital converter connected to said voice pickup and connected to said controller, wherein said incoming analog signal from said voice pickup is converted to a digital signal and passed to said controller, wherein said controller stores said digital signal in said memory as a voice annotation file, and connects said voice annotation file and said image data file.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. A scanning device according to claim 1 further comprising:
3. A scanning device according to claim 1 further comprising:
- a display connected to said controller, wherein said image array stored as said image data file is passed from said controller to said display wherein a visual representation of said portion of said surface of said document is displayed.
4. A scanning device according to claim 1 further comprising:
- a host connection connected to said controller, wherein said image data file and said voice annotation file are uploaded through said host connection to a communications interface of a host computer.
5. A scanning device according to claim 4 further comprising:
- a system bus connected to said communications interface;
  
  a host memory connected to said system bus;
  
  a storage device connected to said system bus;
  
  a speaker/sound card connected to said system bus;
  
  a key board connected to said system bus;
  
  a mouse connected to said system bus;
  
  a graphics display connected to said system bus; and
  
  a processing element connected to said system bus wherein said processing element communicates with said communications interface, said host memory, said storage device, said speaker/sound card, and said display over said system bus.
6. A scanning device according to claim 5 further comprising:
- file transfer software located in said host memory of said host computer wherein said file transfer software receives said uploaded image data file and said uploaded voice annotation file from said communications interface and saves said uploaded image data file and said uploaded voice annotation file to said storage device.
7. A scanning device according to claim 6 further comprising:
- a user interface program in said host memory of said host computer wherein a user selects said uploaded image data file through said keyboard or through said mouse, and further wherein file conversion software located in said host memory converts said uploaded voice annotation file to an audio file format recognized by sound card driver software located in said host memory, wherein said sound card driver software converts said uploaded voice annotation file in said audio file format to an analog signal for audio output through said speaker/sound card, and said uploaded image data file is displayed on said graphics display as a visual representation of said portion of said surface of said document.

8. A method of scanning images comprising the steps of:
- (a) scanning a portion of a surface of a document with an image pickup component of a scanning device wherein said image pickup component reads a plurality of sample points from said portion of said surface of said document;
  
  (b) generating a grey scale value for each of said plurality of sample points;
  
  (c) assembling each of said grey scale values into an image array;
  
  (d) storing said image array as an image data file in a memory;
  
  (e) capturing a user'"'"'s speech and converting said captured user'"'"'s speech to an incoming analog signal;
  
  (f) converting said incoming analog signal to a digital signal;
  
  (g) storing said digital signal into said memory as a voice annotation file; and
  
  (h) connecting said image data file and said voice annotation file.
- View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
- - 9. A method of scanning images according to claim 8 wherein step (c) is replaced by the following new step (c):
10. A method of scanning images according to claim 8 wherein step (g) further comprises compressing said digital signal prior to storing as said voice annotation file.
11. A method of scanning images according to claim 8 wherein step (g) further comprises storing said digital signal as said voice annotation file in an industry standard audio file format.
12. A method of scanning images according to claim 8 wherein step (h) further comprises connecting said image data file and said voice annotation file by giving said image data file and said voice annotation file a same file name and giving said image data file a first file extension name and giving said voice annotation file a second file extension name.
13. A method of scanning images according to claim 8 further comprising the following step:
- (i) repeating steps (a) through (h) for a portion of a surface of a next document.
14. A method of scanning images according to claim 8 further comprising the following steps:
- (i) converting said digital signal stored as said voice annotation file to an outgoing analog signal; and
  
  (j) sending said outgoing analog signal to a speaker for audio output.
15. A method of scanning images according to claim 8 further comprising the following step:
- (i) displaying said image array stored as said image data file on a display wherein a visual representation of said portion of said surface of said document is displayed.
16. A method of scanning images according to claim 8 further comprising the following step:
- (i) uploading said image data file and said voice annotation file stored in said memory through a host connection connected to said memory to a communication interface of a host computer, wherein said host connection communicates with said communication interface; and
  
  (j) storing said uploaded image data file and said uploaded voice annotation file in a storage device connected to said host computer.
17. A method of scanning images according to claim 16 further comprising the following steps:
- (k) receiving input into a user interface program to select said uploaded image data file from said storage device;
  
  (l) decompressing said uploaded voice annotation file from said storage device;
  
  (m) converting said decompressed uploaded voice annotation file to an audio file format;
  
  (n) processing said converted decompressed uploaded voice annotation file in said audio file format in sound card drive software through a speaker/sound card for audio output; and
  
  (o) repeating steps (k) through (n) for a next uploaded image data file.
18. A method of scanning images according to claim 17 wherein step (k) further comprises the following steps (k1) and (k2):
- (k1) displaying on a graphics display in said host computer an indication that said uploaded image data file has a connected said uploaded voice annotation file; and
  
  (k2) receiving input to select said uploaded voice annotation file for audio play back.
19. A method of scanning images according to claim 17 wherein step (n) further comprises the following steps (n1) and (n2):
- (n1) selecting said uploaded image data file for display; and
  
  (n2) displaying said uploaded image data file on said graphics display in said host computer.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hewlett-Packard Development Company, L.P. (HP Inc.)
Original Assignee
Hewlett-Packard Company (HP Inc.)
Inventors
Oliver, Thomas C.
Primary Examiner(s)
Tadaton, Bijan
Assistant Examiner(s)
Do, Anh Hong

Application Number

US09/026,082
Time in Patent Office

1,167 Days
Field of Search

382/312, 382/313, 235/472.01, 386/95, 386/117, 375/202
US Class Current

382/313
CPC Class Codes

G10L 19/00   Speech or audio signals ana...

H04M 3/42204   Arrangements at the exchang...

H04N 1/00127   Connection or combination o...

H04N 1/00204   with a digital computer or ...

H04N 1/00326   with a data reading, recogn...

H04N 1/32122   in a separate device, e.g. ...

H04N 1/32128   attached to the image data,...

H04N 2201/0081   Image reader H04N2201/0091 ...

H04N 2201/3222   of processing required or p...

H04N 2201/3264   of sound signals

H04N 2201/3274   Storage or retrieval of pre...

H04N 2201/3278   Transmission

Voice annotation of scanned images for portable scanning applications

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

19 Claims

Specification

Solutions

Use Cases

Quick Links

Voice annotation of scanned images for portable scanning applications

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

19 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links