Systems and methods for detection and high-quality capture of documents on a cluttered tabletop with an automatically controlled camera
First Claim
1. A computer-implemented method being performed in a computerized system comprising a processing unit, a memory and a camera, the camera being mounted on a turret operatively coupled to the processing unit, the computer-implemented method comprising:
- a. scanning a surface with the camera to acquire a first plurality of images of the surface;
b. using the acquired first plurality of images of the surface to determine at least one location of a text on the surface;
c. capturing a second plurality of images using the camera based on the determined location of the text on the surface; and
d. extracting the text using the second captured plurality of images.
2 Assignments
0 Petitions
Accused Products
Abstract
Described are systems and methods for recognizing paper documents on a tabletop using an overhead camera mounted on pan-tilt servos. The described automated system first finds paper documents on a cluttered desk based on a text probability map, constructed using multiple images acquired at fixed grid positions, and then captures a sequence of high-resolution overlapping frames of the located document(s), which are then fused together and perspective-rectified, using computed homography, to reconstruct a high quality and fronto-parallel document image that is of sufficient quality required for optical character recognition. The extracted textual information may be used, for example, for indexing and search, document repository and/or language translation applications.
19 Citations
20 Claims
-
1. A computer-implemented method being performed in a computerized system comprising a processing unit, a memory and a camera, the camera being mounted on a turret operatively coupled to the processing unit, the computer-implemented method comprising:
-
a. scanning a surface with the camera to acquire a first plurality of images of the surface; b. using the acquired first plurality of images of the surface to determine at least one location of a text on the surface; c. capturing a second plurality of images using the camera based on the determined location of the text on the surface; and d. extracting the text using the second captured plurality of images. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A non-transitory computer-readable medium embodying a set of computer-executable instructions, which, when executed in a computerized system comprising a processing unit, a memory and a camera, the camera being mounted on a turret operatively coupled to the processing unit, cause the computerized system to perform a method comprising:
-
a. scanning a surface with the camera to acquire a first plurality of images of the surface; b. using the acquired first plurality of images of the surface to determine at least one location of a text on the surface; c. capturing a second plurality of images using the camera based on the determined location of the text on the surface; and d. extracting the text using the second captured plurality of images. - View Dependent Claims (17, 18, 19)
-
-
20. A computerized system comprising a processing unit, a memory and a camera, the camera being mounted on a turret operatively coupled to the processing unit, the memory storing a set of computer-executable instructions causing the computerized system to perform a method comprising:
-
a. scanning a surface with the camera to acquire a first plurality of images of the surface; b. using the acquired first plurality of images of the surface to determine at least one location of a text on the surface; c. capturing a second plurality of images using the camera based on the determined location of the text on the surface; and d. extracting the text using the second captured plurality of images.
-
Specification