SEGREGATION OF HANDWRITTEN INFORMATION FROM TYPOGRAPHIC INFORMATION ON A DOCUMENT
First Claim
1. A method of segregating handwritten information from typographic information on a document, the method comprising:
- receiving an electronic document image of a document, the electronic document image comprising a plurality of pixels, wherein each of the plurality of pixels comprises a characteristic of a plurality of characteristics;
identifying a first, a second and a third most frequently occurring characteristic of the plurality of pixels, wherein the pixels of the plurality of pixels comprising the first most frequently occurring characteristic of the plurality of characteristics represent a background of the document;
determining typographic information of the document, wherein the typographic information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the second most frequently occurring characteristic of the plurality of characteristics;
determining, by a processor, handwritten information of the document, wherein the handwritten information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the third most frequently occurring characteristic of the plurality of characteristics; and
deriving a first representation of the handwritten information and a second representation of the typographic information.
2 Assignments
0 Petitions
Accused Products
Abstract
A system for segregating handwritten information from typographic information on a document may include a memory, an interface, and a processor. The memory stores an electronic document image of a document where the electronic document image includes pixels and each pixel has a characteristic. The processor may receive, via the interface, the electronic document image and may identify first, second and third most frequently occurring characteristics of the pixels of the electronic document image. The pixels having the first most frequently occurring characteristic represent a background of the document. The processor may determine the typographic information of the document as represented by pixels having the second most frequently occurring characteristic. The processor may determine the handwritten information of the document as represented by pixels having the third most frequently occurring characteristic. The processor may derive a first representation of the handwritten information and a second representation of the typographic information.
79 Citations
45 Claims
-
1. A method of segregating handwritten information from typographic information on a document, the method comprising:
-
receiving an electronic document image of a document, the electronic document image comprising a plurality of pixels, wherein each of the plurality of pixels comprises a characteristic of a plurality of characteristics; identifying a first, a second and a third most frequently occurring characteristic of the plurality of pixels, wherein the pixels of the plurality of pixels comprising the first most frequently occurring characteristic of the plurality of characteristics represent a background of the document; determining typographic information of the document, wherein the typographic information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the second most frequently occurring characteristic of the plurality of characteristics; determining, by a processor, handwritten information of the document, wherein the handwritten information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the third most frequently occurring characteristic of the plurality of characteristics; and deriving a first representation of the handwritten information and a second representation of the typographic information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method of segregating handwritten information from typographic information on a document, the method comprising:
-
receiving an electronic document image of a document, the electronic document image comprising a plurality of pixels, wherein each of the plurality of pixels comprises a characteristic of a plurality of characteristics; determining a document type of the electronic document image, wherein the document type is indicative of a first characteristic of the plurality of pixels associated with typographic information and a second characteristic of the plurality of pixels associated with handwritten information; determining typographic information of the document, wherein the typographic information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the first characteristic; determining, by a processor, handwritten information of the document, wherein the handwritten information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the second characteristic; and deriving a representation of the handwritten information and the typographic information.
-
-
17. A method of separating a first information applied to a medium at a first date/time from a second information applied to the medium at a second date/time, the method comprising:
-
receiving an electronic image of a medium, the medium comprising a first information applied at a first date/time and a second information applied at a second date/time, wherein the electronic image comprises a plurality of pixels and each pixel of the plurality of pixels comprises a characteristic of a plurality of characteristics; determining the first information applied to the medium, wherein the first information applied to the medium comprises the pixels of the plurality of pixels of the electronic image which comprise a second most frequently occurring characteristic of the plurality of characteristics; determining, by a processor, the second data applied to the medium, wherein the second data applied to the medium comprises pixels of the plurality of pixels of the electronic image which comprise a third most frequently occurring characteristic of the plurality of characteristics; and deriving a first representation of the first information and a second representation of the second image. - View Dependent Claims (18, 19)
-
-
20. A method for delineating regions of an electronic document image comprising handwritten information and typographic information, the method comprising:
-
(a) determining a top edge, a bottom edge, a right edge and a left edge of an electronic document image comprising typographic information and handwritten information, wherein the edges are determined based on an orientation of the typographic information; (b) identifying an upper-left most pixel of a plurality of pixels corresponding to the handwritten information, wherein the upper-left most pixel comprises a pixel closest to the top edge and closest to the left edge of the electronic document image; (c) determining a top bound of a region of the electronic document image based on a top line running through the upper-left most pixel, wherein the top line is parallel to the top edge and the bottom edge of the electronic document image; (d) determining a bottom pixel of the plurality of pixels corresponding to the handwritten information wherein the bottom pixel is located a height number of pixels below the upper-left most pixel; (e) determining a bottom bound of the region of the electronic document image based on a bottom line running through the bottom pixel, wherein the bottom line is parallel to the top edge and the bottom edge of the electronic document image; (f) determining a leftmost pixel of the plurality of pixels corresponding to the handwritten information located within the top bound and the bottom bound, wherein the leftmost pixel is located at the left edge of the electronic document image or the leftmost pixel comprises a closet pixel to the left edge of the electronic document image which is located within top bound and the bottom bound and has no other pixels within a buffer number of pixels to the left; (g) determining a left bound of the region of the electronic document image based on a left line running through the leftmost pixel, wherein the left line is parallel to the left edge and the right edge of the electronic document image; (h) determining a rightmost pixel of the plurality of pixels corresponding to the handwritten information located within the top bound and the bottom bound, wherein the rightmost pixel is located at the right edge of the electronic document image or the rightmost pixel comprises a closest pixel to the right edge of the electronic document image which is located within the top bound and the bottom bound and has no other pixels with the buffer number of pixels to the right; (i) determining a right bound of the region of the electronic document image based on a right line running through the rightmost pixel, wherein the right line is parallel to the left edge and the right edge of the electronic document image; (j) determining the bottom pixel of the plurality of pixels corresponding to the handwritten information which is located within the left bound and the right bound, wherein the bottom pixel is located at the bottom edge of the electronic document image or the bottom pixel is a closest pixel to the bottom edge of the electronic document image with no other pixels within the buffer number of pixels below; (k) determining the bottom bound of the region of the electronic document image based on a bottom line running through the bottom pixel, wherein the bottom line is parallel to the top edge and the bottom edge of the electronic document image; (l) repeating, by a processor, steps (d)-(k) using the bottom bound determined in step (k), the left bound determined in step (g), and the right bound determined in step (i) until the bottom bound, left bound and right bound remain constant; and (m) determining the region of the electronic document image based on an area enclosed by the top bound, the bottom bound, the left bound and the right bound. - View Dependent Claims (21, 22)
-
-
23. A method for classifying handwritten information in an electronic document image, the method comprising:
-
receiving an electronic document image, wherein the electronic document image comprises typographic information and handwritten information; determining a plurality of characters corresponding to the handwritten information; determining a document type corresponding to the typographic information; determining a location of the handwritten information within the electronic document image relative to the typographic information; determining, by a processor, a data field corresponding to the handwritten information based on the document type corresponding to the typographic information and the location of the handwritten information within the electronic document image relative to the typographic information; and storing the plurality of characters in a database record corresponding to the data field. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A system for classifying handwritten information in an electronic document image, the system comprising:
-
means for receiving an electronic document image, wherein the electronic document image comprises typographic information and handwritten information; means for determining a plurality of characters corresponding to the handwritten information; means for determining a document type corresponding to the typographic information; means for determining a location of the handwritten information within the electronic document image relative to the typographic information; means for determining, by a processor, a data field corresponding to the handwritten information based on the document type corresponding to the typographic information and the location of the handwritten information within the electronic document image relative to the typographic information; and means for storing the plurality of characters in a database record corresponding to the data field.
-
-
34. A system for segregating handwritten information from typographic information on a document, the system comprising:
-
a memory operative to store an electronic document image of a document, the electronic document image comprising a plurality of pixels, wherein each of the plurality of pixels comprises a characteristic of a plurality of characteristics; an interface coupled with the memory and operative to receive the electronic document image; and a processor coupled with the interface and operative to receive, via the interface, the electronic document image of the document, identify a first, a second, and a third most frequently occurring characteristic of the plurality of pixels, wherein the pixels of the plurality of pixels comprising the first most frequently occurring characteristic of the plurality of characteristics represent a background of the document, determine typographic information of the document, wherein the typographic information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the second most frequently occurring characteristic of the plurality of characteristics, determine handwritten information of the document, wherein the handwritten information is represented by the pixels of the plurality of pixels of the electronic document image which comprise the third most frequently occurring characteristic of the plurality of characteristics, and derive a first representation of the handwritten information and a second representation of the typographic information. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45)
-
Specification