Automated method for extracting highlighted regions in scanned source
First Claim
1. A method for extracting highlighted regions in a scanned text document, comprising:
- converting a scanned text document into a highlighted region comprising a highlighted text;
extracting said highlighted text from said highlighted region; and
optically recognizing said highlighted text in order to recognize text extracted from said highlighted region of said scanned text document.
7 Assignments
0 Petitions
Accused Products
Abstract
An automated method for extracting highlighted regions in a scanned text documents includes color masking of highlight regions, extracting text from highlighted regions, recognizing the characters in extracted text optically and inserting the recognized characters to new document in order to easily identify highlighted text in scanned images. Using a two-layer multi-mask compression technology configured in a scanned export image path, edges and text regions can be extracted and together with the use of mask coordinates and associated mask colors, all highlighted texts can be easily identified and extracted. Optical Character Recognition (OCR) can then be utilized to appropriate summarization of different extracted highlighted texts.
-
Citations
20 Claims
-
1. A method for extracting highlighted regions in a scanned text document, comprising:
-
converting a scanned text document into a highlighted region comprising a highlighted text;
extracting said highlighted text from said highlighted region; and
optically recognizing said highlighted text in order to recognize text extracted from said highlighted region of said scanned text document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 19)
-
-
10. A method for extracting highlighted regions in a scanned image document, comprising:
-
converting a scanned image document into a plurality of background regions and a plurality of mask regions;
analyzing said plurality of background regions utilizing at least one mask coordinate, wherein said plurality of background regions are located beneath at least one mask region among said plurality of mask regions; and
optically recognizing a highlighted text in said at least one mask region, if the said plurality of background regions beneath said at least one mask region comprises a uniform color.
-
-
11. A system for extracting highlighted regions in a scanned text document, comprising:
-
a module for converting a scanned text document into a highlighted region comprising a highlighted text;
a module for extracting said highlighted text from said highlighted region; and
a module for optically recognizing said highlighted text in order to recognize text extracted from said highlighted region of said scanned text document. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
20. A system for extracting highlighted regions in a scanned image document, comprising:
-
a module for converting a scanned image document into a plurality of background regions and a plurality of mask regions;
a module for analyzing said plurality of background regions utilizing at least one mask coordinate, wherein said plurality of background regions are located beneath at least one mask region among said plurality of mask regions; and
a module for optically recognizing a highlighted text in said at least one mask region, if the said plurality of background regions beneath said at least one mask region comprises a uniform color.
-
Specification