System and method of managing documents
First Claim
1. A process for managing documents which comprises the steps of:
- recognizing text in each image;
extracting the text to form a text file;
verifying the text file which comprises the sub steps of generating an adjustable score threshold and scoring each text file to determine if the text file exceeds the score threshold; and
using the text file to form an inventory of every word;
wherein the process occurs without manual coding.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method of managing documents wherein after document preparation, documents may be scanned to form a digital document image. After optical character recognition, a compressed digital image file with a text layer is created so that a separate text file may be extracted from the document image and tethered together by a unique identifier. The compressed digital image file and its corresponding extracted text file may be sent to a server and where an inventory of each word of every document is created. The images and text inventory are then inserted into a database such that users manipulating the system may use Boolean searches and/or activate hyperlinks tethered to document images for the purposes of navigation or the creation of index entries that may contain additional information about the documents. In the preferred method, the system allows the management of a plurality of documents over a wide area network such as the Internet.
-
Citations
20 Claims
-
1. A process for managing documents which comprises the steps of:
-
recognizing text in each image;
extracting the text to form a text file;
verifying the text file which comprises the sub steps of generating an adjustable score threshold and scoring each text file to determine if the text file exceeds the score threshold; and
using the text file to form an inventory of every word;
wherein the process occurs without manual coding. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 18, 19, 20)
-
-
13. A system for managing documents comprising:
-
extraction software capable of extracting text from images to form text files;
verification software capable of scoring each text file to determine if the text file exceeds an adjustable score threshold;
text indexing software capable indexing each text file to form an inventory; and
a file server capable of containing each image and the text inventory.
-
-
17. A method of using a system for managing documents, wherein the system comprises at least one server containing a database having images and a text inventory created from extracted and verified text layer exceeding an adjustable scored threshold from each image accessible via a wide area network, the method which comprises the steps of:
-
accessing via the wide area network; and
searching the inventory via the wide area network.
-
Specification