Systems and methods for data indexing and processing
First Claim
1. A method for indexing a document file comprising:
- receiving the document file, wherein the document file comprises a plurality of characters;
organizing the plurality of characters into an array of strings;
comparing a first set of strings from the array of strings against a comparison reference database comprising a plurality of records wherein each record comprises at least one data field element; and
responsive to at least a portion of the first set of strings exceeding a threshold match with at least a portion of a record in the comparison reference database, associating the document file with the record.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed that allow for indexing, processing, or both of information from physical media or electronic media, which may be received from a plurality of sources. In embodiments, a document file may be matched using pattern matching methods and may include comparisons with a comparison reference database to improve or accelerate the indexing process. In embodiments, information may be presented to a user as potential matches thereby improving manual indexing processes. In embodiments, one or more additional actions may occur as part of the processing, including without limitation, association additional data with a document file, making observations from the document file, notifying individuals, creating composite messages, and billing events. In an embodiment, data from a document file may be associated with a key word, key phrase, or word frequency value that enables adaptive learning so that unindexed data may be automatically indexed based on user interaction history.
89 Citations
32 Claims
-
1. A method for indexing a document file comprising:
-
receiving the document file, wherein the document file comprises a plurality of characters;
organizing the plurality of characters into an array of strings;
comparing a first set of strings from the array of strings against a comparison reference database comprising a plurality of records wherein each record comprises at least one data field element; and
responsive to at least a portion of the first set of strings exceeding a threshold match with at least a portion of a record in the comparison reference database, associating the document file with the record. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for indexing a document file comprising:
-
receiving a document file, wherein the document file comprises a plurality of characters;
organizing the plurality of characters into an array of strings;
receiving at least a portion of a reference database from a client, wherein the reference database comprise a plurality of records wherein each record comprises at least one data field element;
comparing a first set of strings from the array of strings against a comparison reference database obtained from the reference database;
responsive to at least a portion of the first set of strings exceeding a threshold match with at least a portion of a record in the comparison reference database, generating a structure message that associates the document file with the record. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A system for indexing a document file comprising:
-
a communications services module coupled to receive a document file;
an extraction services module, communicatively coupled to the communications services module, that obtains from the document file a plurality of characters arranged into an array of strings;
indexing services module, communicatively coupled to the extraction services module, that compares a first set of strings from the array of strings against a comparison reference database comprising a plurality of records wherein each record comprises at least one data field element, and responsive to at least a portion of the first set of strings exceeding a threshold match with at least a portion of a record in the comparison reference database, associates the document file with the record. - View Dependent Claims (29, 30, 31, 32)
-
Specification