×

Method and apparatus for template-based processing of electronic documents

  • US 8,521,757 B1
  • Filed: 09/26/2008
  • Issued: 08/27/2013
  • Est. Priority Date: 09/26/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method of processing electronic documents, comprising:

  • analyzing text content of the electronic documents to identify whether each of the electronic documents matches any of a plurality of predefined document templates, wherein one or more of the electronic documents conforms to a structure of at least one of the plurality of predefined document templates, and wherein the step of analyzing comprises executing at least one machine learning algorithm, the at least one machine learning algorithm trained using at least one sample electronic document having a predefined template;

    generating a template index that relates at least one of the electronic documents with at least one of the plurality of predefined document templates based at least in part upon an identified match between the at least one of the electronic documents and the at least one of the plurality of predefined document templates;

    generating a search query using at least one of the plurality of predefined document templates as at least one search parameter;

    searching an archive having the electronic documents using the template index to locate one or more of the electronic documents that match the at least one predefined document template of the search query; and

    providing access to the one or more of the electronic documents that match the at least one predefined document template of the search query.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×