×

Method and system for processing structured data and unstructured data

  • US 7,668,849 B1
  • Filed: 12/09/2005
  • Issued: 02/23/2010
  • Est. Priority Date: 12/09/2005
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for processing data, the method comprising:

  • (a) executing, on a data processing, programming for providing a plurality of data processing modules, said data processing modules comprising;

    an e-mail capture and parsing engine for intercepting, copying, and processing e-mails transmitted from a gateway to an email server, said processing comprising dividing a copy of an email into sections including at least a header section and a body section, and dividing the header section into sections comprising one or more of sender email address, sender name, recipient email address, recipient name, summary of email contents, date the email was sent, or time the email was sent;

    an email reload engine for loading data associated with archived or stored emails;

    a web crawler engine for Internet crawling and capturing Internet web pages;

    a document gathering engine for capturing data from external sources comprising one or more of an application data warehouse, application server, or file system;

    one or more data staging areas for temporary storage of data associated with said email capture and parsing engine, said web crawler engine, and said document gathering engine;

    a text extraction and parsing engine for receiving data from said data staging areas, extracting structured data from unstructured data, and correlating extracted structured data and associated unstructured data to define a link between said structured data and said associated unstructured data;

    a data holding area for temporary storage of said structured data and said unstructured data from said text extraction and parsing engine;

    a data loading engine for loading and storing into a database management system said structured data and said unstructured data from said data holding area based on said link;

    an email account management engine for bypassing said text extraction and parsing engine and directly copying structured data from said email server to said database management system;

    (b) using said plurality of data processing modules, carrying out operations comprising;

    capturing unstructured data from an unstructured data source and structured data from a structured data source, the unstructured data being associated with the structured data, wherein the unstructured data source and the structured data source are each associated with at least an email, the email including a header;

    parsing the header into at least a sending email address, a receiving email address, a date and time of transmission associated with the email, and a carbon copy email address;

    evaluating the email using the email capture and parsing engine, wherein the email capture and parsing engine generates a summary of content associated with the email, the sending email address, the receiving email address, the date and the time of transmission associated with the email, the carbon copy email address, and a summary used to classify the email;

    correlating the unstructured data and the structured data to establish a link between the unstructured data and the structured data, wherein the link integrates the unstructured data and the structured data; and

    storing the unstructured data and the structured data in a first data structure, wherein the unstructured data is stored in an unstructured portion of the first data structure, wherein the structured data is stored in a structured portion of the first data structure, wherein the link is preserved in the first data structure, and wherein the storing of the unstructured data and the structured data enables access of the unstructured data and the structured data from the first data structure.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×