×

Systems and methods for processing data

  • US 9,501,455 B2
  • Filed: 06/30/2011
  • Issued: 11/22/2016
  • Est. Priority Date: 06/30/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for processing data, the method comprising:

  • receiving, at a data processing tool, at least one data file including at least partially unstructured data from at least one data source, wherein the at least partially unstructured data includes actual data from a main application;

    processing, by a processor, the at least partially unstructured data to generate at least partially structured data that includes tagged data, wherein the tagged data includes a tag inserted to precede at least one identified term of interest, and wherein processing the at least partially unstructured data comprises at least one of;

    processing the at least partially unstructured data using an associative memory application that tags the at least one term of interest based on a generated identification score exceeding a predetermined threshold where the score is determined based on the number of matching terms between a segment of unstructured text and a segment of text in the associative memory application; and

    processing the at least partially unstructured data using a regular expression processing program;

    transmitting the at least one data file including the at least partially structured data to the main application;

    incorporating the at least partially structured data into the main application based at least in part on the tagged data, wherein incorporating the at least partially structured data comprises at least one of including and excluding data based on at least one of existence, content and type of a tag;

    displaying, at a user interface, the at least partially structured data, wherein at least partially structured data includes at least one segment of misidentified data that is at least one of incorrectly tagged and incorrectly not tagged;

    receiving, at the user interface, a user selection of at least one segment of misidentified data;

    updating the misidentified data to form re-identified data;

    updating the associative memory application to include the re-identified data that includes data that has been correctly tagged or correctly not tagged;

    receiving, at the data processing tool, text segments generated by parsing the at least partially unstructured data into discrete text segments;

    identifying one or more of the text segments as boilerplate data based on a comparison between the text segments and strings of text in a column incorporated in an associative memory application, wherein the text segments need not exactly match the strings of text in the associative memory application; and

    incorporating data including text segments parsed from the at least partially structured data into the main application, wherein the text identified as boilerplate data is excluded from the data incorporated into the main application.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×