×

Financial event and relationship extraction

  • US 10,049,100 B2
  • Filed: 01/30/2009
  • Issued: 08/14/2018
  • Est. Priority Date: 01/30/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of identifying and extracting by a computer financial information from tables in documents, the method comprising:

  • automatically, without further intervention from a user, identifying by a computer a document from a set of documents retrieved by the computer from a document source database;

    screening the identified document by a support vector machine classifier to distinguish between tables and non-tables and identify one or more tables that contain a desired relation without performing a detailed extraction process;

    identifying within the identified document a table from a set of tables that contains at least one predetermined desired relation, wherein the at least one predetermined desired relation comprises a plurality of desired attributes and desired values;

    partitioning by the computer the identified table into a plurality of labels and one or more values, with one or more of the labels identified as a column label and one or more identified as a row label;

    determining by the computer a set of attribute-value pairs by associating each value of the one or more values partitioned from the identified table with a plurality of the labels, with an abstract table including the set of attribute-value pairs; and

    generating by the computer a set of data for inclusion into a database of financial information, the set of data generated for inclusion in the database of financial information based on the determined set of attribute-value pairs.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×