Please download the dossier by clicking on the dossier button x
×

Machine learning system for extracting structured records from web pages and other text sources

  • US 20060123000A1
  • Filed: 12/02/2005
  • Published: 06/08/2006
  • Est. Priority Date: 12/03/2004
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for extracting a structured record from a document, said structured record including information related to a predetermined subject matter, said information to be organized into categories within said structured record, said method comprising the steps of:

  • identifying a span of text in said document according to criteria associated with said predetermined subject matter; and

    processing said span of text to extract at least one text element associated with at least one of said categories of said structured record from said document.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×