×

Computer method and apparatus for extracting data from web pages

  • US 20070027672A1
  • Filed: 05/18/2006
  • Published: 02/01/2007
  • Est. Priority Date: 07/31/2000
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for extracting data from a Web page comprising the computer-implemented steps of:

  • using natural language processing, finding possible formal names on a given Web page, the step of finding producing a first found set of formal names;

    searching the given Web page for formal names not found by the natural language processing step of finding, said searching producing a second set of formal names; and

    refining a combined set of formal names formed of the first found set and the second set, said refining producing a working set of people and organization names extracted from the given Web page.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×