×

Computer method and apparatus for extracting data from web pages

  • US 20020091688A1
  • Filed: 07/20/2001
  • Published: 07/11/2002
  • Est. Priority Date: 07/31/2000
  • Status: Active Grant
First Claim
Patent Images

1. A method for extracting data from a Web page comprising the computer-implemented steps of:

  • using natural language processing, finding possible formal names on a given Web page, the step of finding producing a first found set of formal names;

    searching the given Web page for formal names not found by the natural language processing step of finding, said searching producing a second set of formal names; and

    refining a combined set of formal names formed of the first found set and the second set, said refining producing a working set of people and organization names extracted from the given Web page.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×