×

Data extraction system, terminal, server, programs, and media for extracting data via a morphological analysis

  • US 8,321,198 B2
  • Filed: 10/27/2005
  • Issued: 11/27/2012
  • Est. Priority Date: 09/06/2005
  • Status: Active Grant
First Claim
Patent Images

1. A data extraction system for extracting and accumulating prescribed data from web pages on the web, the data extraction system comprising:

  • a plurality of terminals; and

    a server connected to the plurality of terminals,wherein the server comprises;

    a receiver for receiving the prescribed data, the prescribed data being extracted by at least one of the plurality of terminals and being a phrase having at least one part of speech of a morpheme;

    a part-of-speech accumulator for accumulating the at least one part of speech of the morpheme;

    a data accumulator for accumulating the prescribed data extracted by the at least one of the plurality of terminals and received by the receiver with extracted data; and

    a verifier for verifying whether the prescribed data extracted by the at least one of the plurality of terminals and received by the receiver is already accumulated with the extracted data by the data accumulator, the data accumulator accumulating the prescribed data with the extracted data when the prescribed data is determined by the verifier to not be already accumulated with the extracted data, andwherein each terminal of the plurality of terminals comprises;

    a searcher for searching for one of the web pages on the web;

    a morphological analyzer for performing a morphological analysis on text data in the one of the web pages searched for by the searcher, the morphological analyzer receiving the at least one part of speech of the morpheme accumulated by the part-of-speech accumulator from the server in advance;

    an extractor for extracting, as the prescribed data and from the text data in the one of the web pages on which the morphological analyzer performed the morphological analysis, the phrase that has the at least one part of speech of the morpheme that is received from the server in advance;

    a sender for sending the prescribed data extracted by the extractor to the server; and

    an interface for receiving, from the server, the prescribed data only when the prescribed data is determined by the verifier to not be already accumulated with the extracted data by the data accumulator and after the accumulator accumulates the prescribed data with the extracted data and not when the prescribed data is determined by the verifier to be already accumulated with the extracted data; and

    a display for displaying the prescribed data on a display screen via the interface when the prescribed data is received from the server.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×