Method and system for extracting homoionym in network

Method and system for extracting homoionym in network

  • CN 101,226,532 B
  • Filed: 12/28/2007
  • Issued: 10/03/2012
  • Est. Priority Date: 12/28/2007
  • Status: Active Grant
First Claim
Patent Images

1. a method of on network, extracting near synonym is characterized in that, comprising:

  • Obtain the anchor text of each backward chaining on the webpage;

    Calculate the weight of said anchor text, remove the anchor text that weight is lower than default value;

    Wherein, for the backward chaining anchor text of subpage frame, said anchor text weight does not belong to number with father'"'"'s webpage in main territory and multiply by separately sum behind the weight coefficient respectively for belonging to number, this sub-pages with father'"'"'s webpage in main territory with this sub-pages;

    The anchor text is contrasted in twos, remove overlapping word respectively;

    The near synonym set formed in remaining word, extract near synonym based on said near synonym set;

    Wherein, if webpage A uses anchor text S linked web pages B, then webpage A is father'"'"'s webpage, and webpage B is a sub-pages, and link is forward chaining for webpage A, is backward chaining for webpage B.

View all claims
    ×
    ×

    Thank you for your feedback

    ×
    ×