×

Verifying relevance between keywords and web site contents

  • US 7,260,568 B2
  • Filed: 04/15/2004
  • Issued: 08/21/2007
  • Est. Priority Date: 04/15/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for verifying relevance between terms and Web site contents, the method comprising:

  • retrieving site contents from a bid URL;

    formulating expanded term(s) comprising at least one of semantically or contextually related to bid term(s), which are mined from a search engine in view of high-frequency of occurrence historical query terms;

    generating content similarity and expanded similarity measurements from respective combinations of the bid term(s), the site contents, and the expanded terms, wherein the similarity measurements indicate relatedness between respective ones of the bid term(s), site contents, or expanded terms;

    calculating category similarity measurements between the expanded terms and the site contents in view of a similarity classifier, wherein the similarity classifier has been trained from mined web site content associated with directory data;

    calculating a confidence value from combined ones of multiple similarity measurements, wherein the combined ones comprise content, expanded, and category similarity measurements, wherein the confidence value provides objective measure of relevance between the bid term(s) and the site contents;

    analyzing the confidence value to identify the bid term(s); and

    using the bid term(s) identified to increase traffic to a site to obtain site exposure;

    wherein generating the category similarity measurements further comprises;

    extracting features from Web site content associated with the directory data, the features comprising a combination of at least one of title, metadata, body, hypertext link(s), visual feature(s), and summarization by page layout analysis information;

    reducing dimensionality of the features via feature selection;

    categorizing the features via a classifier model to generate the similarity classifier;

    generating respective term vectors from the bid term(s), the site contents, and the expanded terms; and

    calculating similarity between the respective term vectors as a function of the similarity classifier to determine the category similarity measurements.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×