×

Method and system for generating web pages for topics unassociated with a dominant URL

  • US 8,799,260 B2
  • Filed: 12/17/2010
  • Issued: 08/05/2014
  • Est. Priority Date: 12/17/2010
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for identifying a query topic unassociated with a dominant URL (uniform resource locator), the method comprising:

  • receiving an identification of a set of keywords associated with a query topic;

    scanning a search log to identify search queries associated with the set of keywords;

    grouping the identified search queries into clusters, wherein each of the clusters is associated with at least one URL returned by a search engine when performing a search for information using one or more of the identified search queries;

    merging the clusters to generate an extended seed query string;

    determining whether the extended seed query string is associated with a dominant URL by calculating a URL dominance score for queries of the extended seed query; and

    if the calculated URL dominance score is within a threshold range, the extended seed query is unassociated with a dominant URL, generating a web page associated with the query topic, includingretrieving documents associated with queries of the extended seed query string;

    generating a compilation document that includes the retrieved documents;

    calculating term frequency-inverse document frequency (TF-IDF) scores for n-grams (w) of phrase consisting of n consecutive words within the compilation document; and

    outputting the n-grams (w) having TF-IDF scores above a predetermined threshold;

    wherein the TF-IDF score for the n-grams (w) is calculated by multiplying term frequency (TF) and inverse document frequency (IDF), where TF represents the number of times the n-gram (w) is contained within the document d, and IDF represents the total number of documents (by URL) divided by the number of documents that contain the n-gram.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×