×

System and method for identifying website verticals

  • US 9,330,168 B1
  • Filed: 02/13/2014
  • Issued: 05/03/2016
  • Est. Priority Date: 02/19/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving, by at least one server communicatively coupled to a network, a list of a plurality of first keywords, the plurality of first keywords obtained by scraping each web page of a plurality of web pages in a target website, each of the plurality of web pages having at least one of the first keywords obtained therefrom;

    converting, by the at least one server, the list into a target vector representing the target website, the target vector comprising a plurality of elements each associated with a corresponding second keyword of a plurality of second keywords, the plurality of second keywords being selected from a corpus of websites, by;

    counting the number of times each second keyword of the plurality of second keywords appears in the list to produce a corresponding frequency of appearance of each second keyword in the target website; and

    storing, in each element of the plurality of elements, the corresponding frequency of appearance of the corresponding second keyword;

    comparing, by the at least one server, the target vector to a plurality of reference vectors each being assigned one or more categories of a category structure; and

    assigning, by the at least one server, the assigned one or more categories of the closest matching reference vector to the target website.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×