×

Method and apparatus for building sales tools by mining data from websites

  • US 8,359,307 B2
  • Filed: 04/18/2011
  • Issued: 01/22/2013
  • Est. Priority Date: 12/23/2005
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for characterizing a plurality of extensible markup language documents, each of the plurality of extensible markup language documents comprising a link to another extensible markup language document of the plurality of extensible markup language documents, the method comprising:

  • traversing the plurality of extensible markup language documents by following each link of the plurality of extensible markup language documents;

    parsing the plurality of extensible markup language documents to determine a structural hierarchy of the plurality of extensible markup language documents;

    associating a task complexity with the plurality of extensible markup language documents based on the structural hierarchy, the associating comprising;

    extracting a plurality of blocks of information from the plurality of extensible markup language documents; and

    assigning a block of information in the plurality of blocks of information to a category in a plurality of categories, the plurality of categories comprising a task complexity category, the block of information comprising a value indicative of a number of extensible markup language documents associated with the plurality of extensible markup language documents and a value indicative of a number of links associated with the plurality of extensible markup language documents; and

    characterizing the plurality of extensible markup language documents based on the task complexity.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×