×

Personalization engine for classifying unstructured documents

  • US 8,214,346 B2
  • Filed: 01/30/2009
  • Issued: 07/03/2012
  • Est. Priority Date: 06/27/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for classifying an electronic document, the method comprising:

  • analyzing, with a computing device, author-generated classification information regarding a document and assigning a set of first taxonomic nouns to characterize the document based upon the author-generated classification information;

    examining, with a computing device, a user-generated tag from a client computer characterizing a portion of the document and assigning a set of second taxonomic nouns to characterize the document based upon the user-generated tag characterization;

    identifying, with a computing device, a method of access through which the document has been accessed from a content provider and assigning a set of third taxonomic nouns to characterize the document based upon the method of access;

    evaluating, with a computing device, attributes related to the method of access and assigning a set of fourth taxonomic nouns to characterize the document based upon the attributes related to the method of access;

    processing, with a computing device, the document to extract a set of fifth taxonomic nouns to characterize the document based upon a predetermined pattern rule;

    aggregating, with a computing device, the taxonomic nouns to determine at least one term vector that represents the document; and

    categorizing, with a computing device, the document based upon the taxonomic nouns, the author-generated classification information, and at least one of the term vectors.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×