×

CLASSIFYING FUNCTIONS OF WEB BLOCKS BASED ON LINGUISTIC FEATURES

  • US 20080270334A1
  • Filed: 04/30/2007
  • Published: 10/30/2008
  • Est. Priority Date: 04/30/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method in a computing device for classifying a block of a document based on its function, the method comprising:

  • identifying (203) blocks of training documents;

    for each identified block,receiving (207) a classification label for the identified block indicating its function; and

    generating (206) a feature vector for the identified block, the feature vector including a linguistic feature;

    training a classifier using the feature vectors and classification labels; and

    classifying a block of a document based on its function by applying the trained classifier to a feature vector for the block.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×