×

System and Method for Web Content Extraction

  • US 20120303636A1
  • Filed: 12/14/2009
  • Published: 11/29/2012
  • Est. Priority Date: 12/14/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for Web content extraction, comprising:

  • extracting Web content in a Webpage by identifying paragraphs, one or more titles and one or more images in the Web content based on line-break node determination; and

    outputting the Web content including the identified paragraphs, the one or more titles, and the one or more images.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×