×

Sniffing hypertext content to determine type

  • US 8,612,844 B1
  • Filed: 09/09/2005
  • Issued: 12/17/2013
  • Est. Priority Date: 09/09/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method for determining a type of embedded content in a web page, the method comprising:

  • receiving web page content;

    parsing the received web page content;

    determining from the parsing that the web page content specifies embedded content to be retrieved;

    requesting the embedded content;

    receiving the embedded content and a response header;

    analyzing the received embedded content to determine a first type of the embedded content;

    analyzing the received response header to determine a second type of the embedded content; and

    responsive to one of the first type of the embedded content and the second type of the embedded content not being an excluded content type, determining a third type of the embedded content based on the first type of the embedded content and based on the second type of the embedded content, wherein the third type of the embedded content is either the first type of the embedded content or the second type of the embedded content; and

    responsive to the first type of the embedded content and the second type of the embedded content being excluded content types, determining the third type of the embedded content based on a highest score of a plurality of generated scores for a plurality of possible content types, the plurality of possible content types comprising the second type of the embedded content, the first type of the embedded content, and a content type associated with a file extension for the embedded content.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×