×

Systems and methods for parsing user-generated content to prevent attacks

  • US 9,098,722 B2
  • Filed: 03/15/2013
  • Issued: 08/04/2015
  • Est. Priority Date: 03/15/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for parsing a token stream symbolizing user generated content, using a computer implemented security system, the method comprising:

  • removing tokens, using a processor, from the token stream to generate a sanitized token stream, wherein the removal of tokens is performed by;

    iterating over the token stream while filtering for nodes that are hypertext markup language tags, and cross referencing the tag against a whitelist;

    if the tag is in the whitelist, then iterating through the attributes of the tag and cross referencing the attributes against the whitelist;

    iterating through protocol-based hypertext markup language attributes to identify a valid URL, and cross referencing the valid URL with the whitelist;

    iterating through cascade style sheet selectors within <

    style> and

    <

    link>

    tags and cross referencing the cascade style sheet selector with the whitelist;

    if the cascade style sheet selector is in the whitelist, then iterating through properties for the cascade style sheet selector in <

    style>

    /<

    link>

    tags or as “

    style”

    attributes on a specific hypertext markup language tag, and cross referencing the properties against the whitelist; and

    removing any token which is not found in the whitelist when cross-referenced.

View all claims
  • 10 Assignments
Timeline View
Assignment View
    ×
    ×