×

SYSTEMS AND METHODS OF UTF-8 PATTERN MATCHING

  • US 20120254210A1
  • Filed: 03/27/2012
  • Published: 10/04/2012
  • Est. Priority Date: 03/28/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for case insensitive searching of a variable width encoded pattern in a block of text, the method comprising:

  • (a) determining, by a device, for each character in a pattern for which to search for a match within a block of text, a corresponding lower case Unicode value, the pattern comprising variable-width encoded characters;

    (b) establishing, by the device, an index table of jump values for the pattern, the index table comprising a hash to each corresponding lower case Unicode value that identifies a number of byte lengths for the corresponding character;

    (c) jumping, by the device responsive to the index table of jump values, a pointer to the block of text to a pivot element in the block of text based on a byte length of the pattern and the byte length of a last character of the pattern; and

    (d) comparing, by the device, a lower case Unicode value of the pivot element to the corresponding lower case Unicode value of the character of the last character of the pattern.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×