×

Recognizer of text-based work

  • US 7,356,188 B2
  • Filed: 04/24/2001
  • Issued: 04/08/2008
  • Est. Priority Date: 04/24/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method for hashing a body of text, the method comprising:

  • obtaining a body of text containing textual content in a computer-readable format, wherein the textual content of the obtained computer-readable formatted body of text is mutable via software tools for manipulation of textual content of bodies of text;

    filtering the textual content of the body of text to remove elements of the textual content, wherein the filtering act produces filtered subtext, which is a subset of the textual content of the body of text;

    formatting the filtered subtext into a defined image-based format, wherein the textual content of the defined image-based formatted filtered subtext is immutable via software tools for manipulation of the textual content of bodies of text;

    deriving a hash value representative of the textual content of the filtered subtext, perceptually distinct filtered subtexts having hash values that are substantially independent of each other, wherein the deriving comprises hashing the image-based formatted, filtered subtext resulting from the formatting,wherein the filtering further comprises removing superfluous elements from the textual content, thereby leaving a remaining textual content and re-arranging the remaining textual content into a canonical format.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×