×

BAG-OF-REPEATS REPRESENTATION OF DOCUMENTS

  • US 20140229160A1
  • Filed: 02/12/2013
  • Published: 08/14/2014
  • Est. Priority Date: 02/12/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system for representing a textual document based on the occurrence of repeats, comprising:

  • a sequence generator which defines a sequence representing words forming a collection of documents;

    a repeat calculator which identifies a set of repeats within the sequence, the set of repeats comprising subsequences of the sequence which each occur more than once;

    a representation generator which generates a representation for at least one document in the collection of documents based on occurrence, in the document, of repeats from the set of repeats; and

    a processor which implements the sequence generator, repeat calculator, and representation generator.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×