×

SEMIOTIC INDEXING OF DIGITAL RESOURCES

  • US 20130013603A1
  • Filed: 05/23/2012
  • Published: 01/10/2013
  • Est. Priority Date: 05/24/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method of classifying a plurality of documents, comprising:

  • providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms;

    generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document;

    generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document;

    generating a first similarity matrix from the first frequency array;

    generating a second similarity matrix from the second frequency array;

    determining an entrywise combination of the first similarity matrix and the second similarity matrix; and

    clustering the plurality of documents based on the result of the entrywise combination.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×