×

Construction of trainable semantic vectors and clustering, classification, and searching using trainable semantic vectors

  • US 7,444,356 B2
  • Filed: 04/14/2004
  • Issued: 10/28/2008
  • Est. Priority Date: 01/27/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of classifying new datasets within a predetermined number of categories based on assignment of a plurality of sample datasets to each category, the method comprising the computer-executed steps:

  • constructing a trainable semantic vector for each sample dataset relative to the predetermined categories in a multi-dimensional semantic space;

    constructing a trainable semantic vector for each category based on the trainable semantic vectors for the sample datasets;

    receiving a new dataset;

    constructing a trainable semantic vector for the new dataset;

    determining a distance between the trainable semantic vector for the new dataset and the trainable semantic vector of each category; and

    classifying the new dataset within the category whose trainable semantic vector has the shortest distance to the trainable semantic vector of the new dataset;

    wherein;

    the new data set or each of the sample data sets includes at least one data point;

    each data point corresponds to at least one of a word, a phrase, a sentence, a color, a typography, a punctuation, a picture, and a character string; and

    the trainable semantic vector for each sample data set or the new dataset is constructed by performing the steps of;

    for each data point, identifying a relationship between each data point and predetermined categories corresponding to dimensions in the semantic space;

    determining the significance of each data point with respect to the predetermined categories;

    constructing a trainable semantic vector for each data point, wherein each trainable semantic vector has dimensions equal to the number of predetermined categories and represents the relative strength of its corresponding data point with respect to each of the predetermined categories; and

    combining the trainable semantic vector for each of the at least one data point to form the semantic vector of the sample dataset or the new dataset.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×