×

Cross-language text classification

  • US 9,588,958 B2
  • Filed: 06/28/2012
  • Issued: 03/07/2017
  • Est. Priority Date: 10/10/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing text classification based on language-independent text features, the method comprising:

  • performing, by a processor, a first syntactic and semantic analysis of a training natural language text to produce a first plurality of language-independent semantic structures representing a plurality of sentences of the training natural language text;

    producing, based on the first plurality of language-independent semantic structures, a text classifier model;

    performing a second syntactic and semantic analysis of an input natural language text to produce a second plurality of language-independent semantic structures representing a plurality of sentences of the input natural language text;

    extracting, using the second plurality of language-independent semantic structures, a set of features, wherein at least one feature references a semantic class of a language-independent semantic hierarchy comprising a plurality of semantic classes, in which the semantic class exhibits one or more properties inherited from its parent semantic class;

    applying the text classifier model to the set of features to produce a classification spectrum comprising a plurality of weight values, wherein each weight value reflects a degree of association of the input natural language text with a particular category of natural language texts; and

    associating the input natural language text with one or more categories using the classification spectrum.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×