Document classification with weighted supervised n-gram embedding
First Claim
Patent Images
1. A method for document classification, comprising:
- embedding n-grams from an input text in a latent space;
embedding the input text in the latent space based on the embedded n-grams and weighting the n-grams according to a non-linear function
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for document classification include embedding n-grams from an input text in a latent space, embedding the input text in the latent space based on the embedded n-grams and weighting said n-grams according to spatial evidence of the respective n-grams in the input text, classifying the document along one or more axes, and adjusting weights used to weight the n-grams based on the output of the classifying step.
-
Citations
18 Claims
-
1. A method for document classification, comprising:
-
embedding n-grams from an input text in a latent space; embedding the input text in the latent space based on the embedded n-grams and weighting the n-grams according to a non-linear function - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for document classification, comprising:
-
an n-gram embedding module configured to embed n-grams from an input text in a latent space; a document embedding module configured to embed the input the input text in the latent space based on the embedded n-grams, weighted according to a non-linear function - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification