Systems and methods for classifying electronic documents
First Claim
1. A non-transitory computer readable medium storing computer-executable instructions for creating a plurality of classification rules to classify an electronic document from an electronic media source, the instructions including generating statistical data from one or more training documents, and creating a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data and creating at least one query-based classification rule using one or more user defined categories and the statistical data.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of classifying an electronic document from an electronic media source includes generating statistical data from one or more training documents and creating a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data and creating at least one query-based classification rule using one or more user defined categories and the statistical data. The method further includes classifying the electronic document using the at least one topic model-based classification rule. Example systems for classifying an electronic document from an electronic media source are also disclosed.
-
Citations
23 Claims
- 1. A non-transitory computer readable medium storing computer-executable instructions for creating a plurality of classification rules to classify an electronic document from an electronic media source, the instructions including generating statistical data from one or more training documents, and creating a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data and creating at least one query-based classification rule using one or more user defined categories and the statistical data.
-
9. A computer system for classifying an electronic document from an electronic media source, the computer system comprising a communication network and a computer server in communication with the communication network, the computer server configured to receive the electronic document via the communication network, the computer server having memory and a processor, said memory including one or more training documents and one or more user defined categories, the processor configured to
generate statistical data from the one or more training documents, create a plurality of classification rules, the classification rules including at least one topic model-based classification rule created using the statistical data and at least one query-based classification rule created using the one or more user defined categories and the statistical data, and classify the electronic document using the at least one topic model-based classification rule.
Specification