Systems and methods for classifying electronic documents
First Claim
1. A non-transitory computer readable medium having instructions stored thereon that, when executed by at least one processor, cause the at least one processor to perform operations for creating a plurality of classification rules to classify an electronic document from an electronic media source, the operations comprising:
- generating statistical data from one or more training documents, and creating a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data, the at least one topic model-based classification rule formatted as an XML file; and
creating at least one query-based classification rule using one or more user defined categories and the statistical data, the at least one query based classification rule formatted as an XML file.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of classifying an electronic document from an electronic media source includes generating statistical data from one or more training documents and creating a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data and creating at least one query-based classification rule using one or more user defined categories and the statistical data. The method further includes classifying the electronic document using the at least one topic model-based classification rule. Example systems for classifying an electronic document from an electronic media source are also disclosed.
102 Citations
20 Claims
-
1. A non-transitory computer readable medium having instructions stored thereon that, when executed by at least one processor, cause the at least one processor to perform operations for creating a plurality of classification rules to classify an electronic document from an electronic media source, the operations comprising:
-
generating statistical data from one or more training documents, and creating a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data, the at least one topic model-based classification rule formatted as an XML file; and creating at least one query-based classification rule using one or more user defined categories and the statistical data, the at least one query based classification rule formatted as an XML file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a memory; and at least one processor to; generate statistical data from one or more training documents, and create a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data, the at least one topic model-based classification rule formatted as an XML file; and create at least one query-based classification rule using one or more user defined categories and the statistical data, the at least one query-based classification rule formatted as an XML file. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A method comprising:
-
generating, by at least one processor, statistical data from one or more training documents, and creating a plurality of classification rules, including creating at least one topic model-based classification rule using the statistical data, the at least one topic model-based classification rule formatted as an XML file; and creating, by the at least one processor, at least one query-based classification rule using one or more user defined categories and the statistical data, the at least one query based classification rule formatted as an XML file. - View Dependent Claims (18, 19, 20)
-
Specification