System and method for providing default hierarchical training for social indexing
First Claim
1. A computer-implemented method for providing default hierarchical training for social indexing, comprising:
- maintaining articles of digital information for social indexing;
specifying a hierarchically-structured tree of topics, which each comprise a label comprising one or more words;
identifying hard constraints based on the labels comprised in the topic tree and the topic tree'"'"'s hierarchical structure, and defining the hard constraints to include immutable rules comprising at least one of;
requiring that a topic model comprises a single term comprised from a label that is duplicated within the topic tree;
requiring that a topic model includes no term from the label for the topic to which the topic model belongs; and
when the label is duplicated within the topic tree, requiring that a topic model includes no term from the label of a parent topic for the topic to which the topic model belongs;
for each topic in the topic tree, creating a topic model subject to the hard constraints, the topic model comprising a finite state pattern that comprises a pattern evaluable against the articles;
evaluating the topic models for the topic tree against the hard constraints and disfavoring those topic models that violate one or more of the immutable rules; and
identifying for each topic, the topic model, which best satisfies the constraints.
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method for providing default hierarchical training for social indexing is provided. Articles of digital information for social indexing are maintained. A hierarchically-structured tree of topics is specified. Each topic includes a label that includes one or more words. Constraints inherent in the literal structure of the topic tree are identified. For each topic in the topic tree, a topic model that includes at least one term derived from the words in at least one of the labels is created. The topic models for the topic tree are evaluated against the constraints. Those of the topic models, which best satisfy the constraints are identified.
-
Citations
20 Claims
-
1. A computer-implemented method for providing default hierarchical training for social indexing, comprising:
-
maintaining articles of digital information for social indexing; specifying a hierarchically-structured tree of topics, which each comprise a label comprising one or more words; identifying hard constraints based on the labels comprised in the topic tree and the topic tree'"'"'s hierarchical structure, and defining the hard constraints to include immutable rules comprising at least one of; requiring that a topic model comprises a single term comprised from a label that is duplicated within the topic tree; requiring that a topic model includes no term from the label for the topic to which the topic model belongs; and when the label is duplicated within the topic tree, requiring that a topic model includes no term from the label of a parent topic for the topic to which the topic model belongs; for each topic in the topic tree, creating a topic model subject to the hard constraints, the topic model comprising a finite state pattern that comprises a pattern evaluable against the articles; evaluating the topic models for the topic tree against the hard constraints and disfavoring those topic models that violate one or more of the immutable rules; and identifying for each topic, the topic model, which best satisfies the constraints. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented system for providing default hierarchical training for social indexing, comprising:
-
an electronic database, comprising; articles of digital information maintained for social indexing; and a hierarchically-structured tree of topics, which each comprise a label comprising one or more words; a processor and memory within which code for execution by the processor is stored, further comprising; an electronically-stored rules set identifying hard constraints based on the labels comprised in the topic tree and the topic tree'"'"'s hierarchical structure, wherein the hard constraints are defined to include required immutable rules comprising at least one of; that a topic model comprises a single term comprised from a label that is duplicated within the topic tree; that a topic model includes no term from the label for the topic to which the topic model belongs; and when the label is duplicated within the topic tree, that a topic model includes no term from the label of a parent topic for the topic to which the topic model belongs; a topic builder module that, for each topic in the topic tree, creates a topic model subject to the hard constraints, the topic model comprising a finite state pattern that comprises a pattern evaluable against the articles; and an evaluator module evaluating the topic models for the topic tree against the hard constraints, and disfavoring those topic models that violate one or more of the immutable rules; and a user interface visually identifying for each topic, the topic model, which best satisfies the constraints. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification