Method and system for classifying media content
First Claim
1. A method of processing metadata associated with media content units in order to classify the media content units using terms contained in a structured vocabulary, the method comprising:
- (a) receiving metadata for association with a media content unit;
(b) programmatically segmenting the received metadata to generate one or more descriptive phrases;
(c) comparing the one or more descriptive phrases with a structured vocabulary of terms in order to identify one or more terms in the structured vocabulary that are related to the one or more descriptive phrases; and
(d) creating an association between the identified one or more terms and the media content unit so that the media content unit is characterized by the one or more terms.
11 Assignments
0 Petitions
Accused Products
Abstract
A hardware and software facility for classifying media content units using keywords from a structured vocabulary. Metadata associated with each media content unit is segmented into a series of descriptive phrases. The descriptive phrases are mapped to keywords in a structured vocabulary, and the identified keywords associated with the media content units. Descriptive phrases that are not found in the structured vocabulary are tracked as candidate phrases for later addition to the structured vocabulary. A keyword index to the media content units may be constructed. The index is used to identify specific media content units that are responsive to search queries in a reliable and accurate fashion.
-
Citations
55 Claims
-
1. A method of processing metadata associated with media content units in order to classify the media content units using terms contained in a structured vocabulary, the method comprising:
-
(a) receiving metadata for association with a media content unit;
(b) programmatically segmenting the received metadata to generate one or more descriptive phrases;
(c) comparing the one or more descriptive phrases with a structured vocabulary of terms in order to identify one or more terms in the structured vocabulary that are related to the one or more descriptive phrases; and
(d) creating an association between the identified one or more terms and the media content unit so that the media content unit is characterized by the one or more terms. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for processing metadata associated with media content units in order to classify the media content units using terms contained in a structured vocabulary, the system comprising:
-
(a) a segmentation component for receiving metadata associated with a plurality of media content units and segmenting the metadata to generate one or more descriptive phrases characterizing the plurality of media content units;
(b) a matching component for comparing the one or more descriptive phrases generated by the segmentation component with a structured vocabulary of terms in order to identify one or more terms in the structured vocabulary that are correlated with the one or more descriptive phrases for the plurality of media content units; and
(c) a mapping component that, for the plurality of media content units, provides a relationship between the media content unit and the one or more terms in the structured vocabulary that were identified by the matching component. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method of processing descriptive text associated with visual content in order to classify the visual content using terms contained in a structured vocabulary, the method comprising:
-
(a) receiving descriptive text associated with visual content;
(b) programmatically segmenting the received descriptive text to identify one or more descriptive phrases;
(c) comparing the identified one or more descriptive phrases with a structured vocabulary of terms in order to identify one or more terms in the structured vocabulary that are related to the one or more descriptive phrases; and
(d) creating an association between the identified one or more terms and the image so that the visual content is classified by the one or more terms. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47)
-
-
48. A computer-readable medium containing instructions for controlling a computer processor in a computer system to classify media content units using terms contained in a structured vocabulary by:
-
(a) receiving metadata associated with a media content unit;
(b) programmatically segmenting the received metadata to generate one or more descriptive phrases;
(c) comparing the one or more descriptive phrases with a structured vocabulary of terms in order to identify one or more terms in the structured vocabulary that are related to the one or more descriptive phrases; and
(d) creating an association between the identified one or more terms and the media content unit so that the media content unit is classified by the one or more terms. - View Dependent Claims (49, 50, 51)
-
-
52. A system for segmenting metadata associated with a plurality of media content units, the system comprising:
-
a structured keyword vocabulary used to classify media content units; and
a segmentation component for receiving metadata for association with a plurality of media content units and segmenting the metadata to generate one or more descriptive phrases characterizing the plurality of media content units, wherein the segmentation component utilizes an application-specific dictionary that is a subset of the structured keyword vocabulary to segment the received metadata. - View Dependent Claims (53)
-
-
54. A method of segmenting metadata associated with a plurality of media content units, the method comprising:
-
(a) receiving metadata for association with a plurality of media content units; and
(b) programmatically segmenting the received metadata to generate one or more descriptive phrases associated with the plurality of media content units utilizing an application-specific dictionary, wherein the application-specific dictionary is a subset of a structured keyword vocabulary that is used to classify the plurality of media content units. - View Dependent Claims (55)
-
Specification