Hybrid audio-visual categorization system and method
First Claim
Patent Images
1. A method of producing a set of tags for an input audiovisual file, the set of tags indicating values of attributes of an audiovisual work of defined type represented by said audiovisual file, the method comprising the steps of:
- issuing, to a user, a prompt for manual assignment of tags to said input audiovisual file;
inputting, as an initial estimate of the values of the respective attributes of the audiovisual work represented by said audiovisual file, data representative of the tags assigned to said input audiovisual file by the user in response to said prompt;
automatically applying a set of one or more correlation functions to the attribute-value estimates of said initial estimate, to produce a set of revised estimates;
assigning a respective confidence measure to each attribute value of said revised estimates; and
outputting the final result of the applying step as the set of tags for said input audiovisual file;
wherein the correlation functions applied in said applying step are functions embodying the correlations holding between known attribute-values of a set of training examples, said training examples being audiovisual works of said defined type corresponding to manually-tagged audiovisual files, andwherein the correlation-function application step is applied iteratively, said correlation functions are applied only to attribute-values estimates associated with a confidence measure whose value exceeds a threshold, and said confidence measure is set to a maximum value for attribute values provided by the user as the initial estimate to ensure that said attribute values provided by the user as the initial estimate are not changed.
0 Assignments
0 Petitions
Accused Products
Abstract
Meta-data (tags) for an audiovisual file can be generated by prompting a user to input certain tags (meta-data) descriptive of the audiovisual file, to serve as an initial estimate of the tags, and then revising the initial estimate (notably to expand it and/or render it more precise) based on the assumption that the relationships which hold between the different tags for a set of manually-tagged training examples will also hold for the tags of the input file now being tagged.
-
Citations
22 Claims
-
1. A method of producing a set of tags for an input audiovisual file, the set of tags indicating values of attributes of an audiovisual work of defined type represented by said audiovisual file, the method comprising the steps of:
-
issuing, to a user, a prompt for manual assignment of tags to said input audiovisual file; inputting, as an initial estimate of the values of the respective attributes of the audiovisual work represented by said audiovisual file, data representative of the tags assigned to said input audiovisual file by the user in response to said prompt; automatically applying a set of one or more correlation functions to the attribute-value estimates of said initial estimate, to produce a set of revised estimates; assigning a respective confidence measure to each attribute value of said revised estimates; and outputting the final result of the applying step as the set of tags for said input audiovisual file; wherein the correlation functions applied in said applying step are functions embodying the correlations holding between known attribute-values of a set of training examples, said training examples being audiovisual works of said defined type corresponding to manually-tagged audiovisual files, and wherein the correlation-function application step is applied iteratively, said correlation functions are applied only to attribute-values estimates associated with a confidence measure whose value exceeds a threshold, and said confidence measure is set to a maximum value for attribute values provided by the user as the initial estimate to ensure that said attribute values provided by the user as the initial estimate are not changed. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer readable storage medium storing a computer program having a set of instructions which, when executed by a computer apparatus, cause the computer apparatus to perform a method of producing a set of tags for an input audiovisual file, the set of tags indicating values of attributes of an audiovisual work of defined type represented by said audiovisual file, the method comprising the steps of:
-
issuing, to a user, a prompt for manual assignment of tags to said input audiovisual file; inputting, as an initial estimate of the values of the respective attributes of the audiovisual work represented by said audiovisual file, data representative of the tags assigned to said input audiovisual file by the user in response to said prompt; automatically applying a set of one or more correlation functions to the attribute-value estimates of said initial estimate, to produce a set of revised estimates; assigning a respective confidence measure to each attribute value of said revised estimates; and outputting the final result of the applying step as the set of tags for said input audiovisual file; wherein the correlation functions applied in said applying step are functions embodying the correlations holding between known attribute-values of a set of training examples, said training examples being audiovisual works of said defined type corresponding to manually-tagged audiovisual files, and wherein the correlation-function application step is applied iteratively, said correlation functions are applied only to attribute-values estimates associated with a confidence measure whose value exceeds a threshold, and said confidence measure is set to a maximum value for attribute values provided by the user as the initial estimate to ensure that said attribute values provided by the user as the initial estimate are not changed. - View Dependent Claims (9, 10, 11, 12, 13, 14, 17)
-
-
15. An audiovisual-file-tagging system configured to produce a set of tags for an input audiovisual file, the set of tags indicating values of attributes of an audiovisual work of defined type represented by said audiovisual file, the system comprising:
-
a prompt-issuing unit configured to issue, to a user, a prompt for manual assignment of tags to said input audiovisual file; an initial estimate unit configured to input, as an initial estimate of the values of the respective attributes of the audiovisual work represented by said audiovisual file, data representative of the tags assigned to said input audiovisual file by the user in response to said prompt; a correlation function application unit having a processor configured to automatically apply a set of one or more correlation functions to the attribute-value estimates of said initial estimate, to produce a set of revised estimates, said processor being further configured to assign a respective confidence measure to each attribute value of said revised estimates; and a final result outputting unit configured to output the final result of the applying step as the set of tags for said input audiovisual file; wherein the correlation functions applied by said correlation function application unit are functions embodying the correlations holding between known attribute-values of a set of training examples, said training examples being audiovisual works of said defined type corresponding to manually-tagged audiovisual files, and wherein said processor is further configured to apply the correlation function application step iteratively, to apply said correlation functions only to attribute-values estimates associated with a confidence measure whose value exceeds a threshold, and to set said confidence measure to a maximum value for attribute values provided by the user as the initial estimate to ensure that said attribute values provided by the user as the initial estimate are not changed. - View Dependent Claims (16, 18, 19, 20, 21, 22)
-
Specification