System, apparatus and method for determining correct metadata from community-submitted data
First Claim
Patent Images
1. A method of processing a plurality of metadata used to characterize content, comprising:
- accessing a first plurality of raw metadata;
processing the first plurality of raw metadata to produce a first canonical form, wherein;
based on the processing, multiple raw metadata of the first plurality of raw metadata are mapped to the same first canonical form; and
each of the multiple raw metadata mapped to the same first canonical form has an associated metric value, wherein the metric value comprises one or more of a frequency of occurrence metric, a network popularity metric, or a social networking reliability metric;
comparing the associated metric value of each of the multiple raw metadata to identify the raw metadata having the highest metric value as the preferred metadata form;
generating an additional mapping that maps the first canonical form back to the preferred metadata form;
receiving a request for a data processing operation that includes a first raw metadata of the multiple raw metadata that maps to the same first canonical form, wherein the first raw metadata is different from the preferred metadata form; and
utilizing the preferred metadata form as an input in the data processing operation instead of the first raw metadata based on the mappings.
10 Assignments
0 Petitions
Accused Products
Abstract
A system, apparatus, and method for processing or correcting metadata used to characterize content such as images, video, books, or music, where that metadata may be provided by a community of users or other source. The metadata may be searched as part of a process of identifying and accessing content of interest to a user or of sharing content among users of a network. The metadata is typically a string or strings of characters that is submitted by a community, so that the accuracy of specific data cannot be guaranteed and consistent formats and unambiguous descriptions may not be used by all members of the community.
-
Citations
20 Claims
-
1. A method of processing a plurality of metadata used to characterize content, comprising:
-
accessing a first plurality of raw metadata; processing the first plurality of raw metadata to produce a first canonical form, wherein; based on the processing, multiple raw metadata of the first plurality of raw metadata are mapped to the same first canonical form; and each of the multiple raw metadata mapped to the same first canonical form has an associated metric value, wherein the metric value comprises one or more of a frequency of occurrence metric, a network popularity metric, or a social networking reliability metric; comparing the associated metric value of each of the multiple raw metadata to identify the raw metadata having the highest metric value as the preferred metadata form; generating an additional mapping that maps the first canonical form back to the preferred metadata form; receiving a request for a data processing operation that includes a first raw metadata of the multiple raw metadata that maps to the same first canonical form, wherein the first raw metadata is different from the preferred metadata form; and utilizing the preferred metadata form as an input in the data processing operation instead of the first raw metadata based on the mappings. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of searching for an item of content, the item of content being associated with one or more elements of metadata, comprising:
-
receiving raw metadata associated with the item of content; processing the received raw metadata to produce a canonical form, wherein; based on the processing of the raw metadata and other raw metadata, multiple raw metadata are mapped to the same canonical form; and each of the multiple raw metadata mapped to the same canonical form has an associated metric value, wherein the metric value comprises one or more of a frequency of occurrence metric, a network popularity metric, or a social networking reliability metric; comparing the associated metric value of each of the multiple raw metadata to identify the raw metadata having the highest metric value as the preferred metadata form; generating an additional mapping that maps the same canonical form back to the preferred metadata form; receiving a request for a search process that includes a first raw metadata of the multiple raw metadata, wherein the first raw metadata is different from the preferred metadata form; and utilizing the preferred metadata form as an input in the search process instead of using the first raw metadata based on the mappings. - View Dependent Claims (14, 15, 16)
-
-
17. An apparatus for processing a plurality of raw metadata used to characterize content, comprising:
-
a processor configured to execute a set of instructions; a data store configured to store the raw metadata; and an instruction store configured to store the set of instructions, wherein when executed by the processor, the set of instructions implements a process to; access the plurality of raw metadata; process the plurality of metadata to produce a canonical form, wherein; based on the process, multiple raw metadata of the plurality of raw metadata are mapped to the same canonical form; and each of the multiple raw metadata mapped to the same canonical form has an associated metric value, wherein the metric value comprises one or more of a frequency of occurrence metric, a network popularity metric, or a social networking reliability metric; compare the associated metric value of each of the multiple raw metadata to identify the raw metadata having the highest metric value as as the preferred metadata form; generate an additional mapping that maps the same canonical form back to the preferred metadata form; receive a request for a data processing operation that includes a first raw metadata of the multiple raw metadata that maps to the same canonical form, wherein the first raw metadata is different from the preferred metadata form; and utilize the preferred metadata form as an input in the data processing operation instead of the first raw metadata based on the mappings. - View Dependent Claims (18, 19, 20)
-
Specification