Fuzzy database retrieval
First Claim
Patent Images
1. A computer-implemented method performed by a computer system to assess the quality of metadata associated with a media stream on a communications network, the method comprising:
- extracting metadata associated with the media stream;
parsing the extracted metadata into metadata fields, the extracted metadata in each metadata field comprising a set of keywords, the set of keywords comprising at least one keyword;
providing a valid database comprising accurate metadata represented as a plurality of database records, each of the database records comprising a plurality of record fields, each of the record fields comprising a set of keywords, the set of keywords comprising at least one keyword, the accurate metadata being free of inaccurate metadata;
comparing the contents of the metadata fields to the contents of each of the record fields to identify a matching record field that contains all of the keywords contained in the metadata fields;
determining a similarity score based on the degree of similarity between the metadata field and the matching record field, the similarity score being indicative of the quality of the extracted metadata; and
reorganizing the metadata fields and repeating the comparing and scoring,wherein code implementing the method is stored in a memory of the computer system for execution by a processor of the computer system.
8 Assignments
0 Petitions
Accused Products
Abstract
A method for querying metadata associated with media, such as multimedia and/or streaming media, on a computer network includes separating the metadata into keywords. The keywords are compared with valid keywords. A score is calculated in accordance with the degree of similarity between the keywords and valid keywords. If the degree of similarity is above a threshold, the metadata is qualified as valid metadata. Valid metadata is available for comparison and correction of other (invalid) metadata.
-
Citations
20 Claims
-
1. A computer-implemented method performed by a computer system to assess the quality of metadata associated with a media stream on a communications network, the method comprising:
-
extracting metadata associated with the media stream; parsing the extracted metadata into metadata fields, the extracted metadata in each metadata field comprising a set of keywords, the set of keywords comprising at least one keyword; providing a valid database comprising accurate metadata represented as a plurality of database records, each of the database records comprising a plurality of record fields, each of the record fields comprising a set of keywords, the set of keywords comprising at least one keyword, the accurate metadata being free of inaccurate metadata; comparing the contents of the metadata fields to the contents of each of the record fields to identify a matching record field that contains all of the keywords contained in the metadata fields; determining a similarity score based on the degree of similarity between the metadata field and the matching record field, the similarity score being indicative of the quality of the extracted metadata; and reorganizing the metadata fields and repeating the comparing and scoring, wherein code implementing the method is stored in a memory of the computer system for execution by a processor of the computer system. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-readable storage medium whose contents cause a computing system to:
-
extract metadata associated with a media file; parse the metadata into a plurality of metadata fields, the metadata in each of the metadata fields comprising a set of keywords, the set of keywords comprising at least one keyword; provide a valid database comprising a plurality of database records, each of the database records comprising a plurality of record fields, each of the record fields comprising a set of keywords, the set of keywords comprising at least one keyword; for each of the plurality of metadata fields, compare the contents of the at least one metadata field to the contents of each of a plurality of record fields to identify a matching record field that contains all of the keywords contained in the metadata field, the plurality of record fields being of a plurality of database records in a ground truth database comprising accurate metadata and not inaccurate metadata; determine a similarity score based on the degree of similarity between each of the plurality of metadata fields and its matching record field, the similarity score being indicative of the quality of the metadata and usable as a work flow process of a search engine; and perform prefix rule evaluation on the plurality of metadata fields and repeating comparing and scoring. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system to assess the quality of metadata associated with a media stream on a communications network, the system comprising:
at least one processor configured to; extract metadata associated with the media stream; parse the extracted metadata into metadata fields, the extracted metadata in the metadata field comprising a set of keywords, the set of keywords comprising at least one keyword; provide a valid database comprising accurate metadata represented as a plurality of database records, each of the database records comprising a plurality of record fields, each of the record fields comprising a set of keywords, the set of keywords comprising at least one keyword, the accurate metadata not including inaccurate metadata; compare the contents of the metadata fields to the contents of each of the record fields to identify a matching record field that contains all of the keywords contained in the metadata fields; determine a similarity score based on the degree of similarity between the metadata field and the matching record field, the similarity score being indicative of the quality of the extracted metadata; and reorganize the metadata fields and repeat the comparing and determining operations. - View Dependent Claims (18, 19, 20)
Specification