Matching media file metadata to standardized metadata
First Claim
Patent Images
1. A method for providing metadata associated with media content and stored in a database on a first computing device, said method comprising:
- receiving, by the first computing device from a second computing device storing a media file storing media content, a request comprising an item of initial metadata identified by the media file, wherein said request is in response to selecting of the media file via a media player by a user of the second computing device;
extracting one or more tokens from the item of initial metadata in the received request, each of said extracted tokens representing a portion of the item of initial metadata;
assigning a weight to each of the extracted tokens, said assigned weight corresponding to a frequency of occurrence of the extracted token in standardized metadata stored in the database;
selecting a plurality of the extracted tokens to create a token group;
searching the database on the first computing device for additional metadata associated with the media content using the created token group, wherein the token group comprises a text string and searching the database comprises searching the database using a substring of less than the entire text string, said searching comprising generating a list of items of media content ordered according to the assigned weights; and
providing the additional metadata associated with the media content from the database on the first computing device to the second computing device wherein the additional metadata provided to the first computing device comprises the generated list of items of media content.
2 Assignments
0 Petitions
Accused Products
Abstract
Providing standardized metadata associated with media content responsive to input metadata. The invention extracts one or more tokens from the input metadata. Each of the extracted tokens represents a portion of the input metadata. The invention creates a token group comprising a plurality of selected tokens. The invention searches the database of standardized metadata using a multi-pass approach using the token group and the extracted tokens to identify the standardized metadata associated with the input metadata.
79 Citations
37 Claims
-
1. A method for providing metadata associated with media content and stored in a database on a first computing device, said method comprising:
-
receiving, by the first computing device from a second computing device storing a media file storing media content, a request comprising an item of initial metadata identified by the media file, wherein said request is in response to selecting of the media file via a media player by a user of the second computing device; extracting one or more tokens from the item of initial metadata in the received request, each of said extracted tokens representing a portion of the item of initial metadata; assigning a weight to each of the extracted tokens, said assigned weight corresponding to a frequency of occurrence of the extracted token in standardized metadata stored in the database; selecting a plurality of the extracted tokens to create a token group; searching the database on the first computing device for additional metadata associated with the media content using the created token group, wherein the token group comprises a text string and searching the database comprises searching the database using a substring of less than the entire text string, said searching comprising generating a list of items of media content ordered according to the assigned weights; and providing the additional metadata associated with the media content from the database on the first computing device to the second computing device wherein the additional metadata provided to the first computing device comprises the generated list of items of media content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for providing standardized metadata associated with media content and stored in a database on a first computing device, said method comprising:
-
selecting a media file storing media content via a media player, said media file stored on a second computing device, wherein said selecting is performed by a user of the second computing device; receiving, from the second computing device storing the media file having the media content, a request comprising metadata associated with said media content, said metadata including text strings corresponding to an album title, an artist name, and a track title, wherein said request is in response to selecting of the media file via a media player by a user of the second computing device; extracting one or more tokens from the text strings, each of said extracted tokens representing a substring of the text strings; assigning a weight to each of the extracted tokens, said assigned weight corresponding to a frequency of occurrence of the external token in the standardized metadata stored in the database; selecting a plurality of the extracted tokens from the album title text string and the artist name text string to create a first token group; searching the database on the first computing device for additional metadata associated with the media content using the first token group, said searching comprising generating a list of items of media content ordered according to the assigned weights; modifying one or more of the extracted tokens to create various keyword groups; searching the database using the various keyword groups to yield a plurality of results; selecting a plurality of the extracted tokens from the album title text string and the track title text string to create a second token group; searching the database for additional metadata associated with the media content using the second token group to yield another plurality of results; selecting a plurality of the extracted tokens from the artist name text string and the track title text string to create a third token group; searching the database for additional metadata associated with the media content using the third token group to produce the additional metadata relating to a single item of media content; and providing the additional metadata associated with the media content and the single item of media content from the database on the first computing device to the second computing device. - View Dependent Claims (20)
-
-
21. One or more computer-readable storage medium having computer-executable components for providing metadata associated with media content and stored in a database on a first computing device, said components comprising:
-
a user interface component of a media player for enabling a user of a second computing device to select a media file stored on the second computing device, said media file storing media content; an interface component for receiving, from the second computing device, a request comprising an item of initial metadata identified by the selected media file stored on said second computing device, said media file storing media content, wherein said request is in response to the user selecting the media file; a tokenizer component for extracting one or more tokens from the item of initial metadata in the request received by the interface component, each of said extracted tokens representing a portion of the item of initial metadata, said tokenizer component further selecting a plurality of the extracted tokens to create a token group; a weighting component for assigning a weight to each of the extracted tokens, said assigned weight corresponding to a frequency of occurrence of the extracted token in the metadata stored in the database; and a query component for searching the database on the first computing device for additional metadata associated with the media content using the token group created by the tokenizer component and for providing the additional metadata from the database on the first computing device to the second computing device, wherein searching the database comprises generating a list of items of media content ordered according to the assigned weights of the extracted tokens, wherein the second computing device updates metadata associated with the media file stored on the second computing device with the additional metadata from the database, wherein the request comprises a plurality of items of initial metadata each corresponding to one of a plurality of categories, and wherein the query component searches the database based on the items corresponding to one of the categories, wherein the query component further performs additional searching based on the items corresponding to at least one of the other categories, and wherein the query component produces additional metadata relating to a single item of media content and further identifies the additional metadata and the single item of media content to the second computing device. - View Dependent Claims (22, 23, 24)
-
-
25. A system for receiving a request from a first computing device, said request comprising an item of initial metadata identified by a media file stored on said first computing device, said media file storing media content, said system comprising:
-
a user interface component of a media player for enabling a user of the first computing device to select the media file stored on the first computing device from which the item of initial metadata is identified, wherein the request is in response to the user selecting the media file; a tokenizer component for extracting one or more tokens from the item of initial metadata in the received request, each of said extracted tokens representing a portion of the item of initial metadata, said tokenizer component further selecting a plurality of the extracted tokens to create a token group; a database on a second computing device storing additional metadata relating to the media content; a weighting component for assigning a weight to each of the extracted tokens, said assigned weight corresponding to a frequency of occurrence of the extracted token in standardized metadata stored in the database; and a query component for searching the database for the additional metadata associated with the media content using the created token group and for providing the additional metadata from the database on the second computing device to the first computing device, wherein searching the database comprises generating a list of items of media content ordered according to the assigned weights of the extracted tokens, wherein the request comprises a plurality of items of initial metadata each corresponding to one of a plurality of categories, and wherein the query component searches the database based on the items corresponding to one of the categories, and wherein the query component further performs additional searching based on the items corresponding to at least one of the other categories. - View Dependent Claims (26, 27, 28, 29)
-
-
30. A computer-readable storage medium of a first computing device having stored thereon a data structure describing an item of media content, said data structure comprising:
-
a plurality of metadata items including a metadata item associated with the media content; one or more tokens, each of said tokens representing a portion of the metadata item; a weight associated with each text string, said weight representing a frequency of occurrence of the text string in the metadata item associated with the media content; and a token group including one or more of the tokens, wherein a second computing device stores the media content, and wherein the metadata item is provided to the second computing device from the first computing device responsive to a request from the second computing device as a function of comparing a token group identified in the request with the token group of said data structure, said token group identified in the request being associated with the media content stored by the second computing device, said media content stored in a media file selected by a user of the second computing device via a media player, wherein comparing a token group identified in the request with the token group of said data structure comprises generating a list of items of media content ordered according to the assigned weights, and wherein the request is in response to the user selecting the media file, wherein each of the tokens comprises a text string including a plurality of words, and further comprising a weight associated with each word of the text string, said weight representing a frequency of occurrence of the word of the text string in the metadata item. - View Dependent Claims (31, 32)
-
-
33. A method of processing data via a first computing device for storage on a computer-readable storage medium, of the first computing device said method comprising:
-
receiving one or more metadata items at the first computing device from a metadata source, said metadata items being related to media content; extracting one or more tokens from the received metadata items, each of said extracted tokens representing a portion of one or more of the received metadata items; assigning a weight to each of the extracted tokens based on the frequency of occurrence of the extracted token in the metadata items; ranking each of the extracted tokens for each of the metadata items based on the assigned weight; selecting a plurality of the extracted tokens to create a token group; and storing the received metadata items, the extracted tokens, and the created token group on the computer-readable medium of the first computing device to associate the media content with the stored metadata items, tokens, and token group, wherein a second computing device stores the media content, said media content stored in a media file selected by a user of the second computing device via a media player, and wherein at least one of the metadata items is provided from the first computing device to the second computing device responsive to a request from the second computing device as a function of comparing a token group identified in the request with the token group stored on the computer-readable medium of the first computing device, said token group identified in the request being associated with the media content stored by the second computing device, wherein the created token group comprises a text string and wherein comparing a token group identified in the request with the token group stored on the computer-readable medium of the first computing device comprises comparing a token group identified in the request with the token group stored on the computer-readable medium of the first computing device using a substring of less than the entire text string, and wherein the request is in response to the user selecting the media file. - View Dependent Claims (34, 35, 36, 37)
-
Specification