Media content search engine incorporating text content and user log mining
First Claim
1. One or more computer-readable media having stored thereon a plurality of instructions that, when executed by one or more processors of a computer, causes the one or more processors to perform the following acts:
- receiving search criteria;
generating a query vector based on text features of the search criteria;
identifying media content pieces to be rendered by comparing the query vector to text feature vectors associated with a plurality of media content pieces;
receiving user feedback regarding the relevancy of the identified media content pieces;
modifying the query vector based on the user feedback;
modifying one or more of the text feature vectors associated with the plurality of media content pieces based on the user feedback; and
identifying new media content pieces to be rendered by comparing the modified query vector to the text feature vectors, including the one or more modified text feature vectors, associated with the plurality of media content pieces.
2 Assignments
0 Petitions
Accused Products
Abstract
Text features corresponding to pieces of media content (e.g., images, audio, multimedia content, etc.) are extracted from media content sources. One or more text features (e.g., one or more words) for a piece of media content are extracted from text associated with the piece of media content and text feature vectors generated therefrom and used during subsequent searching. Additional low-level feature vectors may also be extracted from the piece of media content and used during the subsequent searching. Relevance feedback can also be received from a user(s) identifying the relevance of pieces of media content rendered to the user in response to his or her search request. The relevance feedback is logged and can be used in determining how to respond to subsequent search requests, such as by modifying feature vectors (e.g., text feature vectors) corresponding to the pieces of media content for which relevance feedback is received.
173 Citations
54 Claims
-
1. One or more computer-readable media having stored thereon a plurality of instructions that, when executed by one or more processors of a computer, causes the one or more processors to perform the following acts:
-
receiving search criteria;
generating a query vector based on text features of the search criteria;
identifying media content pieces to be rendered by comparing the query vector to text feature vectors associated with a plurality of media content pieces;
receiving user feedback regarding the relevancy of the identified media content pieces;
modifying the query vector based on the user feedback;
modifying one or more of the text feature vectors associated with the plurality of media content pieces based on the user feedback; and
identifying new media content pieces to be rendered by comparing the modified query vector to the text feature vectors, including the one or more modified text feature vectors, associated with the plurality of media content pieces. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
identifying a media content source;
collecting one or more pieces of media content and associated text from the media content source;
extracting, for a piece of media content, one or more text features from the associated text; and
making the one or more text features available for searching. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
27. A method comprising:
-
receiving search criteria;
generating a query vector based on the search criteria;
comparing the query vector to a feature vector corresponding to a piece of media content and having been generated based on text associated with the piece of media content; and
determining, based at least in part on a result of the comparing, whether to render the piece of media content to a user.
-
-
40. A system comprising:
-
a crawler module coupled to access a media content source and collect a plurality of media content pieces and associated text from the media content source;
a feature extraction module coupled to extract one or more text features from one of the media content pieces; and
a media content indexing module coupled to generate a text feature vector, based on the extracted one or more text features, corresponding to the one media content piece.
-
-
41. A system comprising:
-
a query generator to generate a query vector based on received search criteria; and
a matching module coupled to, receive the query vector and compare the query vector to a plurality of feature vectors corresponding to a plurality of pieces of media content, wherein each of the plurality of feature vectors has been generated based on text associated with one of the plurality of pieces of media content, and identify one or more of the plurality of pieces of media content to return for rendering to a user based on the comparison of the query vector to the plurality of feature vectors.
-
-
42. A method comprising:
-
receiving search criteria;
identifying, based at least in part on the search criteria, a piece of media content to be rendered;
receiving user feedback regarding the relevancy of the rendered piece of media content;
weighting for another piece of media content, based on the user feedback, both a result of comparing the high-level query vector to a high-level feature vector of the other piece of media content and a result of comparing the low-level query vector to a low-level feature vector of the other piece of media content; and
combining the weighted result to determine whether to identify the other piece of media content for rendering. - View Dependent Claims (43, 44, 45, 46, 47, 49, 50)
-
-
48. A method comprising:
-
receiving user feedback regarding the relevancy of one or more pieces of rendered media content; and
for each of the one or more pieces of media content, modifying, based on the user feedback, a feature vector corresponding to the piece of media content, and making the modified feature vector available for subsequent searching of the one or more pieces of media content.
-
-
51. One or more computer-readable media having stored thereon a plurality of instructions that, when executed by one or more processors of a computer, causes the one or more processors to perform acts including:
-
identifying a piece of media content to render to a user based at least in part on comparing a query vector corresponding to search criteria of the user and a feature vector corresponding to the piece of media content;
receiving user feedback regarding the relevancy of the piece of media content;
modifying the query vector based on the received user feedback; and
modifying the feature vector based on the received user feedback in an off-line log mining process. - View Dependent Claims (52, 53, 54)
-
Specification