Extracting audiovisual features from content elements on online documents
First Claim
1. A system to extract audiovisual features from online document elements, comprising:
- a recognition engine that executes on a data processing system having one or more processors that;
receives, from a client device, a request for content to insert into an online document, the online document including a first audiovisual content element loaded into a first content slot and a second content slot, the second content slot separate from the first content slot on the online document, the first audiovisual content element originating from a source different from the content to be inserted into the second content slot and including image data, the request for content related to a search query including the first audiovisual content element and a characteristic of the second content slot;
retrieves, responsive to receipt of the request for content, a plurality of candidate audiovisual content elements from a content provider database based on the characteristic of the second content slot, the second audiovisual content element including image data;
extracts an image feature from the first audiovisual content element by applying an image feature detection to the image data of the first audiovisual content element;
identifies a text label corresponding the first audiovisual content element from a metadata field of the online document;
extracts an image feature from each candidate audiovisual content element by applying the image feature detection to the image data of the candidate audiovisual content element;
identifies a keyword of each candidate audiovisual content element, the keyword associated with the candidate audiovisual content element from on a previous search query and a corresponding interaction event;
determines an image feature match between the image feature of the first audiovisual content element and the image feature of each candidate audiovisual content element;
determines a keyword match between the text label of the first audiovisual content element from the metadata field of the online document and the keyword of the second audiovisual content from on the previous search query and the corresponding interaction event;
selects, from the plurality of candidate audiovisual content elements, a second audiovisual content element for display by the client device on the online document based on the image feature match and the keyword match; and
the data processing system that transmits, responsive to the selection of the second audiovisual content element, via a network interface, the second audiovisual content element to the client device for insertion by the client device into the second content slot of the online document to be presented on the online document with the first audiovisual content element loaded into the first content slot.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for extracting audiovisual features from online document elements are described herein. A computing device can identify a first audiovisual content element on an online document and can retrieve a second audiovisual content element from a content provider database. The computing device can extract an image, video, or audio feature from the first and the second audiovisual content elements by applying image feature, video frame feature, or audio fingerprint detection. The computing device can determine a match between the features extracted from the first and the second audiovisual content elements. The computing device can select the second audiovisual content element for display on the online document based on the match. The computing device can transmit the second audiovisual content element for insertion in a content slot of the online document.
165 Citations
16 Claims
-
1. A system to extract audiovisual features from online document elements, comprising:
-
a recognition engine that executes on a data processing system having one or more processors that; receives, from a client device, a request for content to insert into an online document, the online document including a first audiovisual content element loaded into a first content slot and a second content slot, the second content slot separate from the first content slot on the online document, the first audiovisual content element originating from a source different from the content to be inserted into the second content slot and including image data, the request for content related to a search query including the first audiovisual content element and a characteristic of the second content slot; retrieves, responsive to receipt of the request for content, a plurality of candidate audiovisual content elements from a content provider database based on the characteristic of the second content slot, the second audiovisual content element including image data; extracts an image feature from the first audiovisual content element by applying an image feature detection to the image data of the first audiovisual content element; identifies a text label corresponding the first audiovisual content element from a metadata field of the online document; extracts an image feature from each candidate audiovisual content element by applying the image feature detection to the image data of the candidate audiovisual content element; identifies a keyword of each candidate audiovisual content element, the keyword associated with the candidate audiovisual content element from on a previous search query and a corresponding interaction event; determines an image feature match between the image feature of the first audiovisual content element and the image feature of each candidate audiovisual content element; determines a keyword match between the text label of the first audiovisual content element from the metadata field of the online document and the keyword of the second audiovisual content from on the previous search query and the corresponding interaction event; selects, from the plurality of candidate audiovisual content elements, a second audiovisual content element for display by the client device on the online document based on the image feature match and the keyword match; and the data processing system that transmits, responsive to the selection of the second audiovisual content element, via a network interface, the second audiovisual content element to the client device for insertion by the client device into the second content slot of the online document to be presented on the online document with the first audiovisual content element loaded into the first content slot. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of extracting audiovisual features from online document elements, comprising:
-
receiving, by a recognition engine executing on a data processing system having one or more processors, from a client device, a request for content to insert into an online document, the online document including a first audiovisual content element loaded into a first content slot and a second content slot, the second content slot separate from the first content slot on the online document, the first audiovisual content element originating from a source different from the content to be inserted into the second content slot and including image data, the request for content related to a search query including the first audiovisual content element and a characteristic of the second content slot; retrieving, by the recognition engine, responsive to receipt of the request for content, a plurality of candidate audiovisual content elements from a content provider database based on the characteristic of the second content slot, the second audiovisual content element including image data; extracting, by the recognition engine, an image feature from the first audiovisual content element by applying an image feature detection to the image data of the first audiovisual content element; identifying, by the recognition engine, a text label corresponding the first audiovisual content element from a metadata field of the online document; extracting, by the recognition engine, an image feature from each candidate audiovisual content element by applying the image feature detection to the image data of the candidate audiovisual content element; identifying, by the recognition engine, a keyword of each candidate audiovisual content element, the keyword associated with the candidate audiovisual content element from on a previous search query and a corresponding interaction event determining, by the recognition engine, an image feature match between the image feature of the first audiovisual content element and the image feature of each candidate audiovisual content element; determining, by the recognition engine, a keyword match between the text label of the first audiovisual content element from the metadata field of the online document and the keyword of the second audiovisual content from on the previous search query and the corresponding interaction event; selecting, by the recognition engine, from the plurality of candidate audiovisual content elements, a second audiovisual content element for display by the client device on the online document based on the image feature match and the keyword match; and transmitting, by the data processing system, responsive to the selection of the second audiovisual content element, via a network interface, the second audiovisual content element to the client device for insertion by the client device into the second content slot of the online document to be presented on the online document with the first audiovisual content element loaded into the first content slot. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification