Multimodal transmission of packetized data
First Claim
1. A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment, comprising:
- a natural language processor component executed by a data processing system to receive, via an interface of the data processing system, data packets comprising an input audio signal detected by a sensor of a client computing device;
the natural language processor component to parse the input audio signal to identify a request and a trigger keyword corresponding to the request;
a direct action application programming interface of the data processing system to generate, based on at least one of the request and the trigger keyword, one or more action data structures;
a content selector component executed by the data processing system to receive at least one of the request and the trigger keyword identified by the natural language processor component and to select, based on at least one of the request and the trigger keyword, one or more content items via a real-time content selection process;
an interface management component of the data processing system to;
poll a plurality of interfaces to identify a first candidate interface and a second candidate interface within a threshold distance of the client computing device;
determine a first device resource utilization value for the first candidate interface and a second device resource utilization value for the second candidate interface;
select, based on a comparison of the first device resource utilization value and the second device resource utilization value, the first candidate interface as a selected interface to present a first action data structure of the one or more action data structures and a first content item of the one or more content items;
provide the first action data structure and the first content item for delivery in a modality compatible with the selected interface selected based on the comparison of the first device resource utilization value and the second device resource utilization value; and
transmit the first action data structure and the first content item in the modality to deliver the content item for presentation via the selected interface.
1 Assignment
0 Petitions
Accused Products
Abstract
A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
203 Citations
20 Claims
-
1. A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment, comprising:
-
a natural language processor component executed by a data processing system to receive, via an interface of the data processing system, data packets comprising an input audio signal detected by a sensor of a client computing device; the natural language processor component to parse the input audio signal to identify a request and a trigger keyword corresponding to the request; a direct action application programming interface of the data processing system to generate, based on at least one of the request and the trigger keyword, one or more action data structures; a content selector component executed by the data processing system to receive at least one of the request and the trigger keyword identified by the natural language processor component and to select, based on at least one of the request and the trigger keyword, one or more content items via a real-time content selection process; an interface management component of the data processing system to; poll a plurality of interfaces to identify a first candidate interface and a second candidate interface within a threshold distance of the client computing device; determine a first device resource utilization value for the first candidate interface and a second device resource utilization value for the second candidate interface; select, based on a comparison of the first device resource utilization value and the second device resource utilization value, the first candidate interface as a selected interface to present a first action data structure of the one or more action data structures and a first content item of the one or more content items; provide the first action data structure and the first content item for delivery in a modality compatible with the selected interface selected based on the comparison of the first device resource utilization value and the second device resource utilization value; and transmit the first action data structure and the first content item in the modality to deliver the content item for presentation via the selected interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of multi-modal transmission of packetized data in a voice activated data packet based computer network environment, comprising:
-
receiving, by a data processing system comprising one or more processors and memory, data packets comprising an input audio signal detected by a sensor of a client computing device; parsing, by the data processing system, the input audio signal to identify a request and a trigger keyword corresponding to the request; generating, by the data processing system, based on at least one of the request and the trigger keyword, one or more action data structures; receiving, by the data processing system, at least one of the request and the trigger keyword identified by a natural language processor; selecting, by the data processing system based on at least one of the request and the trigger keyword, one or more content items via a real-time content selection process; polling, by the data processing system, a plurality of interfaces to identify a first candidate interface and a second candidate interface within a threshold distance of the client computing device; determining, by the data processing system, a first device resource utilization value for the first candidate interface and a second device resource utilization value for the second candidate interface; selecting, by the data processing system, based on a comparison of the first device resource utilization value and the second device resource utilization value, the first candidate interface as a selected interface to present a first action data structure of the one or more action data structures and a first content item of the one or more content items; providing, by the data processing system, the first action data structure and the first content item for delivery in a modality compatible with the selected interface selected based on the comparison of the first device resource utilization value and the second device resource utilization value; and transmitting, by the data processing system, the first action data structure and the first content item in the modality to deliver the content item for presentation via the selected interface. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification