Method and system for voice based media search
First Claim
1. A method comprising:
- receiving voice input data at a media device;
determining whether the voice input corresponds to one of a plurality of commands;
in response to determining that the voice input corresponds to one of the plurality of commands, executing the command;
in response to determining that the voice input does not correspond to one of the plurality of commands, sending the voice input data to a speech-to-text service;
receiving, by the media device from the speech-to-text service, a textual representation of at least a portion of the voice input data;
generating a signature based on at least a portion of the textual representation;
locating a particular data entry among a set of data entries by searching the set of data entries for a data entry matching the signature generated based on the at least a portion of the textual representation, each data entry of the set of data entries specifying a mapping between a given signature and one or more media device actions;
updating the set of data entries by storing the mapping between the signature and the at least a portion of the textual representation;
in response to locating the particular data entry among the set of data entries based on the generated signature, performing one or more particular media device actions associated with the particular data entry, the one or more particular media device actions including sending a media content query to a media search service;
receiving, by the media device, one or more content item listings based on the media content query; and
generating for display at least a portion of the one or more content item listings.
7 Assignments
0 Petitions
Accused Products
Abstract
Voice-based input is used to operate a media device and/or to search for media content. Voice input is received by a media device via one or more audio input devices and is translated into a textual representation of the voice input. The textual representation of the voice input is used to search one or more cache mappings between input commands and one or more associated device actions and/or media content queries. One or more natural language processing techniques may be applied to the translated text and the resulting text may be transmitted as a query to a media search service. A media search service returns results comprising one or more content item listings and the results may be presented on a display to a user.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving voice input data at a media device; determining whether the voice input corresponds to one of a plurality of commands; in response to determining that the voice input corresponds to one of the plurality of commands, executing the command; in response to determining that the voice input does not correspond to one of the plurality of commands, sending the voice input data to a speech-to-text service; receiving, by the media device from the speech-to-text service, a textual representation of at least a portion of the voice input data; generating a signature based on at least a portion of the textual representation; locating a particular data entry among a set of data entries by searching the set of data entries for a data entry matching the signature generated based on the at least a portion of the textual representation, each data entry of the set of data entries specifying a mapping between a given signature and one or more media device actions; updating the set of data entries by storing the mapping between the signature and the at least a portion of the textual representation; in response to locating the particular data entry among the set of data entries based on the generated signature, performing one or more particular media device actions associated with the particular data entry, the one or more particular media device actions including sending a media content query to a media search service; receiving, by the media device, one or more content item listings based on the media content query; and generating for display at least a portion of the one or more content item listings. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
control circuitry configured to; receive voice input data at a media device; determine whether the voice input corresponds to one of a plurality of commands; in response to the determination that the voice input corresponds to one of the plurality of commands, execute the command; in response the determination that the voice input does not correspond to one of the plurality of commands, send the voice input data to a speech-to-text service; receive, by the media device from the speech-to-text service, a textual representation of at least a portion of the voice input data; generate a signature based on at least a portion of the textual representation; locate a particular data entry among a set of data entries by searching the set of data entries for a data entry matching the signature generated based on the at least a portion of the textual representation, each data entry of the set of data entries specifying a mapping between a given signature and one or more media device actions; update the set of data entries by storing the mapping between the signature and the at least a portion of the textual representation; in response to the location of the particular data entry among the set of data entries based on the generated signature, perform one or more particular media device actions associated with the particular data entry, the one or more particular media device actions including sending a media content query to a media search service; receive, by the media device, one or more content item listings based on the media content query; and generate for display at least a portion of the one or more content item listings. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification