Speech recognition for localized content
First Claim
1. A computer-implemented method, the method comprising:
- receiving channel data that represents a plurality of terrestrial over-the-air broadcast television channels detectable by a first device associated with a user profile, the plurality of terrestrial over-the-air broadcast television channels including a first broadcast channel;
determining, using the channel data and a program guide data source, program data including;
a list of program titles that are broadcast on the plurality of terrestrial over-the-air broadcast television channels, anda list of broadcast time periods corresponding to each of the program titles;
receiving first audio input data from a second device associated with the user profile;
performing speech recognition on the first audio input data to determine text data;
determining, using the user profile, that the second device is associated with the first device;
configuring, using information included in the program data, a knowledge database associated with the user profile, wherein the knowledge database is used by a natural language understanding (NLU) component to interpret the text data;
determining, using the text data, the NLU component, and the knowledge database, that the text data corresponds to a command to operate the first device to output a program received using the first broadcast channel; and
sending, to the first device, an instruction to tune to the first broadcast channel and output the program.
1 Assignment
0 Petitions
Accused Products
Abstract
A system capable of generating and storing program data associated with localized content, such as over-the-air television broadcasts. The system may update a knowledge base with the program data to improve natural language understanding (NLU) processing associated with the localized content. For example, a voice enabled device may receive a voice command and the system may perform NLU processing to interpret the voice command, determine content based on the program data and send an instruction to an output device to output the content. The program data may include information about localized content such as channels detected by the output device and/or content broadcast on the channels, including information about an output device, a channel number, call sign, station name, signal strength, etc. associated with a detected channel, and/or a program name, genre, actors, directors, producers, etc. associated with content broadcast on the channel or stored in a local storage device.
35 Citations
24 Claims
-
1. A computer-implemented method, the method comprising:
-
receiving channel data that represents a plurality of terrestrial over-the-air broadcast television channels detectable by a first device associated with a user profile, the plurality of terrestrial over-the-air broadcast television channels including a first broadcast channel; determining, using the channel data and a program guide data source, program data including; a list of program titles that are broadcast on the plurality of terrestrial over-the-air broadcast television channels, and a list of broadcast time periods corresponding to each of the program titles; receiving first audio input data from a second device associated with the user profile; performing speech recognition on the first audio input data to determine text data; determining, using the user profile, that the second device is associated with the first device; configuring, using information included in the program data, a knowledge database associated with the user profile, wherein the knowledge database is used by a natural language understanding (NLU) component to interpret the text data; determining, using the text data, the NLU component, and the knowledge database, that the text data corresponds to a command to operate the first device to output a program received using the first broadcast channel; and sending, to the first device, an instruction to tune to the first broadcast channel and output the program. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method, comprising:
-
receiving first data indicating a plurality of content channels accessible via a first device associated with a user profile, the plurality of content channels accessible by detecting terrestrial over-the-air broadcasts and including a content channel; determining, based on the first data, second data including information associated with the plurality of content channels; associating the second data with the user profile; operating a speech processing component based at least in part on the second data, wherein the speech processing component is associated with the user profile; receiving audio data representing an utterance; performing speech recognition on the audio data to determine text data; determining that the audio data is associated with the first device; performing natural language understanding, using the text data and the speech processing component, to determine that the text data corresponds to a command to operate the first device to output a first content item being broadcast on the content channel; and sending a message to the first device to tune to the content channel. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system, comprising:
-
at least one processor; memory including instructions operable to be executed by the at least one processor to perform a set of actions to cause the system to; receive first data indicating a plurality of content channels accessible via a first device associated with a user profile, the plurality of content channels accessible by detecting over-the-air broadcasts and including a content channel; determine, based on the first data, second data including information associated with the plurality of content channels; associate the second data with the user profile; receive audio data representing an utterance; perform speech recognition on the audio data to determine text data; determine that the audio data is associated with the first device; perform natural language understanding, using the text data and the user profile, to determine that the text data corresponds to a command to operate the first device to output a first content item being broadcast on the content channel, wherein determining that the text data corresponds to the command further comprises; determining that the text data includes a first word, identifying the first word within the user profile, determining that the first word is associated with the first content item, determining a confidence score that the text data is associated with the first content item, wherein the confidence score is determined based at least in part on the first word being associated with the first content item, and determining that the confidence score is above a threshold; and send a message to the first device to tune to the content channel. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A computer-implemented method, comprising:
-
receiving, from a first device associated with a user profile, first data indicating a plurality of content channels accessible via the first device by detecting over-the-air broadcasts, the plurality of content channels including a content channel; receiving, from a second device, second data including information associated with a second plurality of content channels, the second plurality of content channels including the plurality of content channels; determining, based on the first data, third data including information associated with the plurality of content channels, wherein the third data corresponds to a portion of the second data that is associated with the plurality of content channels; associating the third data with the user profile; operating a speech processing component based at least in part on the third data, wherein the speech processing component is associated with the user profile; receiving audio data representing an utterance; performing speech recognition on the audio data to determine text data; determining that the audio data is associated with the first device; performing natural language understanding, using the text data and the speech processing component, to determine that the text data corresponds to a command to operate the first device to output a first content item being broadcast on the content channel; and sending a message to the first device to tune to the content channel.
-
Specification