Identifying media content
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving, by one or more processors, audio data that encodes (i) a spoken natural language query, and (ii) music;
determining, by the one or more processors, that one or more keywords in a transcription of the spoken natural language query are associated with a movie content type; and
based on determining that the one or more keywords in the transcription of the spoken natural query are associated with the movie content type, identifying, by the one or more processors, a movie content item that is recognized using the music.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental audio data, obtaining a transcription of the spoken natural language query, determining a particular content type associated with one or more keywords in the transcription, providing at least a portion of the environmental audio data to a content recognition engine, and identifying a content item that has been output by the content recognition engine, and that matches the particular content type.
-
Citations
15 Claims
-
1. A computer-implemented method comprising:
-
receiving, by one or more processors, audio data that encodes (i) a spoken natural language query, and (ii) music; determining, by the one or more processors, that one or more keywords in a transcription of the spoken natural language query are associated with a movie content type; and based on determining that the one or more keywords in the transcription of the spoken natural query are associated with the movie content type, identifying, by the one or more processors, a movie content item that is recognized using the music. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by one or more processors, (i) an image or a video, and (ii) audio data that encodes a spoken natural language query; determining, by the one or more processors, that one or more keywords in a transcription of the spoken natural language query are associated with a music content type; and based on determining that the one or more keywords in the transcription of the spoken natural query are associated with the music content type, identifying, by the one or more processors, a music content item that is recognized using the image or the video. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by one or more processors, audio data that encodes (i) a spoken natural language query and (ii) music; determining, by the one or more processors, that one or more keywords in a transcription of the spoken natural language query are associated with a movie content type; and based on determining that the one or more keywords in the transcription of the spoken natural query are associated with the movie content type, identifying, by the one or more processors, a movie content item that is recognized using the music. - View Dependent Claims (12, 13, 14, 15)
Specification