Selecting from multiple content sources
First Claim
Patent Images
1. A system, comprising:
- one or more processors;
one or more computer-readable media storing computer-executable instructions that, when executed on the one or more processors, cause the one or more processors to perform acts comprising;
receiving an audio signal corresponding to a spoken request that expresses a play content intent, wherein the spoken request indicates a semantic role and an entity;
analyzing the audio signal to detect the semantic role and the entity;
identifying multiple content collections that have matching content for which the semantic role is filled by the entity;
identifying feature scores of each content collection, wherein an individual feature score of a particular content collection indicates an amount of usage of the content collection by one or more users;
calculating a composite score for each content collection based at least in part on the feature scores of the content collection;
selecting one of the multiple content collections by analyzing the calculated composite scores of the content collections;
receiving the matching content from the selected content collection; and
playing the matching content.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech-based system may be configured to receive and act upon spoken requests from a user. In some cases, a spoken request may ask the system to play content such as music without specifying from which of multiple available content sources the music is to be obtained. In response to such a request, the system analyzes feature scores for each of the content sources. The feature scores indicate usage characteristics of the different sources by a current user or groups of users. The features scores for a particular source may be averaged or otherwise combined to create a composite score, and the source having the highest composite score is selected as the source of the requested content.
171 Citations
19 Claims
-
1. A system, comprising:
-
one or more processors; one or more computer-readable media storing computer-executable instructions that, when executed on the one or more processors, cause the one or more processors to perform acts comprising; receiving an audio signal corresponding to a spoken request that expresses a play content intent, wherein the spoken request indicates a semantic role and an entity; analyzing the audio signal to detect the semantic role and the entity; identifying multiple content collections that have matching content for which the semantic role is filled by the entity; identifying feature scores of each content collection, wherein an individual feature score of a particular content collection indicates an amount of usage of the content collection by one or more users; calculating a composite score for each content collection based at least in part on the feature scores of the content collection; selecting one of the multiple content collections by analyzing the calculated composite scores of the content collections; receiving the matching content from the selected content collection; and playing the matching content. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
receiving, by one or more processors, an audio signal corresponding to a natural language request that expresses an intent, wherein the request indicates an entity that fills a semantic role of the intent; analyzing, by the one or more processors, the audio signal to detect the semantic role and the entity; identifying, by the one or more processors, multiple content sources that have matching content for which the semantic role is filled by the entity; identifying, by the one or more processors, at least one feature score for each of the content sources, wherein the at least one feature score characterizes usage of the content sources by one or more users; selecting, by the one or more processors, one of the content sources based at least in part on the at least one feature score; and receiving, by the one or more processors, the matching content from the selected content source. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
receiving, by one or more processors, an audio signal corresponding to a natural language request from a current user, wherein the natural language request indicates requested content; identifying, by the one or more processors, multiple sources that have the requested content; identifying, by the one or more processors, at least one feature score for each of the sources, wherein the feature scores characterize at least one of (a) usage of the sources by one or more users;
or (b) usage of the sources for providing the requested content; andselecting, by the one or more processors, one of the sources based at least in part on the at least one feature score. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification