Latency reduction for content playback
First Claim
1. A method, comprising:
- receiving from a first user device, at an electronic device, first audio data representing a first utterance;
determining a first customer identifier associated with the first user device;
determining, using the first customer identifier, a user account on the electronic device, wherein the user account is associated with the first user device;
generating first text data representing the first audio data by executing speech-to-text functionality on the first audio data;
determining, using the first text data, that a first intent of the first utterance is for a song to be played;
determining a download history for the user account, the download history indicating content that has been downloaded from the electronic device by one or more devices associated with the user account;
determining, based on the download history, that first song audio data representing the song was previously downloaded to the first user device from the electronic device;
determining a first user device profile associated with the user account, the first user device profile being associated with the first user device and indicating content items that are currently stored by the first user device;
determining, from the first user device profile, that the first song audio data is stored in memory by the first user device;
generating a first instruction to cause the first user device to play the first song audio data;
sending the first instruction to the first user device;
receiving, at the electronic device, second audio data representing a second utterance that requests additional music to be played, the second audio data being received from the first user device;
generating second text data representing the second audio data by executing the speech-to-text functionality on the second audio data;
determining, using the second text data, that a second intent of the second utterance is for a new song to be played;
determining, based on the download history, that second song audio data representing the new song is not stored within the memory;
determining, based on the download history, that a second user device associated with the user account had previously downloaded the second song audio data;
determining that the first user device and the second user device are capable of communicating directly with each other using a direct communications link;
generating a second instruction that causes the first user device to request that the second user device send the second song audio data to the first user device using the direct communications link; and
sending the second instruction to the first user device.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and devices for determining whether a local version of content is stored on an electronic device associated with a user account on a backend system are described herein. In a non-limiting embodiment, the backend system may track and monitor the content stored on the electronic device using the associated user account. If an individual speaks an utterance requesting a particular content item, the backend system may determine, prior to sending the content to the electronic device, whether a local version is stored within the electronic device'"'"'s memory. If so, the backend system may instruct the electronic device to output the local version, thereby reducing the amount of bandwidth consumed. The backend system may further be capable of predictively generating and then caching certain audio data to the electronic device. For instance, frequent utterances may be tracked, and likely responses to those utterances may be generated prior to the utterance being spoken so that the response is available substantially instantaneously.
181 Citations
17 Claims
-
1. A method, comprising:
-
receiving from a first user device, at an electronic device, first audio data representing a first utterance; determining a first customer identifier associated with the first user device; determining, using the first customer identifier, a user account on the electronic device, wherein the user account is associated with the first user device; generating first text data representing the first audio data by executing speech-to-text functionality on the first audio data; determining, using the first text data, that a first intent of the first utterance is for a song to be played; determining a download history for the user account, the download history indicating content that has been downloaded from the electronic device by one or more devices associated with the user account; determining, based on the download history, that first song audio data representing the song was previously downloaded to the first user device from the electronic device; determining a first user device profile associated with the user account, the first user device profile being associated with the first user device and indicating content items that are currently stored by the first user device; determining, from the first user device profile, that the first song audio data is stored in memory by the first user device; generating a first instruction to cause the first user device to play the first song audio data; sending the first instruction to the first user device; receiving, at the electronic device, second audio data representing a second utterance that requests additional music to be played, the second audio data being received from the first user device; generating second text data representing the second audio data by executing the speech-to-text functionality on the second audio data; determining, using the second text data, that a second intent of the second utterance is for a new song to be played; determining, based on the download history, that second song audio data representing the new song is not stored within the memory; determining, based on the download history, that a second user device associated with the user account had previously downloaded the second song audio data; determining that the first user device and the second user device are capable of communicating directly with each other using a direct communications link; generating a second instruction that causes the first user device to request that the second user device send the second song audio data to the first user device using the direct communications link; and sending the second instruction to the first user device. - View Dependent Claims (2, 3)
-
-
4. A method, comprising:
-
receiving, from a first device, first audio data representing a first utterance; determining a user account associated with the first device; determining, based on first text data representing the first audio data, that a first intent of the first utterance is for first content to be output; determining, for the user account, content information associated with at least the first device; determining, based on the content information, that a first local version of the first content is stored on the first device; generating a first instruction for the first local version to be output by the first device; sending the first instruction to the first device; receiving, from the first device, second audio data representing a second utterance; determining, based on second text data representing the second audio data, that a second intent of the second utterance is for second content to be output; determining that a second device is also associated with the user account; determining, based on the content information, that a second local version of the second content is stored on the second device; and determining that the second device and the first device are capable of communicating using at least one short-range communications protocol. - View Dependent Claims (5, 6, 7, 8, 9, 10)
-
-
11. An electronic device, comprising:
-
communications circuitry operable to communicate with at least a first device; memory; and at least one processor operable to; receive, from a first device, first audio data representing a first utterance; determine a user account associated with the first device; determine, based on first text data representing the first audio data, that a first intent of the first utterance is for first content to be output; determine that a first local version of the first content is stored on the first device; generate second text data representing a first response; generate second audio data representing the second text data; generate a first instruction for the first local version to be output by the first device; send, using the communications circuitry, the first instruction and the second audio data to the first device such that the first local version is output after the second audio data; receive, from the first device, second audio data representing a second utterance; generate second text data from the second audio data by applying speech-to-text processing to the second audio data; determine, based on the second text data, that a second intent of the second utterance is for second content to be output by the first device; determine, from content information associated with at least the first device, that the first device does not include a second local version of the second content; determine that there are no additional devices associated with the user account that are capable to send content to the first device using a short-range communications protocol; generate a link between the first device and a remote device storing a third local version of the second content; and send, using the communications circuitry, the link to the remote device such that the second content is output to the first device. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
Specification