Asynchronous audio messaging
First Claim
1. A system comprising:
- one or more processors; and
one or more memories coupled to the one or more processors, the one or more memories including instructions that upon execution cause the system to;
receive a first audio signal that includes a representation of words spoken by a user;
generate a first text by performing speech recognition on the first audio signal;
analyze the first text to determine that the first audio signal includes a message to be sent to a recipient;
truncate the first text and the first audio signal to create a second text and a second audio signal as the message to be transmitted to the recipient, wherein truncation to create the second audio signal includes removing a representation of a first portion of the words spoken by the user while retaining a representation of a second portion of the words spoken by the user as the second audio signal;
determine a user profile associated with the user, wherein the user profile is one of a plurality of user profiles;
determine, from plurality of recipients different from the user, the recipient of the message based at least in part on the user profile;
determine an electronic address associated with the recipient of the message;
determine an expiration time associated with the second audio signal based at least in part on an analysis of the first audio signal;
transmit the second audio signal and the second text to the electronic address; and
transmit the expiration time to the electronic address as metadata associated with the second audio signal to update at least a portion of the second audio signal based at least in part on the expiration time of the second audio signal.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, devices, and techniques may provide asynchronous audio messaging. Asynchronous audio messaging may enable a user to quickly and easily create and transmit a message to a recipient. The user may simply record a message for a recipient. The message may include an indication of the recipient of the message, an action (e.g., to send a message, etc.) and/or other types of information. A messaging module may modify the message to create a modified version of the message and then generate an additional version of the modified message in a different media type. The modified message and the addition version of the modified message may be transmitted to the recipient. In some embodiments, the messaging module may transmit other information such as location information, an expiration, or other information derived from the message to enhance the message.
19 Citations
22 Claims
-
1. A system comprising:
-
one or more processors; and one or more memories coupled to the one or more processors, the one or more memories including instructions that upon execution cause the system to; receive a first audio signal that includes a representation of words spoken by a user; generate a first text by performing speech recognition on the first audio signal; analyze the first text to determine that the first audio signal includes a message to be sent to a recipient; truncate the first text and the first audio signal to create a second text and a second audio signal as the message to be transmitted to the recipient, wherein truncation to create the second audio signal includes removing a representation of a first portion of the words spoken by the user while retaining a representation of a second portion of the words spoken by the user as the second audio signal; determine a user profile associated with the user, wherein the user profile is one of a plurality of user profiles; determine, from plurality of recipients different from the user, the recipient of the message based at least in part on the user profile; determine an electronic address associated with the recipient of the message; determine an expiration time associated with the second audio signal based at least in part on an analysis of the first audio signal; transmit the second audio signal and the second text to the electronic address; and transmit the expiration time to the electronic address as metadata associated with the second audio signal to update at least a portion of the second audio signal based at least in part on the expiration time of the second audio signal. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method comprising:
- under control of one or more computing devices executing instructions,
receiving a first audio signal, at least a portion of the first audio signal representing words spoken by a user; performing at least one of automatic speech recognition or natural language understanding on the at least a portion of the first audio signal to determine first text based at least in part on the words spoken by the user; creating a second audio signal based at least in part on the first audio signal, at least a portion of the second audio signal representing audio corresponding to at least a portion of the words spoken by the user from the first audio signal; creating a message associated with the second audio signal, the message including second text corresponding to the at least the portion of the words spoken by the user; determining a user profile associated with the user, the user profile being one of a plurality of user profiles associated with a device located in a user environment; determining, from a plurality of recipients different from the user, a recipient based at least in part on the user profile; determining an expiration time associated with the second audio signal based at least in part on an analysis of the first audio signal; transmitting the second audio signal and the message to an address associated with the recipient; and transmitting the expiration time to the address as metadata associated with the second audio signal. - View Dependent Claims (7, 8, 9, 10, 11, 17, 18, 20, 21)
- under control of one or more computing devices executing instructions,
-
12. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed by one or more processors, cause the one or more processors to:
-
receive a first audio signal that includes a representation of words spoken by a user; perform at least one of automatic speech recognition or natural language understanding on at least a portion of the first audio signal to determine first text based at least in part on the representation of the words spoken by the user; create a second audio signal based at least in part on the first audio signal, the second audio signal including a representation of audio corresponding to at least a portion of the words spoken by the user from the first audio signal; create a message associated with the second audio signal, the message including second text corresponding to the representation of the at least the portion of the words spoken by the user; determine a user profile associated with the user, wherein the user profile is one of a plurality of user profiles; determine, from a plurality of recipients different from the user, a recipient, wherein the recipient is based at least in part on the user profile; determine an address associated with the recipient; determine an expiration time of the second audio signal based at least in part on an analysis of the first audio signal; transmit the second audio signal and the message to the address associated with the recipient; and transmit the expiration time to the address as metadata associated with the second audio signal. - View Dependent Claims (13, 14, 15, 16, 19, 22)
-
Specification