Digital assistant providing whispered speech
First Claim
Patent Images
1. An electronic device, comprising:
- one or more processors;
memory; and
one or more programs stored in memory, the one or more programs including instructions for;
receiving a speech input from a user;
determining, based on the speech input, that a whispered speech response is to be provided;
upon determining that a whispered speech response is to be provided, generating the whispered speech response, wherein generating the whispered speech response comprises;
generating text based on the speech input;
performing natural language processing of the text;
generating an intermediate speech based on a result of the natural language processing;
obtaining a residual signal based on a linear prediction analysis of the intermediate speech;
modifying the residual signal; and
obtaining the whispered speech response based on a linear prediction synthesis of the modified residual signal; and
providing the whispered speech response to the user.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and processes for detecting and/or providing a whispered speech response are provided. In one example process, speech is received from a user, and based on the speech input, determined that a whispered speech response is to be provided. Upon determining that a whispered speech response is to be provided, the whispered speech response is generated and provided to the user.
4370 Citations
54 Claims
-
1. An electronic device, comprising:
-
one or more processors; memory; and one or more programs stored in memory, the one or more programs including instructions for; receiving a speech input from a user; determining, based on the speech input, that a whispered speech response is to be provided; upon determining that a whispered speech response is to be provided, generating the whispered speech response, wherein generating the whispered speech response comprises; generating text based on the speech input; performing natural language processing of the text; generating an intermediate speech based on a result of the natural language processing; obtaining a residual signal based on a linear prediction analysis of the intermediate speech; modifying the residual signal; and obtaining the whispered speech response based on a linear prediction synthesis of the modified residual signal; and providing the whispered speech response to the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to:
-
receive a speech input from a user; determine, based on the speech input, that a whispered speech response is to be provided; upon determining that a whispered speech response is to be provided, generate the whispered speech response, wherein generating the whispered speech response comprises; generating text based on the speech input; performing natural language processing of the text; generating an intermediate speech based on a result of the natural language processing; obtaining a residual signal based on a linear prediction analysis of the intermediate speech; modifying the residual signal; and obtaining the whispered speech response based on a linear prediction synthesis of the modified residual signal; and provide the whispered speech response to the user. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A method for operating a digital assistant, comprising:
-
at a user device with one or more processors and memory; receiving a speech input from a user; determining, based on the speech input, that a whispered speech response is to be provided; upon determining that a whispered speech response is to be provided, generating the whispered speech response, wherein generating the whispered speech response comprises; generating text based on the speech input; performing natural language processing of the text; generating an intermediate speech based on a result of the natural language processing; obtaining a residual signal based on a linear prediction analysis of the intermediate speech; modifying the residual signal; and obtaining the whispered speech response based on a linear prediction synthesis of the modified residual signal; and providing the whispered speech response to the user. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
-
Specification