Personalized Speech Synthesis for Voice Actions
First Claim
1. A computer-implemented method comprising:
- receiving audio data encoding a voice query from a user;
obtaining, by an automated speech recognizer, a transcription of the voice query from the audio data, wherein the transcription includes a particular term;
determining, from the audio data, custom pronunciation data that reflects a user'"'"'s pronunciation for the particular term;
generating a spoken acknowledgment of the voice query, wherein the spoken acknowledgment includes the particular term, and wherein, when output, the particular term is spoken in accordance with the user'"'"'s pronunciation for the particular term based at least on the custom pronunciation data that was determined from the audio data;
providing the spoken acknowledgment for output; and
providing the voice query for execution.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in an enterprise system. In one aspect, a method include actions of obtaining a template that defines (i) trigger criteria for presenting a notification type and (ii) content rules for determining content to include in a notification of the notification type. Additional actions include accessing enterprise resources of an enterprise, the enterprise resources including data describing entities related to the enterprise and relationships among the entities. Further actions include, accessing user information specific to a user and determining that the trigger criteria is satisfied by the enterprise resources and the user information. Additional actions include generating a particular notification of the notification type based at least on the content rules and providing the particular notification to the user.
29 Citations
22 Claims
-
1. A computer-implemented method comprising:
-
receiving audio data encoding a voice query from a user; obtaining, by an automated speech recognizer, a transcription of the voice query from the audio data, wherein the transcription includes a particular term; determining, from the audio data, custom pronunciation data that reflects a user'"'"'s pronunciation for the particular term; generating a spoken acknowledgment of the voice query, wherein the spoken acknowledgment includes the particular term, and wherein, when output, the particular term is spoken in accordance with the user'"'"'s pronunciation for the particular term based at least on the custom pronunciation data that was determined from the audio data; providing the spoken acknowledgment for output; and providing the voice query for execution. - View Dependent Claims (2, 3, 4, 5, 6, 7, 12, 21, 22)
-
-
8. A system comprising:
-
one or more computers; and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving audio data encoding a voice query from a user; obtaining, by an automated speech recognizer, a transcription of the voice query from the audio data, wherein the transcription includes a particular term; determining, from the audio data, custom pronunciation data that reflects a user'"'"'s pronunciation for the particular term; generating a spoken acknowledgment of the voice query, wherein the spoken acknowledgment includes the particular term, and wherein, when output, the particular term is spoken in accordance with the user'"'"'s pronunciation for the particular term based at least on the custom pronunciation data that was determined from the audio data; providing the spoken acknowledgment for output; and providing the voice query for execution. - View Dependent Claims (9, 10, 11, 13, 14)
-
-
15. A non-transitory computer-readable medium storing instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving audio data encoding a voice query from a user; obtaining, by an automated speech recognizer, a transcription of the voice query from the audio data, wherein the transcription includes a particular term; determining, from the audio data, custom pronunciation data that reflects a user'"'"'s pronunciation for the particular term; generating a spoken acknowledgment of the voice query, wherein the spoken acknowledgment includes the particular term, and wherein, when output, the particular term is spoken in accordance with the user'"'"'s pronunciation for the particular term based at least on the custom pronunciation data that was determined from the audio data; providing the spoken acknowledgment for output; and providing the voice query for execution. - View Dependent Claims (16, 17, 20)
-
-
18. (canceled)
-
19. (canceled)
Specification