Automated speech pronunciation attribution
First Claim
1. A computer-implemented method comprising:
- receiving, by a digital assistant device that stores multiple user profiles that are each associated with a respective one of multiple users, a voice command of a particular one of the multiple users, wherein the voice command includes a particular term that, among the multiple users, is pronounced uniquely by the particular one of the multiple users, and wherein each user profile stored by the digital assistant device specifies pronunciation data for terms that the respective user pronounces uniquely;
matching the voice command to a particular user profile among the multiple stored user profiles that are stored by the digital assistant;
generating, by the digital assistant device, an acknowledgment of the voice command, wherein the acknowledgement includes the particular term and pronunciation data that was stored in the matched, particular user profile and that reflects the unique pronunciation of the particular term by the particular one of the multiple users; and
providing, for output by a speech synthesizer of the digital assistant device, a spoken representation of the acknowledgment, wherein the spoken representation of the acknowledgement of the voice command includes the particular term as uniquely pronounced by the particular one of the multiple users.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus for determining candidate user profiles as being associated with a shared device, and identifying, from the candidate user profiles, candidate pronunciation attributes associated with at least one of the candidate user profiles determined to be associated with the shared device. The methods, systems, and apparatus are also for receiving, at the shared device, a spoken utterance; determining a received pronunciation attribute based on received audio data corresponding to the spoken utterance; comparing the received pronunciation attribute to at least one of the candidate pronunciation attributes; and selecting a particular pronunciation attribute from the candidate pronunciation attributes based on a result of the comparison of the received pronunciation attribute to at least one of the candidate pronunciation attributes. With the methods, systems, and apparatus, the particular pronunciation attribute, selected from the candidate pronunciation attributes, is provided for outputting audio associated with the spoken utterance.
-
Citations
17 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a digital assistant device that stores multiple user profiles that are each associated with a respective one of multiple users, a voice command of a particular one of the multiple users, wherein the voice command includes a particular term that, among the multiple users, is pronounced uniquely by the particular one of the multiple users, and wherein each user profile stored by the digital assistant device specifies pronunciation data for terms that the respective user pronounces uniquely; matching the voice command to a particular user profile among the multiple stored user profiles that are stored by the digital assistant; generating, by the digital assistant device, an acknowledgment of the voice command, wherein the acknowledgement includes the particular term and pronunciation data that was stored in the matched, particular user profile and that reflects the unique pronunciation of the particular term by the particular one of the multiple users; and providing, for output by a speech synthesizer of the digital assistant device, a spoken representation of the acknowledgment, wherein the spoken representation of the acknowledgement of the voice command includes the particular term as uniquely pronounced by the particular one of the multiple users. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising:
-
receiving, by a digital assistant device that stores multiple user profiles that are each associated with a respective one of multiple users, a voice command of a particular one of the multiple users, wherein the voice command includes a particular term that, among the multiple users, is pronounced uniquely by the particular one of the multiple users, and wherein each user profile stored by the digital assistant device specifies pronunciation data for terms that the respective user pronounces uniquely; matching the voice command to a particular user profile among the multiple stored user profiles that are stored by the digital assistant; generating, by the digital assistant device, an acknowledgment of the voice command, wherein the acknowledgement includes the particular term and pronunciation data that was stored in the matched, particular user profile and that reflects the unique pronunciation of the particular term by the particular one of the multiple users; and providing, for output by a speech synthesizer of the digital assistant device, a spoken representation of the acknowledgment, wherein the spoken representation of the acknowledgement of the voice command includes the particular term as uniquely pronounced by the particular one of the multiple users. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer-readable storage device storing instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by a digital assistant device that stores multiple user profiles that are each associated with a respective one of multiple users, a voice command of a particular one of the multiple users, wherein the voice command includes a particular term that, among the multiple users, is pronounced uniquely by the particular one of the multiple users, and wherein each user profile stored by the digital assistant device specifies pronunciation data for terms that the respective user pronounces uniquely; matching the voice command to a particular user profile among the multiple stored user profiles that are stored by the digital assistant; generating, by the digital assistant device, an acknowledgment of the voice command, wherein the acknowledgement includes the particular term and pronunciation data that was stored in the matched, particular user profile and that reflects the unique pronunciation of the particular term by the particular one of the multiple users; and providing, for output by a speech synthesizer of the digital assistant device, a spoken representation of the acknowledgment, wherein the spoken representation of the acknowledgement of the voice command includes the particular term as uniquely pronounced by the particular one of the multiple users. - View Dependent Claims (14, 15, 16, 17)
-
Specification