Training punctuation models
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving audio data encoding an utterance of a user;
obtaining, by an automated speech recognizer, a transcription of the utterance of the user;
determining, by one or more processors, that one or more punctuation rules dictate that a punctuation mark may optionally be placed at a particular location within the transcription;
identifying, by the one or more processors, text that (i) was written by the user, (ii) is similar to the transcription, and (iii) includes a location that corresponds to the particular location within the transcription;
determining, by the one or more processors, whether the user placed the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription; and
determining, by the one or more processors, whether to place or not place the punctuation mark at the particular location within the transcription of the utterance, according to the one or more punctuation rules, based on determining whether the user placed or did not place the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription;
creating an electronic communication using the punctuated transcription, wherein the electronic communication is a type of electronic communication selected from the group consisting of an electronic mail message, a SMS text message, a social media message or post, or a word processing document; and
transmitting, by a client computing device, the electronic communication to a communication recipient.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for customizing the punctuation style of a transcription. A method includes receiving an utterance from a user, obtaining an unpunctuated transcription of the utterance, identifying an instance within the unpunctuated transcription where a punctuation mark may be placed, identifying, using data associated with the user, one or more past instances that are similar to the identified instance, punctuating the unpunctuated transcription based at least on the one or more past instances, and presenting the punctuated transcription to the user.
45 Citations
17 Claims
-
1. A computer-implemented method comprising:
-
receiving audio data encoding an utterance of a user; obtaining, by an automated speech recognizer, a transcription of the utterance of the user; determining, by one or more processors, that one or more punctuation rules dictate that a punctuation mark may optionally be placed at a particular location within the transcription; identifying, by the one or more processors, text that (i) was written by the user, (ii) is similar to the transcription, and (iii) includes a location that corresponds to the particular location within the transcription; determining, by the one or more processors, whether the user placed the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription; and determining, by the one or more processors, whether to place or not place the punctuation mark at the particular location within the transcription of the utterance, according to the one or more punctuation rules, based on determining whether the user placed or did not place the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription; creating an electronic communication using the punctuated transcription, wherein the electronic communication is a type of electronic communication selected from the group consisting of an electronic mail message, a SMS text message, a social media message or post, or a word processing document; and transmitting, by a client computing device, the electronic communication to a communication recipient. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by a data processing apparatus cause the data processing apparatus to perform operations comprising:
-
receiving audio data encoding an utterance of a user; obtaining, by an automated speech recognizer, a transcription of the utterance of the user; determining, by one or more processors, that one or more punctuation rules dictate that a punctuation mark may optionally be placed at a particular location within the transcription; identifying, by the one or more processors, text that (i) was written by the user, (ii) is similar to the transcription, and (iii) includes a location that corresponds to the particular location within the transcription; determining, by the one or more processors, whether the user placed the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription; determining, by the one or more processors, whether to place or not place the punctuation mark at the particular location within the transcription of the utterance, according to the one or more punctuation rules, based on determining whether the user placed or did not place the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription; creating an electronic communication using the punctuated transcription, wherein the electronic communication is a type of electronic communication selected from the group consisting of an electronic mail message, a SMS text message, a social media message or post, or a word processing document; and transmitting, by a client computing device, the electronic communication to a communication recipient. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A system comprising:
one or more computers and one or more non-transitory storage devices storing instructions that are operable, when executed by one or more computers, to cause the one or more computers to perform operations comprising; receiving audio data encoding an utterance of a user; obtaining, by an automated speech recognizer, a transcription of the utterance of the user; determining, by one or more processors, that one or more punctuation rules dictate that a punctuation mark may optionally be placed at a particular location within the transcription; identifying, by the one or more processors, text that (i) was written by the user, (ii) is similar to the transcription, and (iii) includes a location that corresponds to the particular location within the transcription; determining, by the one or more processors, whether the user placed the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription; and determining, by the one or more processors, whether to place or not place the punctuation mark at the particular location within the transcription of the utterance, according to the one or more punctuation rules, based on determining whether the user placed or did not place the punctuation mark at the corresponding location within the text that was written by the user, and that is similar to the transcription; creating an electronic communication using the punctuated transcription, wherein the electronic communication is a type of electronic communication selected from the group consisting of an electronic mail message, a SMS text message, a social media message or post, or a word processing document; and transmitting, by a client computing device, the electronic communication to a communication recipient. - View Dependent Claims (13, 14, 15, 16, 17)
Specification