×

LEARNING PERSONALIZED ENTITY PRONUNCIATIONS

  • US 20170221475A1
  • Filed: 02/03/2016
  • Published: 08/03/2017
  • Est. Priority Date: 02/03/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving audio data corresponding to an utterance that includes a voice command trigger term and an entity name that is a proper noun;

    generating, by an automated speech recognizer, an initial transcription that (i) corresponds to a portion of the audio data that is associated with the entity name that is a proper noun, and (ii) includes a transcription of a mispronounced term that is associated with a pronunciation of a term that is not a proper noun;

    in response to the generation of the initial transcription that includes a transcription of a mispronounced term that is associated with a pronunciation of a term that is not a proper noun, prompting a user for feedback, wherein prompting the user for feedback comprises;

    providing, for output, a representation of the initial transcription that (i) corresponds to the portion of the audio data that is associated with the entity name that is a proper noun, and (ii) includes the transcription of the mispronounced term that is associated with a pronunciation of a term that is not a proper noun;

    receiving a corrected transcription in which a manually selected term that is a proper noun is substituted for the transcription of the mispronounced term that is associated with a pronunciation of a term that is not a proper noun;

    in response to receiving the corrected transcription in which a manually selected term that is a proper noun is substituted for the transcription of the mispronounced term that is associated with a pronunciation of a term that is not a proper noun, obtaining a phonetic representation that is associated with the portion of the received audio data that is associated with the entity name that is a proper noun;

    updating a pronunciation dictionary to associate (i) the obtained phonetic representation that is associated with the portion of the received audio data that is associated with the entity name that is a proper noun with (ii) the entity name from the utterance that is a proper noun;

    receiving a subsequent utterance that includes the entity name; and

    transcribing the subsequent utterance based at least in part on the updated pronunciation dictionary.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×