Inverse text normalization for automatic speech recognition

  • US 10,592,604 B2
  • Filed: 06/29/2018
  • Issued: 03/17/2020
  • Est. Priority Date: 03/12/2018
  • Status: Active Grant
  • ×
    • Pin Icon | RPX Insight
    • Pin
First Claim
Patent Images

1. A non-transitory computer-readable storage medium storing one or more programs configured to be executed by one or more processors of an electronic device, the one or more programs including instructions for:

  • receiving speech input;

    generating a spoken-form text representation of the speech input, the spoken-form text representation comprising a token sequence;

    determining a feature representation for the spoken-form text representation;

    determining, based on the feature representation, a sequence of labels assigned to the token sequence, the sequence of labels specifying a plurality of edit operations to perform on the token sequence, wherein each edit operation of the plurality of edit operations corresponds to one of a plurality of predetermined types of edit operations;

    generating a written-form text representation of the speech input by applying the plurality of edit operations to the token sequence in accordance with the sequence of labels; and

    performing, using the generated written-form text representation, a task responsive to the speech input.

View all claims