×

Semiotic class normalization

  • US 10,210,153 B1
  • Filed: 12/05/2017
  • Issued: 02/19/2019
  • Est. Priority Date: 05/26/2016
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • a data processing apparatus; and

    a non-transitory computer readable storage medium in data communication with the data processing apparatus storing instructions executable by the data processing apparatus and that upon such execution causes the data processing apparatus to perform operations comprising;

    building a semiotic class text normalization system, the building comprising;

    identifying multiple possible verbalizations for a string, wherein the string includes one or more instances of members of one or more semiotic classes;

    generating, for each possible verbalization for the string, a verbalization score according to a scoring function, wherein;

    the scoring function comprises a scoring model that is trained using written expressions of instances of members of semiotic classes and corresponding spoken words for each written expression; and

    the written expressions of instances of members of semiotic classes are generated from the spoken words by providing the spoken words as inputs to an inverse of a verbalization transducer; and

    selecting one of the possible verbalizations as a selected verbalization for the string based on the respective verbalization scores.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×