×

Method and system for adding punctuation to voice files

  • US 9,442,910 B2
  • Filed: 03/19/2014
  • Issued: 09/13/2016
  • Est. Priority Date: 05/24/2013
  • Status: Active Grant
First Claim
Patent Images

1. An improved method for adding punctuations to a voice file, comprising:

  • executing by a processor, program codes stored in a memory to configure a computing device to add punctuations to a voice file, comprising performing the following steps;

    utilizing silence or pause duration detection to divide the voice file into a plurality of speech segments for processing, wherein respective speech segments form respective sentences within the voice file, and each respective sentence of the voice file comprising a plurality of features units, wherein each feature unit comprises a single term or multi-terms expression having semantic features corresponding to the single term or multi-terms expression;

    identifying the plurality of features units that appear in the voice file according to every term or expression, and according to the semantic features corresponding to the every single term or multi-terms expression that form each of the plurality of speech segments, the semantic features comprising a word attribute and a composition within each respective sentence and wherein identifying the plurality of feature units is based on taking the respective location of each term as the current reference location, determine a single term whose relative location relationship with the current reference location comprises the semantic features of the single term feature or expression template according to the single term feature template and further continuing the identifying for multi-terms expression comprising the term based on each of the identified feature units;

    assigning a corresponding weight to each punctuation mode which is associated to the single term or multi-terms expression in each respective identified feature unit, wherein a punctuation mode being either no punctuation used or a particular punctuation being used in the single term or multi-terms expression;

    using a linguistic model to determine a maximum sum of weight as ultimate punctuation modes for the respective speech segments which form the respective sentences within the voice file, wherein a sum of weight is determined by summing all corresponding weights on occurrences of each of various possible punctuation modes in the voice file and according to all the respective identified feature units, wherein the linguistic model is built upon the semantic features of parsed out various single terms or multi-terms expressions from a body text of a spoken sentence according to a language library;

    adding respective punctuations to form respective punctuated sentences within the voice file based on the determined maximum sum of weight of the various punctuation modes; and

    transcribing the voice file with the added respective punctuations to output the punctuated sentences as text.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×