METHOD AND SYSTEM FOR ADDING PUNCTUATION TO VOICE FILES
First Claim
1. A method for adding punctuations to a voice file, comprising:
- utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file comprising a plurality of features units;
identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments;
using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, wherein the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and
adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for adding punctuation to a voice file is disclosed. The method includes: utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file includes a plurality of features units; identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes.
-
Citations
20 Claims
-
1. A method for adding punctuations to a voice file, comprising:
-
utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file comprising a plurality of features units; identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, wherein the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for adding punctuation to a voice file comprises at least a processor working in conjunction with a memory and a plurality of modules, wherein the modules comprise at least:
-
a silence or pause duration detection module, an identification module and a punctuation addition module, wherein; the silence or pause duration detection module divides a voice file to be processed into a plurality of speech segments to be processed based on silence or pause detection; the identification module identifies all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; the punctuation addition module uses a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, wherein the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and
adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A non-transitory computer-readable medium having stored thereon, a computer program having at least one code section being executable by a machine for causing the machine to perform steps comprising:
-
utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file comprising a plurality of features units; identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, wherein the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification