Method and system for endpoint automatic detection of audio record
First Claim
1. A method for detecting an endpoint of an audio record, comprising presetting a mute duration threshold as a first time threshold;
- wherein the method further comprises;
obtaining an audio record text;
determining an acoustic model for a text endpoint of the audio record text; and
obtaining each frame of audio record data in turn starting from an audio record start frame of the audio record data;
determining a characteristics acoustic model of a decoding optimal path for an obtained current frame of the audio record data; and
determining that the characteristics acoustic model of the decoding optimal path for the current frame of the audio record data is the same as the acoustic model for the endpoint;
updating the mute duration threshold to a second time threshold, wherein the second time threshold is smaller than the first time threshold.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for endpoint automatic detection of audio record is provided. The method comprises the following steps: acquiring a audio record text and affirming the text endpoint acoustic model for the audio record text; starting acquiring the audio record data of each frame in turn from the audio record start frame in the audio record data; affirming the characteristics acoustic model of the decoding optimal path for the acquired current frame of the audio record data; comparing the characteristics acoustic model of the decoding optimal path acquired from the current frame of the audio record data with the endpoint acoustic model to determine if they are the same; if yes, updating a mute duration threshold with a second time threshold, wherein the second time threshold is less than a first time threshold. This method can improve the recognizing efficiency of the audio record endpoint.
22 Citations
14 Claims
-
1. A method for detecting an endpoint of an audio record, comprising presetting a mute duration threshold as a first time threshold;
- wherein the method further comprises;
obtaining an audio record text;
determining an acoustic model for a text endpoint of the audio record text; and
obtaining each frame of audio record data in turn starting from an audio record start frame of the audio record data;determining a characteristics acoustic model of a decoding optimal path for an obtained current frame of the audio record data; and determining that the characteristics acoustic model of the decoding optimal path for the current frame of the audio record data is the same as the acoustic model for the endpoint; updating the mute duration threshold to a second time threshold, wherein the second time threshold is smaller than the first time threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7)
- wherein the method further comprises;
-
8. A system for detecting an endpoint of an audio record, wherein a mute duration threshold is preset as a first time threshold;
- and the system further comprises;
a first determining unit adapted to obtain an audio record text and determine an acoustic model for a text endpoint of the audio record text; a first obtaining unit adapted to obtain each frame of audio record data in turn starting from an audio record start frame of the audio record data; a second determining unit adapted to determine a characteristics acoustic model of a decoding optimal path for an obtained current frame of the audio record data; and a threshold determining unit adapted to update the mute duration threshold to the second time threshold if it is determined that the characteristics acoustic model of the decoding optimal path for the current frame of the audio record data is the same as an acoustic model for the endpoint, wherein the second time threshold is smaller than the first time threshold. - View Dependent Claims (9, 10, 11, 12, 13, 14)
- and the system further comprises;
Specification