System and Method of Text Zoning
First Claim
Patent Images
1. A method of zoning a transcription of audio data, the method comprising:
- separating the transcription of audio data into a plurality of utterances;
identifying utterances of the plurality of utterances that are shorter than a predetermined minimum threshold as meaning units;
calculating a probability that each word in an utterance of the plurality of utterances which is longer than the predetermined minimum threshold is a meaning unit boundary;
splitting the utterance longer than the predetermined minimum threshold into two new utterances at a word with a maximum calculated probability; and
identifying at least one of the two utterances that is shorter than a maximum utterance threshold as a meaning unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of zoning a transcription of audio data includes separating the transcription of audio data into a plurality of utterances. A that each word in an utterances is a meaning unit boundary is calculated. The utterance is split into two new utterances at a work with a maximum calculated probability. At least one of the two new utterances that is shorter than a maximum utterance threshold is identified as a meaning unit.
223 Citations
20 Claims
-
1. A method of zoning a transcription of audio data, the method comprising:
-
separating the transcription of audio data into a plurality of utterances; identifying utterances of the plurality of utterances that are shorter than a predetermined minimum threshold as meaning units; calculating a probability that each word in an utterance of the plurality of utterances which is longer than the predetermined minimum threshold is a meaning unit boundary; splitting the utterance longer than the predetermined minimum threshold into two new utterances at a word with a maximum calculated probability; and identifying at least one of the two utterances that is shorter than a maximum utterance threshold as a meaning unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method of zoning, a transcription of audio data, the method comprising:
-
separating the transcription of audio data into a plurality of utterances identifying utterances of the plurality of utterances that are shorter than a predetermined minimum threshold as meaning units; selecting utterances of the plurality of utterances that are longer than the predetermined minimum threshold for subdivision; splitting the selected utterances into widows, each window being twice a maximum utterance threshold; calculating a probability that each word in the plurality of windows is a meaning unit boundary based upon at least a linguistic model applied to each of the plurality of windows; splitting the selected utterances which are longer than the predetermined minimum threshold into two new utterances at a word with a maximum calculated probability; and identifying at least one of the two new utterances that is shorter than a maximum utterance threshold as a meaning unit. - View Dependent Claims (18, 19, 20)
-
Specification