Method and system for segmenting phonemes from voice signals

US 8,849,662 B2
Filed: 12/28/2006
Issued: 09/30/2014
Est. Priority Date: 12/28/2005
Status: Active Grant

First Claim

Patent Images

1. A method for segmenting phonemes from voice signals, comprising:

extracting peak information from input voice signals, the peak information including first peak information corresponding to a plurality of first order peaks of the input voice signals and second peak information corresponding to a plurality of second order peaks for the plurality of first order peaks;

determining a length of a frame for calculating peak statistics;

forming a histogram showing a density distribution of the second order peaks with respect to the determined frame length;

calculating the peak statistics using the histogram;

determining two neighboring maxima of the histogram using the calculated peak statistics per each frame; and

determining a valley between the two neighboring maxima as a boundary between phonemes to perform a phoneme segmentation;

wherein the method further comprises;

extracting the peak information from voice signals on a time domain;

defining a peak order with respect to the extracted peak information;

comparing a peak measurement value of the defined peak order with a predetermined critical peak measurement value; and

determining a present peak order as a final peak order, which is used to extract the second peak information, when the peak measurement value is greater than the critical peak measurement value.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and a system for segmenting phonemes from voice signals. A method for accurately segmenting phonemes, in which a histogram showing a peak distribution corresponding to an order is formed by using a high order concept, and a boundary indicating a starting point and an ending point of each phoneme is determined by calculating a peak statistic based on the histogram. The phoneme segmentation method can remarkably reduce an amount of calculation, and has an advantage of being applied to sound signal systems which perform sound coding, sound recognition, sound synthesizing, sound reinforcement, etc.

23 Citations

4 Claims

1. A method for segmenting phonemes from voice signals, comprising:
- extracting peak information from input voice signals, the peak information including first peak information corresponding to a plurality of first order peaks of the input voice signals and second peak information corresponding to a plurality of second order peaks for the plurality of first order peaks;
  
  determining a length of a frame for calculating peak statistics;
  
  forming a histogram showing a density distribution of the second order peaks with respect to the determined frame length;
  
  calculating the peak statistics using the histogram;
  
  determining two neighboring maxima of the histogram using the calculated peak statistics per each frame; and
  
  determining a valley between the two neighboring maxima as a boundary between phonemes to perform a phoneme segmentation;
  
  wherein the method further comprises;
  
  extracting the peak information from voice signals on a time domain;
  
  defining a peak order with respect to the extracted peak information;
  
  comparing a peak measurement value of the defined peak order with a predetermined critical peak measurement value; and
  
  determining a present peak order as a final peak order, which is used to extract the second peak information, when the peak measurement value is greater than the critical peak measurement value.
- View Dependent Claims (2)
- - 2. The method as claimed in claim 1, further comprising repeating the determination of the present peak order unless the peak measurement value of the defined peak order is greater than the critical peak measurement value after the present peak order is allowed to increase, to define a novel peak order when the peak measurement value is below the critical peak measurement value, and a peak measurement value of the novel peak order is compared with the critical peak measurement value.

3. A system for segmenting phonemes from voice signals, comprising:
- a peak information extractor for extracting peak information from input voice signals, the peak information including first peak information corresponding to a plurality of first order peaks of the input voice signals and second peak information corresponding to a plurality of second order peaks from among the plurality of first order peaks;
  
  a peak statistic calculator for determining a length of a frame for calculating peak statistics and calculating the peak statistics using a histogram;
  
  a boundary determination unit for determining two neighboring maxima of the histogram using the calculated peak statistics per each frame, and determining a valley between the two neighboring maxima as a boundary between the phonemes in order to segment the phonemes;
  
  a frame length determination unit for determining a length of a frame to calculate the peak statistics; and
  
  a histogram forming unit for forming the histogram showing a density distribution of the second order peaks with respect to the determined frame length;
  
  wherein the system further comprises;
  
  a peak order determination unit for extracting peak information on voice signals on a time domain, defining a peak order with respect to the extracted peak information, comparing a peak measurement value of the defined peak order with a predetermined critical peak measurement value, and determining a present peak order as a final peak order, which is used to extract the second peak information, when the peak measurement value is greater than the critical peak measurement value.
- View Dependent Claims (4)
- - 4. The system as claimed in claim 3, wherein the peak order determination unit increases the present peak order so as to define a novel peak order when the peak measurement value is below the critical peak measurement value, and then compares a peak measurement value of the novel peak order with the critical peak measure value, to determine the peak order when the novel peak measurement value is smaller than the critical peak measurement value.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Samsung Electronics Co. Ltd.
Original Assignee
Samsung Electronics Co. Ltd.
Inventors
Kim, Hyun-Soo
Primary Examiner(s)
Desir, Pierre-Louis
Assistant Examiner(s)
KOVACEK, DAVID M

Application Number

US11/646,911
Publication Number

US 20070150277A1
Time in Patent Office

2,833 Days
Field of Search

704/1, 704/7, 704/10, 704/231, 704233-234, 704243-251, 704/254, 704/276, 704E17001-E17011, 704E15001-E15002, 704E15004-E15009, 704E15014-E15016, 704E1502-E15026, 704E11001-E11007
US Class Current

704/243
CPC Class Codes

G10L 15/04 Segmentation; Word boundary...

G10L 2015/025 Phonemes, fenemes or fenone...

Method and system for segmenting phonemes from voice signals

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

23 Citations

4 Claims

Specification

Solutions

Use Cases

Quick Links

Method and system for segmenting phonemes from voice signals

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

23 Citations

4 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links