×

Identification of the presence of speech in digital audio data

  • US 20050192795A1
  • Filed: 02/24/2005
  • Published: 09/01/2005
  • Est. Priority Date: 02/26/2004
  • Status: Active Grant
First Claim
Patent Images

1. Method for determining speech related audio data within a record of digital audio data, the method comprising steps for extracting audio features from the record of digital audio data, classifying the record of digital audio data based on the extracted audio features and with respect to one or more predetermined audio classes, and marking at least a part of the record of digital audio data classified as speech, characterised in that the extraction of at least one audio feature comprises the following steps:

  • partitioning the record of digital audio data into adjoining frames, for each frame defining a window being formed by a sequence of adjoining frames containing the frame under consideration, determining for the frame under consideration and at least one further frame of the window a spectral-emphasis-value which is related to the frequency distribution contained in the digital audio data of the respective frame, and assigning a presence-of-speech indicator value to the frame under consideration based on an evaluation of the differences between the spectral-emphasis-values determined for the frame under consideration and the at least one further frame of the window.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×