Method and device for detecting speech patterns and errors when practicing fluency shaping techniques
First Claim
1. A method for detecting errors when practicing fluency shaping exercises, comprising:
- receiving a set of initial energy levels;
setting each threshold of a set of thresholds to a respective predetermined initial value;
receiving a voice production of a user practicing a fluency shaping exercise;
analyzing the received voice production to compute a set of energy levels composing the voice production;
detecting at least one speech-related error based on the computed set of energy levels, the set of initial energy levels, and the set of a thresholds, wherein the detection of the at least one speech-related error is with respect to the fluency shaping exercise being practiced by the user;
wherein the set of initial energy levels includes at least one of;
a normal speech energy level, a silence energy level, and a calibration energy level,upon detection of the at least one speech-related error, generating visual feedback indicating the at least one detected speech-related error with respect to the received voice production, andperforming an audio calibration process for a computing device of the user to set the normal speech energy level, the silence energy level, and the calibration energy level, wherein the voice production is captured on the computing device of the user,wherein processing the received voice production further comprises;
sampling the received voice production to create voice samples;
buffering the voice samples to create voice chunks;
converting the voice chunks from a time domain to a frequency domain;
extracting spectrum features from each of the frequency domain voice chunks, wherein the spectrum features include at least dominant frequencies, wherein each dominant frequency corresponds to a voice chunk;
computing, for each voice chunk, the energy level of the corresponding dominant frequency; and
determining, for each voice chunk, an energy level of the voice chunk based on the energy level of the corresponding dominant frequency.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and device for detecting errors when practicing fluency shaping exercises, are presented. The method includes receiving a set of initial energy levels; setting a set of thresholds to their respective initial values; receiving a voice production of a user practicing a fluency shaping exercise; analyzing the received voice production to compute a set of energy levels composing the voice production; detecting based on the computed set of energy levels, the set of initial energy levels, and the set of a threshold of at least one speech-related error, wherein the detection of the at least one speech-related error is respective of the fluency shaping exercise being practiced by the user; and upon detection of the at least one speech-related error, generating a feedback indicating the at least one detected speech-related error.
31 Citations
37 Claims
-
1. A method for detecting errors when practicing fluency shaping exercises, comprising:
-
receiving a set of initial energy levels; setting each threshold of a set of thresholds to a respective predetermined initial value; receiving a voice production of a user practicing a fluency shaping exercise; analyzing the received voice production to compute a set of energy levels composing the voice production; detecting at least one speech-related error based on the computed set of energy levels, the set of initial energy levels, and the set of a thresholds, wherein the detection of the at least one speech-related error is with respect to the fluency shaping exercise being practiced by the user;
wherein the set of initial energy levels includes at least one of;
a normal speech energy level, a silence energy level, and a calibration energy level,upon detection of the at least one speech-related error, generating visual feedback indicating the at least one detected speech-related error with respect to the received voice production, and performing an audio calibration process for a computing device of the user to set the normal speech energy level, the silence energy level, and the calibration energy level, wherein the voice production is captured on the computing device of the user, wherein processing the received voice production further comprises;
sampling the received voice production to create voice samples;
buffering the voice samples to create voice chunks;
converting the voice chunks from a time domain to a frequency domain;
extracting spectrum features from each of the frequency domain voice chunks, wherein the spectrum features include at least dominant frequencies, wherein each dominant frequency corresponds to a voice chunk;
computing, for each voice chunk, the energy level of the corresponding dominant frequency; and
determining, for each voice chunk, an energy level of the voice chunk based on the energy level of the corresponding dominant frequency. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A device for detecting errors when practicing of fluency shaping exercises, comprising:
-
a processing unit; and a memory, the memory containing instructions that, when executed by the processing unit, configures the device to; receive a set of initial energy levels, wherein the set of initial energy levels include at least one of;
a normal speech energy level, a silence energy level, and a calibration energy level;set each threshold of a set of thresholds to a respective predetermined initial value; receive a voice production of a user practicing a fluency shaping exercise; analyze the received voice production to compute a set of energy levels composing the voice production; detect at least one speech-related error based on the computed set of energy levels, the set of initial energy levels, and the set of thresholds, wherein the detection of the least one speech-related error is with respect to the fluency shaping exercise being practiced by the user; and upon detection of at least one speech-relate error, generate a-visual feedback indicating the least one detected speech-related error with respect to the received voice production, wherein the device is further configured to;
perform an audio calibration process for a computing device of user to set the normal speech energy level, the silence energy level, and the calibration energy level, wherein the voice production is captured on a computing device of the user, andwherein the device is further configured to;
sample the received voice production to create voice samples;
buffer the voice samples to create voice chunks;
convert the voice chunks from a time domain to a frequency domain;
extract spectrum features from each of the frequency domain voice chunks, wherein the spectrum features include at least dominant frequencies, wherein each dominant frequency corresponds to a voice chunk;
compute, for each voice chunk, the energy level of the corresponding dominant frequency; and
determine, for each voice chunk, an energy level of the voice chunk based on the energy level of the corresponding dominant frequency. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
Specification