SYSTEM AND METHOD FOR DETECTION AND ANALYSIS OF SPEECH
First Claim
1. A method comprising:
- capturing an audio recording from a language environment of a key child,segmenting the audio recording into a plurality of segments;
identifying a segment ID for each of the plurality of segments, the segment ID identifying a source for audio in the segment;
identifying a plurality of key child segments from the plurality of segments, each of the plurality of key child segments having the key child as the segment ID;
estimating key child segment characteristics based in part on at least one of the plurality of key child segments, wherein the key child segment characteristics are estimated independent of content of the plurality of key child segments;
determining at least one metric associated with the language environment using the key child segment characteristics; and
outputting the at least one metric.
2 Assignments
0 Petitions
Accused Products
Abstract
Certain aspects and embodiments of the present invention are directed to systems and methods for monitoring and analyzing the language environment and the development of a key child. A key child'"'"'s language environment and language development can be monitored without placing artificial limitations on the key child'"'"'s activities or requiring a third party observer. The language environment can be analyzed to identify words, vocalizations, or other noises directed to or spoken by the key child, independent of content. The analysis can include the number of responses between the child and another, such as an adult and the number of words spoken by the child and/or another, independent of content of the speech. One or more metrics can be determined based on the analysis and provided to assist in improving the language environment and/or tracking language development of the key child.
102 Citations
30 Claims
-
1. A method comprising:
-
capturing an audio recording from a language environment of a key child, segmenting the audio recording into a plurality of segments; identifying a segment ID for each of the plurality of segments, the segment ID identifying a source for audio in the segment; identifying a plurality of key child segments from the plurality of segments, each of the plurality of key child segments having the key child as the segment ID; estimating key child segment characteristics based in part on at least one of the plurality of key child segments, wherein the key child segment characteristics are estimated independent of content of the plurality of key child segments; determining at least one metric associated with the language environment using the key child segment characteristics; and outputting the at least one metric. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising:
-
capturing an audio recording from a language environment of a key child, segmenting the audio recording into a plurality of segments and identifying a segment ID for at least one of the plurality of segments using a Minimum Duration Gaussian Mixture Model (MD-GMM), the segment ID identifying a key child; estimating key child segment characteristics based in part on the at least one of the plurality of segments, wherein the key child segment characteristics are estimated independent of content of the plurality of segments; determining at least one metric associated with the language environment using the key child segment characteristics; and outputting the at least one metric. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A system comprising:
-
a recorder adapted to capture audio recordings from a language environment of a key child and provide the audio recordings to a processor-based device; and the processor-based device comprising an application having an audio engine adapted to segment the audio recording into a plurality of segments and identify a segment ID for each of the plurality of segments, wherein at least one of the plurality of segments is associated with a key child segment ID, the audio engine being further adapted to; estimate key child segment characteristics based in part on the at least one of the plurality of segments, wherein the audio engine estimates key child segment characteristics independent of content of the at least one of the plurality of segments; determine at least one metric associated with the language environment using the key child segment characteristics; and output the at least one metric to an output device. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification