Systems and Methods for Estimating Age of a Child Based on Speech
First Claim
1. A system comprising:
- a microphone configured to receive an input speech from an individual;
an analog-to-digital (A/D) converter configured to convert the input speech from an analog form to a digital form and generate a digitized speech;
a memory storing an executable code and an age estimation database including a plurality of age determinant formant-based feature vectors;
a hardware processor executing the executable code to;
receive the digitized speech from the A/D converter;
identify a plurality of boundaries between a plurality of phonemes in the digitized speech;
extract a plurality of formant-based feature vectors from one or more phonemes of the plurality of phonemes delineated by the plurality of boundaries, based on at least one of a formant position, a formant bandwidth, and a formant dispersion, wherein the formant dispersion is a geometric mean of the formant spacing;
compare the plurality of formant-based feature vectors with the age determinant formant-based feature vectors of the age estimation database;
estimate the age of the individual when the comparison finds a match in the age estimation database; and
communicate an age-appropriate response to the individual based on the estimated age of the individual.
1 Assignment
0 Petitions
Accused Products
Abstract
There is provided a system comprising a microphone, configured to receive an input speech from an individual, an analog-to-digital (A/D) converter to convert the input speech to digital form and generate a digitized speech, a memory storing an executable code and an age estimation database, a hardware processor executing the executable code to receive the digitized speech, identify a plurality of boundaries in the digitized speech delineating a plurality of phonemes in the digitized speech, extract a plurality of formant-based feature vectors from each phoneme in the digitized speech based on at least one of a formant position, a formant bandwidth, and a formant dispersion, compare the plurality of formant-based feature vectors with age determinant formant-based feature vectors of the age estimation database, determine the age of the individual when the comparison finds a match in the age estimation database, and communicate an age-appropriate response to the individual.
-
Citations
20 Claims
-
1. A system comprising:
-
a microphone configured to receive an input speech from an individual; an analog-to-digital (A/D) converter configured to convert the input speech from an analog form to a digital form and generate a digitized speech; a memory storing an executable code and an age estimation database including a plurality of age determinant formant-based feature vectors; a hardware processor executing the executable code to; receive the digitized speech from the A/D converter; identify a plurality of boundaries between a plurality of phonemes in the digitized speech; extract a plurality of formant-based feature vectors from one or more phonemes of the plurality of phonemes delineated by the plurality of boundaries, based on at least one of a formant position, a formant bandwidth, and a formant dispersion, wherein the formant dispersion is a geometric mean of the formant spacing; compare the plurality of formant-based feature vectors with the age determinant formant-based feature vectors of the age estimation database; estimate the age of the individual when the comparison finds a match in the age estimation database; and communicate an age-appropriate response to the individual based on the estimated age of the individual. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 17, 18)
-
-
9. A method for use with a system having a microphone, an analog-to-digital (A/D) converter, a memory storing an executable code, and a hardware processor, the method comprising:
-
receiving, using the hardware processor, a digitized speech from the A/D converter; identifying, using the hardware processor, a plurality of boundaries between a plurality of phonemes in the digitized speech; extracting, using the hardware processor, a plurality of formant-based feature vectors from one or more phonemes of the plurality of phonemes delineated by the plurality of boundaries, based on at least one of a formant position, a formant bandwidth, and a formant dispersion, wherein the formant dispersion is a geometric mean of the formant spacing; comparing, using the hardware processor, the plurality of formant-based feature vectors with the age determinant formant-based feature vectors of the age estimation database; estimating, using the hardware processor, the age of the individual when the comparison finds a match in the age estimation database; and communicating, using the hardware processor, an age-appropriate response to the individual based on the estimated age of the individual. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 19, 20)
-
Specification