Method and device for searching according to speech based on artificial intelligence
First Claim
Patent Images
1. A method for searching according to a speech based on artificial intelligence, comprising:
- acquiring, by at least one computing device, sample speeches for training a preset classifier;
removing, by the at least one computing device, a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches;
extracting, by the at least one computing device, acoustic features of each training speech; and
training, by the at least one computing device, the preset classifier by inputting the acoustic features of the each training speech into the preset classifier, to obtain a target classifier;
identifying, by at least one computing device, an input speech of a user to determine whether the input speech is a child speech;
filtrating, by the at least one computing device, a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and
feeding, by the at least one computing device, the filtrated searched result to the user,wherein removing, by the at least one computing device, the silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches comprises;
dividing, by the at least one computing device, each sample speech into frames by a preset first step size, and removing, by the at least one computing device, the silent speech from each frame of the each sample speech by performing the speech activity detection on the each frame of the each sample speech, to obtain the each training speech;
wherein extracting, by the at least one computing device, the acoustic features of each training speech comprises;
dividing, by the at least one computing device, the each training speech by a preset second step size; and
extracting, by the at least one computing device, by a preset third step size, the acoustic features of the each training speech after dividing by the preset second step size.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and a device for searching according to a speech based on artificial intelligence are provided. The method includes: identifying an input speech of a user to determine whether the input speech is a child speech; filtrating a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feeding the filtrated searched result back to the user.
-
Citations
15 Claims
-
1. A method for searching according to a speech based on artificial intelligence, comprising:
-
acquiring, by at least one computing device, sample speeches for training a preset classifier; removing, by the at least one computing device, a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches; extracting, by the at least one computing device, acoustic features of each training speech; and training, by the at least one computing device, the preset classifier by inputting the acoustic features of the each training speech into the preset classifier, to obtain a target classifier; identifying, by at least one computing device, an input speech of a user to determine whether the input speech is a child speech; filtrating, by the at least one computing device, a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feeding, by the at least one computing device, the filtrated searched result to the user, wherein removing, by the at least one computing device, the silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches comprises; dividing, by the at least one computing device, each sample speech into frames by a preset first step size, and removing, by the at least one computing device, the silent speech from each frame of the each sample speech by performing the speech activity detection on the each frame of the each sample speech, to obtain the each training speech; wherein extracting, by the at least one computing device, the acoustic features of each training speech comprises; dividing, by the at least one computing device, the each training speech by a preset second step size; and extracting, by the at least one computing device, by a preset third step size, the acoustic features of the each training speech after dividing by the preset second step size. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A device for searching according to a speech based on artificial intelligence, comprising:
-
a processor; and a memory, configured to store instructions executable by the processor, wherein the processor is configured to; acquire sample speeches for training a preset classifier; remove a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches; extract acoustic features of each training speech; and train the preset classifier by inputting the acoustic features of the each training speech into the preset classifier, to obtain a target classifier; identify an input speech of a user to determine whether the input speech is a child speech; filtrate a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feed the filtrated searched result to the user, wherein the processor is configured to remove a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches by acts of; dividing each sample speech into frames by a preset first step size, and removing the silent speech from each frame of the each sample speech by performing the speech activity detection on the each frame of the each sample speech, to obtain the each training speech; and the processor is configured to extract the acoustic features of each training speech by acts of; dividing the each training speech by a preset second step size; and extracting by a preset third step size, the acoustic features of the each training speech after dividing by the preset second step size. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A non-transitory computer readable storage medium comprising instructions, wherein the instructions are executed by a processor of a device to perform:
-
acquiring sample speeches for training a preset classifier; removing a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches; extracting acoustic features of each training speech; and training the preset classifier by inputting the acoustic features of the each training speech into the preset classifier, to obtain a target classifier; identifying an input speech of a user to determine whether the input speech is a child speech; filtrating a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feeding the filtrated searched result to the user, wherein removing a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches comprises; dividing each sample speech into frames by a preset first step size, and removing the silent speech from each frame of the each sample speech by performing the speech activity detection on the each frame of the each sample speech, to obtain the each training speech; wherein extracting the acoustic features of each training speech comprises; dividing the each training speech by a preset second step size; and extracting by a preset third step size, the acoustic features of the each training speech after dividing by the preset second step size. - View Dependent Claims (12, 13, 14, 15)
-
Specification