Method for optimizing loads of speech/user recognition system
First Claim
1. A method for optimizing a load of a speech/user recognition system, wherein said speech/user recognition system comprises a server end, a client end and a network, and the method is achieved by performing N stages of computations for a speech feature of a speech, where N is a positive integer, and an i is selected from 1 to N for representing the ith stage speech feature, comprising steps of:
- (a) providing a computation time for computing a respective stage i of the speech feature at the client end, wherein a factor Ta(i) is for a computation time of computing the ith stage speech feature at the client end with respect to the input time;
(b) providing a computation time for computing a respective stage i of the speech feature at the server end, wherein a factor Tb(i) is for a computation time of computing the ith stage speech feature at the server end with respect to the input time;
(c) providing a load c of the server end and a load d of the network;
(d) deciding an n in the range from 1 to N for minimizing a recognition time Toutput of the speech;
(e) inputting the speech for being recognized with a time Tinput;
(f) performing an computation from the first stage speech feature to the nth stage speech of the speech at the client end, while performing an computation from the (n+1)th stage speech feature to the Nth stage speech feature of the speech at the server end; and
(g) repeating steps (e) to (f).
1 Assignment
0 Petitions
Accused Products
Abstract
A method for optimizing a load of a speech/user recognition system is provided. The speech/user recognition system comprises a server end, a client end and a network, and the method is achieved by performing N stages of computations for speech features of a speech, where N is a positive integer, and an i is selected from 1 to N for representing the ith stage speech features, comprising steps of: (a) providing a real time factor Ta(i) for computing a respective stage i of the speech features at the client end, where Ta(i) is an average computation time of computing the ith stage speech features at the client end with respect to one second input speech; (b) providing a real time factor Tb(i) for computing a respective stage i of the speech features at the server end, where Tb(i) is an average computation time of computing the ith stage speech features at the server end with respect to one second input speech; (c) providing a load c of the server end and a load d of the network; (d) deciding an n in the range from 1 to N for minimizing a recognition time Toutput of the speech; (e) inputting the speech with time Tinput for being recognized; (f) performing an computation from the first stage speech features to the nth stage speech features of the speech at the client end, while performing an computation from the (n+1)th stage speech features to the Nth stage speech features of the speech at the server end; and (g) repeating steps (e) to (f).
32 Citations
53 Claims
-
1. A method for optimizing a load of a speech/user recognition system, wherein said speech/user recognition system comprises a server end, a client end and a network, and the method is achieved by performing N stages of computations for a speech feature of a speech, where N is a positive integer, and an i is selected from 1 to N for representing the ith stage speech feature, comprising steps of:
-
(a) providing a computation time for computing a respective stage i of the speech feature at the client end, wherein a factor Ta(i) is for a computation time of computing the ith stage speech feature at the client end with respect to the input time;
(b) providing a computation time for computing a respective stage i of the speech feature at the server end, wherein a factor Tb(i) is for a computation time of computing the ith stage speech feature at the server end with respect to the input time;
(c) providing a load c of the server end and a load d of the network;
(d) deciding an n in the range from 1 to N for minimizing a recognition time Toutput of the speech;
(e) inputting the speech for being recognized with a time Tinput;
(f) performing an computation from the first stage speech feature to the nth stage speech of the speech at the client end, while performing an computation from the (n+1)th stage speech feature to the Nth stage speech feature of the speech at the server end; and
(g) repeating steps (e) to (f). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 35)
-
-
21. A method for optimizing a recording frame-synchronized speech feature computation comprising a server end, a client end and a network, and the method is achieved by performing N stages of computations for a speech feature of a speech having N′
- frames, where N and N′
are a positive integers, where an i is selected from the range from 1 to N for representing the ith stage speech feature, and a n′
is selected from the range from 1 to N′
for representing the n′
th frame, comprising steps of;
(a) providing an specific n in the range from 1 to N. (b) inputting said speech for an input time (Tinput), wherein an computation from the first stage speech feature to the nth stage speech feature of each frame of the speech is performed at the client end, and an computation from the (n+1)th stage speech feature to the Nth stage speech feature of each frame of the speech is performed at the server end; and
(c) after the step (b) is carried out, an computation of the n′
frames is achieved, and a speech feature computation of the n1th stage of the (n′
+1)th frame is achieved, modifying the n by a specific manner according to the n1 to minimize a computation time for recognizing the speech; and
(d) performing an computation from the first stage speech feature to the nth stage speech feature of the respective remaining frames at the client end according to the modified n in step (c), while performing an computation from the (n+1)th stage speech feature to the Nth stage speech feature of the respective remaining frames at the server end. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
- frames, where N and N′
-
36. A method for optimizing a load of a speech/user recognition system comprising a server end, a client end and a network, wherein a recognition is achieved by performing plural stages of computations to speech features of a speech having an inputting time, comprising steps of:
-
(a) providing a real time factor Ta(i) for computing a respective stage i speech feature at the client end;
(b) providing a real time factor Tb(i) for computing a respective stage i speech feature at the server end;
(c) providing a load of the server end and a load of the network;
(d) obtaining a specific amount according to the load of the server end and the load of the network to minimize a computation time for recognizing said speech; and
(e) determining the computations at the client end and the server end according to the specific amount and the performing the plural stages of computations for the speech features of the speech. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43)
-
-
44. A method for optimizing a recording frame-synchronized speech feature computation comprising a server end, a client end and a network, wherein a recognition of a speech is achieved by performing plural stages of computations for speech features of the speech having plural frames, comprising steps of:
-
(a) providing a specific amount;
(b) inputting the speech for an input time;
(c) after the step (b) is carried out when a part of the plural frames has not been computed, and only part computations of the plural stages for the speech feature of a first frame of the frames having not been computed, modifying the specific amount by specific manner, to minimize a computation time for recognizing the speech; and
(d) distributing the respective loads of the server end and the client end according to the modified specific amount in the step (c) and then performing computations for the frames having not been computed to achieve the recognition. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51, 52, 53)
-
Specification