REAL-TIME VOICE RECOGNITION APPARATUS EQUIPPED WITH ASIC CHIP AND SMARTPHONE
First Claim
1. A real-time voice recognition apparatus comprising an ASIC chip and a smartphone, the apparatus comprising:
- an ASIC chip configured to receive a first digital audio speech as an input, generate a feature vector, and output a phoneme score from the feature vector; and
a smartphone connected to the ASIC chip through a data port and configured to receive the phoneme score from the ASIC chip as an input, output word text, and process and output the word text according to user needs,wherein the ASIC chip comprises;
an input signal selector configured to select one of a second digital audio speech and a third digital audio speech in response to an input selection control signal (SEL) and output the selected audio speech as the first digital audio speech (AIN);
a feature extractor configured to receive the first digital audio speech and one or more first control signals (C1) as an input and output the feature vector;
an acoustic model processor configured to receive the feature vector and one or more second control signals (C2) as an input and output the phoneme score;
a connection device configured to receive the phoneme score from the acoustic model processor as an input, output the received phoneme score to the smartphone, receive data and a first DC voltage (DC1) as an input from the smartphone, and output the second digital audio speech, the input selection control signal (SEL), the first control signal (C1), the second control signal (C2) and the second DC voltage (DC2); and
a DC-DC converter configured to receive the second DC voltage (DC2) as an input and output a third DC voltage (DC3).
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a real-time voice recognition apparatus equipped with an application-specific integrated circuits (ASIC) chip and a smartphone, capable, by using one smartphone and one ASIC chip and without using a cloud computer, of assuring personal privacy, and, due to a short delay time, enabling real-time conversion of voice input signals into text for output. When one DRAM chip is optionally added to the real-time voice recognition apparatus, the number of neural network layers is increased thereby significantly improving accuracy of conversion of voice input signals into text.
5 Citations
29 Claims
-
1. A real-time voice recognition apparatus comprising an ASIC chip and a smartphone, the apparatus comprising:
-
an ASIC chip configured to receive a first digital audio speech as an input, generate a feature vector, and output a phoneme score from the feature vector; and a smartphone connected to the ASIC chip through a data port and configured to receive the phoneme score from the ASIC chip as an input, output word text, and process and output the word text according to user needs, wherein the ASIC chip comprises; an input signal selector configured to select one of a second digital audio speech and a third digital audio speech in response to an input selection control signal (SEL) and output the selected audio speech as the first digital audio speech (AIN); a feature extractor configured to receive the first digital audio speech and one or more first control signals (C1) as an input and output the feature vector; an acoustic model processor configured to receive the feature vector and one or more second control signals (C2) as an input and output the phoneme score; a connection device configured to receive the phoneme score from the acoustic model processor as an input, output the received phoneme score to the smartphone, receive data and a first DC voltage (DC1) as an input from the smartphone, and output the second digital audio speech, the input selection control signal (SEL), the first control signal (C1), the second control signal (C2) and the second DC voltage (DC2); and a DC-DC converter configured to receive the second DC voltage (DC2) as an input and output a third DC voltage (DC3). - View Dependent Claims (3, 4, 5, 7, 9, 10, 12, 13)
-
-
2. (canceled)
-
6. (canceled)
-
8. The real-time voice recognition apparatus of claim wherein the acoustic model processor uses a recurrent neural network (RNN) algorithm of a long short term memory (LSTM) or gated recurrent unit (GRU) method.
-
11. (canceled)
-
14. (canceled)
-
15. A real-time voice recognition apparatus comprising an ASIC chip and a smartphone, the apparatus comprising:
-
a DRAM chip configured to output first parameter data (Dp1) and receive second parameter data (Dp2) as an input; an ASIC chip configured to receive a first digital audio speech and the first parameter data (Dp1) as an input and output a phoneme score (PS) and the second parameter data (Dp2); and a smartphone connected to the ASIC chip through a data port and configured to receive the phoneme score as an input from the ASIC chip, output word text, and process and output the word text according to user needs, wherein the ASIC chip comprises; an input signal selector configured to select one of a second digital audio speech and a third digital audio speech in response to an input selection control signal (SEL) and output the selected audio speech as the first digital audio speech (AIN); a feature extractor configured to receive the first digital audio speech and one or more first control signals (C1) as an input and output the feature vector; an acoustic model processor configured to receive the feature vector, one or more second control signals (C2) and a third parameter data (Dp3) as an input and output the phoneme score; a DRAM controller configured to receive the first parameter data (Dp1), a fourth parameter data (Dp4), and one or more sixth control signals (C6) as an input and output the second parameter data and the third parameter data; a connection device configured to receive the phoneme score as an input from the acoustic model processor, output the received phoneme score to the smartphone, receive data and a first DC voltage (DC1) as an input from the smartphone, and output the second digital audio speech (A2), the fourth parameter data (Dp4), the input selection control signal (SEL), the first control signal (C1), the second control signal (C2), the sixth control signal (C6) and the second DC voltage (DC2); and a DC-DC converter configured to receive the second DC voltage (DC2) as an input and output a third DC voltage (DC3). - View Dependent Claims (17, 18, 20, 21, 23, 24, 25, 27, 28)
-
-
16. (canceled)
-
19. (canceled)
-
22. (canceled)
-
26. (canceled)
-
29. (canceled)
Specification