SPEECH RECOGNITION METHOD AND APPARATUS IN ENVIRONMENT INCLUDING PLURALITY OF APPARATUSES
First Claim
1. A speech recognition method, performed by a speech recognition apparatus, for performing speech recognition in a space in which a plurality of speech recognition apparatuses are present,the speech recognition method comprising:
- extracting a speech signal of a speaker from an input audio signal;
obtaining a first speaker recognition score indicating a similarity between the speech signal and a speech signal of a registration speaker; and
outputting a speech recognition result with respect to the speech signal based on a second speaker recognition score obtained from an other speech recognition apparatus among the plurality of speech recognition apparatuses and based on the first speaker recognition score.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are an artificial intelligence (AI) system that utilizes a machine learning algorithm such as deep learning, etc. and an application of the AI system. A speech recognition method, performed by a speech recognition apparatus, of performing speech recognition in a space in which a plurality of speech recognition apparatuses are present includes extracting a speech signal of a speaker from an input audio signal; obtaining a first speaker recognition score indicating a similarity between the speech signal and a speech signal of a registration speaker; and outputting a speech recognition result with respect to the speech signal based on a second speaker recognition score obtained from another speech recognition apparatus among the plurality of speech recognition apparatuses and the first speaker recognition score.
6 Citations
20 Claims
-
1. A speech recognition method, performed by a speech recognition apparatus, for performing speech recognition in a space in which a plurality of speech recognition apparatuses are present,
the speech recognition method comprising: -
extracting a speech signal of a speaker from an input audio signal; obtaining a first speaker recognition score indicating a similarity between the speech signal and a speech signal of a registration speaker; and outputting a speech recognition result with respect to the speech signal based on a second speaker recognition score obtained from an other speech recognition apparatus among the plurality of speech recognition apparatuses and based on the first speaker recognition score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A speech recognition apparatus among a plurality of speech recognition apparatuses located in a same space,
the speech recognition apparatus comprising: -
a receiver configured to receive an input audio signal; a processor configured to control the speech recognition apparatus to;
extract a speech signal of a speaker from the input audio signal and obtain a first speaker recognition score indicating a similarity between the speech signal and a speech signal of a registration speaker; andan outputter comprising output circuitry configured to output a speech recognition result with respect to the speech signal, wherein the processor is further configured to control the outputter to output the speech recognition result with respect to the speech signal based on a second speaker recognition score obtained from another speech recognition apparatus among the plurality of speech recognition apparatuses and on the first speaker recognition score. - View Dependent Claims (11, 12, 13)
-
-
14. A speech recognition method, performed by a device connected to a plurality of speech recognition apparatuses located in a same space, of performing speech recognition,
the speech recognition method comprising: -
obtaining a first speaker recognition score indicating a similarity between a speech signal received by a first speech recognition apparatus and a speech signal of a registration speaker; obtaining a second speaker recognition score indicating a similarity between a speech signal received by a second speech recognition apparatus and the speech signal of the registration speaker; determining an apparatus closer to the speaker among the first speech recognition apparatus and the second speech recognition apparatus based on the first speaker recognition score and the second speaker recognition score; and outputting a speech recognition result with respect to a first speech signal to the first speech recognition apparatus based on the apparatus closer to the speaker being determined as the first speech recognition apparatus. - View Dependent Claims (15, 16)
-
-
17. A device connected to a plurality of speech recognition apparatuses located in a same space,
the device comprising: -
a communicator comprising communication circuitry configured to receive a speech signal from each of a first speech recognition apparatus and a second speech recognition apparatus and a processor configured to control the device to obtain a first speaker recognition score indicating a similarity between a speech signal received by the first speech recognition apparatus and a speech signal of a registration speaker, obtain a second speaker recognition score indicating a similarity between a speech signal received by the second speech recognition apparatus and the speech signal of the registration speaker, and determine an apparatus closer to a speaker among the first speech recognition apparatus and the second speech recognition apparatus based on the first speaker recognition score and the second speaker recognition score, wherein the processor is further configured to, control the communicator to output a speech recognition result with respect to a first speech signal to the first speech recognition apparatus based on the apparatus closer to the speaker being determined as the first speech recognition apparatus. - View Dependent Claims (18, 19)
-
-
20. A speech recognition system comprising:
-
a plurality of speech recognition apparatuses located in a same space and a device connected to the plurality of speech recognition apparatuses, wherein, among the plurality of speech recognition apparatuses, a first speech recognition apparatus is configured to; receive a first speech signal with respect to an utterance of a speaker and transmit the first speech signal to the device, wherein, among the plurality of speech recognition apparatuses, a second speech recognition apparatus is configured to; receive a second speech signal with respect to the same utterance of the speaker and transmit the second speech signal to the device, and wherein the device is configured to; obtain a first speaker recognition score indicating a similarity between the first speech signal and a speech signal of a registration speaker, obtain a second speaker recognition score indicating a similarity between the second speech signal and the speech signal of the registration speaker, determine an apparatus closer to the speaker among the first speech recognition apparatus and the second speech recognition apparatus based on the first speaker recognition score and the second speaker recognition score, and output a speech recognition result with respect to a first speech signal to the first speech recognition apparatus based on the apparatus closer to the speaker being determined as the first speech recognition apparatus.
-
Specification