User attribute derivation and update for network/peer assisted speech coding
First Claim
1. A communication terminal, comprising:
- a speech capture module that obtains a speech signal associated with a user;
a speech analysis module that processes the speech signal associated with the user to generate user attribute information;
a network interface module that transmits the user attribute information to a network for the purpose of making the user attribute information available to at least one other communication terminal for use in configuring a configurable speech codec of the at least one other communication terminal to operate in a speaker-dependent manner;
decomposition logic configured to decompose the speech signal into a speaker-independent signal and a speaker-dependent signal;
a first encoder configured to encode the speaker-independent signal without using the user attribute information and provide the encoded speaker-independent signal for transmission to the at least one other communication terminal; and
a second encoder configured to encode the speaker dependent signal using the user attribute information and provide the encoded speaker-dependent signal for transmission to the at least one other communication terminal.
7 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods and apparatuses are described for deriving and updating user attribute information about users of a communications system. A communications network is then used to transfer the user attribute information to communication terminals, which use the user attribute information to configure a speech codec to operate in a speaker-dependent manner during a communication session, thereby improving speech coding efficiency. In a network-assisted model, the user attribute information is stored on the communications network and selectively transmitted to the communication terminals while in a peer-assisted model, the user attribute information is derived by and transferred between communication terminals.
63 Citations
44 Claims
-
1. A communication terminal, comprising:
-
a speech capture module that obtains a speech signal associated with a user; a speech analysis module that processes the speech signal associated with the user to generate user attribute information; a network interface module that transmits the user attribute information to a network for the purpose of making the user attribute information available to at least one other communication terminal for use in configuring a configurable speech codec of the at least one other communication terminal to operate in a speaker-dependent manner; decomposition logic configured to decompose the speech signal into a speaker-independent signal and a speaker-dependent signal; a first encoder configured to encode the speaker-independent signal without using the user attribute information and provide the encoded speaker-independent signal for transmission to the at least one other communication terminal; and a second encoder configured to encode the speaker dependent signal using the user attribute information and provide the encoded speaker-dependent signal for transmission to the at least one other communication terminal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method performed by a communication terminal, comprising:
-
obtaining a speech signal associated with a user; processing the speech signal associated with the user to generate user attribute information; transmitting the user attribute information to a network for the purpose of making the user attribute information available to at least one other communication terminal for use in configuring a configurable speech codec of the at least one other communication terminal to operate in a speaker-dependent manner; decomposing the speech signal into a speaker-independent signal and a speaker-dependent signal; encoding, by a first encoder, the speaker-independent signal without using the user attribute information and providing the encoded speaker-independent signal to the at least one other communication terminal; and encoding, by a second encoder, the speaker dependent signal using the user attribute information and providing the encoded speaker-dependent signal to the at least one other communication terminal. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A server, comprising:
-
a speech capture module that obtains a speech signal associated with a user that is transmitted by a communication terminal over a network; a speech analysis module that processes the speech signal associated with the user to generate user attribute information; and a user attribute storage module that makes the user attribute information available to at least one other communication terminal that connects to the network for use in configuring a configurable speech codec of the at least one other communication terminal to operate in a speaker-dependent manner and makes the user attribute information available to the communication terminal for use in encoding the speech signal using the user attribute information for transmission to the at least one other communication terminal, wherein configuring the configurable speech codec of the at least one other communication terminal to operate in the speaker-dependent manner comprises decoding, by a first decoder of the at least one other communication terminal, a speaker-independent signal without using the user attribute information and decoding, by a second decoder of the at least one other communication terminal, speaker-dependent signal using the user attribute information. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. A method implemented by a server, comprising:
-
obtaining a speech signal associated with a user that is transmitted by a communication terminal over a network; processing the speech signal associated with the user to generate user attribute information; making the user attribute information available to at least one other communication terminal that connects to the network for use in configuring a configurable speech codec of the at least one other communication terminal to operate in a speaker-dependent manner, wherein configuring the configurable speech codec of the at least one other communication terminal to operate in the speaker-dependent manner comprises decoding, by a first decoder of the at least one other communication terminal, a speaker-independent signal without using the user attribute information and decoding, by a second decoder of the at least one other communication terminal, speaker-dependent signal using the user attribute information; and providing the user attribute information to the communication terminal for use in encoding the speech signal using the user attribute information for transmission to the at least one other communication terminal. - View Dependent Claims (36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification