Q-learning-based multi-rate transmission control (MRTC) scheme for RRC in WCDMA systems
First Claim
1. A method for radio resource control (RRC) in the WCDMA system, comprising the following iterative steps:
- obtaining a system state x and an action A(x) using an incoming request, a mean of interference Im and a variance of interference Iv;
obtaining Q values based on x and A(x) by using Im, Iv,a change of mean Δ
Im and a change of variance A Iv;
allocating an optimal action A*(x) using the Q values; and
updating the Q values using a state y and a cost function based on x and A*(x) when a new request with the state y arrives.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides a method using Q-learning rule to accurately estimate the transmission cost for the multi-rate transmission control (MRTC) for RRC (radio resource control) in the WCDMA network. This method comprises the following iterative steps: first obtaining a system state and an action using an incoming request, a mean of interference and a variance of interference, then obtaining Q values based on the system state and the action. Afterwards, allocating an optimal action with the Q values, and when a new request with a new state arrives, going back to the first step and updating the Q values using the new state and a cost function based on the system state and the action.
-
Citations
32 Claims
-
1. A method for radio resource control (RRC) in the WCDMA system, comprising the following iterative steps:
-
obtaining a system state x and an action A(x) using an incoming request, a mean of interference Im and a variance of interference Iv;
obtaining Q values based on x and A(x) by using Im, Iv,a change of mean Δ
Im and a change of variance A Iv;
allocating an optimal action A*(x) using the Q values; and
updating the Q values using a state y and a cost function based on x and A*(x) when a new request with the state y arrives. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 14, 15)
-
-
10. A method for radio resource control in the WCDMA system, comprising the following iterative steps:
-
obtaining a system state x and an action A(x) using an incoming request, a mean of interference Im and a variance of interference Iv;
obtaining Q values using x and A(x);
allocating an optimal action A*(x) using the Q values; and
updating the Q values using a state y and a cost function based on x and A*(x) when a new request with the state y arrives. - View Dependent Claims (11, 12, 13, 16, 17)
-
-
18. A method for radio resource control in the WCDMA system, comprising the following iterative steps:
-
obtaining a system state x and an action A(x) using an incoming request, a mean of interference Im and a variance of interference Iv;
obtaining x and A(x) as a function of Im+Δ
Im and Iv+Δ
Iv, wherein Δ
Im and Δ
Iv representing the change of Im and Iv respectively;
performing back-propagation learning rule to the function of Im+A Im and Iv+A Iv;
allocating an optimal action A*(x) using the Q values; and
updating the Q values using a state y and a cost function based on x and A*(x) when a new request with the state y arrives. - View Dependent Claims (19, 20, 21, 22, 23, 31, 32)
-
-
24. A circuit for radio resource control in the WCDMA system, comprising:
-
a state construction circuit for obtaining a system state x and an action A(x) using an incoming request, a mean of interference Im and a variance of interference Iv;
a Q-function computation circuit for obtaining Q values based on x and A(x) by using Im, Iv,a change of mean Δ
Im and a change of variance A Iv;
a rate allocation circuit for allocating an optimal action A*(x) with the Q values; and
a Q-learning rule circuit for updating the Q values using state y and a cost function based on x and A*(x) when a new request with a state y arrives. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
Specification