I Introduction
Future 5G communication systems demand higher data rate, larger bandwidth and higher spectral efficiency. With huge unlicence spectrum, millimeter wave communication from 30GHz to 300GHz is considered a promising solution to the lack of spectrum in current communication systems [2]. Thanks to the short wavelength in mmWave system, compared with GHz radio system, more antennas can be integrated into the same space to get higher directional gain to compensate the path loss [3]. Millimeter wave communication has been widely used for longdistance pointtopoint scenario in satellite and terrestrial applications and is being studied and developed for commercial cellular systems.
In conventional MIMO systems, one antenna is corresponding to one RF chain (including amplifiers, mixers, ADC/DACs). But in massive mmWave MIMO systems, this may cause unbearable power consumption and hardware cost. Hence hybrid beamforming techniques are introduced into the mmWave massive MIMO systems [4]. Hybrid beamforming (HBF) contains two stages: the analog beamforming (ABF) in the RF domain mainly aimed to reduce the number of RF chains and the digital beamforming (DBF) in the baseband domain functioning as in the conventional MIMO systems. The former is usually implemented with phase shifter network with constant amplitude constraint.
There are many works coming up with the HBF algorithms for both the pointtopoint and multiuser mmWave MIMO systems. In the pointtopoint scenario, paper [5] proposed an OMP based algorithm to jointly design the analog and digital beamforming matrix, which is widely considered as the performance comparison of the subsequent algorithms [6, 7, 8]. In [9] a twostage asymptotic optimal hybrid beamforming algorithms based on SVD of the channel matrix was proposed considering the number of RF chains is equal to the number of data streams. The above HBF algorithms are based on the assumption that perfect CSI is available at both the transmitter and the receiver. With no CSI knowledge, channel estimation should be carried out through the ways of dividing the whole space into beam grids and then applying compressive sensing algorithms [10, 11] or the offgrid ways applying spatial spectrum method[12, 13], gradient descent method [14] and so on [15].
In the multiuser scenario, many twostage HBF algorithms are proposed. To design the analog beamformer, apart from the methods with the requirement of CSI mentioned above [16]
, beam training methods are also widely adopted. Beam training methods are intended to find the best performing candidate vectors by searching among the codebook with different strategies such as maximizing the achievable sum rate, maximizing the data rate of per user
[17], or their combination [18]. The exhaustive searching performs best but suffers huge computational complexity thus is impractical to be applied. The peruser selection method [17] reduces the computational complexity but the performance is inferior to exhaustive searching. A twostep selection method is proposed [18] to make a tradeoff between the computational complexity and the performance. The DBF is often intended to reduce interuser interference by, for instance, zeroforcing (ZF) algorithm and block diagonalization (BD) algorithm[19, 20, 21]. It has been proved that applying ZF precoder can achieve linear growth of the sum rate with respect to the number of antennas and users when they go to infinity[22], which is approximate to dirty paper coding (DPC). So ZF precoder is widely used as the digital precoder[17, 23, 24, 16] due to its low complexity and asymptotic optimality.In RF domain of mmWave massive MIMO systems, the heavy usage of phase shifters (from a few hundred to several thousand) in HBF structure makes the system cost sensitive with the unit price of phase shifters and usually the unit price of mass production determines the component performance. Constrained by cost, volume and production process, it is unavoidable to induce the phaseshifting error and gain error of phase shifters. The rms value of phaseshifting error of the stateoftheart 360degreecoverage passive or active phase shifters working at the mmWave frequency is about to and the rms value of gain error is about dB to dB [25, 26, 27]
. It is very critical for manufacture to get the relation between the system performance of mmWave massive MIMO systems and the component performance of imperfect phase shifters for mass production. However, to the best of author’s knowledge, there is no work analyzing the influence of the phaseshifting error and gain error of phase shifters. Different from the uniformly distributed quantization error of digitally controlled phase shifters considered in
[9, 17], the phaseshifting error and gain error follow Gaussian distribution, exist in both analog and digital phase shifters and are random and unknown so that they are hard to be compensated.
In our previous work, we have studied the performance loss with imperfect phase shifters in the pointtopoint mmWave MIMO systems [1]. Our analytic procedure therein is applicable for all kinds of hybrid beamforming methods, including, but not limited to the aforementioned two. In this paper, we focus on multiuser scenario in the mmWave massive MIMO systems and investigate the performance loss with phaseshifting error and gain error. We only consider the fullyconnected hybrid beamforming structure for lack of space and the similar analytic method can be applied to other structures. The main contributions of this paper are listed as follows:

In the downlink multiuser mmWave massive MIMO systems, we derive the upper bound of the achievable sum rate with the phaseshifting error and gain error. We assume that the base station has all the perfect CSI and each user has its own CSI. We use the ABF design method based on the SVD of channel [16] as an example and similar analysis can be applied to other ABF design methods. We use ZF precoder as the digital precoder to analyze the achievable sum rate with imperfect phase shifters. Both the theoretical derivation and numerical simulation show that when the number of antennas at the BS is far more than the number of users, the upper bound can be very tight. With the phaseshifting error of and gain error of dB, the sum rate has nearly degradation when SNR is dB, which is not negligible. As SNR going larger, the achievable sum rate will be limited to a performance ceiling.

To address the aforementioned problem, we proposed a fast and accurate estimation method of the angleofdeparture (AOD) at the downlink transmission and then design the hybrid beamformer. The existing hybrid beamforming methods demand large training overhead to enhance the performance and suffer huge computational complexity thus cannot be carried out in the downlink. Limited to this, many works estimate the CSI in the uplink and design the downlink hybrid beamformer based on the uplink CSI with the assumption of channel reciprocity. But in practice the channel reciprocity is not satisfied. In our proposed downlink channel estimation algorithm, neither iteration procedure nor searching among the codebook is required thus the training overhead can be cut down greatly, as well as the computational complexity. Then the stage of estimating the equivalent channel is added, based on which the digital precoder can be designed. With the proposed channel estimation and HBF design algorithm, the ceiling of the sum rate in the high SNR regime can be removed and the training overhead can be reduced in the meantime. Our proposed algorithm performs well when the channel paths are few and the channel Rician Kfactor is large. We further make a tradeoff between the performance and the training overhead so that the algorithm can work well in more kinds of propagation environment.
The remainder of the paper is organized as follows. In section II, the system model as well as the channel model is characterized. In section III, we study the loss of the achievable sum rate in multiuser scenario with imperfect phase shifters. Then we proposed an algorithm for the channel estimation and hybrid beamforming design to compensate the performance loss and reduce the training overhead in section IV. Simulation results are shown in section V to validate the theorem in the previous section and compare the performance of the existing algorithms with our proposed one. Finally the conclusion is presented in section VI. The proof of the theorem is detailed in the appendix.
Notations: In this paper, we use boldface letters to denote matrices(upper case) and vectors(lower case). We use , , , , to denote transpose, Hermitian transpose, inverse, trace and determinant of matrix , respectively. denotes the set of complexvalued matrix with dimension ; denotes the Frobenius norm of matrix; denotes the statistical expectation; denotes the complex Gaussian random vector with mean and covariance ;
denotes the real Gaussian random variable with mean
and variance
; denotes the real part of the complex number .Ii System Model
Iia System Model
In this section we consider the downlink multiuser mmWave massive MIMO systems. The base station (BS) with antennas communicates to users simultaneously . At the BS, the number of RF chains is , which is equal to the number of users . At the user side, each user is equipped with antennas and one RF chain.
At the BS, before transmitted, the symbol vector where is precoded by the hybrid precoder where is the analog precoder and denotes the digital precoder in the baseband domain. The analog precoding matrix is usually implemented with phase shifters so the elements are limited to the constant amplitude constraint . Then the transmitted signal vector is denoted as .
The received signal vector at the th user is given as below
(1) 
where indicates the average transmitted power for each user, denotes the channel state information from the BS to the th user, which will be modeled later and is the noise vector with independent and identically distributed (i.i.d.) elements. It is assumed that the BS has the perfect CSI of all the users and each user has its own CSI. The factor is the normalization factor of transmission power. At the user side, only analog combiner is adopted for there is only one RF chain, which is denoted as and . After analog combining, the received signal of the th user can be written as
(2) 
In this paper, the fullyconnected hybrid beamforming is considered where one RF chain is connected with all the antennas through the phase shifter network. The phase shifters can be analog with continuous phase or digital with bit resolution discrete phase. With the phase shifter network, the analog precoding matrix can be expressed as
(3) 
where and for digital bit resolution phase shifters, for analog phase shifters. The analog combiner is the same.
When there exist phaseshifting error and gain error, the elements of analog beamforming matrix will become with the gain error and the phaseshifting error . And the analog precoding matrix, the analog combining vector, and the hybrid precoding matrix are denoted by , separately. Since the phaseshifting error and gain error are unknown to the BS, they can’t be compensated in the baseband domain and the practical digital beamforming matrix has no relation with the phaseshifting error and gain error.
IiB Channel Model
In the multiuser scenario, the user equipment may have very few antennas thus only the LOS path is dominant and the scattering paths can be negligible in the longdistance propagation or nonnegligible but very weak relatively in the shortdistance propagation referring to [17]. Therefore, we take the Rician fading channel with a large Rician Kfactor into account. The channel of the th user can be expressed as
(4) 
where is the Rician Kfactor of the th user and we set , to simplify the analysis. is the channel matrix of LOS path and is the NLOS component. Adopting the uniform linear array (ULA) structure, we have
(5)  
where is the path coefficient of the th scattering path corresponding to the th user. The angles , represent the downlink angleofdeparture (AOD) and angleofarrival (AOA) of the LOS path and , stand for the downlink AOD and AOA of the th scattering path. and represent the antenna array response vectors at the BS and user side. The dimensional antenna array response vector at angle , for ULA structure is expressed as
(6) 
where is the wavelength and is the antenna spacing. In this paper, it is assumed that the transmitter and receiver are equipped with linear arrays with .
Iii Performance loss in Multiuser scenario
In this section we consider the performance loss due to the phaseshifting error and gain error in the downlink multiuser mmWave MIMO system. We use the asymptotically optimal hybrid beamformer for the multiuser scenario. Firstly the SVD factorizes each user’s channel matrix as and the diagonal elements of is in a descending order. The analog precoder at the BS is and the analog combiner at each user is where
is the quantization of the angle in the parentheses according to the resolution of the phase shifters. When the number of users is large, more than one users share a very similar AOD with a high probability to some extent and these users may cause large interference to each other. In this case, the user scheduling is indispensable and was ignored in most of the existing research. The work in
[28] proposed a greedy angle division multiple access (ADMA) user scheduling algorithm to divide the users into different scheduling groups. In our work we use a simple way that the users within a AOD range of rad will be silent in the current transmission cycle except only one user. And the analog beamforming vectors corresponding to the silent users will be set as zero vectors. We use to denote the index set of the nonsilent users and its cardinality is (). The analog beamforming matrix at the BS can be written as where stands for the th element in the set , .As to the digital precoder at the BS, the dirty paper coding (DPC) is optimal but too difficult to implement. The low complexity linear precoding algorithms such as zeroforcing (ZF) are preferred whose performance converge to the optimal DPC in the massive MIMO systems. The digital ZF precoder at the BS is given by
With the ZF precoder, the multiuser interference can be eliminated and the SINR of the th nonsilent user can be expressed as
(9) 
Then the achievable sum rate is
(10) 
In the sequel, the phaseshifting error and gain error of phase shifters is taken into account and the equivalent channel under these error can be expressed as
(11) 
where .
We use and to define the digital ZF precoder based on and respectively. Note that is the equivalent channel in practice but unknown due to the phaseshifting error and the gain error of phase shifters. What we know is consisting of perfect phase shifters and we can only design the digital precoder according to , instead of . Thus the received signal at the th user is given by
(12) 
From (12), we can see the multiuser interference can’t be eliminated due to the mismatch of digital precoder and the practical equivalent channel . In this case, the SINR of the th user can be written as
(13) 
where we define for the sake of simplifying the notation.
Then the achievable sum rate under phaseshifting error and gain error of phase shifters is
(14) 
where the normalization factor .
However, it is hard to derive the explicit result of (14). In [17], an approximate result of the achievable sum rate (14) is given by
(15) 
By calculating the second derivative of the function , we find that when , the function is convex and the formula (15) is the lower bound of (14) exactly. In massive MIMO systems, formula (15) can be very approximate to (14) as shown in Fig. 1 in the section V thus formula (15) can describe the effect of phaseshifting error and gain error upon the achievable sum rate well. Then we calculate the theoretical value of the expectation of in theorem 1.
Theorem 1.
In multiuser scenario, the lower bound of can be expressed as
(16) 
When or , the bound can be very tight.
Proof: Refer to Appendix A.
Thus we can derive the lower bound of the loss of achievable sum rate based on the expression
(17) 
In the low SNR regime where , the loss of the achievable sum rate due to the phaseshifting error and gain error can be neglected. In the high SNR regime, we further calculate the performance ceiling of the achievable sum rate which is presented in the following corollary.
Corollary 1.
In the high SNR regime, the performance ceiling of the achievable sum rate with imperfect phase shifters is
(18) 
where ‘LB’ represents the lower bound of given in (16). Formula (18) shows that the achievable sum rate is limited to a ceiling only determined by the variance of phaseshifting error and gain error of phase shifters.
Proof: As the SNR going infinity, thus we have . is the probability of the case that the number of the nonsilent users is . It is hard to calculate the explicit expression of but we can obtain the value of by MonteCarlo simulation. As an example, when and , we have , , , . And for simplicity we can just use
(19) 
as an approximation of the performance ceiling.
Remark 1.
In this paper, we focus on the ZF digital precoder at the BS and analyze the performance loss with imperfect phase shifters. Other digital precoders like regularized ZF (RZF) which can further improve the performance in the low SNR regime can be analyzed by the similar procedure. The RZF digital precoder can be expressed as [29]
(20) 
where the transmission power normalization factor . It is easy to prove that is equal applying RZF and ZF digital precoder with the same analytic procedure. Thus the loss of the achievable sum rate caused by phaseshifting error and gain error is nearly the same with RZF or ZF precoder. While thus the achievable sum rate will be improved with RZF precoder especially in the low SNR regime.
Remark 2.
When the number of users communicating to the BS simultaneously is large, the user scheduling scheme will not be optimal of course. Some hybrid beamforming algorithms will achieve higher sum rate such as the exhaustive search among large enough amount of candidate analog beamforming vectors with the strategy of maximizing the sum rate. In these cases, the analytic procedure is also applicable, as well as the derived lower bound. In fact as long as the digital precoder is the linear precoder such as maximum ratio transmission (MRT), ZF, RZF, the similar analytic procedure can be used to different analog beamforming algorithms. For other digital precoder design such as block diagonalization (BD), we will leave it to the future research.
The reason of the performance loss is mainly twofold. One is that the main lobe width of the directional beam will be larger and the side lobe level will be higher due to the distorted analog beamforming vectors with the imperfect phase shifters, which indicates that the beam power will dissipate more and the directional gain will be lower. The other is the ZF digital precoder cannot eliminate the interuser interference since they lack the perfect knowledge of the practical equivalent channel which is the combination of the analog beamforming matrices at the BS and users and the propagation channel. The latter is easy to deal with by estimating the practical equivalent channel. However the former is hard to calibrate because it is impractical to get the exact knowledge of each imperfect phase shifter for the huge complexity. Fortunately, the former one causes far less harm than the latter one and we just use the imperfect phase shifted array as the analog beamforming vectors. In the next section, we will propose a channel estimation and hybrid beamforming method to resist the phaseshifting error and gain error.
Iv Channel estimation and hybrid beamforming design
The massive MIMO systems face the challenge of the huge training overhead of channel estimation in the downlink as well as the heavy feedback overhead. Furthermore, the user equipment will bear the heavy burden of high computational complexity to estimate the downlink channel, which is not a wise choice for power saving. To deal with it, most of the existing approaches prefer TDD mode with the assumption of channel reciprocity thus the channel can be estimated in the uplink and the downlink precoding matrix can be designed according to the uplink CSI. However, it is worth mentioning that although the propagation channel is reciprocal, the transceiver RF chains are not and we need calibration methods to deal with the nonreciprocity, which incurs extra overhead or hardware cost. In this section, we find a way to estimate the channel in the downlink with low training overhead and limited feedback. The user equipment can also enjoy the low computational complexity for saving power. No reciprocity calibration is required and both TDD and FDD modes can be applied. What’s more, we can design the hybrid beamformer to remove the performance ceiling due to the imperfect phase shifter.
Considering hybrid mmWave MIMO systems under Rice channel model, the main idea of our proposed algorithm is that the analog precoder is intended to steer the beam at the LOS path direction and the power of other scattering paths can be collected partially by digital precoder when the LOS path is far stronger than the scattering paths. Thus to design the analog precoder at the BS, the knowledge of the downlink AOD corresponding to the LOS path is enough, instead of the whole knowledge of channel. Once we have the analog precoder, the BS can send training signals to the users to get the knowledge of the equivalent channel, which has much fewer elements and lower dimension compared with the propagation channel and contains the contribution of scattering paths. Then the digital precoder can be designed based on the equivalent channel. The whole process consists of four stages as shown in Algorithm 1.
At the first stage, it is intended to estimate the downlink AOD of the LOS path at the user side. The DFT interpolation method is applied here to estimate the AOD by directly interpolating the complexvalued DFT coefficients. The BS sends the pilot matrix
using time slots and the analog precoding matrix is set as(21) 
where is the DFT matrix and the to rows of
are one padded. Of course, the elements of
are distorted by the imperfect phase shifters but for simplicity of notation, we just drop the subscript E below.The received signal at the th user is expressed as
(22) 
where and are constants and not involved in the AOD estimation. is the equivalent noise including the contribution of training signals going through scattering paths. Each user uses only one omnidirectional antenna to receive the pilot signals and the analog combining vector can be written as . is the noisy complexvalue DFT coefficients and the DFT size is . Our goal is to estimate the downlink AOD of the LOS path . Quite a few algorithms of DFT interpolation have been published [30, 31, 32] and there is a comparison of them in [33]. Finally we choose Jacobsen’s algorithm for its better performance in low SNR regime.
In Jacobsen’s algorithm, we first find the maximum in the absolute value of the elements of and its index is set as . The indices of its two neighbors are and , respectively. Note that can be at one end of and in these cases one of its neighbors is the other end of . Then the AOD can be estimated by the DFT interpolation of them three complexvalue DFT coefficients
(23) 
where is the th element of .
Then it comes to the second stage of the user scheduling and analog beamformer design. The users feed back the estimated AOD to the BS. And if there are some AODs between which the difference is smaller than the beam width , they are scheduled as silent ones except one and the analog beamforming vectors corresponding to the silent users are zero vectors. The analog beamforming vector corresponding to the th nonsilent user can be set as .
Considering the multipleantenna user equipment, it needs an extra stage to design the analog beamformer at the users. Restricted by the volume and power consumption, the amount of the antennas at the users is far less than the BS thus we can search for the best analog beamforming vector in the beamsteering codebook exhaustedly as [17] and it won’t take much overhead. Of course, unavoidably, the analog beamforming vectors will be distorted due to the phaseshifting error and gain error of phase shifters and we leave it to the digital beamformer.
At the third stage, we estimate the equivalent channel in the downlink. The BS sends orthogonal length pilot sequences to the users with , which is derived at the second stage but distorted by the imperfect phase shifters, as the analog precoding matrix and as the analog combining vector. For simplicity, the pilot matrix is set as . At the th user, the received sequence is
(24) 
and we use as the estimation of , which incurs the noise, inevitably.
In [17], they assume that the channel is reciprocal and estimate the equivalent channel in the uplink. But as we mentioned above, channel reciprocity can not be guaranteed and the synchronization among the users may cause extra overhead, thus we estimate the equivalent channel in the downlink and then feed back it. However, in some of other works, once they get the knowledge of propagation channel and design the hybrid beamformer according to it, they just use (8) as the equivalent channel [16] and they are very sensitive to the phaseshifting error and gain error. Although the estimation of equivalent channel may induce additional noise, it is beneficial to weaken the harm of the imperfect phase shifters and therefore enhance the performance as shown in the simulations. So there may be a tradeoff intuitively that in the low SNR regime where the harm of the noise is more serious than the imperfect phase shifters, the estimation of equivalent channel is more harm than good and can be omitted.
At the last stage, the estimated equivalent channel is fed back to the BS and the digital beamforming matrix can be set as . In some systems with limited feedback requirement, the equivalent channel can be chosen among a codebook and only the index should be fed back[34].
Next, we will compare our proposed algorithm with others in the following aspects. And we will show that our proposed algorithm outperforms some of the other algorithms, especially in [17] conditionally with less training overhead and hardware cost. We also propose a way to further enhance the performance of our algorithm.
Iva Resolution of phase shifters
Referring to [17], to guarantee the analog beamforming performance, the analog beamforming codebook at the BS should contain candidate vectors at least. If the beam steering vectors are used as candidate vectors, as [5, 17], the digital phase shifters should have the resolution of bits. In the massive MIMO systems, the BS is equipped with hundreds of antennas and the heavy use of the high resolution phase shifters may result in huge power consumption and hardware cost. Thus it is expected to use low complexity and powersaving phase shifters with few quantization bits. Alternatively, as proven in [9], if we get the optimal full digital beamformer at first, then extract the phase of the elements, and finally quantize them to obtain the phase of phase shifters, only or bits phase shifters are good enough no matter how many the antennas are. However, in [9], it should do SVD to the channel matrix to obtain and the computational complexity of SVD is too high for the hardware DSP. What’s more, the whole knowledge of channel matrix consisting of both the path angles and the path coefficients is hard to obtain. In our algorithm, we only need to estimate the AOD of the LOS path and can be obtained thus we can use low resolution phase shifters to approximate the phase of the elements of .
IvB The number of users supported
Referring to [33], the Jacobsen’s estimator asymptotically holds
where is the DFT size, is the spectral density of the noise. This reveals that with larger DFT size, the accuracy of the estimation can be improved qualitatively fixing the power of noise. And in our algorithm, it implies that with more users, the accuracy of the estimation is enhanced. To achieve the same performance with [17] when the users are few, we can use several training cycles to enlarge the DFT size. The whole analog beamforming matrix in the AOD estimation stage containing training cycles can be expressed as
And at the th cycle, we use the to columns of as the analog beamforming matrix, . At the user side, the AOD estimation will be operated among the received symbols thus the DFT size is and we can make a tradeoff between the performance and the training overhead.
IvC The number of channel paths and Rice Kfactor
Considering Rice channel in mmWave systems, as we mentioned above, the analog beamforming vector is designed with the intention to steer the transmitting beam at the LOS path. So at the AOD estimation stage, the training signal going through the scattering paths will be regarded as noise at the user side. With more channel paths, and smaller Rician Kfactor, the power of the noise is relatively larger thus the estimation will be more inaccurate. The algorithm in [17] is robust in different propagation environment with different number of channel paths and Rician Kfactor. To enhance the accuracy of estimation in this case, we can also use several training cycles to enlarge the DFT size thus to resist noise.
IvD Computational complexity and training overhead
As mentioned above, the computational complexity of exhaustedly search in the codebook will increase linearly with at the BS in the [17], as well as the training overhead. However, in our proposed algorithm, there is no need of the search process and the complexity of Jacobsen’s estimation method increases linearly with the number of the users, as well as the training overhead. What’s more, it must take several guard time for the stability of the phase shifters after changing their phases and the training time slots will be much longer using the algorithm in [17]. In our algorithm, in the AOD estimation stage, the phase of the phase shifters need not be changed, which will save the guard time as well as the power. In the cases where we should take several cycles to get better performance, the training overhead is multiplying the number of cycles by the number of users, which is also far less than the number of antennas. Of course the guard time is also required.
Remark 3.
In the AOD estimation stage, if the analog beamforming matrix is
the accuracy of AOD estimation will be greatly improved with the existence of phaseshifting error and gain error. However the elements ‘0’ can not be realized by phase shifters. Considering the hardware structure in [35] where each phase shifter is followed by a switch, we can realize the elements ‘0’ by turning off the switch. This may cause extra hardware cost but it can use less training cycles and less phase shifters to achieve the same performance, which may consume less power.
V Simulations
In this section we will show our simulation results about the degradation of the performance with phaseshifting error and gain error in multiuser scenario. And the performance of our proposed algorithm will also be shown later, as well as the comparison with others.
Va Performance loss due to phaseshifting error and gain error
In the downlink multiuser scenario, we set the number of the antennas at the BS as and each user is equipped with antennas. Both the BS and the users have ULA antennas. The channel between the BS and each user is assumed to be Rice model with paths totally and the Rician Kfactor is . The channel knowledge is assumed to be perfect both at the BS and the user side. The number of the total users is . Each user is served by a single RF chain thus the number of RF chains at the BS equals
. As to the hybrid beamforming process, the analog beamforming matrix is set based on the SVD of channels by extracting the phase of the elements of the right and left singular vector corresponding to largest singular value, and then quantize them with
bits phase shifters. The digital precoder at the BS is ZF precoder based on the equivalent channel in (8). The transmit signaltonoise ratio is defined as .As we mentioned before, the upper bound of achievable sum rate with imperfect phase shifters can be derived according to (15), which is an approximation, exactly lower bound, of the practical achievable sum rate shown in (14). In Figure 1, we can see that there is only a very small gap between (15) and (14). Therefore in the following figures, we use (15) as the simulation result of achievable sum rate with imperfect phase shifters.
Then we investigate the effect of phaseshifting error and gain error jointly in Figure 2. The number of the users is . We can see that when and rad, the loss ratio of the achievable sum rate is nearly when dB. With RZF digital precoder, the achievable sum rate can be slightly improved in the low SNR regime while in the high SNR regime, there is no difference between the performance of ZF and RZF digital precoder and they are all limited to the performance ceiling shown by the green dotted lines derived by formula (19). The theoretical upper bound shown by the black lines according to formula (15) and (16) is very tight and it is suitable for the RZF digital precoder, too.
In Figure 3, we fix the SNR as dB and change the number of users from to . With larger amount of users, the orthogonality among users can not be guaranteed and the interuser interference could be more serious. Thus the theoretical upper bound shown by black lines is tighter when the user is less for the orthogonality can be satisfied. Applying our user scheduling method mentioned in section IV, some of the users may be silent and the achievable sum rate does not increase linearly with the number of users. And the gain of achievable sum rate with more users further drops when there exist the phaseshifting error and gain error.
VB Performance of our proposed channel estimation and hybrid beamforming algorithm
In this section we will compare the performance of several existing algorithms with our proposed one with respect of different parameters. In the following three figures, the curves in blue and green both use the SVD based HBF design algorithm with the assumption of full knowledge of CSI. The analog beamforming vectors are obtained by doing SVD to the channel as we mentioned in section III, and the ZF digital precoder is based on the equivalent channel in (8). This algorithm is similar to [16] and the only difference is that they proposed a low complexity algorithm to design therein. However, the curves in blue show the achievable sum rate with perfect phase shifters, which can be regarded as the benchmark while the curves in green represent the performance with imperfect phase shifters. The curves in red are corresponding to the HBF design algorithm in [17], where the analog beamforming vectors are selected from the beamsteering codebook and the digital precoding matrix is ZF precoder based on the estimated equivalent channel. The curves in black show the performance of our proposed channel AOD estimation and hybrid beamforming design algorithm. The number of antennas at the users is and the resolution of the phase shifters is at the BS and at the user side.
In Figure 4, it is interesting to see that with the same number of users , the performance of our proposed algorithm with is a little worse than that with and the performance gap between the algorithm in [17] and ours is also larger when . When , , our proposed algorithm outperforms the algorithm in [17] in the low regime. The reason is that with relatively larger DFT size, the accuracy of AOD estimation can be improved. While in the high regime, the algorithm in [17] turns the tide. It can be explained that in [17] the probability of the number of nonsilent users is different from ours. In [17], if more than one users share the same analog precoding vector, which is selected among the total candidate vectors, they are scheduled as silent users except one. By MonteCarlo simulation, the average of the nonsilent users is when , and when , . However, in our proposed algorithm, the analog precoding vectors are designed based on the AOD of the strongest path and the AODs are uniformly distributed in . By MonteCarlo simulation, the average of the nonsilent users is when , and when , . Thus we can draw the conclusion that the average number of nonsilent users using the algorithm in [17] is larger than our proposed algorithm and the SVD based HBF algorithm, which will result in higher achievable sum rate.
In Figure 5, we can see that the performance in [16] is very sensitive to the gain error and phaseshifting error. In the low SNR regime, it performs best while in the high SNR regime, the performance will be limited to the ceiling, which indicates the necessity of the estimation of the equivalent channel. Both of the other two algorithms can remove the performance ceiling. The red curves corresponding to different level of phaseshifting error and gain error all coincide, which indicates that the algorithm in [17] is the most robust with phaseshifting error and gain error. When and rad, the accuracy of AOD estimation drops seriously and the performance of our algorithm fall behind greatly while when and rad, the influence of phaseshifting error and gain error can be neglected. However there is a performance gap between the red and black curves even with perfect phase shifters because the accuracy of AOD estimation is limited due to the training signals going through the scattering paths, which can be regarded as part of the equivalent noise. Then we focus on the condition that and rad and use multiple training cycles, as shown by the dash line with training cycles and the dot dash line with training cycles, to enhance the performance. We can see that with training cycles, the performance of our proposed algorithm is almost the same as the algorithm in [17] when SNR is less than dB. And with training cycles, which takes the same training overhead as the algorithm in [17], our proposed algorithm performs better.
Then we fix SNR as dB and look into the influence of the Rician Kfactor upon the achievable sum rate. As shown in Figure 6, the algorithm in [17] is robust in different propagation environment. And when and rad, our proposed algorithm using only one training cycle suffers a serious performance loss. With more channel paths and smaller Rician Kfactor the performance further drops. Considering the training overhead, the algorithm in [17] should search the codebook exhaustedly and the training time slots in need are to form the analog beamforming matrix while our proposed algorithm demands only time slots. Then we make a tradeoff between the performance and the overhead. With two training cycles, the performance can be greatly improved and there is only a small performance gap between the algorithm in [17] and ours. With eight training cycles, the performance is further enhanced and better than the algorithm in [17].
To sum up, thanks to the few propagation paths and the strong LOS path in multiuser mmWave communication systems, the performance of our proposed algorithm is not worse than the algorithm in [17], sometimes even better. And the performance ceiling due to the imperfect phase shifters can also be removed. Meanwhile the training overhead can be greatly saved.
Vi Conclusion
In this paper, we looked into the loss of achievable sum rate in multiuser scenario caused by phaseshifting error and gain error of phase shifters. We derived the upper bound of the achievable sum rate and the expression is validated via simulations. The theoretical and simulation results indicate that the degradation of the performance is serious with the existence of phaseshifting error and gain error and will be limited to the ceiling in the high SNR regime. Some compensation algorithms may be required to deal with phaseshifting error and gain error. Then we proposed a downlink channel estimation and hybrid beamforming algorithm, which can enjoy good performance with much lower training overhead. The tradeoff between the performance and training overhead is further made to adjust the algorithm to more communication conditions.
Appendix A
Proof of the Theorem 1
Proof: The derivation of the lower bound is based on the assumption that the resolution of phase shifters is high enough and . Firstly we do SVD to the channel between the BS and th user, and the data stream is desired to be transmitted through the strongest channel path. We have
(25) 
where is the largest singular value of and is the left and right singular vector corresponding to . are the linear combinations of the antenna array response vectors of AOA and AOD [5]. When the resolution of phase shifters is high enough, we have and . When , the right singular vector corresponding to the largest singular value of each user is orthogonal to each other thus .
Based on these assumption, the equivalent channel is approximate to a diagonal matrix and can be expressed as .
And the ZF precoder at the BS is given as .
When there exist the phaseshifting error and gain error, we define and and . With and , we have