Reliability and User-Plane Latency Analysis of mmWave Massive MIMO for Grant-Free URLLC Applications

5G cellular networks are designed to support a new range of applications not supported by previous standards. Among these, ultra-reliable low-latency communication (URLLC) applications are arguably the most challenging. URLLC service requires the user equipment (UE) to be able to transmit its data under strict latency constraints with high reliability. To address these requirements, new technologies, such as mini-slots, semi-persistent scheduling and grant-free access were introduced in 5G standards. In this work, we formulate a spatiotemporal mathematical model to evaluate the user-plane latency and reliability performance of millimetre wave (mmWave) massive multiple-input multiple-output (MIMO) URLLC with reactive and K-repetition hybrid automatic repeat request (HARQ) protocols. We derive closed-form approximate expressions for the latent access failure probability and validate them using numerical simulations. The results show that, under certain conditions, mmWave massive MIMO can reduce the failure probability by a factor of 32. Moreover, we identify that beyond a certain number of antennas there is no significant improvement in reliability. Finally, we conclude that mmWave massive MIMO alone is not enough to provide the performance guarantees required by the most stringent URLLC applications.



There are no comments yet.


page 1


Cell-free Massive MIMO with Short Packets

In this paper, we adapt to cell-free Massive MIMO (multiple-input multip...

Channel Hardening in Massive MIMO - A Measurement Based Analysis

Wireless-controlled robots, cars and other critical applications are in ...

Impact of Interference Subtraction on Grant-Free Multiple Access with Massive MIMO

The design of highly scalable multiple access schemes is a main challeng...

Ultra-Reliable Communication in 5G mmWave Networks: A Risk-Sensitive Approach

In this letter, we investigate the problem of providing gigabit wireless...

Achieving Energy-Efficient Uplink URLLC with MIMO-Aided Grant-Free Access

The optimal design of the energy-efficient multiple-input multiple-outpu...

Enhancing Favorable Propagation in Cell-Free Massive MIMO Through Spatial User Grouping

Cell-Free (CF) Massive multiple-input multiple-output(MIMO) is a distrib...

Balance Queueing and Retransmission: Latency-Optimal Massive MIMO Design

One fundamental challenge in 5G URLLC is how to optimize massive MIMO co...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

I Introduction

The 3rd generation partnership project (3GPP) has identified three distinct use cases for 5G new radio (NR) and beyond cellular networks based on their different connectivity requirements: enhanced mobile broadband (eMBB), massive machine-type communication (mMTC) and ultra reliable low-latency communication (URLLC) [series2015imt]. Since the inception of the idea of 5G NR, it has been argued that its main revolution is a change of paradigm from a smartphone-centric network to a network capable of satisfying the requirements of diverse services, such as machine-to-machine and vehicle-to-vehicle communications [WirelessFuture2021]. The URLLC scenario targets applications that require high reliability and low latency, such as augmented reality (AR), virtual reality (VR), vehicle-to-everything (V2X), critical internet of things (cIoT), industrial automation and healthcare. According to the use cases defined in [3gpp.38.913], the main key performance indicator (KPI) to be satisfied in URLLC applications is the latent access failure probability, which incorporates the reliability and latency requirements needed in such applications. The requirements for URLLC applications vary from transmission reliability to transmit bytes of data with a user-plane latency of less than ms to a reliability to transmit bytes with a user-plane latency of between and ms, depending on the application [3gpp.38.913].

lte and prior networks were not designed with such constraints in mind. Scheduling in long-term evolution (LTE) follows a grant-based approach, where the user equipment (UE) must request resources in a -step random access (RA) procedure before transmitting data [Vilgelm2018]. In the best-case scenario, it takes at least ms for a UE to start transmitting its payload.

Therefore, new mechanisms were introduced into the 5G NR specification to support the latency requirements of URLLC applications. Firstly, a flexible numerology was proposed, introducing the concept of a mini-slot that last as little as ms [3gpp.38.912], in contrast to the ms minimum slot duration on LTE, enabling fine-grained scheduling of network resources [Zaidi2018]. Secondly, the introduction of semi-persistent scheduling (SPS) of grants [3gpp.38.213, Karadag2019], where some of the networkś resource blocks are periodically reserved for URLLC applications, thereby avoiding the grant request procedure. Despite the efforts, not all URLLC applications have a periodic traffic pattern and are therefore unable to benefit greatly from SPS. Additionally, some services require low latency and reliable transmission to transmit small sporadic packets. With that in mind, both the standards committee [3gpp.1704222] and researchers have put a lot of effort to investigate grant-free transmission, where the UEs transmit their payload directly in the RA channel. This culminated with the introduction of the -step RA procedure introduced in Release 16 [3gpp.1704222]. The -step RA procedure follows a grant-free approach, where instead of waiting for a dedicated channel to be assigned by the network, it transmits its data directly into the RA channel and waits for feedback from the network [Kim2021].

Moreover, massive multiple input multiple output (MIMO) is a fundamental part of 5G NR [Lu2014, Larsson2014, Bjornson2018]. It provides performance gains by improving diversity against fading and, along with advanced signal processing techniques, can provide directivity to transmission/reception, mitigating interference between spatially uncorrelated UEs [Marzetta2016]. The performance enhancements provided by MIMO are essential to ensure the reliability and the low latency required by URLLC applications. In conjunction with massive MIMO comes millimeter wave (mmWave) transmission. Due to its small wavelength, mmWave antennas can be packed into massive arrays, making it a key enabler of massive MIMO systems which attracted a significant interest on the topic [Rappaport2013, Rangan2014, Andrews2016, Sattar2017, Sattar2019a, Sattar2019b]. However, mmWave propagation comes with its own challenges due to the severe propagation loss experienced by electromagnetic signals in this frequency range.

In this paper, we develop a spatiotemporal analytical model to evaluate the performance of mmWave massive MIMO communication systems for URLLC

applications. We use tools from stochastic geometry and probability theory to evaluate and compare system performance metrics by deriving closed-form approximate expressions for its latent access failure probability under different

hybrid automatic repeat request (HARQ) protocols.

I-a Related Work

In [Gao2019], the authors propose a queueing model to compare the throughput performance of packet-based (grant-free) and connection-based (grant-based) random access. They conclude that packet-based systems with sensing can achieve greater throughput than connection-based one for small packet transmissions. In [Liu2021, Evangelista2021], the optimization of grant-free access networks is investigated. The former considers the dynamic optimization of HARQ and scheduling parameters with non-orthogonal multiple access (NOMA), while the former considers the distributed link adaptation problem. Both papers formulate the respective optimization tasks as

multiagent reinforcement learning

(MARL) problems. The probability of success and the area spectral efficiency of a grant-free sparse code multiple access (SCMA) system is evaluated in [Lai2021, Evangelista2019b], in an mMTC context, using stochastic geometry. However, none of the works consider the temporal aspects of the system, which are crucial to analyze the latency and reliability of URLLC service. In [Ding2019], the probability of success of grant-free RA with massive MIMO in the sub-6 GHz band is investigated, and analytical expressions are derived for conjugate and zero-forcing beamforming. Despite its contribution, the authors do not evaluate the systemś temporal behavior, which is fundamental to characterize URLLC service’s performance. Moreover, due to its distinct propagation characteristics, this model is unsuitable for mmWave frequency bands.

The authors in [Gharbieh2018] evaluate the scalability of scheduled uplink (grant-based) and random access (grant-free) transmissions in massive internet of things (IoT) networks, although they frame the problem through a revolutionary spatiotemporal framework, fusing stochastic geometry and queueing theory. They conclude that grant-free transmission offers lower latency, however, it does not scale well to a massive number of devices. In our work, we show that using massive MIMO base stations is a viable solution to address the scalability issues of grant-free transmission without sacrificing its latency, rendering it particularly suitable for URLLC applications. In [Jiang2018], the authors use a similar spatiotemporal model to characterize the performance of different RA schemes with respect to the probability of a successful preamble transmission in a grant-based massive IoT system. They conclude that a backoff scheme performs close to optimally in diverse traffic conditions. In [Jacobsen2017], the authors perform system-level simulations of a grant-free URLLC network under different HARQ configurations, and compare it to a baseline grant-based system. They conclude that grant-free systems provide significantly lower latency at the reliability level. The same scenario is evaluated in [Liu2020], however, the authors characterize system performance analytically, using a stochastic geometry-based spatiotemporal model. This paper identifies the suitability of each HARQ scheme for different network loads and received power levels.

Stochastic geometry has become the de facto tool for analyzing large networks [Lu2021, Hmamouche2021, Jiang2018b], and has been successfully used to investigate the performance of MIMO systems for a while now [Tanbourgi2015, Nguyen2013, Adhikary2015, Lee2014]. In [Afify2016], a unified stochastic geometric mathematical model for MIMO cellular networks with retransmission is proposed. In [Ding2017], a stochastic geometry-based analytical model for the performance of downlink mmWave NOMA systems is developed. The authors propose two random beamforming methods that are able to reduce system overhead while providing performance gains for BSs with a large number of antennas.

We seek to answer the following main questions that are to the best of our knowledge missing from the current literature:

  • How do we formulate a tractable spatiotemporal model to investigate the reliability and latency of URLLC applications powered by BSs equipped with massive antenna arrays operating on mmWave frequencies?

  • What closed-form analytical expressions can we derive for the latent access failure probability in this scenario?

  • What are the performance gains obtained from increasing the number of antennas at the BS, and what are the limitations?

I-B Contributions

This paper makes three major contributions:

  • We formulate a mathematical model to evaluate the performance of mmWave massive MIMO on uplink grant-free URLLC networks with HARQ. This model uses stochastic geometry to capture the spatial configuration of the UEs and the BSs, a mmWave channel model, and probability theory to obtain the temporal characteristics necessary to evaluate the performance of URLLC applications.

  • We derive closed-form approximate expressions for the latent access failure probability using reactive and -repetition HARQ schemes. To the best of our knowledge, no previous works has presented closed-form analytic expressions for this key performance measure of URLLC applications in a mmWave massive MIMO communication system.

  • We analyze the system performance for an extensive range of scenarios, identifying the gains and limitations provided by using the mmWave spectrum together with a massive number of antennas at the BS, and identify the scenarios that benefit the most from these two technologies.

I-C Notation and Organization

Italic Roman and Greek letters denote deterministic and random variables, while bold letters denote deterministic and random vectors. The capital Greek letter

denotes a point process and represents a point belonging to said process. The notation , where , is the counting process associated with [Haenggi2012]. Notice that we overload the meaning of so that it can signify a point process, a counting measure or a set depending on the context. is the binomial coefficient of choose .

The uniform, complex normal and binomial distributions are represented by

, and , respectively. The vector is the Hermitian transpose of vector . The function denotes the probability of the event within parentheses. The notation denotes the indicator function, which is equal to one whenever the event within curly braces is true and zero otherwise.

This work is divided into five sections and an appendix. In Section I, we introduce the contents of the manuscript and contextualize within the relevant literature. In Section II, we present a mathematical model to characterize the performance of the grant-free mmWave massive MIMO system in URLLC applications. In Section III, we derive the latent access failure probability of the proposed system using reactive and -repetition HARQ protocols. In Section IV, we show the results of the system simulation. We use these results to validate the analytical derivations, investigate the system’s performance for an extensive range of parameters and finish it by interpreting the results in the context of URLLC applications. In Section V, we summarize our findings and present our conclusions. Finally, in the appendix, we show the detailed proofs of the lemmas and theorems required by the derivations in the paper.

Ii System Model

Fig. 1: Comparison of the transmission procedure in grant-free and grant-based systems.

In most cellular applications, uplink transmissions use a dedicated resource (frequency, time or a MIMO spatial layer) previously assigned by the network to transmit their data payload. Thus, when an UE receives new data, it must send a request for the network to schedule a resource. With dedicated resources, each UE can utilize the wireless channel to its full capacity, thus maintaining good quality of service (QoS). TIn 5G NR networks, the schedule request consists of four steps, illustrated in Figure 1:

  • The UE randomly selects one of the available preambles and transmits it on the physical random access channel (PRACH).

  • The BS transmits a random access request (RAR), acknowledging receipt of the preamble and time-alignment commands.

  • The UE and BS exchange contention resolution messages (messages 3 and 4) that are used to identify possible collisions arising from two different devices transmitting the same preamble.

  • If the grant request is successful, the UE transmits its payload on the physical uplink shared channel (PUSCH).

This grant-based scheme is efficient for applications that need to use the channel multiple times to transmit large amounts of data (e.g., video streaming) or data that’s being continuously generated (e.g., voice). However, in some URLLC applications, UEs sporadically generate data that need to be transmitted reliably and with low latency, such as cIoT and sensors for industrial automation. In such scenarios, the time spent on the schedule request renders grant-based schemes inefficient. A more suitable alternative is to transmit the data directly on the PRACH and thereby avoidall the overhead involved in requesting a grant, as illustrated in Fig. 1. Nonetheless, with grant-free transmission comes the possibility of collisions whenever two UEs randomly select the same preambles. Therefore, HARQ is used to ensure the reliability and robustness of grant-free transmission. harq consists of using feedback information from the BS so the UE can retransmit packets that were not successfully received. Despite this, it can be quite challenging to scale grant-free networks because wireless resources are finite and expensive. To this end, massive MIMO and beamforming can be applied to reduce the interference of spatially uncorrelated UEs and thereby increasing the reliability of the system.

In this section, we discuss the spatial model of the network, the mmWave channel model, the BS receiver beamforming procedure and the different HARQ schemes used.

Ii-a Physical Layer Model

Stochastic geometry and the theory of random point processes has proven to be able to accurately model the spatial distribution of modern cellular network deployments [Lu2015]. Therefore, we consider a cell of radius consisting of a BS, equipped with antennas, located at the origin. We model the spatial location of the single-antenna UEs according to a homogeneous Poisson point process (HPPP) [Haenggi2012], denoted by with intensity . Furthermore, the distance between the -th UE, , and the BS is given by . Both the distance from the UE to the BS and its normalized angle from the BS

are uniformly distributed random variables

[Haenggi2012], and , respectively.

Due to path loss attenuation, the signal received from UEs located further from the BS is “drowned” by the signal from closer users transmitting with the same power, also known as the near-far problem. Uplink power control is fundamental to deal with this issue. We consider that the UEs utilize path loss inversion power control [Elsawy2014], with received power threshold , where each user controls its transmit power such that the average received power at its associated BS is , by selecting their transmit powers as , where is the path loss exponent. We assume that there are subcarriers reserved for grant-free URLLC transmissions and orthogonal preambles. Thus, at each transmission time interval (TTI), the active UEs select a subcarrier and preamble randomly from the available subcarriers and available preambles. Moreover, we assume that at , one packet arrives to the transmitting queue of each UE. Therefore, the HPPP of active users on a specific subcarrier is obtained by thinning [Haenggi2012] and its effective intensity at is given by


Massive MIMO technology and mmWave frequencies are intrinsically connected. Even though one does not imply the other, they complement each other really well. The former requires large antenna arrays, and the size of such arrays is proportional to the targeted wavelength. Moreover, mmWave antennas must be really small to operate in such large frequencies, therefore, a larger number of them are necessary to gather enough energy. In this work, we consider that the BS is equipped with a massive uniform linear array (ULA) containing antennas operating at mmWave frequencies, while the UEs possess a single antenna. The channel vector between user and the bs is given by


where is the complex gain on the -th path and is the normalized direction of the -th path. We assume that the complex gains of different paths are independent. and denote the path loss exponent of the line-of-sight (LOS) and non-line-of-sight (NLOS) paths, respectively. The vector


denotes the phase of the signal received by each antenna. Due to high penetration losses suffered by mmWave signals, the LOS path has a dominant effect on channel gain, being dB larger than the NLOS in some cases [Ding2017, Lee2014]. Hence, we can safely approximate as


for mathematical tractability. Additionally, to avoid cluttering the notation, we drop the subscripts denoting different paths and distinguishing LOS and NLOS variables.

Due to the dominant effect of the LOS link, the channel model also needs to consider a blockage model to determine the probability that the LOS path between the UE the BS is obstructed. To model the effects of blockage, we adopt the model proposed in [Thornburg2016]. This model is obtained by assuming that the obstructing building and structures form an HPPP with random width, length and orientation. So, let be the set of los UEs; then, the probability that user has a LOS link is given by


where is directly proportional to the density, and the average width and length of obstructing structures. This model nicely captures the exponentially vanishing probability of having a LOS link the further you move away from the BS, and can be easily fitted to real urban scenarios.

Signal Model

At each TTI, the active users transmit an information signal such that . Therefore, the vector of the signal received at the BS is given by


where is a circularly symmetric complex Gaussian random variable representing additive white Gaussian noise (AWGN).

To successfully recover the data transmitted by a given user, the BS

must be able to accurately estimate its channel response.

Definition 1 (Preamble Collision).

Preamble collision event, denoted by , happens when two or more devices transmit the same preamble on the same subcarrier.

We assume that the BS is able to perfectly estimate the UE channel response whenever there is no preamble collision. Then, the BS performs conjugate beamforming to separate the intended user’s signal from those of the other interfering UEs by multiplying the received signal by the Hermitian transpose of the intended user channel response. Therefore the recovered signal of the intended user, the -th user, is

where is the event when user does not experience preamble collision and

is a linear combination of the noise vector, which is a Gaussian distributed random variable. Therefore, the

signal-to-interference-plus-noise-ratio (SINR) experienced by user is given by


where is the probability that user has a LOS link and does not suffer from preamble collision, and is the interference from the other UEs. Moreover, the beamforming gain, , can be expressed as [Lee2015]


where is the Fejer kernel [Marsden1993], with .

Fig. 2: Fejer kernel value for normalized angles of arrival varying from to .

A useful property of the Fejer kernel is that [Marsden1993]


meaning that for an asymptotically large value of , the interference for the signals not aligned with beam angle goes to zero. Fig. 2 illustrates this property by plotting the Fejer kernel for increasing values of .

Ii-B HARQ Schemes

HARQ protocols determine how transmitters and receivers exchange information about successful packet reception, by transmitting an acknowledgement (ACK) signal, and how UEs retransmit in the event of failure, which is signaled by the transmission of a negative acknowledgement (NACK) signal. They are especially important to ensure reliability in grant-free transmission. The HARQ protocol used also impacts the overall latency of the system. Hence, in this paper, we investigate the performance of the massive MIMO URLLC network under two distinct HARQ protocols.

With respect to transmissions latency, the HARQ protocols investigated have a few aspects in common. First, the UE spends TTIs to process a newly arrived packet. As soon as the packet is processed, it spends TTIs transmitting it. Upon receipt of the packet, the BS spends TTIs to process it and TTIs to send feedback and for it to reach the UE. Once the UE receives the feedback signal, it takes TTIs to process it. We consider that the transmit and feedback time already take into account the propagation delay between the transmitter and receiver. Without loss of generality, we assume that TTI. Another concept shared between different HARQ protocols is the round-trip time (RTT), which consists of the time it takes from the start of a transmission by the UE to the end of processing of the feedback signal, either ACK or NACK, the UE received from the BS.

Fig. 3: An illustration of a couple of reactive HARQ protocol round trips.
Fig. 4: An illustration of a couple of -repetition HARQ protocol round trips.

Ii-B1 Reactive Scheme

The reactive HARQ protocol is the more straight-forward one of the two considered in this paper. The UE attempts to transmit one packet and waits for feedback from the BS. Once the feedback is processed, it either attempts to retransmit the same packet if it got a NACK signal or sits idle until a new packet arrives. This protocol is illustrated in Fig. 3, which shows the processing times and signal exchange between the UE and the BS. Under the assumptions considered in this paper, the reactive RTT is given by


From (11), the user-plane latency of the -th HARQ round-trip is


Ii-B2 -Repetition Scheme

To increase the reliability and robustness of each transmission attempt, the -repetition HARQ protocol repeats the same packet times on each attempt. Therefore, the only way a transmission attempt fails is if each of the transmissions fail, which translates into an increased reliability of the overall system. However, feedback on the transmission attempt is sent only after the last repetition is processed by the BS. So, there is a tradeoff between enhancing the reliability of each transmission and increasing the latency of a transmition attempt. Fig. 4 shows two -repetition round-trip transmissions, where the first transmission fails and the second is successful. The RTT of the -repetition HARQ protocol is


Therefore, the total latency of -repetition transmissions is given by


Iii System Analysis

The main requirement of URLLC applications is to reliably keep the user-plane latency below an application-dependent latency constraint. We begin this section by unambiguously defining what we mean by reliably and user-plane latency.

Definition 2 (User-Plane Latency).

User-plane latency is the time spent between the arrival of a packet to the UE’s queue and the successful processing of an ACK signal received from the BS.

Definition 3 (Latent Access Failure Probability Requirement).

Latent access failure probability , where is the user-plane latency and is the latency constraint, is the probability that the UE data cannot be successfully decoded.

Therefore, the QoS requirement of URLLC applications can be stated as


where is the latency constraint and is the minimum reliability, and both are application-dependent. Thus, to satisfy the QoS requirement, the probability that an UE cannot transmit its data before must be bounded by . Typically, varies between and ms and varies between and depending on the URLLC application.

Let be the maximum number of retransmissions under the latency constraint . Moreover, notice that some of the UEs will transmit successfully earlier than others, and if the UE’s transmission queue stays idle, the interference levels in distinct retransmissions are different. Therefore, the latent access failure probability is a function of the fraction of active users at the -th retransmission (), the probability that the -th retransmission is successful (( and the maximum number of retransmissions (), as given by [Liu2020]


where is


Given the expressions for , the latent access failure probability is obtained by iteratively computing (17) and (16).

In the rest of this section, we derive closed-form expressions for under the reactive and -repetition HARQ protocols, denoted by and , respectively. To do so, we use stochastic geometric analysis to obtain the probability of success of a randomly chosen user , herein the typical user. From Slivnyak’s theorem [Baccelli2009], the performance of the typical user in an HPPP is representative of the average user’s performance.

Iii-a Reactive Harq

The maximum number of HARQ transmissions following the reactive HARQ protocol with the delay constraint is given by


The first step in deriving an expression for the latent access failure probability is to obtain the probability that the -th reactive retransmission is successful ().

Let be the set of users interfering with the typical user’s transmission on the -th RTT. Notice that due to the exponentially decreasing probability of a LOS link with the increase in distance, is a non-HPPP with density . The mean measure of , the average number of points in a given area, is obtained as

where is a -dimensional ball with radius that is centered at the origin. Now, let be a random variable denoting the number of users that interfere with the typical user on the -th retransmission. From (III-A), the probability that there are interferers in the cell with radius R is derived as

Lemma 1.

If , the probability that the -th reactive HARQ retransmission of the typical user conditioned on the events that the typical user does not experience preamble collision, has a LOS link and is affected by interferers can be approximated as

where is the number of interferers within the primary lobe of the beam directed at the typical user.


See Appendix A. ∎

After deriving the expressions for the probability of having users interfere with retransmission in (III-A) and the conditional probability of success obtained in Lemma 1, the success probability can be obtained as follows:

Theorem 1.

The probability that the -th reactive harq retransmission is successfully decoded is

where is the probability that there are interferers in the cell and is given by (III-A). The probability of no preamble collision is given by


And finally, the probability that the typical user has a LOS link to the BS is


The proof is straight forward if the conditional probability obtained in Lemma 1 is averaged out. ∎

From the results of Theorem 1, the latent access failure probability can be easily obtained by iteratively computing


Iii-B -Repetition Harq

In the -repetition HARQ system, the RTT lasts from when the UE transmits the first repetition until it receives the ACK/NACK feedback signal. Thus, under delay constraint , the maximum number of retransmissions is


Under the -repetition HARQ, the same data is repeated times for every transmission attempt, and after the BS receives all the repetitions, it sends either an ACK or a NACK signal depending whether any of the repetitions sent in the transmission could be successfully decoded. Additionally, the UE selects a new random subcarrier and preamble for the transmission of each distinct repetition. To obtain a closed-form expression for the latent access failure probability, we follow the same steps as were taken for the reactive HARQ derivation.

Lemma 2.

If , the probability that the -th -repetition HARQ retransmission of the typical user, conditioned on the event that the typical user does not experience preamble collision, has a LOS link and is affected by interferers can be approximated as (2), shown at the top of next page,


where the double subscript indicates the -th repetition of the -th HARQ retransmission attempt.


See Appendix B. ∎

With the result from Lemma 2, the probability that the -th retransmission attempt is successful can be obtained by averaging (2) over the conditional random variables.

Theorem 2.

The probability that the -th -repetition harq retransmission is successfully decoded is given by (28), shown at the top of next page,


where the closed-form expression for is derived on Lemma 2.

Given the analytical expression for the probability that the -th -repetition HARQ retransmission is successfully received by the BS in Theorem 2 and the fact that the probability that a randomly selected UE is active can be computed from (17), the latent access failure probability is derived as


Iv Numerical Results and Discussion

In this section, we report the results of Monte-Carlo simulations of the system model described in Section II. We use the simulation results to: a) validate the closed-form analytical approximations derived in Section III b) characterize the performance of the two HARQ protocols in the mmWave massive MIMO scenario and c) discuss the insights provided by the analytical results.

At the beginning of each simulation instance, the users’ locations are generated according to an HPPP inside a cell with radius km. At every TTI:

  • The channel gain between the UEs and the BS located at the origin is generated as an exponential random variable with unit mean.

  • All active UEs are determined to have either a LOS or NLOS link according to the probability in (24), with .

  • All active UEs select a random subcarrier from one of the subcarriers available.

  • All active UEs select a random preamble from one of the preambles available.

  • The BS checks all UEs with LOS links on every subcarrier for preamble collision.

  • The BS computes the dot product between the signal received and the conjugate beam for all the UEs whose preambles have not collided. If the resulting SINR is greater than dB, the transmission is successful, otherwise it fails.

  • The BS sends an ACK feedback signal to the UEs whose transmission was successful and a NACK feedback signal to those whose transmission attempt failed. As the main goal of this work is to characterize grant-free uplink performance, we assume that the feedback sent through the downlink channel is error free.

  • All UEs move to a new location.

In accordance with 3GPP standards [3gpp.38.101, 3gpp.38.211], we consider a TTI mini-slot having a duration of ms and a subcarrier spacing of kHz, which is a configuration compatible with 5G NR frequency range 2 (FR2) operation, located in the mmWave spectrum. We consider a noise figure of dBm/Hz, a path loss exponent of and a received power threshold of .

Fig. 5: CCDF of the latent access failure probability for UE/ for the reactive and -repetition HARQ protocols with . The plots in the figure show the results for , and antennas.
Fig. 6: CCDF of the latent access failure probability for UE/ for the reactive and -repetition HARQ protocols with . The plots in the figure show the results for , and antennas.

Figs. 5 and 6 show the

complementary cumulative distribution function

(CCDF) of the latent access probability for a user density of UE/ and UE/, respectively. The three plots in each figure display the performance for , and antennas, from left to right. The behavior of the performance curves, where the latent access failure probability remains constant for a period of time and then drops on the following TTI, is due to the transmission propagation time on the uplink and the feedback, and the processing times. Fig. 5 depicts the performance with a moderate UE density scenario and shows that the reactive HARQ protocol is the best option for strict delay constraints, with TTIs ( ms), as there is no time for any of the -repetition configurations to finish their first round-trip. When the first and second round-trips for are completed, it has the best performance in TTIs and TTI intervals. From this point on, the best performance is dominated by and , with the best configuration being the one that has more completed round-trips in under TTIs. A similar trend occurs with a higher user density as shown in Fig. 6.

Tables I and II show the reduction in the latent access failure probability upon increasing the number of antennas from to . There is little improvement for a delay constraint of ms in either scenario. In applications with a moderate UE density and a delay constraint of ms or more, we notice an average improvement of around across both HARQ protocols investigated, while in applications with a higher user density, the failure probability is reduced by as much as times for repetitions and a delay constraint greater or equal to ms.

HARQ ms ms ms
1.25 1.59 1.982
1 2.18 1.98
1 2.49 -
Reactive 1.12 1.42 1.64
TABLE I: Latent access failure probability reduction in increasing from to antennas when UE/
HARQ ms ms ms
1.54 6.81 13.69
3.25 17.13 -
1.75 32.51 32.51
Reactive 1.40 2.60 6.18
TABLE II: Latent access failure probability reduction in increasing from to antennas when UE/

Nonetheless, notice from Figs. 5 and 6 that increasing the number of antennas from to does not change the latency performance significantly. Additionally, the increase in the number of antennas has a larger impact on performance for the higher user density scenario shown in Fig. 6 than for the moderate density one in Fig. 5. Later in this section, we discuss why this happens and how to possibly address it.

As the approximation used to derive the results in Section III relies on , there is a gap between the analytical and simulation results when as, in this regime, the value of , and consequently the interference, outside the main lobe are no longer negligible in comparison to the gain on the main lobe.

Fig. 7: The probability that an UE fails to transmit its packet under ms for an user density ranging from UE/. The plots show the results for , and antennas, respectively.
Fig. 8: The probability that an UE fails to transmit its packet under ms for an user density ranging from UE/. The plots show the results for , and antennas, respectively.

Figs. 7 and 8 show how system reliability, i.e., in the probability of transmission failure under the latency constraint , scales with an increase in user density for a latency constraint of ms and ms, respectively. In both figures the user density ranges from to UE/. Fig. 7 shows that that the combination of mmWave and massive MIMO is not enough to satisfy the QoS requirement of URLLC applications with the stricter delay constraint (failure probability below ). Also, in this latency range, the benefit from increasing the number of BS antennas is rather small. For applications with less stringent delay constraints, shown in Fig. 8, the -repetition HARQ protocol with and is able to support the URLLC QoS requirements. Table III depicts the highest UE density that can be supported by each HARQ and MIMO configuration. When , increasing the number of BS antennas from to increases the supported user density by and increasing the number from to increases it by only . For , those values are and , respectively. It is worth noting that as long as the QoS constraints are satisfied, it is desirable to use the HARQ configuration with the least number of repetitions as possible in order to save UE power.

HARQ antennas antennas antennas
- - -
1800 UE/ 2000 UE/ 2150 UE/
2800 UE/ 3200 UE/ 3350 UE/
Reactive - - -
TABLE III: User density supported (failure probability below ) by each configuration.
Fig. 9: The impact of the number of antennas on the latent access failure probability. The leftmost plot shows results for delay constraint ms, while the rightmost for ms.

In Fig. 9, we show the impact of increasing the number of antennas on the failure probability for a delay constraint of ms on the left and ms on the right. From this figure, we can conclude that increasing the number of antennas beyond for the configuration under consideration ( km and ) has a decreasing impact on latency performance. Moreover, the -repetition HARQ protocol benefits more from an increased number of antennas than the reactive HARQ protocol does. Also, the plot on the right shows that the latency performance of applications with a moderate latency constraint ( ms) and higher user densities ( UE/) is greatly improved by changing from traditional MIMO to massive MIMO. This is explained by the capability of producing narrower receiver beams on systems with a higher number of antennas. Nevertheless, as of a certain point, the probability of having a LOS link becomes the dominant bottleneck in reducing the latent access failure probability. As the LOS link probability is unaffected by the number of antennas, another measure must be taken to further reduce the latency. In the system model formulated in this paper, one way to achieve this would be to increase the BS deployment density, effectively decreasing the radius of the cells.

V Conclusions

In this work, we formulated a model to analyze the latency and reliability of mmWave massive MIMO URLLC applications using reactive and -repetition HARQ protocols. We used stochastic geometric spatiotemporal tools to derive closed-form approximations of the system’s latent access failure probability. We validated the analytical results using Monte-Carlo simulations, identifying the limitations of our analytical results. Also, we investigated how the system’s performance is impacted by the application’s latency constraint, the density of UEs served by the system, and the number of antennas in the BS. We concluded that:

  • Other than for extremely strict delay constraints ( ms), the -repetition HARQ protocol is a better choice.

  • Increasing the number of BS antennas from to BS antennas can reduce the latent access failure probability by a factor of 32 for the cell configuration analyzed in the manuscript.

  • Massive MIMO’s interference reduction capability significantly improves the reliability of systems with high user density and moderately improves the performance of systems with low user density.

  • The increase in reliability from increasing the number of BS antennas beyond is greatly reduced in the configuration investigated in Section IV, as the probability of having a LOS link between the UE and the BS becomes the main bottleneck.

  • Under the configurations investigated in this manuscript, the system can support a UE density as high as UE/ for a URLLC application with latency and reliability constraints of ms and , respectively.

Overall, we can conclude that it is possible to increase the reliability of URLLC applications by using mmWave massive MIMO, and when this technique is combined with selecting reasonable configuration parameters, these two techniques together can improve reliability under a strict latency constraint ( ms) and can satisfy URLLC QoS requirements under a less strict latency constraint ( ms).

Appendix A Proof of Lemma 1

The probability of success in (1) can be expanded to


where , is the interference on the typical user’s transmission and is the Laplace transform of the interference conditioned on the user not experiencing preamble collision, them having a LOS path, and there being interferers in the cell.

The Laplace transform of the interference can be derived as (31), shown at the top of the next page.


Unfortunately, obtaining a closed-form expression for the expectation in (31) is not mathematically tractable. Therefore, we first obtain a suitable approximation to the Fejer kernel. Due to the Fejer kernel property in (10), as the number of antennas increases most of the energy is concentrated on the main lobe as shown in Fig. 2. Hence, we choose to approximate it as


The quadratic approximation in (32) renders the derivation of the expectation in (31) tractable. Furthermore, it ensures that whenever , i.e., the contributions of the signals arriving from directions outside of the main lobe to the interference is zero, and that . Hence,


where is obtained from . While step comes from the fact that the interferers’ angles are uniformly distributed, given that there are interferers in the cell, the number of interferers within the typical user’s main lobe direction follows a binomial distribution with . Thus, the conditional success probability can be obtained by summing the marginal distribution weighted by ’s probability mass function (PMF). This completes the proof.

Appendix B Proof of Lemma 2

In order for a -repetition HARQ transmission attempt to be successful at least one of the repetitions must be successfully decoded. Therefore, the probability that the -th retransmission attempt is successful, conditioned on no preamble collisions, a LOS path and interferers, can be obtained as the complement probability that all repetitions fail , as derived in (34), shown at the top of the next page,


where step follows from the fact that the set of interferers is different from one repetition to the next, as a new subcarrier is randomly selected for every repetition by each UE, making the SINRs on distinct repetitions mutually independent. Also, as the SINR of every repetition is affected by an interferer process having the same intensity, the probability of success of each repetition is equal, which justifies step . Finally, step is obtained from the binomial expansion of the power term. If we work from (34) and follow the same steps derived in Appendix A, we obtain (2), which completes the proof.