Attaining the vision of Internet-of-Things (IoT) will require the ubiquitous deployment of an enormous number of sensors (e.g., tens of billions) in our society [1, 2]. The brute-force approach of “transmit-then-compute” is obviously impractical for this large-scale sensor network as the massive radio access would result in excessive network latency and low efficiency in spectrum utilization. The situation is exacerbated at high mobility where ultra-fast data aggregation from many sensors is desired. This is the case when sensors and/or the access-point (AP) are mounted on ground vehicles or unmanned aerial vehicles (UAV) for ubiquitous city surveillance in the smart-city application [see Fig. 1(a)], or for wild-area monitoring to avoid natural disasters [see Fig. 1(b)]. Motivated by the need of ultra-fast data aggregation, an intelligent solution, known as over-the-air computation (AirComp), is proposed recently that exploits the signal-superposition property of a multi-access channel (MAC) to compute a class of so called nomographic functions of distributed sensing data via concurrent sensor transmissions (see Fig. 1), thereby integrating computation and communication [3, 4]. Examples of such functions are show in Table I. Unlike rate-centric wireless systems where simultaneous transmissions result in interference, the computation accuracy for a sensor network with AirComp may grow with the number of simultaneous sensors due to the sensing-noise averaging. In this paper, we aim to advance the area of AirComp by developing multiple-input-multiple output (MIMO) AirComp for next-generation multi-antenna multi-modal sensor networks. The technology supports high-mobility multi-modal (HMM) sensing by enabling multi-function computation via spatial multiplexing and accurate reception of the results by exploiting spatial diversity.
I-a Over-the-Air Computation for Sensor Networks
The idea of AirComp can be tracked back to the pioneer work studying functional computation in sensor networks .111AirComp also appeared the same time as a key operation in the scheme named physical-layer network coding in . In , structured codes (e.g., lattice codes) are designed for reliably computing at an AP a function of distributed sensing values transmitted over a MAC. The significance of the work lies in its counter-intuitive finding that “interference” can be harnessed to help computing. Subsequently, it was proved that the simple analog transmission without coding, where transmitted signals are scaled versions of sensing values, can achieve the minimum functional distortion achievable by any scheme . On the other hand, coding can be still useful for other settings such as sensing correlated Gaussian sources . The satisfactory performance (with optimality in certain cases) of simple analog AirComp have led to an active area focusing on designing and implementing techniques for receiving a desired function of concurrent signals, namely a targeted coherent combination of the signal waveforms [8, 9, 10, 11, 12, 13, 14, 3, 15, 16]. In particular, considering analog AirComp, power control at sensors was optimized in [8, 9], the computation rate (defined as the number of functional values computed per time slot) analyzed in 
, and the effect of channel estimation error characterized in.
The implementation of AirComp faces several practical issues. One is the synchronization of all active sensors required for coherent combining at the AP. To cope with synchronization errors, a scheme was proposed in [12, 13] where the sensing value is modulated at each sensor as the power of transmitted signal and furthermore a random phase rotation is applied to the signal. The design transforms functional computation at the receiver to power detection while synchronization error appears as random noise. An alternative solution, called AirShare, was developed in  for synchronizing sensors by broadcasting a reference-clock signal and its effectiveness was demonstrated using a prototype. In addition, by applying appropriate data pre/post-processing, later work extended and implemented AirComp to compute a variety of functions besides the linear ones (as summarized in Table I) [15, 16].
It is worth mentioning that the coding techniques designed for AirComp in computing-centric sensor networks  inspired researchers to adopt relevant principles and ideas in designing new schemes for rate-centric communication networks [17, 18, 19, 20, 21]. For relay assisted networks, the application of AirComp at relay nodes for decoding and forwarding linear functions of the transmitted messages led to the invention of the well known compute-and-forward relaying schemes [17, 18, 19]. Building on lattice coding, a novel so called integer-forcing linear receiver was proposed for spatial multiplexing in a multiple-input-multiple-output (MIMO) system that attempts to create an effective channel matrix with integer coefficients to facilitate lattice decoding. The key operation, integer forcing, is similar to AirComp and computes a desired set of linear functions with integer coefficients [20, 21]. In parallel with the above research, extensive progresses were made in the area of physical layer network coding where the celebrated network coding schemes invented for wired networks were extended to wireless networks with AirComp relays (see survey in ).
Sensing devices targeting emerging applications such as smart cities are sophisticated. Each typically contains multiple multi-modal sensors monitoring different environmental parameters (e.g., pressure, light, humidity, and temperature) . In particular, a smartphone recruited for crowd-sensing typically contains seven or more sensors such as inertial, GPS, and light sensors . In view of prior work, the existing solutions focus only single-function AirComp assuming single-antenna sensors having uni-modal sensing capabilities. However, next-generation wireless networks equipped with large-scale arrays will make it possible to simultaneously compute multiple functions of multi-modal sensing data over-the-air. This inspires the current work on developing the technology of AirComp for a MIMO MAC, which can simultaneously spatial multiplex multi-function computation and suppress computation errors by exploiting spatial-diversity gain. Thereby, the data-fusion latency in sensor networks can be substantially reduced, meeting the ultra-low latency requirement in next-generation networks especially when high-mobility support is needed .
I-B MIMO Beamforming: Multi-Access versus AirComp
Beamforming design for multiuser MIMO systems is a classic topic that has been extensively studied and there exists a rich relevant literature [25, 26, 27]. In terms of network topology, the multi-antenna multi-modal sensor network we consider is equivalent to a MIMO multi-access communication network where a single AP supports simultaneous uplink transmissions of multiple users. For the communication network, the designs of multiuser MIMO beamforming at the AP can be largely grouped into capacity-achieving nonlinear designs based on successive interference cancellation (SIC)  and low-complexity linear designs (minimum mean-squared error (MMSE) or zero-forcing) [26, 27, 28]. All of the designs share the same objective of decoupling multiuser signals by interference suppression and spatial multiplexing of data streams for each user. In contrast, AirComp receive beamforming in the sensor network has a different objective of minimizing the total distortion in the received values of multiple functions combining multi-modal data simultaneously sent by a set of sensors. Due to the difference in objective between communication and sensing, the known designs for the former are inapplicable for the latter. On the other hand, existing AirComp literature considers only uni-function computation targeting uni-modal sensing as discussed earlier. This makes receive beamforming for multi-function AirComp for multi-modal sensing an uncharted problem to be tackled in this work.
It is worth mentioning that the discovery of uplink-downlink duality is a breakthrough in multiuser MIMO communication. The duality reveals similar structures in optimal beamformers for the MACs and broadcast channels that exist under various performance criteria ranging from capacity maximization [29, 30] to MMSE . This allows beamforming designs derived for the MACs to be applied to the broadcast channels where the optimal beamforming design was largely an open problem prior to the finding of the duality. Inspired by this finding, we address a similar question in the current work: What is the downlink dual of the (uplink) AirComp beamforming for sensor networks?
I-C Contributions and Organization
We consider a multi-antenna multi-modal sensor network where a multi-antenna AP performing fusion of data transmitted by a cluster of multi-antenna multi-modal sensors. By measuring multiple time-varying parameters of the environment, each sensor generates multiple data streams. In each time slot, a sensor transmits a set of multi-modal data values in the analog way, namely by amplitude modulation [12, 13], over multiple antennas. The transmissions of all sensors are simultaneous. The AP attempts to jointly receive multiple nomographic functions (such as those in Table I) of distributed sensor data by AirComp and spatial multiplexing. The AirComp of a nomographic function is implemented by three cascaded operations: 1) pre-processing at sensors, 2) weighted summation of preprocessed outputs by simultaneous transmissions, and 3) post-processing at the AP [12, 13, 3]. In the current scenario, transmit and receive beamforming are applied to spatially multiplex multi-function AirComp as well as exploit spatial diversity to minimize the distortion of function values caused by noise, which is measured by sum mean-squared error (MSE) over functions.
While the traditional uni-function AirComp is a simple technique, the proposed multi-function version is challenging with the optimization of receive beamforming proved to be NP-hard. Specifically, for uni-function AirComp, channel inversion at each sensor yields a desired weighted sum of preprocessed data at the AP, giving the desired function value after post-processing . For multi-function AirComp, channel inversion remains optimal as shown in this work and is implemented by zero-forcing beamforming. Nevertheless, receive beamforming for multi-function AirComp, referred to as MIMO-AirComp equalization, is a new design problem that finds no relevant result in the AirComp literature. The equalizer optimization is non-convex but can be relaxed as a semidefinite programming (SDP) problem and thus solved using an iterative interior point algorithm. The standard approach does not yield any insight into the optimal equalizer structure and more importantly does not lead to an efficient channel-feedback design for acquiring the equalizer at the AP. To address these issues, we impose an orthogonality constraint on the AirComp equalizer, which is a technique for approximate beamformer design and limited channel feedback as widely applied in the literature [32, 33, 34, 35]. Concretely, this allows a close-to-optimal equalizer to be derived in closed-form using tools from differential-geometry, revealing an interesting geometry structure in the design. Moreover, the closed-form solution leads to an efficient channel feedback design that exploits the AirComp architecture for direct equalizer acquisition at the AP from simultaneous feedback by all sensors.
The main contributions of this work are summarized as follows.
Multi-Function AirComp Beamforming: As mentioned, while zero-forcing transmit beamforming is found to be optimal, the receive-beamformer optimization for sum-MSE minimization under transmission-power constraints can be proved to be NP-hard. By tightening the constraints, the resultant approximate problem is found to involve optimization on a Grassmann manifold, which can be interpreted as the space of subspaces. This allows the application of differential geometry to solve the approximate problem. The derived solution shows the normalized receive beamformer to be the weighted centroid of a cluster of points on the manifold
, where each point represents the eigen-subspace of an individual MIMO channel and the corresponding weight its smallest eigenvalue. In addition, the optimal beamformer norm is also derived in closed-form. Such a beamformer design allowing efficient computation is verified by simulation to be close-to-optimal.
AirComp-Multicasting Duality: As a by-product of our investigation, for the special case of AirComp with single-antenna (uni-modal) sensors, the problem of receive-beamforming optimization is discovered to have the identical form as the classic problem of multicast transmit-beamforming, thereby establishing a novel AirComp-multicasting duality. The latter problem is known to be NP-hard and typically solved using the semidefinite relaxation (SDR) method. The significance of our finding lies in allowing the solution method for AirComp beamforming to be transferable to multicast-beamforming. This yields a new solution method for the latter with complexity much lower than the existing SDR approach as the network scales up.
AirComp Channel Feedback: Last, we solve the AirComp feedback problem: How to efficiently acquire the derived AirComp beamformer at the AP, which depends on global channel-state information (CSI), by sensor distributed transmissions based on local CSI? Given channel reciprocity, it is discovered that the AirComp system architecture can be also used for efficient feedback. The resultant number of feedback rounds is independent of the sensor population, overcoming the drawback of traditional channel training. Novel feedback techniques are designed for sequential feedback of the normalized AirComp beamformer and beamformer norm based on their derived expressions, where each feedback round involves concurrent transmissions by all sensors. Essentially, the two techniques implement AirComp of two functions, namely the weighted centroid of a set of matrices and the maximum of a set of scalar values. They are hence exclusively for multi-function AirComp and may not be applicable for traditional multiuser MIMO communication systems where multiuser feedback requires orthogonal channels and focuses on precoder quantization [36, 37].
The remainder of the paper is organized as follows. Section II introduces the AirComp system model. Section III presents the problem formulation for the enabling beamforming design and channel feedback. The proposed beamforming design is presented in Section IV, and the duality between the uplink AirComp and downlink multicasting is discussed in Section V. Section VI proposes an efficient channel-feedback scheme that can be implemented by AirComp. Simulation results are provided in Section VII, followed by concluding remarks in Section VIII.
Ii System Model
We consider a wireless sensor network consisting of multi-modal sensors and a single AP as illustrated in Fig.2. All nodes are equipped with antenna arrays. Specifically, antennas are deployed at each sensor and at the AP. Each multi-modal sensor records the values of heterogeneous time-varying parameters of the environment, e.g., temperature, pollution, and humidity. The data from the record of the -th parameter is referred to as type-
data. For an arbitrary time slot, the measurement vector of the-th sensor, grouping sample values, is denoted by , where is the measurement of the parameter at sensor . Instead of collecting the whole data set, the AP aims at computing functions of corresponding types of data, denoted by , to support ultra-fast HMM sensing. The class of functions that are computable by AirComp are called nomographic functions as defined below.
Definition 1 (Nomographic Function ).
The function is said to be nomographic, if there exist preprocessing functions along with a post-processing function such that it can be represented in the form:
Some common nomographic functions are listed in Table I. Based on the nested form of (1), the AirComp of a nomographic function can be implemented in the sensor network by three operations as illustrated in Fig. 2: 1) preprocessing at each sensor specified by where operates on the type- data at sensor , 2) summation of preprocessed data realized by multi-access, and 3) post-processing at the AP. Considering the computation of the geometric mean of type- data as an example, the pre-processing , and the post-processing . Let denote the multi-modal symbol vector after preprocessing and
the AirComputed function values. For ease of transmission-power control and without loss of generality, the symbols are assumed to be normalized to have unit variance, i.e.,, where the normalization factor for each data type is uniform for all sensors and can be inverted at the AP. Given the one-to-one mapping between and according to (1), we refer to as the target-function vector for ease of exposition.
Ii-a AirComp Phase
Assuming symbol-level synchronization222This can be achieved by broadcasting a common reference clock from the AP to sensors using the AirShare solution developed in ., all users transmit their symbol vectors simultaneously using their arrays. The distortion of the received vector with respect to the target-function vector due to MIMO channels and noise is suppressed using transmit and receive beamforming. In other words, the joint beamforming attempts to attain coherent combining of symbol vectors at the AP. Let denote the receive beamforming matrix and the transmit beamforming matrix at sensor . Then the symbol vector received by the AP after receive beamforming is given as
where represents the MIMO channel matrix for the link from the sensor to the AP, and is the additive white Gaussian noise (AWGN) vector with independent and identically distributed (i.i.d.) elements. The distortion of with respect to , which quantifies the AirComp performance, is measured by the MSE defined as follows:
Ii-B Channel Feedback Phase
Consider the existence of channel reciprocity and assume that perfect local CSI is available at all sensors. One can infer from (4) that computing the MMSE receive beamformer requires global CSI, namely . As mentioned, the naive approach of estimating the global CSI would incur long latency and large overhead when the number of sensors is large, thus is impractical for the ultra-fast HMM sensing applications. An intelligent channel-feedback design is proposed in the sequel to allow beamformer acquisition via concurrent transmissions by all sensors. Let denote the signal matrix transmitted by sensor where is the signal length in symbol. Given typical high transmission power for channel training and feedback, the feedback observation at the AP can be assumed to be noiseless and thus be represented by an matrix as follows:
As is clear in the sequel, with proper design of , can serve as a sufficient surrogate of the global CSI in beamformer computation at the AP.
Iii Problem Formulation
Iii-a AirComp Beamforming Problem
Consider the joint optimization of the transmit and receive beamformers under the MMSE criterion and the transmission-power constraints. It is assumed that the average transmission power of each sensor cannot exceed a given positive value . Since the transmitted symbols have unit variance, the power constraints are given as
Following a common approach in the MIMO beamforming literature (see e.g., [32, 33, 34, 35]), the receive beamformer is constrained to be orthonormal matrix. As mentioned, the constraint can lead to a closed-form suboptimal solution with only marginal performance loss and furthermore facilitate efficient channel feedback design as presented in Section VI. As pointed out in , for most communication objectives, it is the subspace spanned by the beamformer but not the exact beamformer that has a crucial effect on the system performance, justifying the said constraint. Furthermore, under the MMSE criterion, a positive scaling factor , called denoising factor, is included in for regulating the tradeoff between noise reduction and transmission-power control. To be specific, reducing suppresses noise but requires larger transmission power to maintain the MSE of computed function values. Mathematically, we can write with
being a tall unitary matrix and thus. Then given the MSE in (4), the MMSE beamforming problem can be formulated as:
The problem is solved in Section IV.
Iii-B AirComp Channel Feedback Problem
We propose the use of the AirComp architecture in Fig. 2 to realize the mentioned receive-beamformer acquisition by concurrent transmissions by all sensors. Let denote the derived beamformer solution to problem P1 and and be the feedback counterparts of the AirComp operations and (see Fig. 2). The key design constraint is that the transmitted signal in (5) must be a function of local CSI only, denoted as . Then it follows that
and the problem of AirComp feedback design reduces to the design of the functions and . The solution is presented in Section VI.
Iv Multi-Function AirComp: Beamforming
In this section, the AirComp beamforming problem in Problem P1 is solved. While zero-forcing transmit beamforming can be proved to be optimal, the receive beamforming optimization is found to be NP-hard. An approximate problem is obtained by tightening the power constraints. This problem allows a practical solution approach based on differential geometry. The solution reveals that the optimal receive beamformer can be approximated by the weighted centroids of a cluster of points on a Grassmann manifold, each corresponding to the subspace of an individual MIMO channel. To facilitate exposition, some basic definitions and principles of Grassmann manifolds are provided in Appendix -A.
Problem P1 is difficult to solve due to its non-convexity. The lack of convexity arises from the coupling between transmit and receive beamformers in the objective function, and the orthogonality constraint on the receive beamformer. To simplify the problem, zero-forcing transmit beamforming conditioned on a receive beamformer is first shown to be optimal as follows.
Given a receive beamformer , the MSE objective stated in (4) is minimized by the following zero-forcing transmit beamformers:
Proof: See Appendix A-A.
We note that the power constraint imposed on the precoder will be enforced in the sequel via regulating the norm of the equalizer , or equivalently, the denoising factor .
Remark 1 (Number of AirComputable Functions).
Note that to ensure matrix is invertible, it requires . This implies that, the number of functions that can be simultaneously computed by the proposed multi-function AirComp is limited by . The result is due to the underpinning limit of MIMO spatial multiplexing: the maximum number of spatial streams is .
where is defined earlier as the normalized receive beamformer. Though Problem P3 has a simpler form than P1, it remains non-convex due to the non-convex orthonormal constraint of the receive beamformer. In fact, Problem P3 is found in the next section to be NP-hard via proving its equivalence to the NP-hard multicast beamforming problem. To develop a tractable approximation of the problem, a reasonable modification of the power constraints is derived. To this end, a useful inequality is obtained as shown below.
Tightening the power constraints in Problem P3 using Lemma 2 gives the approximate problem:
Since the feasible set of Problem P4 is smaller than that of P3, the solution to P4 is a feasible solution though potentially a suboptimal one to P3. To solve Problem P4 using differential geometry, an equivalent form containing subspace distances between the receive beamformer and individual MIMO channels is obtained as follows.
Problem P5 is not yet in a ready form admitting the differential-geometry solution approach and requires an additional approximation. For this purpose, the objective function is bounded below.
The objective function in Problem P5 can be bounded as follows:
where we define which is a constant independent of the control variable .
The proof is straightforward and omitted for brevity. Approximating the objective function in P5 by either the lower or the upper bound in Lemma 4 both lead to the same approximate problem which is given by:
Remark 2 (Beamformer Geometric Interpretation).
Problem P6 allows a geometric interpretation of the desired receive beamformer . In fact, the problem is to find a weighted centroid of a set of points (each being a subspace) on a Grassmann manifold with the squared projection 2-norm as the distance metric, where the weights are . This reveals that the receive beamformer makes the best-effort to be aligned with all the MIMO channel matrices with the alignment (subspace) distances adjusted by corresponding channel gains as specified by the smallest channel eigenvalues.
Problem P6 can be approximately solved by a closed-form solution that can be efficiently computed without resorting to an iterative algorithm. Particularly, the closed-form solution can be derived by replacing the projection 2-norm distance with the projection F-norm (see Appendix -A). Note that for a small principal angle  and are exactly equivalent in the case of according to (27). Thus, the problem P6 can be approximated as
which still seeks a weighted centroid of channel subspaces as before but based on a different subspace distance metric. According to the definition in (26), can be computed in a matrix form by
Then, substituting it to the objective function in Problem P7, the problem can be equivalently written as
Problem P7 remains non-convex due to 1) maximimizing a convex objective function and 2) the orthogonality constraints on the variable . Nevertheless, by intelligently constructing an equivalent unconstrained problem, we are able to derive a closed-form solution for Problem P7 (see the following Lemma 5) via analyzing the stationary points of the unconstrained problem. For ease of exposition, define a matrix , called effective CSI, as follows:
As shown shortly in Lemma 5, the normalized AirComp receive beamformer depends on the global CSI via the effective CSI. In other words, is sufficient for computing .
V AirComp-Multicasting Duality
In this section, consider the case of single-antenna uni-modal sensors. The AirComp receive-beamforming problem for uplink sensing-data collection is shown to be equivalent to the well known problem of transmit beamforming for downlink multicasting (see Fig. 3). This establishes the AirComp-multicasting duality, allowing the low-complexity beamforming design in the preceding section to be transferable to solve the NP-hard multicast beamforming problem.
V-a Review of the Multicast Beamforming Problem
Consider the scenario that multiple single-antenna users request the same data stream from a multi-antenna AP equipped as shown in Fig. 3(a). Assuming global CSI is available at AP, the problem of multicast beamforming is to minimize the total transmission power subject to a set of signal-to-noise-ratio (SNR) constraints specifying the users’ quality-of-service requirements. Mathematically, the problem can be formulated as follows:
where denotes the multicast beamforming vector, , and are the channel vector, noise variance and target SNR of user , respectively. The problem can be proved to be NP-hard . Nevertheless, it is known that a close-to-optimal solution can be efficiently computed using the well known SDR technique. The key idea of SDR is to recast the problem as an equivalent rank-one constrained SDP by denoting as follows:
Then SDR drops the rank-one constraint and solves the relaxed SDP. Finally, a rank-one approximate solution of the original problem is retrieved by a Gaussian randomization strategy based on the solution of the relaxed SDP (or simply the principal eigenvector of it). For more details on the SDR algorithms, readers are referred to the key references in the area[39, 40].
V-B Duality between AirComp and Multicast Beamforming
For the special case of a sensor network with single-antenna sensors () and a multi-antenna AP. The receive beamforming reduces to a vector denoted by . Its design problem for MMSE AirComp can be directly simplified from problem P5 (which is equivalent to the original P3 when ) to the following form:
where represents the orientation of the channel vector, i.e., .
Remark 3 (AirComp-Multicasting Duality).
Comparing (15) and (18) reveals that the beamforming problems for the uplink AirComp and downlink multicasting share the same mathematical form. This establishes the AirComp-multicasting duality that is analogous to the famous uplink-downlink duality for multiuser MIMO communication . Intuitively, the AirComp-multicasting duality is a result of the fact that both the AirComp and multicast beamformers must make the same best effort to be aligned with multiple vector channels (see Fig. 3) though for different objectives: one is to minimize the distortion in the computed function value and the other maximize the minimum SNR.
The AirComp beamforming technique designed in the last section is based on computing the weighted centroid on the Grassmann manifold. The duality allows the technique to be applied to multicast beamforming. Compared with the classic SDR method discussed in the preceding subsection, the AirComp beamforming has the following two main advantages.
(Efficient CSI Feedback): The AirComp beamforming solution requires only the effective CSI in (12) and thus enables the efficient “one-shot” channel feedback scheme presented in the next section. In contrast, the SDR solution requires global CSI and thus requires all users to feed back their local CSI. This results in excessive channel training overhead when the number of users is large.
(Low Computation Complexity): As shown in Lemma 5, the weighted centroid solution requires only one-shot computation and has a relatively low complexity of arising from the principal eigenvector calculation. However, the SDR requires first solving a SDP of dimension , by an iterative interior point method, resulting in the complexity of where denotes the solution accuracy . The complexity becomes overwhelming when the numbers of the receive antennas and/or users are large. This makes the AirComp solution preferable in practice. The low-complexity advantage of the proposed solution is also verified by simulation in Section VII.
Vi Multi-Function AirComp: Channel Feedback
In this section, the AirComp feedback problem stated in Problem P2 is solved by feedback technique design. Using the optimization results in Section IV, two novel techniques are designed in the following subsections for sequential feedback of two components of the AirComp receive beamformer, namely the normalized beamformer and the denoising factor. Essentially, the two techniques realize the AirComp of two functions: 1) the weighted centroid of a set of matrices and 2) the maximum of a set of scalars, which can be thus implemented using the AirComp architecture in Fig. 2.
Vi-a Feedback of Normalized Beamformer
Assume that the feedback channel is noiseless due to high transmission power for channel feedback. The AirComp feedback scheme for acquisition of the normalized beamformer in (13) is derived as follows. As indicated by Lemma 5, the normalized beamformer can be directly computed from effective CSI matrix in (12). The key step for solving the AirComp feedback problem in Problem P2 is to enforce the equality . Then given and Let denote the compact SVD of , it is easy to verify using Problem P2 that designing the feedback signals as gives the desired equality . Then the normalized receive beamformer can be computed as the principal eigenvectors of the received signal . The AirComp feedback design for acquiring is summarized as follows.
Normalized Beamformer Feedback:
where denote the dominant left eigenvectors of . It is important to note that scaling the feedback signal by a constant so as to meet a transmission power constraint has no effect on the received normalized beamformer .
The above AirComp feedback technique inherits the advantage of AirComp by turning interference from multiple access into useful signals for functional computing. In contrast with the traditional method of channel training, increasing the number of simultaneous sensors may even be beneficial to the computation accuracy via sensing-noise averaging.
Vi-B Feedback of Denoising Factor
Following (14), the denoising factor is the maximum of a set of scalars called feedback values and defined as:
Since the maximum is not a nomographic function, it is not directly AirComputable. However, an intelligent feedback technique is presented shortly that shows the possibility of denoising-factor acquisition by AirComp over a fixed number of feedback rounds. To begin with, it is assumed that the normalized beamformer is acquired at the AP using the technique in the preceding sub-section and then broadcasts to all sensors. This allows each sensor to apply zero-forcing transmit beamforming for inverting the corresponding channel matrix. Specifically, the transmit beamformer at sensor is given as . Such beamforming creates a effective set of parallel MACs such that the signal vectors transmitted by sensors are summed at the AP. In other words, with denoting the signal vector for sensor , the receive signal vector at the AP is . Furthermore, we assume that the denoising factors lie in a fixed finite range . Given the parallel MACs and the assumption, the algorithm for denoising-factor feedback with feedback rounds is described as follows.
Algorithm for Denoising-Factor Feedback:
(Initialization): Set the feedback counter and intialize the feedback-quantization range with , and .
(Feedback Quantization): A quantizer codebook with values, denoted by , with , is generated by uniformly partitioning the range . Thus, the maximum quantization error is bounded by half of each partition interval denoted as : . Quantizing the feedback value in (20) at sensor gives the codebook index .
(Concurrent Feedback): Each sensor transmits a signal vector comprising a single at the location specified by the corresponding codebook index and ’s at other locations. Specifically, the signal vector for sensor is
Then all sensors transmit their signal vectors simultaneously over the said effective parallel MACs. The AP finds the largest index of a nonzero element in the received signal vector , denoted as the . Thereby, using the codebook , it can be inferred at the AP that the quantized value of the denoising factor is and the exact value lies in the range .
(Refining Quantization Range): To improve the quantization resolution in the next feedback round, the AP refines the quantization range as
Next, increase the counter by setting and go back to 2) if or otherwise stop the feedback process.
From the above algorithm, the key result on the feedback accuracy follows.
Given feedback rounds, the AP receives a quantized version of , denoted as , with the quantization error bounded as
Proposition 1 implies that the feedback error reduces exponentially with the number of feedback rounds. Specifically, adding a feedback round improves the feedback resolution by bit. Given a target resolution with the maximum quantization error , the required number of feedback rounds by the proposed feedback scheme is given by
As an example, for some practical settings of , , , , the required number of rounds (feedback slots) is according to (23). This is much smaller than the number of sensors in a dense network which determines the feedback rounds if the traditional method of channel training is adopted.
Remark 4 (Comparison with State-of-the-Art).
The state-of-the-art algorithm for AirComp of a maximum function was proposed in  based on a different principle from the current design. In , the maximum of a set of distributed feedback values is progressively computed at the AP by sequential detection of the bits in the binary representation of the desired value via scheduling transmitting sensors by broadcasting a threshold. As the result, each feedback round increases the feedback resolution by a single bit and the algorithm cannot be straighforwardly extended to exploit spatial multiplexing. In contrast, by exploiting spatial channels for implementing uniform quantization, the proposed feedback algorithm achieves multi-bit resolution improvement for each feedback round as mentioned earlier.
Vi-C Comparison with Conventional Channel Training
For conventional multiuser channel training, sensors take turns to transmit pilot signals to AP for uplink channel training to avoid collision (see e.g., [26, 27]). The pilot signal for each sensor should be a or larger matrix for estimating a channel matrix. Thus it takes at least symbol slots to complete the channel training process for a network comprising sensors. In contrast, the proposed AirComp feedback technique for normalized beamformer feedback involves simultaneous transmissions of all , each of size , which thus requires only symbol slots. This together with the slots (typically ) for the feedback of the denoising factor yields the total feedback slots of independent of the network size . Consider a typical dense sensor network with , and , it takes only slots for the proposed AirComp feedback scheme in contrast to slots required by the conventional channel training. Thus AirComp feedback achieves -time of feedback overhead reduction in this example.
Vii Simulation Results
In this section, the performance of the proposed multi-function AirComp is evaluated by simulation. The simulation parameters are set as follows unless specified otherwise. The number of multi-modal sensors is , the AP array size at AP , the sensor array size and the number of computed functions are equal to . Each MIMO channel are assumed to be i.i.d. Rician fading , modelled as i.i.d. complex Gaussian random variables with non-zero mean
, modelled as i.i.d. complex Gaussian random variables with non-zero meanand variance . In addition, the average transmit-SNR constraint, defined as , is set to be dB.
Vii-A Baseline Beamforming Schemes
Given that the optimization of AirComp beamforming is a NP-hard problem, for the purpose of comparison, we consider two baseline AirComp beamforming schemes designed based on the classic approaches, namely antenna selection and eigenmode beamforming. Both schemes assume zero-forcing transmit beamforming in (7) and their difference lies in the receive beamformers. Define the sum-channel matrix . To enhance the receive SNRs, the antenna-selection scheme selects the receive antennas observing the largest channel gains in the sum channel . Consequently, the effective channel matrix after beamforming consists of rows of with largest vector norms. On the other hand, to select the strongest eigenmodes of for AirComp, the normalized eigenmode receive beamformer consists of the dominant left eigenvectors of . The denoising factor of each type of beamforming design is computed following (14) with modified accordingly.
Vii-B Performance of Multiple-Function AirComp
In Fig 4, the MSE performance of the proposed multi-function AirComp beamforming is compared with that of two baseline schemes introduced in the preceding subsections. A varying number of functions , size of receive array , and also number of sensors are considered in Fig. 4(a) - 4(c), respectively. Several key observations can be made as follows. First, for all schemes, the MSE is a increasing function of and but an increasing function of . This coincides with our intuition that, higher computation throughput is at a cost of declining accuracy, and more connected sensors makes it harder to design one common receive beamformer to equalize all different users’ MIMO channels. Nevertheless, deploying more receive antennas compensate for the performance degradation by exploiting diversity gain. Second, under various parameter settings, the proposed scheme outperforms the other two baseline schemes, showing the effectiveness of the new design approach based on optimization on the Grassmann manifold. Furthermore, the performance gain of the proposed design is larger in the regime of large and , further confirming the effectiveness of the proposed design for multi-modal sensing and dense networks. Last, one can observe that the performance between different schemes converge as grows. This suggests that the large diversity gain enhances the receive SNRs such that the optimzation of AirComp beamforming is less critical and simple designs suffice.
Vii-C Comparison with the SDR Method
The discovered AirComp-multicasting duality leads to the availability of two methods, the proposed weighted centroid and the SDR methods, for designing beamforming in either type of systems. Their performance and complexity are compared by simulation as follows. Consider single-antenna uni-modal sensors as in Section V.
The comparison of the MSE performance and computation time between the proposed weighted centroid and the SDR solutions is provided in Fig. 5 for the varying receive array size and number of sensors . The computation time is measured using MATLAB. It is observed that the weighted centroid solution can achieve comparable performance as the SDR solution, which is optimal with a high probability for the NP-hard multicast beamforming problem as shown in
. The computation time is measured using MATLAB. It is observed that the weighted centroid solution can achieve comparable performance as the SDR solution, which is optimal with a high probability for the NP-hard multicast beamforming problem as shown in. On the other hand, the former achieves dramatic computation time reduction with respect to the latter, ranging from 100x to 1000x in the considered ranges of and . The simulation results support our previous analysis in Section V that the complexity of the proposed solution is independent of the network size and furthermore insensitive to the variation of the array size while the complexity of the SDR solution is . Thereby, the proposed solution features low complexity and is preferred in the large scale sensor networks (or large scale multicast networks) or when the AP is equipped with a large scale array.
Viii Concluding Remarks
In this paper, we have proposed the framework of multi-function AirComp for MIMO HMM sensor networks. In particular, we have developed an approach for designing receive beamforming using tools from differential geometry. This approach achieves dramatic complexity reduction than the state-of-the-art SDR approach while maintaining comparable performance. Furthermore, building on the AirComp system architecture, intelligent channel feedback techniques have been designed for enabling AirComp beamforming. Unlike the traditional method of channel training, the techniques prevent feedback overhead from escalating with the number of sensors and thus are highly efficient for dense HMM sensor networks. Last, the discovery of AirComp-multicasting duality allows the low-complexity beamforming design to be transferable to multi-antenna multicast systems, which traditionally relies on the computation-intensive SDR method for beamforming optimzation. The work points to the promising new research area of MIMO AirComp where many interesting research issues warrant further investigation such as sensor scheduling, broadband AirComp, AirComp for multi-AP cooperative sensor networks, and AirComp for supporting distributed learning and inference.
-A Preliminaries on Grassmann Manifold
-A1 Stiefel and Grassmann Manifolds
The Stiefel manifold is the set of all -by- tall orthonormal matrices for , denoted by . Mathematically, . On the other hand, the Grassmann manifold is a set of all -dimensional subspaces in , denoted by . Thereby a Grassmann manifold can be seen as the quotient space of . To be specific, a point on the Grassmann manifold corresponds to a class of -by- orthonormal matrices on the Stiefel manifold that span the same column subspace defined by the point. Choosing an arbitrary matrix from this class and using it as a generator, the class, denoted as , can be mathematically written as where denotes the group of unitary matrices. This leads to a relation between the Grassmannian and the Stiefel : .
-A2 Distance Metrics on Grassmann manifold
Algorithms on Grassmann manifold often involves the calculation of the distance between points on the manifold. There exist many different distance metrics and all of them are derived from a key notion called geodesic. Roughly speaking, a geodesic is the unique curve linking two points on a manifold that has the shortest length among all. The length of the geodesic is called the geodesic distance (or arc length). Mathematically, given , their geodesic distance, denoted as is calculated by where are called the principal angles, measuring the minimal angles among any two sets of orthonormal bases spanning the two subspaces. An efficient way to compute the principal angles is to perform SVD on , i.e.,
where the singular values in (24) yields the cosines of the principal angles. Based on these angles, a rich set of subspace-distance metrics can be defined. Two particular metrics of relevance in this paper are :
where is the vector formed by , and matrices and follow those in (24). For , we have the following inequalities relating different distance metrics:
Appendix A Supplementary Proof for the Derived Key Results
A-A Proof of Lemma 1
Given the MSE objective provided in (4), it is easy to note that both the first and the second terms within, i.e., and are positive semidefinite matrix with non-negative eigenvalues. As a result, for any given equalizer , we have the following inequality:
It is easy to verify that setting to have the zero-forcing structure in (7) enforces
and thus achieves the equality in (28), which completes the proof.
A-B Proof of Lemma 2
Utilizing the fact that, for any square matrix, the inequality holds, it is straightforward to show
Note that is a tall unitary matrix, thus matrix and share the same eigen-spectrum due to the well-known unitary invariant property. Therefore, it is easy to see that the upper bound (29) becomes exact when the channel is well-conditioned.
Then given the compact eigenvalue decomposition , we have the following
where the second equality uses the fact that, for arbitrary two matrices and of the same size, and have the same eigen-spectrum, while the last inequality is due to [41, Corollary 11], and it is easy to verify that the equality holds when is a scaled identity matrix, namely
is a scaled identity matrix, namelywhich also implies the channel is well-conditioned.
A-C Proof of Lemma 3
Note that the set of power constraints in P4 can be rewritten as one single constraint by:
It is easy to note that the minimum in P4 is achieved when the above constraint is active (the equality holds). Therefore, one can move the constraint in (33) to the objective function and have the following equivalent min-max problem:
The problem in (34) can be further simplified by max-min the inverse of the objective function and dropping the constant term which leads to the following form
The objective function in (35) is related to the projection 2-norm Grassmannian metric via
A-D Proof of Lemma 5
In this proof, we first construct an equivalent problem of P7 by modifying the objective function leveraging the orthogonality constraint, giving the problem . Then, we show that solving a relaxed version of without the orthogonality constraint, denoted by , yields a solution enforcing the constraint still. Therefore, one can conclude that problem P7 and share the same optimal solution, while the unconstrained problem can be solved easily by checking all stationary points of the objective. The detailed derivation is presented below.
After some simple algebra manipulation exploiting the linearity of the trace operation, the objective function in (P7) can be further simplified as
where is the effective CSI that has been defined in (12). Then problem (P7) reduces to
Starting with problem , let’s first define an alternative objective function given by
It is straightforward to verify that the following problem is equivalent to problem .
Now, let’s relax the orthogonality constraint, and consider the following unconstrained problem:
Since the function is a smooth function with gradient defined everywhere, the solution to the problem should be a stationary point of , i.e., . It follows that
From (42), one can note that is one of the stationary points.
To seek other stationary points that , we left multiply both sides of the equality in (42) with