I Introduction
Nonorthogonal multiple access (NOMA) has received much hype due to its promise to effectively utilize the wireless spectrum. NOMA works by allowing users to share the same temporal/ spatial resources while the receiving side carries out successive interference cancellation (SIC) [1, 2]
. On the other hand, cooperative communications can help by improving the system capacity, extend the coverage area and achieve a higher degree of freedom with single antenna nodes. Thus, the idea of user cooperation in NOMA has attracted much interest due to its applications in 5G and has given birth to an important research topic called the cooperative NOMA. It was first proposed in
[3] wherein, a user with the stronger channel decodes the message and then assist by relaying the message to the far NOMA user.Despite substantial improvements in terms of spectral efficiency, the research work on energy efficient cooperative NOMA schemes is still at infancy stage. To that end, simultaneous wireless information and power transfer (SWIPT) has drawn much research interest due to the ability of RF signals to transfer information and energy at the receiver [4]. Thus, applications of SWIPT in NOMA have been studied from the perspective of outage performance, cooperation, and energy harvesting (EH) efficiency [5]. However, owing to the dual function of RF signal and broadcast nature of NOMA, the transmission from source to destination can be eavesdropped by a malicious user. More specifically, the EH receivers can intercept the confidential information being exchanged between legitimate users. In order to provide security to the lowpowered devices, physical layer security (PLS) has been introduced as an alternative to computation heavy cryptographic techniques [6]. PLS techniques can improve the secrecy performance of wireless networks by means of cooperative relaying, jamming and multipleantenna beamforming.
In [7], the authors proposed energy and spectral efficient protocol by combining NOMA with SWIPT. They showed that the proposed scheme does not jeopardize the diversity gain of the edge users while enabling the cellcenter users to selfpower themselves. In [4], Diamantoulakis et al. consider downlink and uplink multiple access protocols for SWIPT systems. In particular, they investigate the performance in the downlink for NOMA and time division multiple access (TDMA), while for uplink conditions they consider NOMA with time sharing. These works were extended for multipleinputsingleoutput (MISO) NOMA for hybrid time switching and powersplitting SWIPT architecture in [8]. They also derived tight closedform expressions of the outage probability and demonstrated the superiority of cooperative NOMA over conventional NOMA and OMA systems. Besides these developments, few investigations have been conducted for improving the secrecy performance of NOMA using SWIPT. The authors of [9] maximized the secrecy sum rate by optimizing the allocated power. Closedform expression of optimal powersplitting ratio was derived and it was shown that the proposed method outperforms the uniform power allocation methodology. In [10], Zhou et al. proposed a cooperative jamming technique for energy harvesting multipleinputsingleoutput (MISO)NOMA cognitive radio systems. The authors claimed that the proposed scheme for NOMA outperforms the conventional orthogonal multiple access (OMA) scheme in terms of power efficiency.
Of late, deep learning has emerged as a key technique for improving the performance of wireless networks. Deep learning is a part of machine learning consisting of multiple hidden layers
[11]. More specifically, in contrast to shallow machine learning methods, deep learning has multiple intermediate layers of neurons between input and output layers. At each hidden layer, the weighted sum of the previous layers are updated and an activation function is applied
[12]. The authors of [13] first proposed the idea that deep learning is an important and powerful tool for handling nonlinear and complex problems. Some other works considered deep learning for the physical layer, multipleinputmultipleoutput (MIMO) systems, and channel coding [14, 15]. This positive trend also attracted muchneeded attention to multiple access schemes. Thus, the authors of [16] optimized the sparse code multiple access (SCMA) scheme using deep learning. To do so, they developed a strategy for selecting the codebook which minimizes the bit error rate (BER) while using the minimum amount of computation time. Another important study that integrates orthogonal frequency division multiplexing (OFDM) and deep learning was conducted by the authors of [17]. It was shown that the deep learning approach performs best for signal detection and channel estimation. More recently, the authors of
[18]used long shortterm memory (a branch of supervised deep learning) for data detection in uplink NOMA. They showed that the deep learning based NOMA scheme is more reliable as compared to conventional harddecision optimization solutions.
So far, it has become evident that the work on secrecy performance of energy harvesting cooperative NOMA systems is very limited. Moreover, the work on deep learning approaches for physical layer security of NOMA is nonexistent. Therefore, in order to advance this promising field of wireless communications, we consider a scenario where energy harvesting cooperative NOMA users communicate in the presence of an energy harvesting eavesdropper. We derive the analytical expression of intercept probability of DF energy harvesting cooperative NOMA system which, according to the authors’ best knowledge, has not been derived in the literature. We also optimize the secrecy performance of the system by using deep learning for finding the optimal value of the power allocation factor. The results of deep learning approaches are then compared with benchmark iterative search algorithms. It has been shown that the deep learning based NOMA scheme is robust and computationally lightweight.
The remainder of the paper is organized as follows. Section II provides details of the system model. In Section III, the analytical results for intercept probability are provided. In Section IV, deep learning based neural network model is discussed. Section V provides numerical results and their relevant discussion. In Section VI, some concluding remarks are provided.
Ii System Model
Let us consider a cooperative relaying system consisting of a source (S), and two destinations ( and ) in the presence of an eavesdropper (E) as shown in Figure 1. The nodes , and E are able to decode information and harvest energy from the received RF signal. It is assumed that , and E have the channel state information (CSI) of their corresponding links, whereas, S being the source has the CSI of all the nodes. The channel gains from S , S , S E and E are assumed to be Rayleigh distributed and given as , , , , respectively.
The transmission takes place in two time slots. In the first phase, S transmits the superimposed message to and , where and are the data symbol and power allocation coefficient of th destination and denotes the total transmit power. It is assumed that , hence the power allocation factor should satisfy , where . The nodes , and E are assumed to use the powersplitting receiver architecture for ID and EH. According to powersplitting architecture, the received power is split into two power streams by a powersplitting factor for EH and for ID, where . The received signal at , and E during the first time slot can be written as
(1)  
(2)  
(3) 
where , , denote powersplitting factor at , and E during first phase. Also, , , represent the additive white Gaussian noise (AWGN) with zero mean and variance. The node first decodes its own symbol by treating as interference. After obtaining , cancels its own signal by using successive interference cancellation (SIC) to get . The received signal to interference and noise ratio (SINR) and signaltonoise ratio (SNR) for symbols and can be, respectively, given as
(4)  
(5) 
The far user, , treats the as interference. Then the received SINR at can be written as
(6) 
It is assumed that the link between the source and near user is secure and the eavesdropper tries to decode the information signal of the far user. In order to decode , the eavesdropper treats as noise. Hence, the SINR at E can be expressed as
(7) 
In the second phase, transmits the decoded symbol to with power . Assuming that can perfectly decode and use all harvested energy during first phase to transmit to , the received SNR at can be given as
(8) 
Similarly, the received SNR at E during the second phase can be given as
(9) 
Iii Intercept Probability
In this section, we derive the analytical expression of intercept probability for the considered case. An intercept event occurs when the achievable secrecy rate falls below 0 [6]. The achievable secrecy rate is the difference between the rates of the main and wiretap links i.e. .
Since we consider DF protocol at , therefore, the SNR will be determined by the bottleneck link between S and and to is given as while the achievable rate for the link S is given as . The eavesdropper is assumed to select the best messages received during first and second phase given as . Therefore, the achievable rate for wiretap links can be given as . Now the intercept probability can be given as
(10) 
Assuming and , we can rewrite 10 as
(11) 
The Cumulative Distribution Function (CDF) can be found as
(12) 
In the above equation, we obtain
(13) 
where and . The CDF of depends on the event that near user has successfully decoded the symbol of the far user. In this case, the CDF of can be expressed as
(14) 
Assuming and we get
(15) 
where is the complementary CDF, , and .
(16) 
(17) 
Now we find which is given as
(18) 
where and . Now by differentiating we obtain
(19) 
Iv Deep Learning Based Optimization
In this section, we are going to present a deep learning based resource allocation scheme for optimizing the achievable secrecy rate of the far user. We employ neural networks to learn the relationship between inputs and outputs and predict the optimal power allocation factor that maximizes the achievable secrecy rate. To do so, we carefully train our multilayer artificial neural network, whereby, each layer consists of multiple neurons as illustrated in Figure 2
. We show in the numerical results section that the computational efficiency of artificial neural networks is one of the highlights of deep learning models. After the model has been trained on a set of inputs, the testing (i.e., the realtime running phase) involves only nonlinear transformations and vector multiplications without compromising the performance.
Iva Problem Formulation
We now try to optimize the secrecy performance of the far user due to it being most vulnerable to eavesdropping attack. Considering the worstcase scenario^{1}^{1}1We consider that the CSI of the eavesdropper is not available at the source due to which the achievable rate of eavesdropper cannot be calculated., the legitimate receivers have no option but to maximize their own achievable rate. By this approach, maximizing the achievable rate would result in maximizing the secrecy rate as well since the secrecy rate is the difference between the rate of legitimate link and the rate of wiretap link. Under this condition, the optimization problem of the achievable rate becomes equivalent to
(20) 
However, the second term (i.e., ) in (20) does not contain power allocation factor . Thus, the optimization problem can be reformulated as .
IvB Deep Learning Network Setup
Our neural network consists of multiple hidden layers and a single input and output layer. The main reason for using multiple hidden layers is to avoid underfitting of test data while maintaining a sufficient level of complexity. Moreover, by utilizing multiple hidden layers, the complex interplay of inputs and outputs can be understood by the network during the learning phase. In our case, the inputs are the channel realizations and the outputs are the power allocation factors of the far user. For each channel realization, we take samples from the Rayleigh distribution while fixing all the other parameters. These values are generated for training and validation datasets that are fed into the network during the training phase.
At each hidden layer, we use rectified linear unit (ReLU) activation function. Mathematically, the ReLU activation function is represented as
(21) 
where denotes the output of the activation function and is the input of the function. We have used mean square error as the cost function and apply the minibatch algorithm on the training data samples for calculating the gradients.
V Numerical Results
This section provides numerical results and relevant discussion. It is worth mentioning that the analytical and simulation results have been generated using MATLAB, whereas, the deep learning optimization is performed in Python 3.6.7. Unless stated otherwise, the parameters used for generation of plots are as follows: ,
dB, decay rate = 0.9, training samples = 30000, test samples= 6000, and epochs=100.
Figure 3 illustrates the intercept probability against different values of transmit SNR. It can be seen that the intercept probability decreases with an increase in the values of . As anticipated, the values of also have a prominent impact on the intercept probability. Strictly speaking, the intercept probability generally increases with a reduction in the values of . However, the separation between the curves of grows as the value of transmitting SNR increases. This shows that a low energy harvesting efficiency is more harmful at higher values of SNR, giving rise to a higher intercept probability. In addition to this, we note that at higher values of , the intercept probability significantly increases. This is partly because of the low power allocation to the near cooperative user. At lower values of the power allocation factor, decoding the message of the far user becomes difficult for the near cooperating user. It can also be seen that the simulation results closely follow the analytical result which validates the derived expression.
To further highlight the impact of a powersplitting factor, Figure 4 shows the intercept probability as a function of . It can be observed that the intercept probability generally increases with an increase in the value of the powersplitting factor. This trend can be attributed to the low amount of energy reserved for information decoding which makes it difficult to maintain cooperation among near and far users. Evidently, the increasing values of causes intercept probability to decreases. However, the power allocation factors for near and far users have shown different trends as the value of changes. Precisely, we note that at lower values of powersplitting factors, the separation between the curves of and is quite large. But, as the values of the powersplitting factor increase, the difference between the curves becomes smaller. This shows that the impact of power allocation factors of NOMA reduces at higher values of a powersplitting factor.
In Figure 5
, we have demonstrated the optimization results for the deep learning approach. Here, “OptimalIterative” refers to the optimal results achieved through iterative search scheme, “DL” denotes the results for deep learning approach, and “Random” represents the results for random power allocation factor generated using uniform distribution. Figure
5(a) shows the results for achievable secrecy rate against the increasing values of the powersplitting factor. As shown in this plot, the larger values of the powersplitting factor reduce the achievable secrecy capacity as more power is reserved for energy harvesting. It can be seen that the deep learning approach strictly follows the optimal results, while always achieving the accuracy of more than 90%. By contrast, the random power allocation achieves a very low secrecy capacity. To further highlight the robustness of the deep learning approach, Figure 5(b) plots the results for computation time against different values of . For this case, the deep learning approach performs significantly better than that of iterative search. This shows that once trained, the deep learning models can provide a lightweight solution to achieve optimal results.(a) 
(b) 
(a) 
(b) 
Figure 6 emphasizes the suitable number of layers for the neural network but plotting achievable secrecy rate and computation time for 05 hidden layers. It can be seen in Figure 6(a) that the separation between deep learning and iterative approach slightly increases. We attribute this increase to overfitting of data during the training phase. This results in causing the neural network to memorize the training set. However, when new testing data is presented, the network fails to generalize the results. This increase in a number of hidden layers also affects the computation time, as shown in Figure 6(b). Specifically, we observe that the computation time of deep learning approach considerably increases as compared to Figure 5(b). This increase in computation time is due to the increase in the number of hidden layers and in the total number of neurons in the neural network.
Vi Conclusion
This study provides secrecy analysis and deep learning optimization of SWIPTbased cooperative NOMA systems. We derive the analytical expression of the intercept probability when near user act as a cooperative node in the presence of an eavesdropper. We have shown that the impact of power allocation factors of NOMA reduces at higher values of the powersplitting factor. Moreover, we have shown that deep learning approach is more robust and computationally efficient as compared to conventional iterative search approach. In the future, we aim to use deep learning for optimizing the secrecy performance of cooperative NOMA systems under colluding eavesdroppers.
Acknowledgment
This work is partially supported by the National Key R & D Plan (2017YFC0803403), the National Natural Science Foundation of China (61371188) and the Fundamental Research Funds of Shandong University (2018GN051).
References
 [1] W. U. Khan, Z. Yu, S. Yu, G. A. S. Sidhu, and J. Liu, “Efficient power allocation in downlink multicell multiuser noma networks,” IET Communications, vol. 13, no. 4, pp. 396–402, 2019.
 [2] F. Jameel, S. Wyne, S. J. Nawaz, Z. Chang, and T. Ristaniemi, “Outage Analysis of RelayAided NonOrthogonal Multiple Access with Partial Relay Selection,” in 2018 IEEE Globecom Workshops (GC Wkshps), Dec 2018, pp. 1–6.
 [3] Z. Ding, M. Peng, and H. V. Poor, “Cooperative NonOrthogonal Multiple Access in 5G Systems,” IEEE Communications Letters, vol. 19, no. 8, pp. 1462–1465, Aug 2015.
 [4] P. D. Diamantoulakis, K. N. Pappi, G. K. Karagiannidis, H. Xing, and A. Nallanathan, “Joint downlink/uplink design for wireless powered networks with interference,” IEEE Access, vol. 5, pp. 1534–1547, 2017.
 [5] Z. Ding, H. Dai, and H. V. Poor, “Relay selection for cooperative NOMA,” IEEE Wireless Communications Letters, vol. 5, no. 4, pp. 416–419, 2016.
 [6] F. Jameel, S. Wyne, and Z. Ding, “Secure Communications in ThreeStep TwoWay Energy Harvesting DF Relaying,” IEEE Communications Letters, vol. 22, no. 2, pp. 308–311, Feb 2018.
 [7] Y. Liu, Z. Ding, M. Elkashlan, and H. V. Poor, “Cooperative Nonorthogonal Multiple Access With Simultaneous Wireless Information and Power Transfer,” IEEE Journal on Selected Areas in Communications, vol. 34, no. 4, pp. 938–953, 2016.
 [8] T. N. Do, D. B. da Costa, T. Q. Duong, and B. An, “Improving the performance of celledge users in MISONOMA systems using TAS and SWIPTbased cooperative transmissions,” IEEE Transactions on Green Communications and Networking, vol. 2, no. 1, pp. 49–62, 2018.
 [9] G. He, L. Li, X. Li, W. Chen, L. L. Yang, and Z. Han, “Secrecy sum rate maximization in NOMA systems with wireless information and power transfer,” in 2017 9th International Conference on Wireless Communications and Signal Processing (WCSP), Oct 2017, pp. 1–6.
 [10] F. Zhou, Z. Chu, H. Sun, R. Q. Hu, and L. Hanzo, “Artificial Noise Aided Secure Cognitive Beamforming for Cooperative MISONOMA Using SWIPT,” IEEE Journal on Selected Areas in Communications, pp. 1–1, 2018.
 [11] Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” nature, vol. 521, no. 7553, p. 436, 2015.

[12]
V. Nair and G. E. Hinton, “Rectified linear units improve restricted boltzmann machines,” in
Proceedings of the 27th international conference on machine learning (ICML10), 2010, pp. 807–814.  [13] G. E. Hinton, S. Osindero, and Y.W. Teh, “A fast learning algorithm for deep belief nets,” Neural computation, vol. 18, no. 7, pp. 1527–1554, 2006.
 [14] T. Gruber, S. Cammerer, J. Hoydis, and S. ten Brink, “On deep learningbased channel decoding,” in Information Sciences and Systems (CISS), 2017 51st Annual Conference on. IEEE, 2017, pp. 1–6.
 [15] T. Wang, C.K. Wen, H. Wang, F. Gao, T. Jiang, and S. Jin, “Deep learning for wireless physical layer: Opportunities and challenges,” China Communications, vol. 14, no. 11, pp. 92–111, 2017.
 [16] M. Kim, N.I. Kim, W. Lee, and D.H. Cho, “Deep LearningAided SCMA,” IEEE Communications Letters, vol. 22, no. 4, pp. 720–723, 2018.
 [17] H. Ye, G. Y. Li, and B.H. Juang, “Power of deep learning for channel estimation and signal detection in OFDM systems,” IEEE Wireless Communications Letters, vol. 7, no. 1, pp. 114–117, 2018.
 [18] G. Gui, H. Huang, Y. Song, and H. Sari, “Deep learning for an effective nonorthogonal multiple access scheme,” IEEE Transactions on Vehicular Technology, vol. 67, no. 9, pp. 8440–8450, 2018.
Comments
There are no comments yet.