Received: 28 January 2021 Revised: 5 July 2021 Accepted: 10 August 2021 DOI: 10.1002/ett.4352 S U R V E Y PA P E R Machine learning for cooperative spectrum sensing and sharing: A survey Dimpal Janu1 Kuldeep Singh1 1 Department of Electronics and Communication Engineering, Malaviya National Institute of Technology, Jaipur, India Sandeep Kumar2 Abstract With the rapid development of next-generation wireless communication technologies and the increasing demand of spectrum resources, it becomes nec- 2 Central Research Laboratory, Bharat Electronics Ltd, Ghaziabad, India Correspondence Sandeep Kumar, Central Research Laboratory, Bharat Electronics Ltd, Ghaziabad, Uttar Pradesh, India. Email: sann.kaushik@gmail.com essary to introduce learning and reasoning capabilities in cognitive radio networks (CRN). In particular, our focus is on two fundamental applications in CRNs, namely spectrum sensing (SS) and spectrum sharing. The application of machine learning (ML) techniques has added new aspects to SS and spectrum sharing. This paper offers a survey on various ML-based algorithms in the cooperative spectrum sensing (CSS) and dynamic spectrum sharing (DSS) domain, with its emphasis on types of features extracted from primary user signal, types of ML algorithm, and performance metrics utilized for evaluation of ML algorithms. Starting with the basic principles and challenges of SS, this paper also justifies the applicability of supervised, unsupervised, and reinforcement ML algorithms in the CSS domain. The application of ML algorithms, to solve the DSS problem has also been reviewed. Finally, the survey paper is concluded with some suggested open research challenges and future directions for ML application in next-generation communication technologies. 1 I N T RO DU CT ION Global data traffic has been increasing since the past years and is expected to grow faster; mobile subscribers will increase from 5.1 billion in 2018 to 5.7 billion in 2023.1 According to a Cisco report, mobile devices will grow from 8.8 billion in 2018 to 13.1 billion in 2023. 5G connection speed will be 13 times higher than the average mobile connection and is expected to be 575 Mbps by 2023. Along with the significant growth in data traffic, new applications of wireless communications such as IoT, wearable devices, etc., are continuously generating a huge amount of data. To address the spectrum scarcity problem and utilize available energy resources, the future wireless communication system must support intelligent processing. This has led to research in the field of cognitive radio (CR). The CR is an intelligent device that senses its electromagnetic environment and can dynamically change/configure its radio operating parameters.2 The concept of CR came into existence to overcome the scarcity problem of limited radio spectrum by exploiting the underutilization of the available spectrum.3 The CR devices opportunistically utilize the spectrum allocated to primary user (PU) when PU is detected idle by it. Due to shadowing4,5 and multi-path fading,6 the quality of the PU signal received at the secondary user (SU) is degraded, leading to wrong sensing decisions about the presence/absence of PU. In certain situations, due to the geographical constraint, the SU cannot receive the signal of Trans Emerging Tel Tech. 2022;33:e4352. https://doi.org/10.1002/ett.4352 wileyonlinelibrary.com/journal/ett © 2021 John Wiley & Sons, Ltd. 1 of 28 JANU et al. 2 of 28 the PU, which is known as the hidden node problem. To overcome these issues, the collaborative spectrum sensing (SS) is performed, which improves the system’s sensing performance.7,8 CR technologies have capabilities to provide intelligent spectrum management to fulfill the ever-increasing demand for spectrum resources. Figure 1 presents the complete spectrum management framework and it shows the four main functions of cognitive radio network (CRN). a) Spectrum sensing: First step is to discover the spectrum holes on the arrival of SUs. SS enables to detect spectrum holes (in terms of duration of availability, frequency and location) and PU activity by continuously sensing the spectrum. b) Spectrum sharing: As the multiple SUs are trying to access the available spectrum, there is possibility of collision among the SUs. Spectrum sharing deals with coordination among the SUs to access the spectrum. c) Spectrum decision: It includes spectrum analysis by choosing the appropriate spectrum bands as per the users’ quality of service requirement and decision making. After the spectrum analysis, a decision has been made to access the vacant spectrum bands. d) Spectrum mobility: Main function of spectrum mobility is spectrum handoff and connection management. SUs need to switch from one spectrum hole to another on the arrival of PU. Connection management ensures continuous data transmission from SU in new spectrum hole. As mentioned above, the various applications of CR are briefly discussed. In this paper, our main focus is on the two fundamental applications such as SS and spectrum sharing. In the cooperative spectrum sensing (CSS) framework, SUs of a CRN exchange their sensing results with a common receiver known as a fusion center (FC), making the final decision for PU’s presence or absence. There are various conventional SS techniques such as energy detection, matched filter detection, cyclostationary detection, etc. These techniques require the optimum threshold to detect the presence of PU and each design has a different algorithm for the calculation of the threshold value. If the threshold is too high, then it can lead to the probability of missed detections. If the threshold is too low, it can lead to the probability of false alarms. After completing the process of detecting the spectrum holes, the concept of spectrum sharing was introduced to solve the spectrum scarcity problem. Spectrum sharing is a means to optimize spectrum usages by enabling the sharing of the same frequency band among the multiple users having different priorities without causing interference to each other. The main advantages of spectrum sharing are increment in capacity and improvements in spectrum utilization. Therefore, to alleviate the problems encountered in conventional SS techniques and dynamic spectrum sharing (DSS), machine learning (ML)-based frameworks are employed for CSS and DSS. The ML algorithms can solve FIGURE 1 Spectrum management framework for CR network JANU et al. 3 of 28 complex problems by analyzing and interpreting structures and patterns in data as they are capable of learning, reasoning, and decision making. Applying ML algorithms in CRN increases perception capability and reconfigurability as well as automation level and network flexibility.9 The three types of ML algorithms: unsupervised, supervised, and reinforcement learning (RL) have been utilized for SS in recent years. Various traditional SS techniques are being used, but ML has been introduced in SS to enhance the probability of detection and accuracy of the system. 1.1 Existing survey papers To the best of our knowledge, an exclusively survey paper on ML-based CSS and DSS is not presented to date. This survey is very specific to the application of ML techniques for CSS and DSS. All the existing survey papers which have a brief discussion on ML-based CSS methods have been discussed here. Alshawaqfeh et al10 discussed ML applications for CSS, learning engine, power allocation, and modulation classification in CRN. Bkassiny et al11 characterized the learning problems in CRs and highlighted the importance of ML in developing practical CRs. In Reference 12, authors have reviewed how intelligent wireless networks can be created with the help of CR and how ML algorithms can be utilized to improve the spectrum efficiency and energy efficiency of the wireless networks. To enhance the intelligence in future wireless communication networks, authors have discussed different ML approaches, environment perception, interference sensing, modeling, multi-dimensional SS, and spectrum measurements. Arjoune and Kaabouch13 provide an overview of advancement in SS technique and discuss the methods that describe the incorporation of compressive sensing and ML into CRN. Table 1 lists the existing survey papers in which ML-based algorithms for CSS and DSS have need discussed in a subsection. In our work, we have presented a detailed discussion on the ML-enabled CSS and DSS techniques from the perspective of various features used, different ML algorithms and performance evaluation metrics and recent development in DSS domain as well. We have reviewed 45 papers related to CSS and 10 papers related to DSS utilizing machine learning algorithms in the recent years 2010-2021. 1.2 Key contributions of the work In this paper, we present a comprehensive survey of the key research work on two applications of CR namely CSS and DSS using ML algorithms. Our main contributions can be summarized as follows: • Various ML-based techniques have been recently applied in the field of CSS to provide efficient solutions to the challenges associated with the traditional CSS methods. Therefore, before introducing the ML-based CSS methods, the conventional approaches are briefly discussed first. T A B L E 1 Existing survey papers Existing survey paper Number of papers employing ML Major area discussed CSS DSS Duration 10 Utilization of ML algorithms for power allocation, modulation classification, and for CSS 6 0 2010-2013 11 A study of ML algorithms and characterization of their applications in CRN 1 0 2009 12 A detailed study on CR technology such as spectrum sensing, spectrum access and resource allocation and application of ML algorithms in CRN and future wireless communication systems 7 0 2010-2014 13 Advancement of ML techniques, Incorporation of ML and compressive sensing 10 3 2013-2017 JANU et al. 4 of 28 • The ML algorithms have been extensively used for solving the issues in CSS in this decade. However, to the best of our knowledge, a survey paper reviewing the advances of ML in the field of CSS is not presented to date. We are presenting the first survey paper exclusively for ML-based CSS techniques. • The classification of CSS techniques based on the different categories of ML algorithms is presented. An extensive review of the feature computation methodologies for implementing ML techniques is also presented in this survey paper. • We have also catered to the performance evaluation perspective of ML-based CSS techniques. Various metrics that have been utilized for performance evaluation of these algorithms are reviewed and discussed. • Along with CSS, we have projected light on DSS domain with emphasizing on ML-based spectrum sharing techniques, which are the key features to improve the spectral efficiency and energy efficiency of a CRN. The rest of the paper is organized as follows. In Section 2, traditional approaches for CSS have been discussed. Section 3 elaborates on the application of ML in the field of CSS. In this section, different types of ML, such as supervised, unsupervised, and reinforcement ML used for SS in the literature is discussed along with a brief introduction of these techniques and in the subsection of this Section 3, the reviewed papers are characterized based on the feature computation techniques and the evaluation metrics utilized for performance comparison. In Section 4, DSS with fundamental approaches along with the application of ML into DSS have been discussed. In the last, conclusions and future scope are discussed for building intelligent wireless communication systems. In order to improve the readability of this paper, we define abbreviations in Table 2. T A B L E 2 Nomenclature Acronyms Definitions A3C Asynchronous advantage actor critic AWGN Additive White Gaussian Noise ACF Auto correlation function ANN Artificial neural networks BSs Base stations BCDLN Bidirectional cognitive deep learning nodes BP-HMM Beta process-hidden Markov model CA Classification accuracy CSS Cooperative spectrum sensing CR Cognitive radio CRN Cognitive radio network CH Cluster heads CNN Convolutional neural network CN Cognitive node CRTP Convex relaxation with tree pruning CSI Channel state interference CVAE Conventional variational autoencoder DL Deep learning DNN Deep neural network DSS Dynamic spectrum sharing DSLSTM dynamic spectrum sharing long short term memory DPPO distributed proximal policy optimization DT Decision tree (Continues) JANU et al. 5 of 28 T A B L E 2 (Continued) Acronyms Definitions DMM Difference between maximum and minimum eigenvalue DRL Deep reinforcement learning DMEAE Difference between maximum eigenvalue and average energy DAR Decomposition and recombination ECOC Error correcting output codes EM Expected maximum EMD Empirical mode decomposition FCM Fuzzy C-means FD Full duplex FC Fusion center FFA Firefly algorithm GAC Genetic algorithm clustering GMM Gaussian mixture model GPU Graphics processing unit ICA Independent component analysis IQ In-phase quadrature-phase IoT Internet of things I-DAR Interval decomposition and recombination KNN k-Nearest neighbors LTE Long-term evolution MAC MAC Medium access control MDP Markov decision process MME Ratio of maximum and minimum eigenvalue MSE Difference between the maximum eigenvalue and mean eigenvalue ML Machine learning NP Non-parallel NSGA Non-dominated sorting genetic algorithm NSGA-RL Non-dominated sorting genetic algorithm reinforcement learning OFDM Orthogonal frequency division multiplexing O-DAR Order decomposition and recombination PCA Principle component analysis RMET Ratio of maximum eigenvalue to matrix trace RL Reinforcement learning ROC Receiver operating characteristic SS Spectrum sensing SVM Support vector machine SNR Signal-to-noise ratio SVD Singular value decomposition SAS Spectrum access sharing SARSA State action reward state SU Secondary User JANU et al. 6 of 28 2 C S S M ET H O D S In this section, various SS techniques, such as conventional SS and advanced SS techniques with their mathematical models, have been discussed. Figure 2 highlights the classification of several SS techniques that handle many challenging problems related to SS in CRN. 2.1 Conventional SS techniques Before starting the data transmission, a SU must ensure the availability of the spectrum. In CR networks, a SU identifies the PU activity without creating any disturbance to the network infrastructure of the PU. A hypothesis testing problem can be analyzed where we consider H0 hypothesis for the absence of PU’s signal and H1 hypothesis for the presence of PU’s signal. This problem can be considered as similar to a binary classification problem in the context of ML. The model for the analysis of the SS problem14 can be expressed as: y[i] = w[i]; H0 when PU is absent (1) y[i] = h ∗ s[i] + w[i]; H1 when at least one PU′s signal is present (2) i = 1, 2, 3, 4, … … ..N where y[i] represents the received signal at the SU node, s[i] is the transmitted signal by the PU node, and w[i] denotes additive White Gaussian noise (AWGN) with zero mean and variance π 2 , h is the channel gain, i is the sensing time, and N Energy detection Cyclostationary feature detection Conventional spectrum sensing techniques Matched filter detection Waveform based detection Cooperative Spectrum Sensing Wideband compressive Adaptive compressive Advanced spectrum sensing techniques Covariance based Machine Learning FIGURE 2 Cooperative spectrum sensing techniques JANU et al. 7 of 28 FIGURE 3 Traditional CSS model represents the number of sampling points. The sensing techniques’ performance can be evaluated in terms of probability of detection and probability of false alarm. We can define the probabilities as: pf = p(H0 βH1 ) (3) pd = p(H0 βH0 ) (4) Figure 3 depicts various components and their work in a traditional CSS model. Different conventional SS techniques are discussed in the following subsections. 2.1.1 Energy detection-based sensing technique The energy detection at SU is the most straightforward technique which does not require prior information of PU’s signal characteristics.15,16 The main advantage of this technique is low implementation and computational complexities in comparison with other sensing techniques.17,18 This method estimates the energy of samples received and compares it with a predefined threshold value to check the spectrum’s availability. If the received signal’s energy is lower/higher than the threshold value, PU is considered absent/present.19 However, this technique’s major challenge is to determine the threshold value for the identification of PU activity without causing any interference to it. Energy estimated at SU node can be defined as 1∑ |y[n]|2 N n=1 N XED = The mathematical expressions are as below. XED < π βΆ H0 is considered. XED > π βΆ H1 is considered. (5) JANU et al. 8 of 28 There are certain issues with the energy detection technique such as noise uncertainty, fading effect of the channel between the PU and SU, etc. Moreover, this method does not give good performance for the communication system that operates on low signal-to-noise ratio (SNR). The performance of the energy detection-based method is analyzed in literature by considering different fading models. 2.1.2 Cyclostationary feature detection-based sensing technique The statistical parameters of signal, such as modulation rate and carrier frequency, are periodic in nature and are considered as cyclostationary features. If the mean and the auto correlation function (ACF) of the received signal are periodic, then the signal is called a cyclostationary signal. The periodicity of mean and ACF of a signal is utilized by cyclostationary features-based detectors to accomplish the sensing task. Mathematically, periodicity of mean and ACF can be defined as: RπΏy (π) = RπΏy (n + N0 , π) (6) my (n) = E[y(n)] = my (n + π) (7) ∑ N−1 my (n) = [y[n]e−j2ππΏn ] (8) n=1 RπΏy (π) = E[y[n]∗ y[n + π]ej2ππΏn ] (9) where, E[.] represents expectation operator, N0 represents time period of received signal, π represents time period, πΏ represents cyclic frequency. RπΏy (π) represents autocorrelation function, and my (n) represents mean of the received signal y[n]. Compared to the energy detection technique, the cyclostationary feature detection technique is less sensitive to noise uncertainty and, therefore, has less probability of false alarm. Further, this technique can detect low SNR signal values. The cyclostationary features-based approach provides better detection performance than the energy detector. This is because noise is stationary and has no correlation and be easily discriminated from the signal by calculating the spectral correlation function.20,21 The noise signals are uncorrelated and non-periodic in nature, so it is easy for cyclostationary detection to distinguish between the PU signal and noise as the PU signal exhibits periodic properties. This technique’s major drawback is that it needs more power consumption, processing complexity, and sensing time. 2.1.3 Matched filter-based sensing technique The Matched filter-based sensing technique requires complete information about the characteristics of the signal transmitted by the PU.22,23 The PU signal’s principal properties are various modulation techniques, operating frequency, order, bandwidth, frame structure, etc. In this technique, there is a requirement of SU to demodulate the received signal. As the PU signal information is available to SU, this technique is most preferred to recognize the PU activity. The received signal at SU can be correlated with the parameters of PU’s signal. Mathematically test statistics can be defined as: 1∑ = y[n]xp∗ (n) N n=1 N TMF (10) where, xp∗ (n) represents test samples collected from PU transmitter, N represents the total number of samples. A comparison is made between the test statistics and the predefined threshold value to determine PU’s presence. TMF < π βΆ H0 is considered. TMF ≥ π βΆ H1 is considered. JANU et al. 9 of 28 This technique uses a dynamically calculated threshold value rather than the static threshold used in the above-discussed methods. The main disadvantage of this technique is that the SU must have accurate knowledge about the spectrum, but this is difficult to acquire information about the spectrum. 2.1.4 Waveform-based sensing technique This technique is known as coherent sensing. Synchronization can be made at the receiver end by using well-known patterns. These known patterns are called waveform and consist of various key elements such as spreading sequence, hopping sequence, pilot signal patterns, etc. To perform waveform-based sensing, the same copy of the signal is used to correlate the signal at the receiver.24 2.2 Advanced SS techniques The conventional sensing techniques deal with a single frequency spectrum band at a particular time. However, the recent advancements in sensing techniques identify multiple frequency spectrum bands by cooperation among multiple SUs and improve sensing precision. Moreover, conventional sensing techniques require more sensing time, but advanced sensing techniques can reduce the sensing time with cooperative decision-making. Despite having advantages of these techniques, there are few challenges, such as overhead problems caused due to the sharing of sensing results among SUs and the presence of malicious SUs in the CR network, which affects the cooperation and reduces the network’s reliability. Various recent advancements in the sensing techniques are discussed in the following subsections, which overcome conventional sensing techniques’ limitations. 2.2.1 Wideband compressive sensing As the advanced technologies in wireless communication systems require high data rates and increased bandwidth, SU is necessary to sense a wide frequency spectrum band. Because of hardware limitations on the sampling rate, the above-discussed techniques sense only a single frequency channel at a time. This technique is employed when signal bandwidth is more significant than the coherence bandwidth. In compressive sensing, sparse signals are recovered from a few measurements.25 In wideband SS, the band is divided into multiple frequency channels, and each channel is sensed concurrently. But this approach requires very high sensing time, hardware cost, and computational complexity. To reduce the sensing time and to expedite the sensing process, compressive sensing was introduced. Compressive sensing, which includes sampling and compressing in one step, is an acquisition process in which signal is compressed with the necessary information and reconstructed by sampling at the Nyquist rate or less than the Nyquist rate.26 To perform this process, there are three primary stages; the first one is the sparse representation of signals. In the second stage, signals are compressed and sampled. In the final step, reconstruction algorithms are performed to recover the wideband signal. 2.2.2 Adaptive compressive sensing technique To sense the wideband spectrum by using an appropriate number of measurements, the adaptive compressive sensing technique27 utilize compressed sensing features. The main idea behind adaptive compressive sensing is that the signal can be reconstructed using a sub-Nyquist sampling rate or if a band is sparse in nature. In this technique, wideband signals are recovered in blocks from various mini-slots, and the spectrum is reconstructed from compressed samples. To perform SS within a frequency band, a specific time slot from the complete periodic time is utilized for sensing, and remaining time slots are utilized for data transmission. Consider that a SU aims to detect the spectrum holes within a particular frequency band (from 0-W Hz). The complete time period T is considered, where from 0 to π sec used for SS and the remaining time period from π to T second reserved for data transmission. The continuous signal received at frontend of SU mainly consists of PU signal and noise signal as all SU keep quiet to follow medium access control (MAC) protocols during sensing time. If the sampling rate is fN over the observation time π, the discrete time samples are obtained as: JANU et al. 10 of 28 ( y(n) = yc n fN ) , n = 0, 1, 2, 3, 4.., N − 1 (11) where N = πfN is considered a natural number. 2.2.3 Covariance-based sensing technique Covariance-based sensing technique employs the covariance matrix of received signal and singular value decomposition (SVD) to detect the presence of PUs.28,29 Ideally, correlations of noise signals would be zero if the AWGN channel is assumed, and the correlations of received signals would be non-zero if calculated at different time instances. The PU signal can be easily differentiated from noise signals as they have zero correlation. A sample covariance matrix of the received signal can be evaluated, and the covariance matrix’s eigenvalues are determined by using SVD method.30,31 The sample covariance matrix can be calculated using the following expression: Ry (N) = L−2+Ns 1 ∑ Μ y(n)Μ yT (n) N n=L−1 (12) From the covariance, matrix eigenvalues can be calculated. A test statistic is calculated by taking the ratio between the maximum eigenvalue Emax to the minimum eigenvalue Emin of the covariance matrix. To decide between test hypotheses, the test statistics is compared with a threshold value πΎc . The test statistics can be calculated as: Tcov [y] = Emax Emin (13) If Tcov [y] < πΎc βΆ H0 is considered. else Tcov [y] < γc βΆ H1 is considered. where πΎc is a predetermined threshold value. 2.2.4 ML-based sensing techniques To make the spectrum resources available to all the users, there should be more intelligence involved in the CRN. To cater this requirement, the ML techniques have been used to solve various problems in CRN.32,33 ML can provide a Machine Learning Supervised Learning SVM, KNN, Naïve bayes, decision trees , linear regression, ANN, Bayesian Learning FIGURE 4 Summary of ML algorithms for CSS Unsupervised Learning K-Means, PCA, GMM, Fuzzy C-means, K-medoids , Genetic Algorithm Reinforcement Learning Markov Decision Process, Multi-agent learning JANU et al. 11 of 28 promising solution to this problem and can introduce intelligence to CR technology. ML algorithms can solve complex problems by analyzing and interpreting structures and patterns in data as they are capable of learning, reasoning, and decision making. Application of ML algorithms in CR networks increases perception capability and reconfigurability and automation level, and network flexibility. This section is aimed to provide an extensive survey of the state-of-the-art performance of the ML algorithm in the field of CSS. Figure 4 portrays the family tree of ML algorithms utilized in CSS. 3 MACHINE LEARNING FO R CSS ML algorithms provide systems the ability to learn from experiences while performing a particular task T, aiming to improve the task’s performance measured by performance metric P by using the experience E.34 ML-based algorithms are broadly classified into three categories supervised learning, unsupervised learning, and RL. In supervised learning techniques, a task is performed by learning from training data which are provided with corresponding labels or outputs. The goal is to derive an objective function based on the labeled training data to predict the correct output for future training samples. In contrast to supervised learning techniques, unsupervised learning exploits unlabeled training data aimed to extract hidden features in the input data. The RL technique is used when some information about the environment is known, and a decision-making agent learns and adapts to the environment. The agent’s main objective is to take appropriate actions and maximize the reward by interacting with the environment. ML algorithms are used for pattern classification, where a feature vector extracted from the pattern is fed into the classifier, which determines a particular class of the feature vector.14 SS process can be formulated as a binary classification problem that can be solved by a supervised or unsupervised ML algorithm, where the classifier has to determine between the availability and unavailability of the two possible states of the radio frequency channel. To detect the availability of the spectrum, the classifier makes use of energy vectors or probability vectors, also referred to as a feature vector in the context of ML. Figure 5 highlights an ML-based CSS framework. In this section, various studies and research papers have been reviewed related to ML’s application for SS in the CRN. FIGURE 5 ML-based CSS model JANU et al. 12 of 28 3.1 System architecture and paradigms Considering P number of PUs, where PUs are indexed as p = 1, 2, 3, … .P, and M number of SUs indexed as m = 1, 2, 3 … .M. The SUs are assumed to be located in a d-dimensional feature space. SUs can be equipped with a single antenna or multiple antennae. Each SU senses the channel and then reports to the FC, which makes the final decision about the availability of spectrum holes in the case of CSS. The FC estimates feature vector based on the sample values received from all SUs. For feature vector realization, different methodologies are described further in detail. If at least one PU is active, the channel is unavailable to access. The channel is considered available when all the PUs is inactive in transmission. The channel availability A can be defined as A= { } − 1, when at least one PU active (channel unavailable) 1, when all PUs are inactive (channel available) (14) Let Ym (i) is the ith signal sample observed by mth SU. Ym (i) received at the respective SU is summation of all the signals from all the PUs14 can be defined as: β§π (i) H0 βΆ When PU signal is absent β« βͺ βͺ Pm Ym (i) = β¨ ∑ β¬ βͺ hp,m Sp (i) + πm (i) H1 βΆ when PU signal is presentβͺ β β©p=1 (15) where hp,m represents the channel gain from the pth PU to mth SU, Sp (i) is the signal transmitted by the pth PU, πm (i) is the noise signal, and π represents the bandwidth of the signal. Various methods are used for feature realization such as principle component analysis (PCA) analysis to remove the noise, energy detection, covariance matrix realization, etc. To achieve high detection probability, it becomes necessary to select unique and relevant features. The FC constructs feature vector, then ML and deep learning (DL) models are employed for CSS. The feature vector can be realized as by using various preprocessing techniques such as PCA, energy detection-based methods. 3.2 3.2.1 ML techniques Supervised ML techniques In the case of supervised ML algorithms, the attributes of input and output datasets are labeled, and the objective of these algorithms is to build a model to map the input to the output. The supervised ML tasks can be further divided into a classification or regression tasks if the output is discrete or continuous. SS problem can be formulated as a classification problem as the output is a binary value representing channel availability. Assuming there are L number of training data points z = {z(1), z(2), z(3) … ..z(L)} with corresponding output labels in terms of channel availability or unavailability a = {a(1), a(2), a(3) … ..a(L)} are fed into the classifier for training as this method requires prior information related to channel availability for each training feature vector. Now when an unknown data point z(i) is presented to the trained model, the ML algorithm has to classify it to one of the class a(i), that is, channel available (a(i) = 1) or unavailable class (a(i) = −1). The supervised ML algorithms such as support vector machine (SVM), k-nearest neighbors (KNN), Bayesian networks, and artificial neural network (ANN) used in SS are introduced before discussing their CSS applications. Table 3 summarizes the use of a supervised ML algorithm for CSS along with the features used. The SVM algorithm is mainly used for classification tasks as the spectrum availability variable can only a discrete binary value. In the SVM algorithm, training vectors can be separated from a linear hyperplane. This algorithm aims to find a maximal margin hyperplane with the help of support vectors (training vectors nearest to the decision surface) to separate the training feature vectors into two classes. When training vectors are not linearly separable, different kernel methods are used for non-linear transformation of input feature vectors. Hence, mapping the training vectors to a higher dimensional feature space by a non-linear function makes the training feature vectors linearly separable. SVM algorithm with the linear kernel is easy to train for the input feature vectors which are linearly separable. JANU et al. 13 of 28 T A B L E 3 Supervised ML methods for CSS References Supervised ML technique Features 14 KNN, SVM-Linear, SVM-Polynomial Energy statistics 35 SVM, KNN, decision tree, Naïve Bayes classifier Energy statistics 36 SVM Probability vector 37 SVM with FFA, naive Bayesian classifier, decision trees, SVM, linear regression Spectrum status 38 Two-Phase SVM Energy statistics 39 Kernel-SVM Eigenvalues 40 SVM Derived from eigenvalues 41 Non-Parallel-Fuzzy SVM Energy statistics 42 Multi-class SVM Energy statistics 43 SVM Energy statistics 44 SVM Derived form received samples 45 Naïve Bayes Classifier Energy statistics 46 Bayesian Learning Energy statistics 47 Bayesian Learning Energy statistics 48 SVM, Logistic regression, K-NN, Gaussian Naive Bayes, and Decision Trees Power and SNR levels 49 SVM Energy statistics 50 ANN Energy statistics and Zhang statistic from the likelihood ratio statistic The KNN is also a classification algorithm that is based on the majority voting of nearest neighbors. In KNN, K refers to the number of nearest neighbors involved in the majority voting process. This algorithm is based on the feature vectors similarity, classifies a new test vector based on the class of its majority voting of neighbors. To determine a test vector class, the KNN classifier finds K number of nearest training vectors among the whole training set based on a specific distance measure. To find K number of neighbors, the test vector’s distance from all training vectors is calculated and then sorted in ascending order. The classifier then chooses the first K training vectors as nearest neighbors and calculates the number of neighbors corresponds to channel available class and channel, unavailable class. Whichever count is maximum, that particular class is assigned to the test vector. Artificial neural networks (ANNs) are very effective tools to solve complex real-world problems as they can adapt to signals’ non-linear characteristics. ANNs are constituted by a set of layers, including one input layer, one or more hidden layers, and one output layer. Both the hidden and output layers are composed of neurons. Hidden layers receive the input vector and output layer and calculate the system’s output through a non-linear activation function. The ANNs learn specific parameters of the complete network from the given training data set. The modern DL architectures such as deep neural network (DNN) and convolutional neural network (CNN) are derived from the ANNs. The decision tree (DT) classifier forms a binary tree based on the node error splitting rule to classify input data points. The DT classifier has two types of nodes; one is a decision node that makes decisions and has multiple branches, whereas the second node is a leaf node, which is the output of those decisions and represents class labels. Naïve Bayes classifier is an independent feature model where the classes are independent of each other, and the prior probabilities for the given classes are computed from the given training examples. To estimate the probability of a test data point belonging to a particular class, “posterior probability” is calculated using the Naïve Bayes assumption and Bayes rule. Bayesian learning, which has roots from Bayes theorem, formulate the knowledge about the situation probabilistically. The model specifies the prior probabilities of unknown parameters and based on the available examples, the posterior 14 of 28 JANU et al. probability distribution for the parameters is computed. These posterior distributions are further utilized for making decisions or conclusions about a new data point. Thilina et al14 proposed a CSS model in CRN using supervised learning techniques such as SVM, weighted KNN algorithm. The energy vector estimated at SUs is treated as a feature vector and fed to the classifiers to detect whether the channel is available or not. The SVM classifier with linear kernel for linear inputs provides better detection probability and requires lesser training and classification delay. Mikaeil et al35 proposed an ML-based algorithm for a data FC to train the model per frame in real-time and detect the channel availability. The performance of four supervised ML algorithms KNN, SVM, Naive Bayes, and DT were compared and concluded that KNN and DT classifiers outperform the other two classifiers in terms of accuracy. In Reference 36, a low-dimensional probability vector is treated as a feature vector along with SVM classifier to accomplish the task of CSS. Making use of low dimensional probability vector instead of N-dimensional energy vector, this method requires less training time, less classification time, and provides equal classification accuracy (CA). Azmat et al37 have presented a spectrum occupancy model in CRN using various ML algorithms such as DT, SVM, SVM with firefly algorithm, and linear regression to achieve higher CA where spectrum status is considered as an input feature vector. Ghazizadeh et al38 utilized two-phase SVM to enhance SVM performance in terms of detection probability and misclassification risk. The energy levels of PUs are treated as a feature vector to train the classifier. In Reference 39, sample covariance matrix eigenvalue estimation-based features and the SVM classifier are blended for SS in the multi-antenna CRs framework. Coluccia et al40 experimented in terms of features by mapping the received signal to high-dimensional feature space and further classified using SVM. The feature vector consists of decision statistics such as energy detection, maximum-minimum eigenvalues ratio, and their higher-order combinations. In Reference 41, a fuzzy SVM algorithm with non-parallel hyper-plane is utilized to address the performance issues of conventional parallel hyper-plane SVM algorithm degraded due to the noise uncertainty, which causes overlapping of different classes. To reduce the noise uncertainty effect, this approach introduces fuzziness in decision-making for SS with the Kernel shadow C-means algorithm’s help. The performance of different supervised classification algorithms, that is, SVM, Logistic regression, KNN, Gaussian Naives Bayes, and DT, were compared based on power and SNR features in Reference 48. Li et al49 tackled the issue of cooperation overhead through user grouping. The idea is to group the CR users based on energy data samples and the SVM model before their participation in the CSS process, which brings safety and reduces the redundancy in the network. In Reference 50, a hybrid SS scheme was presented, which uses a combination of classical energy detection and Likelihood Ratio Test statistics-based features to train the ANN. The above-discussed methods have attempted only binary classification problems, which are limited to temporal SS (detecting unused spectrum bands at a specific time). However, it is also essential to detect spectrum holes both at a particular time and geographic location in SS. In Reference 42, authors have addressed the joint spatio-temporal SS problem under multiple PUs scenarios. The binary classification problem was reformulated as multiple class signal detection problems, where each class further sub-categorized into one or more categories representing system states. Error correcting output codes (ECOC)-based multi-SVM algorithm is used for Spatio-temporal SS. The work was further extended in Reference 43, for multi-antenna SUs. Multi-state signal detection problem under multiple PUs scenario is solved by using multiple antennas with beamforming capabilities and ECOC-based multi-class SVM algorithms. The Beamforming-based preprocessing technique increases the received SNR in order to increase the detection capability of the SVM algorithm. A CSS model has been proposed based on multiple class hypotheses to maximize the throughput of secondary links.44 The problem can be solved using the SVM algorithm with linear, polynomial, and radial basis kernel. The multi-class hypothesis can be classified based on the quantized regions of sensed energy from the received signal and residual energy of the secondary node’s battery. In contrast to the above-mentioned multi-class problem, the hypothesis that PU is absent is further sub-divided based on sensed energy and residual energy of SU node. The input feature vector is composed of three statistical features extracted from the received PU signal and the percentage of residual energy as the fourth feature. The authors claim that this is the first work to select such type of features. SVM with polynomial kernel provides higher throughput and higher accuracy as compared to the conventional two-class hypothesis-based sensing problem using SVM. Tian et al45 have addressed the SS problem for the orthogonal frequency division multiplexing (OFDM) system, which is formulated as a multi-classification problem, adapting to new SNR variations. The average received signal power and cyclic-prefix-induced correlation are considered two-dimensional signal features and further a naïve Bayes classifier is trained to detect the spectrum occupancy. A novel class-reduction assisted prediction method is also introduced to reduce the SS time. The variational Bayesian learning-based CSS model was implemented in Reference 46 for a multi-antenna SU scenario realizing the energy vector as the features. This novel technique has the advantage of finding the number of JANU et al. 15 of 28 active PUs. The proposed sensing technique’s performance is quantified in terms of probability of detection vs probability of false alarm. Another Bayesian ML-based SS method to obtain the global spectrum states was proposed for the challenging scenario of a large-scale heterogeneous network.47 In the earlier SS techniques, the SUs were assumed to be static; however, this work exploited the mobility of SUs since the mobility of SU could increase the spatial-temporal diversity of the received signal and increase the sensing reliability and throughput. This framework is based on non-parametric Bayesian learning, referred to as beta process-hidden Markov model, which finds spatial-temporal correlation within the time series data acquired by SU to discover the total number of spectrum states. Bayesian inference is carried out to perform group sensing, and then based on the inference results, the proposed novel refinement mechanism identifies the availability of spectrum globally. Although the supervised techniques have shown promising results in detecting spectrum availability, various challenges are associated with these algorithms. One main concern is that they require labeled datasets, which is usually quite difficult in practice. Consequently, some researchers employ unsupervised algorithms to mitigate this problem. Besides, most of the above-discussed methods have been tested on simulated data. The performance of these algorithms on real-world data needs to be extensively analyzed. 3.2.2 Unsupervised ML techniques The unsupervised learning techniques do not require any prior information or labels corresponding to each training feature vector as supervised learning techniques. These algorithms extract patterns and similarity in the input data and build profound associations with the help of internalized heuristics. The main function of unsupervised learning is clustering, dimensionality reduction, and determining the distribution of data. A clustering algorithm classifies the input data so that objects with similar attributes are grouped, forming a cluster. The dimensionality reduction involves projecting a high-dimensional data to a low-dimensional space. The third category determines the distribution of data or estimates unknown parameters in the distribution. The following subsections briefly explain different unsupervised learning techniques and their usage in SS. Table 4 summarizes the unsupervised ML algorithms applied in CSS. K-means clustering: The process involves grouping data into different groups, where data within a particular group or cluster have certain similar features. K-means clustering is one of the most popular centroid-based clustering algorithms used to classify unlabeled data. A dataset of L samples having the form {xi }li=1 , where each sample is N-dimensional, is used for K-means clustering. Here K represents the number of clusters, and the word “means” represents an average of all data points in a particular cluster, that is, also referred to as centroid of the cluster. The clusters are formed such that the cost function, the sum of distances of sample data points in a particular cluster to the centroid, is minimized. K-means clustering algorithm has been extensively used in SS with a wide variety of features derived from received signal at SU such as Energy Vector,14,51 Eigen Values derived features52,53,55-57 and Probability vector.36 K-medoids clustering technique54,58 with the variant of K-means is also applied in the SS domain. To improve the sensing performance of the multiple antennae equipped SUs model, genetic algorithm clustering60 is used for decision statistics. Dimensionality reduction: Dimensionality reduction techniques are used for data compression, feature extraction without loss of useful information. The most popular dimensionality reduction technique is PCA and independent component analysis (ICA). PCA’s goal is to project data from high-dimensional space to lower dimensional space and extract meaningful information of data that preserves high variance in the data. PCA has been widely used for data preprocessing for extracting features. In contrast to PCA, ICA aims to preserve maximum statistical independence in the component. In Reference 52, the K-means clustering algorithm is exploited for CSS, where input feature vectors are extracted by PCA analysis. The maximum eigenvalue and trace of the eigenvalues obtained from the sample covariance matrix are selected as features. Compared to the energy feature vector, eigenvalue-based features are more robust to noise. Gaussian mixture model (GMM): GMM is a widely used clustering algorithm that uses multiple Gaussian distributions as parameter models according to the number of clusters. The expected maximum (EM) algorithm obtains the most optimal Gaussian distribution parameters by using the training samples. In SS, the PU signal’s presence and absence can be considered two different Gaussian distributions. In Reference 14, authors have utilized the GMM algorithm for temporal SS. Sobabe et al53 use the GMM algorithm, where the leading eigenvector and maximum and minimum eigenvalues are considered an input feature vector. Making use of more robust features leads to an increase in the computational JANU et al. 16 of 28 T A B L E 4 Unsupervised ML methods for CSS References Unsupervised ML technique Features 14 K-means clustering, Gaussian mixture model Energy detection 36 K-means clustering Probability vector 41 Kernel shadow C-means Energy detection 51 K-means clustering Energy vector 52 K-means clustering The largest eigenvalue and sum of all the eigenvalues are feature vector 53 K-means clustering and Gaussian mixture model Maximum eigenvector, eigenvalues 54 K-means and K-medoids clustering Derived from eigenvalues 55 K-means clustering Derived from eigenvalues 56 K-means clustering Empirical mode decomposition and wavelet transform, derived from eigenvalues 57 Fuzzy c-means clustering Derived from eigenvalues 58 K-medoids clustering Empirical mode decomposition, geodesic distance 59 Fuzzy C-means clustering Geodesic distance 60 Genetic algorithm clustering Preprocessed by PCA, Riemann distance 61 Gaussian mixture model Signal preprocessed by Wavelet transform, derived value from eigenvalues 62 Fuzzy C-Means clustering Energy detection 63 Optimal fuzzy c-means clustering Energy detection (cluster-based approach) 64 Kernel fuzzy c-means clustering Energy detection (cluster-based approach) complexity of the system model. Still, the performance of the CSS model outperforms as compared to using conventional energy-based features. SS performance is affected due to fading, shadowing, noise, and other interfering factors, reducing the impact of these factors, multiple antennae equipped SUs-based model is proposed under the k − π fading channel.61 Signals received by SUs are preprocessed by wavelet transform to remove noise from signals, and two-dimensional input features are formulated with the combination of difference between maximum and minimum eigenvalue (DMM) eigenvalue and ratio of maximum eigenvalue to matrix trace (RMET) eigenvalue. GMM technique is used for the classification of these signal features. Fuzzy C-means (FCM) clustering algorithm: FCM is a clustering algorithm mostly used for pattern recognition. Each data points assigned a certain degree of membership with respect to all clusters’ center, whose summation should be one. Kernel FCM enables mapping of inseparable energy data sensed at low SNR to a higher dimensional space where data can be more easily separated into clusters. In Reference 57, authors have used the FCM algorithm to classify signal features obtained from the covariance matrices’ eigenvalues. To improve the accuracy of extracted features, signals are preprocessed by the Null Space Pursuit algorithm and in-phase quadrature-phase (IQ) decomposition. Zhang et al59 have proposed a CSS model based on empirical mode decomposition (EMD) and information geometry theory, where the FCM clustering algorithm is used for the classification of features based on geodesic distances. As the performance of FCM degrades at low SNR, some variations in FCM63,64 are also applied to address these problems. Cluster-based CSS: To make the CR energy efficient, clusters-based CSS approach is utilized. A large number of SUs in cooperation sense the channel and report the information to the FC, so a considerable amount of energy is wasted. All SUs involved in CSS share their sensing information (FC); this situation leads to excessive computational overhead for FC. A cluster-based CSS approach is proposed to overcome this issue, which groups all the SUs in CRN into clusters. The cluster is formed using ML algorithms, and cluster heads (CHs) are chosen from each cluster. Some selected SUs from each cluster forward sensing information to CH, which forwards the sensed result to FC. Bhatti et al65 have proposed a cluster-based CSS model to increase the energy efficiency of the network, where clusters of SUs are formed using a JANU et al. 17 of 28 FCM clustering algorithm. Further, FCM was utilized for cluster formation of SUs, and the CHs are chosen based on the channel error detection technique by Bhatti et al.62 In Reference 66, a novel ML affinity propagation algorithm was proposed for cluster formation of SUs. The affinity propagation algorithm is based on message passing between users, and thus best delegated SUs are selected for cluster formation. Since the training feature vectors play a crucial role in building the classification models, it is essential to calculate accurate feature vectors; otherwise, inaccurate training models may lead to a wrong decision. 3.2.3 Reinforcement learning RL is useful in control systems to optimize decisions and in the case when some knowledge about an environment is known and an agent, which is the optimal decision-maker, learns from the environment and adapts to it by interacting with the surrounding environment. A finite state machine is generally used to model the environment with inputs such as action received from the agent, and outputs such as rewards and observations sent from environment to agent. The agent’s primary goal is to achieve optimum decisions by exploiting the known decision and exploring new decisions. In the RL-based model, an agent interacts with the environment and receives a response as the next state’s reward and the next action selection. Based on the reward received from the environment, the agent chooses the best action to maximize the reward. The reward is useful for analyzing the performance of the complete learning process. Figure 6 depicts the RL-based framework for CSS. Markov decision process is the most popular RL framework to model the SS problem. An Markov decision process (MDP) problem can be represented by four terms such as β¨S, A, T, Rβ©.Where S represents a set of finite states, and A represents a set of all actions taken by the agent, T represents state transition probability, and R represents the reward functions. Action is the decision made by an agent in each state. In this MDP model, the agent (FC) interacts with SUs present in the environment and requests for local decisions. After receiving the local decisions from selected SUs, the agent uses majority rule to find the final decision on the presence of PU. The agent receives a reward from neighboring SU for the next state, and based on the received reward agent, selects the next action. As the sequence of decisions to be made by an agent on selecting the cooperating SUs, this is the main reason for using MDP. The agent learns from the sequence of decisions received from selected SUs by exploring the unknown states and receiving the rewards from known states. The two major tasks of MDP and agent are a selection of action and calculation of reward. FIGURE 6 Reinforcement learning framework for CSS JANU et al. 18 of 28 Lo et al67 proposed a CSS model based on MDP to minimize the cooperation overhead problem and energy inefficiency. This approach finds an optimal set of cooperative SUs that minimize the reporting delay and improve the detection performance. In Reference 68, multi-agent RL-based SS framework has been proposed to discover more number of the available frequency spectrum as well as to achieve the desired diversity gain. This approach is based on the state action reward state (SARSA) algorithm and linear function approximation as it reduces the dimensionality of SS state-action space. In a multi-agent learning-based sensing policy, each SUs employs a reinforcement algorithm. This approach employs user collaboration to find more frequency bands simultaneously, and an action selection algorithm is utilized to select the number of SUs in collaboration. The work was extended in Reference 69, the multiuser multiband SS problem is formulated as a partially observable stochastic game and is solved using a multi-agent RL algorithm. This approach aims to maximize the number of available spectrum bands by addressing the trades off between the sensing reliability and discovering more frequency channels under the constraint of missed detection probability. It is proved that the proposed model of SS in a multi-agent scenario is computationally efficient and can be deployed in the networks with a large number of SUs and a set of a different frequency band. In contrast to single-agent RL, multi-agent RL addresses more challenges, such as non-stationary and coordination. In a CR ad-hoc network, to make the network energy efficient and to achieve higher throughput, Oksanen et al70 proposed a sensing model based on the RL. The sensing policy employs π-greedy method for selecting frequency bands to be sensed and for the selection of sensing assignments as well as this sensing policy guides the SU to concentrate the search of vacant spectrum to the frequencies that provide high data rates. Zhang et al71 proposed a distributed CSS model based on RL to solve data fusion data between users having different reputations. The main contribution of this work is to reduce the interference of malicious users by selecting honest users and improves the performance of the whole CRN to make it more intelligent and stable. By reasoning which ML algorithm has been used and which have not been used for SS, the provided categorization allows the researchers to identify potential research gaps and, thus, possible future research directions to explore. 3.3 Characterization of surveyed papers In this section, we have characterized each reviewed paper based on feature computation, types of SUs (SU devices equipped with a single antenna or multiple antennas) and the performance metrics used to evaluate the ML algorithm. 3.3.1 Feature computation In ML-based classification problems, feature computation and feature selection have significant importance. Choosing appropriate features improves the classifier’s performance in terms of CA, computational cost, and classification time. A feature selection scheme can be utilized to fulfill the requirement of selecting a better feature combination from the features pool. In this subsection, we have discussed various feature computation methodologies applied to the sensing signal received at SU. Energy detection-based features: The energy of the sensing signal received at SUs is calculated to construct a feature vector. Energy vector has been considered as a feature vector in various research papers.14,38,42,46,51 Energy features are realized using beamforming techniques, which increases the SNR of the received signal to enhance the detection capability of classifier-based learning algorithms.43 The SS model’s performance using the energy feature vector is degraded due to the overlapping of different classes caused by noise uncertainty. The energy vector can be realized as: Zn = πt ∑ |Yn (i)|2 (16) i=1 Z = (Z1 , Z2 … … … ..ZN )T , where energy vector Z can be treated as a feature vector. The energy vector-based feature computation is easy to implement though they are not robust to noise. Features derived from Eigenvalues: Various researchers use features derived from eigenvalues obtained from the sample covariance matrix of received sensing signals as they are more robust to noise uncertainty. In Reference 39, eigenvalues of the covariance matrix of the received signals at SUs are considered as a feature vector. Sobabe et al53 has JANU et al. 19 of 28 considered leading eigenvector, maximum and minimum eigenvalues as a feature vector for implementing the unsupervised ML algorithm. To obtain the eigenvalues of the covariance matrix of the received signal can be determined from Equation (5). Let us consider eigenvalues of the covariance matrix πmax > π2 > π3 > π4 > πmin , where πmax be the maximum eigenvalue and πmin minimum eigenvalue. The derived features from eigenvalues TDMM , TMME , TRMME , and TMSE can be calculated from the equations as below. One of the feature TDMM is calculated as difference between maximum and minimum eigenvalue, and another one TMME is calculated as ratio of maximum eigenvalue to minimum eigenvalue. TDMM = πmax − πmin (17) TMME = πmax βπmin (18) TRMET = πmax βtrace of covariance matrix (19) TMSE = πmax − (Sum of all numer of eigenvalueβtotal eigenvalues) (20) TRMET is calculated as the ratio of maximum eigenvalue and trace of the covariance matrix and TMSE is calculated as the difference between maximum eigenvalue and mean of eigenvalues. Further, the feature vector can be derived from the computed eigenvalue-based features pool T = {TMME , TDMM , TRMET , TMSE } such as a two-dimensional vector or three-dimensional vector can be defined as {TMME , TDMM }, {TRMET , TMSE , TMME } respectively. Various preprocessing methods are implemented to reduce the noise and redundant signals from the sensed signal, such as PCA analysis and EMD. In Reference 52, PCA is applied as a preprocessing step, and further, the maximum eigenvalue and trace of the eigenvalues were obtained from the sample covariance matrix to compute the features. The two-dimensional input feature vector is obtained from I and Q decomposition of the received signal. Zhang et al54 used features derived from eigenvalues such as RMET, MME, and MSE, as defined above. In Reference 55 also, the features derived from eigenvalues such as MME, MSE, and RMET were used to train the K-mean clustering algorithm for SS. Wang et al56 have applied the EMD method and wavelet transform method on the sensing signal to remove the noise component and other interfering signals and further calculated feature vectors as DMM and DMEAE. In Reference 57, the received signals are preprocessed by Null Space Pursuit algorithm and IQ decomposition to extract the two- dimensional features, and eigenvalues of covariance matrices are considered as a feature vector. Zhang et al61 applied a preprocessing technique based on wavelet transform to remove noise from sensed signals and computes a two-dimensional input features composed of DMM, RMET. The features derived from eigenvalues are more robust to noise. However, acquiring more robust features leads to an increase in the computational complexity of the system model, but the CSS model’s performance is better compared to using energy-based features. Information geometry theory-based features: In the information geometry theory, geodesic distance is the most popularly used method to measure the distance between two probability distributions. This distance can be measured between two probability distributions on a statistical manifold. The distance between the two points on manifold curvature depends on the curve’s choice between these points. A curve that minimizes the distance between two points can be defined as a geodesic, and the related distance is called geodesic distance. Consider any two points π1 and π2 on the curve and the curve between two points can be defined as π(t) (t1 ≤ t ≤ t2 ). The geodesic distance between two points can be defined in Reference 72 as D(π1 , π2 ) β t2 ∫t1 √ ( dπ dt )T G(π) dπ dt dt (21) where G(π) is the Fisher information matrix in geometry theory for statistical matrix, and it can be expressed as [ G(π) = E π ln p(x|π) π ln p(x|π) ⋅ ππi ππj ] (22) where, p(x|π) is the probability density function considered in information geometry theory, x denotes the n dimensional sample related to a random variable, and π denotes the parameter vector. In the context of CR technology, the SS signal JANU et al. 20 of 28 matrix is a multivariate Gaussian distribution family with different covariance matrices and the same mean.73 Assuming the two covariance matrices R1 and R2 , geodesic distances between them is calculated in Reference 74 as √ 1 −1β2 −1β2 tr log2 (R1 R2 R1 ) 2 √ √ n √1∑ log2 πu D(R1 , R2 ) = √ 2 u=1 D(R1 , R2 ) = −1β2 (23) (24) −1β2 where, πu denotes the uth eigenvalue of the matrix R1 R2 R1 . In the context of CSS, decomposition and recombination (DAR) is applied to the signal matrix perceived at SU to increase the matrix dimensions. The DAR can be divided into O-DAR and I-DAR. In the process of applying the O-DAR and I-DAR on the signal vector perceived by the SU, corresponding high-dimensional matrices YO−DAR and YI−DAR and covariance matrices RO and RI can be calculated. ( ) 1 T RO = YO−DAR YO−DAR (25) s ( ) 1 T (26) YI−DAR YI−DAR RI = s Here s is the length of the split signal vector after splitting. Now, when the SUs observe K environmental noise matrices, the corresponding covariance matrices ROk (k = 1, 2 … … K) and RIk (k = 1, 2 … … K) are calculated after processing the noise matrix using O-DAR and I-DAR. Further, the Riemann mean objective functions are defined as: K ( )∑ 1 D(ROk , R−O ) K k=1 K ( )∑ 1 D(RIk , R−I ) Ψ(R−I ) = K k=1 Ψ(R−O ) = (27) (28) R−O and R−I are the Riemann Mean of RO and RI (minimizes the Ψ(.)) and D(−, −) is the geodesic distance between two points on the manifold. For the feature extraction, R−O and R−I are considered as reference points and the geodesic distances from RO to R−O and RI to R−I are calculated individually. d1 = D(RO , R−O ) (29) d2 = D(RI , R−I ) (30) These geodesic distances represent the distances between the sensing signals and the reference points on the manifold in the context of information geometry. Further, a feature vector G = [d1 , d2 ] can be constructed for the training of the ML algorithm. Geodesic distance-based features have been utilized in a few research works.58,59 These methods applied preprocessing on the sensed signal using the EMD method, and the signal detection problem is transformed into geometric problems by applying information geometry theory. In Reference 60, the authors proposed an information fusion-based method to avoid complicated matrix decomposition algorithms, as discussed above. Instead of geodesic distance, the Riemann distance is used to extract the signal feature vector. In the information geometry context, the geodesic is considered as the shortest curve between two points on the manifold; however, the Riemann distance is the length of the geodesic, which can be used to analyze the similarity between distributions. The Riemann distance is calculated as: −1β2 D2 (R1 , R2 ) = ||log2 (Rc −1β2 Rd Rc )|| 2 = Tr[log (R−1 c Rd )] = n ∑ log2 πu u=1 (31) JANU et al. 21 of 28 where, ||.|| denotes the Frobenius norm, Rc and Rd represent the coordinate points on the statistical manifold, Tr[.] denotes −1β2 −1β2 trace of the matrix and πu denotes the uth eigenvalue of the matrix Rc Rd Rc . For the SS scenario, R1 and R2 can be considered as the covariance matrices of the divided signal matrix collected at SUs corresponding to authorized channel and Rw1 and Rw2 can be considered as the corresponding covariance matrices in the absence of PU signal. Further, the Riemann distances can be calculated using the reference points dr1 = D2 (Rw1 , R1 ) (32) dr2 = D2 (Rw2 , R2 ) (33) Rw1 and Rw2 are the corresponding Riemann mean and considered as a reference point. The two-dimensional signal feature vector T = [dr1 , dr2 ] can be used to train the models. Other features: Few researchers have utilized other attributes of the sensing signals received at SU as feature vectors. Azmat et al37 have used spectrum status vector of all the considered time slots and frequency bins in the CR network. In Reference 36, a low dimensional probability vector derived from the high dimensional energy vector has been considered a feature vector for training the SVM classifier and K-means clustering algorithm. Basumatary et al48 have treated the Power and SNR levels measured at the testbed’s independent CR device as a feature vector. 3.3.2 Performance evaluation metrics The performance of the ML algorithm for SS can be evaluated using different performance metrics. The most popular metrics for assessing the performance of the ML models are discussed below. a) Classification accuracy is defined as the ratio of correct predictions to the total predictions made by the model. CA can be evaluated at different SNR levels, and the graph between CA and SNR can be analyzed for performance evaluation. CA is defined as: CA = Nc βNt × 100% (34) where Nc denotes correctly predicted states and Nt denotes the total number of states. b) Receiver operating characteristic curve is a plot between the probability of detection against the probability of false alarm. The receiver operating characteristic curve portrays the primary signal detection performance of the algorithm. c) Average throughput is defined as the amount of data that can be transmitted and received at the destination successfully within a timeframe. It is measured in megabits per second or bits per second. d) Energy efficiency is defined as the ratio of average throughput of the network to the total energy dissipated by the whole network. π= π½ E (35) where π½ represents the average throughput of the system and E represents total energy dissipated from the network. e) Probability of detection and probability of false alarm: Probability of detection is defined as the probability of a SU that declares the presence of a PU when the spectrum is actually occupied by the PUs. The probability of false alarm is defined as the probability of a SU that declares the presence of PU when the spectrum is detected free. The formula for the probability of detection and probability of false alarm is given in Equations (3) and (4). Table 5 summarizes the usage of different evaluation metrics in the reviewed research papers. 3.3.3 Type of secondary user The surveyed research papers are characterized into two categories based on the types of SUs considered in the CRN. The first category belongs to SU with a single antenna and the second category involves SU equipped with multiple antennas. JANU et al. 22 of 28 T A B L E 5 Performance evaluation metrics References Performance evaluation metrics 14,36 Average training time, average classification delay, ROC, detection probability 35,40,41,47,51,54,59-61 ROC curve 37 Classification accuracy 38 ROC curve, detection probability, average misclassification error rate 39 Probability of detection 42 ROC curve, classification accuracy 43 ROC curve, classification accuracy, probability of detection, probability of false alarm 44 Accuracy, average throughput 45 ROC curve, probability of detection, average prediction time vs, SS error rate 46 Probability of detection and probability of false alarm 48 Training time, classification time 52,53 ROC curve, Probability of detection 62 Average throughput, energy efficiency 63,64 ROC curve, probability of detection, average energy consumption, average energy consumption, probability of detection 65 Energy efficiency, throughput, energy consumption 66 Average throughput, efficiency 67 Cumulative reward, ROC curve, probability of detection, and probability of false alarm 68 Number of frequency bands sensed 70 Probability of detection, probability of missed detection, expected throughput 71 ROC curve, detection time Authors have concluded that SU equipped with multiple antennas improves the SS performance by overcoming the problems such as path losses and shadows, which is predominantly found in SU equipped with a single antenna. Further, multiple antennas improve spatial diversity and spatial multiplexing gain and hence enhance the SS performance. In References 39,43,46,60,61, the CSS problem is solved using multiple antennae equipped SU; however, in the rest of the ML-based methods reported in the reviewed papers, single antenna-based SUs have been used for SS. 4 DY NA MIC SPECTRUM SHARING In the case of static spectrum sharing technique, spectrum has to be divided in advance and a fixed spectrum band is shared among the users statically, which leads to inefficient utilization of spectrum. Hence, for the efficient utilization of spectrum bands in 5G technology, spectrum sharing has to be addressed to overcome the spectrum shortage issues. The 5G networking technology has been considered as a new communication network that is capable of providing higher data rates and higher-capacity of data transmission. It is flexible to support various devices and services, which makes it possible to combine with IoT. 5G network has the capability to enable wide range of machine type applications and hence leads to development of IoT. In future, IoT technology enabled with 5G can provide various kinds of IoT services with low power consumption, low cost, big data, wide coverage and large connection, which supports mobile designs to support massive IoT with variety of services requirements and enhanced broadband communications. Hence, the major challenge in these new emerging technologies is the requirement of spectrum and bandwidth resources. Now, it becomes necessary to dynamically share the available spectrum to fulfill the resource demand. DSS allows network equipment to share the spectrum amongst the users connected JANU et al. 23 of 28 to long-term evolution and 5G communication technologies. Cognitive IoT combined with CR, which is considered as an effective spectrum access technology for DSS, provides robust operation in a highly dynamic spectrum environment. A multichannel IoT has been proposed in Reference 75 to share the 5G communications dynamically, where IoT nodes are designed in a way to perform 5G communication and IoT communication simultaneously. 4.1 Classification of spectrum sharing techniques Spectrum sharing techniques are classified into three main approaches: network architecture, spectrum access and spectrum allocation behavior. A schematic diagram for the classification of various spectrum sharing techniques is shown in Figure 7. Network architecture categorized into centralized (infrastructure-oriented) and distributed (infrastructure-less). This technique includes distributed sensing devices, base station (BS) and software defined network controller in the system. By using this method incorrect decisions influenced by inconsistent quality of service can be reduced. Spectrum access sharing (SAS) is an emerging method nowadays gaining attention classified into three techniques: dynamic exclusive, open access, and hierarchical model. Dynamically advanced SAS method consists of a common model, shared use, and exclusive use. The open access model can be used any service as it provides unrestricted access of spectrum. Hierarchical model includes interweave, underlay, and overlay mechanism for spectrum sharing. In underlay and overlay spectrum sharing schemes, SU can access the spectrum along with PU transmission by maintaining the constraint that it will not degrade the performance and protect the PU from interference by keeping SU power below than predetermined threshold. Whereas, in interweave scenario, the SU can dynamically access the spectrum when PU is in idle state. Spectrum allocation behavior methods are categorized into cooperative and non-cooperative. 4.1.1 Cooperative spectrum sharing Advanced spectrum sharing techniques can increase the energy efficiency and spectral efficiency from the cost effective perspective. Yang et al76 has performed a survey on cognitive and cooperative spectrum sharing techniques to increase the spectral efficiency and energy efficiency. They have discussed cooperative spectrum sharing methods from two perspectives: economic perspective, which mainly focus on spectrum trading and spectrum leasing and cross layer perspective, which emphasizes on spectrum mobility, routing, relaying, and harvesting. To improve the spectrum utilization, a novel economic tool size-negotiable auction mechanism-based spectrum sharing framework has developed in Reference 77, which aimed at providing the solution between auction and FIGURE 7 Classification of spectrum sharing techniques JANU et al. 24 of 28 negotiation for multiple buyers sharing a common spectrum band. MAC protocol plays important role in spectrum sharing. A distance-dependent multichannel MAC protocol78 is utilized to increase CRN throughput, as it uses a probabilistic channel assignment mechanism that exploits the dependence between the transmission distance and signal’s attenuation model, and allows multiple SUs to make more concurrent transmission on a channel under moderate and high traffic loads. Although, there are numerous spectrum sharing techniques utilized have been studied in order to enhance the spectral efficiency and energy efficiency of future wireless network. The efficient utilization of spectrum achieved by DSS can be further increased by utilizing the full duplex (FD) communication technology. Due to its advantages including doubling the capacity, increased secondary throughput and increased spectrum utilization efficiency, this technology is considered to be candidate for the 5G wireless network and beyond the wireless communication networks. In Reference 79, various FD-enabled DSS techniques and recent advancement are studied and a communication framework is proposed by considering a power control-based self-interference mitigation technique in order to make possible the concurrent sensing and transmission in DSS systems, which improves the system throughput of the network. 4.2 DSS using machine learning The above discussed works are less energy efficient for allocation of spectrum resources as the learning capability of CR devices has not been explored for determining the optimal strategies. Hence, to obtain the potential solution for energy efficient resource allocation, machine learning integrated with CR can be seen as a potential solution. Machine learning is an important research topic for next generation wireless communication and has been widely used in SS and spectrum sharing. To satisfy the quality of service requirement from multiple users, the resource allocation problem is addressed in Reference 80. An efficient heuristic algorithm is developed by considering multiple resource dimensions (frequency, power consumption, and antenna direction) for the optimization as well as multiple objectives also simultaneously optimized. Three algorithms such as convex relaxation with gradual removal, convex relaxation with tree pruning (CRTP), and GA, where GA has shown promising results approaching CRTP with the less computational effort by comparing with two other techniques. RL is considered as most suitable approach for obtaining optimal solution as it does not form any extensive model and it learns with interaction with environment. In multi-agent RL, multiple agent access channel state information from environment and find optimal policy with cooperative or non-cooperative manner. In Reference 81, a model free multi-agent RL approach based on Q-learning and SARSA algorithms with cooperative framework have been utilized to determine the optimal strategy for resource allocation in energy efficient manner in complex dynamic environment, which reduces the channel state interference and maximizes the channel capacity of CR network while maintaining the users’ quality of service. An another solution to DSS is the application of heuristically accelerated RL approach which speed up the RL algorithm in multi-agent domain by guiding the learning process of cognitive cellular system by using additional heuristic information.82 To solve the continuous and high dimensional problem, such as power control for spectrum sharing, a DNN concept along with RL is introduced as the traditional RL approaches are not capable to solve these problems. A deep RL-based power control method is developed for spectrum sharing in scenario where PU and SU work in non-cooperative manner and SU does not know PU’s transmit power, which a SU can intelligently adjust its transmit power such that is can share the spectrum with PU with desired quality of service requirement.83 The power control problem is addressed in Reference 84 by constructing an information interaction model among PU, SU and wireless sensors and investigating with the application of deep RL methods such A3C and distributed proximal policy optimization, etc. Another work for spectrum management and resource allocation is presented by exploring the deep learning (DL) algorithms. Generative model conventional variational autoencoder (CVAE) is utilized by solving linear sum assignment problem for enabling spectrum sharing underlay device to device communication.85 Performance is evaluated for CVAE by exploring three different architectures such as CNN architecture auto-encoder, feed-forward neural network auto-encoder and hybrid auto-encoder. Jacob et al86 present a new spectrum sharing technique using 5G enabled bidirectional cognitive deep learning nodes (BCDLN) along with DSS long short term memory (DSLSTM), where the BCDLN self-learning nodes forward the information to other cognitive node at a constant spectrum sharing targets and cooperate to each other with DSLSTM. The dynamic spectrum allocation is performed with CP-OFDM. In Reference 87, spectrum resource allocation problem is JANU et al. 25 of 28 considered as decentralized contention-based spectrum access for BS and solved by incorporating its features to a distributed RL framework. This problem is formulated as a decentralized partially observable MDP with reward structure that provides long term proportional fairness in terms of throughput and a two-stage MDP is introduced, which uses SS decision and reception quality information to make a medium access decision. A SS and spectrum allocation framework is proposed for CR-enabled IoT cellular network consisting of SU devices as IoT objects and multiple PU base-stations,88 where SU devices discover the spectrum holes in multiband PU spectrum by using SVM and forward spectrum allocation request to intelligent FC. This request is processed through a multi-dimensional feature set, where the SVM classifier assigns vacant spectrum PU band to the SU-IoT device which need for spectrum access in the CR-IoT framework. Spectrum allocation issue is addressed by considering the two objectives such as network capacity and spectrum efficiency has been modeled as a multi-objective optimization problem in CR network.89 To handle these conflicting objectives non-dominated sorting genetic algorithm-II which combines features of evolutionary algorithms and machine learning based on non-dominated sorting genetic algorithm RL has incorporated. 5 CO N C LUSION AN D FU T U RE S CO PE It is well-known that billions of wireless communicating devices have connected to the Internet, so it has become a big challenge for users to access the radio spectrum. CR technology has become a promising solution to address the issues related to spectrum scarcity. The CR technology and ML provide the ability to automatically learn and adapt to the environment and have the potential to dynamically access the spectrum and to utilize spectrum resources. Considering the popularity of ML in the CSS domain, we have presented an exclusive survey of relevant literature. We characterize the surveyed papers based on the information about the sensing signal they use (ie, the features), what ML techniques they apply and what evaluation metrics they use to check the performance of the algorithm. In this survey, we summarized the most up-to-date ML-based SS methods, along with their limitations and advantages. We hope that the outcomes of this survey can bring new directions and methodologies to enhance the research in the CSS domain. DL is becoming the most popular research direction as it can automatically capture complex and hierarchically organized features and outperform the ML-based methods with carefully hand-crafted features. This manual feature extraction process is time inefficient and requires retraining of the network with new sets of data collection whenever the inputs feature changes. DL has shown great success in various fields such as computer vision, natural language processing, signal detection, and resource allocation in wireless communication. In practical scenarios, GPU-enabled computing is becoming more and more popular, a larger volume of complex data is available, hence more effective training algorithms are developed. DL has become a promising research direction for building an intelligent wireless communication system as it plays a major role in predictive analysis. Although researchers have explored neural network architectures in SS, spectrum sharing, and resource allocation, still there is a need to improve spectral efficiency and energy efficiency, hence it becomes necessary to develop the DL algorithms for building the intelligent wireless communication systems. DATA AVAILABILITY STATEMENT Data sharing not applicable to this article as no datasets were generated or analyzed during the current study. ORCID Kuldeep Singh https://orcid.org/0000-0002-2350-1700 Sandeep Kumar https://orcid.org/0000-0002-5750-6112 REFERENCES 1. Cisco. Cisco annual Internet report (2018–2023) white paper; 2020. 2. Rasethuntsa TR, Kumar S. An integrated performance evaluation of ED-based spectrum sensing over α−κ−μ and α−κ−μ-extreme fading channels. Trans Emerg Telecommun Technol. 2019;30(5):e3569. 3. Yadav P, Kumar S, Kumar R. A comprehensive survey of physical layer security over fading channels: classifications, applications, and challenges. Trans Emerg Telecommun Technol. 2021;e4270. https://doi.org/10.1002/ett.4270 4. Cabric D, Mishra SM, Brodersen RW. Implementation issues in spectrum sensing for cognitive radios. Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers. Pacific Grove, CA; IEEE: 2004. 5. Kaur M, Yadav RK. Performance analysis of Beaulieu-Xie fading channel with MRC diversity reception. Trans Emerg Telecommun Technol. 2020;31(7):e3949. 26 of 28 JANU et al. 6. Sahai A, Tandra R, Mishra SM, Hoven N. Fundamental design tradeoffs in cognitive radio systems. Paper presented at: Proc. of TAPAS’06; 2006; Boston, MA. 7. Kumar S, Chauhan PS, Raghuwanshi P, Kaur M. ED performance over α-η-μ/IG and α-κ-μ/IG generalized fading channels with diversity reception and cooperative sensing: a unified approach. AEU Int J Electron Commun. 2018;97:273-279. 8. Kumar S, Kaur M, Singh NK, Singh K, Chauhan PS. Energy detection based spectrum sensing for gamma shadowed α–η–μ and α–κ–μ fading channels. AEU Int J Electron Commun. 2018;93:26-31. 9. Zhou X, Sun M, Li G, Juang B-H. Machine learning and cognitive technology for intelligent wireless networks; 2017. ArXiv, vol. abs/1710.11240. 10. Alshawaqfeh M, Wang X, Ekti AR, Shakir MZ, Qaraqe K, Serpedin E. A survey of machine learning algorithms and their applications in cognitive radio. Cognitive Radio Oriented Wireless Networks. Doha, Qatar: CrownCom; 2015. 11. Bkassiny M, Li Y, Jayaweera SK. A survey on machine-learning techniques in cognitive radios. IEEE Commun Surv Tutor. 2013;15(3):1136-1159. 12. Zhou X, Sun M, Li GY, Juang BF. Intelligent wireless communications enabled by cognitive radio and machine learning. China Commun. 2018;15(12):16-48. 13. Arjoune Y, Kaabouch N. A comprehensive survey on spectrum sensing in cognitive radio networks: recent advances new challenges and future research directions. Sensors. 2019;19(1):126. 14. Thilina KM, Choi KW, Saquib N, Hossain E. Machine learning techniques for cooperative spectrum sensing in cognitive radio networks. IEEE J Selected Areas Commun. 2013;31(11):2209-2221. 15. A. Ranjan, and B. Singh, εDesign and analysis of spectrum sensing in cognitive radio based on energy detection,ε in International Conference on Signal and Information Processing (IConSIP), Vishnupuri, India, IEEE: 2016. 16. Alom MZ, Godder TK, Morshed MN, Maali A. Enhanced spectrum sensing based on energy detection in cognitive radio network using adaptive threshold. International Conference on Networking, Systems and Security. Dhaka, Bangladesh; IEEE: 2017. 17. Kumar S. Energy detection in Hoyt/gamma fading channel with micro-diversity reception. Wirel Pers Commun. 2018;101(2):723-734. 18. Kumar S, Soni SK, Jain P. Performance of MRC receiver over Hoyt-lognormal composite fading channel. Int J Electron. 2018;105(9):1433-1450. 19. Kumar S. Performance of ED based spectrum sensing over α–η–μ fading channel. Wirel Pers Commun. 2018;100(4):1845-1857. 20. Mahapatra R, Krusheel M. Cyclostationary detection for cognitive radio with multiple receivers. Proceedings of the 2008 IEEE International Symposium on Wireless Communication Systems, ISWCS’08. Reykjavik, Iceland; IEEE: 2008. 21. Ilyas I, Paul S, Rahman A, Kundu RK. Comparative evaluation of cyclostationary detection based cognitive spectrum sensing. In Proceedings of the Ubiquitous Computing, Electronics, and Mobile Communication Conference. New York, NY; IEEE: 2016. 22. Salahdine F, Ghazi HE, Kaabouch N, Fihri WF. Matched filter detection with dynamic threshold for cognitive radio networks. International Conference on Wireless Networks and Mobile Communications (WINCOM). Marrakech, Moracco; IEEE: 2015. 23. Zhang X, Chai R, Gao F. Matched filter based spectrum sensing and power level detection for cognitive radio network. IEEE Global Conference on Signal and Information Processing (GlobalSIP). Atlanta, GA; IEEE: 2014. 24. Tang H. Some physical layer issues of wide-band cognitive radio systems. First IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks. Baltimore, MD; IEEE: 2005. 25. Salahdine F, Kaabouch N, Ghazi HE. A survey on compressive sensing techniques for cognitive radio networks. Phys Commun. 2016;20:61-73. 26. An T, Song I, Lee S, Min H. Detection of signals with observations in multiple subbands: a scheme of wideband Spectrum sensing for cognitive radio with multiple antennas. IEEE Trans Wirel Commun. 2014;13(12):6968-6981. 27. Sun H, Chiu W, Nallanathan A. Adaptive compressive spectrum sensing for wideband cognitive radios. IEEE Commun Lett. 2012;16(11):1812-1815. 28. Jin M, Guo Q, Li Y. On covariance matrix based Spectrum sensing over frequency-selective channels. IEEE Access. 2018;6:29532-29540. 29. Zeng Y, Liang Y. Spectrum-sensing algorithms for cognitive radio based on statistical covariances. IEEE Trans Veh Technol. 2009;58(4):1804-1815. 30. Zeng Y, Liang Y. Maximum-minimum eigenvalue detection for cognitive radio. International Symposium on Personal, Indoor and Mobile Radio Communications. Athens, Greece; IEEE: 2007. 31. Zeng Y, Liang Y. Covariance based signal detections for cognitive radio. 2007 2nd IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks. Dublin; IEEE: 2007. 32. Alsheikh MA, Lin S, Niyato D, Tan H. Machine learning in wireless sensor networks: algorithms, strategies, and applications. IEEE Commun Surv Tutor. 2014;16(4):1996-2018. 33. Jiang C, Zhang H, Ren Y, Han Z, Chen K, Hanzo L. Machine learning paradigms for next-generation wireless networks. IEEE Wirel Commun. 2017;24(2):98-105. 34. Bishop CM. Pattern Recognition and Machine Learning. Germany: Springer; 2006. 35. Mikaeil AM, Guo B, Wang Z. Machine learning to data fusion approach for cooperative Spectrum sensing. International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery. Shanghai, China; IEEE: 2014. 36. Lu Y, Zhu P, Wang D, Fattouche M. Machine learning techniques with probability vector for cooperative spectrum sensing in cognitive radio networks. 2016 IEEE Wireless Communications and Networking Conference. Doha, Qatar; IEEE: 2016. 37. Azmat F, Chen Y, Stocks N. Analysis of Spectrum occupancy using machine learning algorithms. IEEE Trans Veh Technol. 2016;65(9):6853-6860. JANU et al. 27 of 28 38. Ghazizadeh E, Abbasi D, Nezamabadi-pour H. An enhanced two-phase SVM algorithm for cooperative spectrum sensing in cognitive radio networks. Int J Commun Syst. 2018;32(2):e3856. 39. O. P. Awe, Z. Zhu, S. Lambotharan, εEigenvalue and support vector machine techniques for spectrum sensing in cognitive radio networks,ε in Conference on Technologies and Applications of Artificial Intelligence, Taipei, Taiwan, IEEE: 2013. 40. Coluccia A, Fascista A, Ricci G. Spectrum sensing by higher-order SVM-based detection. 27th European Signal Processing Conference (EUSIPCO). A Coruna, Spain; IEEE: 2019. 41. Huang Y-D, Liang Y-C, Yang G. A fuzzy support vector machine algorithm for cooperative Spectrum sensing with noise uncertainty. Global Communications Conference (GLOBECOM). Washington, DC; IEEE: 2016. 42. Awe OP, Lambotharan S. Cooperative spectrum sensing in cognitive radio networks using multi-class support vector machine algorithms. 9th International Conference on Signal Processing and Communication Systems (ICSPCS). Cairns, QLD; IEEE: 2015. 43. Awe OP, Deligiannis A, Lambotharan S. Spatio-temporal Spectrum sensing in cognitive radio networks using Beamformer-aided SVM algorithms. IEEE Access. 2018;6:25377-25388. 44. Jan SU, Vu V-H, Koo I. Throughput maximization using an SVM for multi-class hypothesis-based spectrum sensing in cognitive radio. Appl Sci. 2018;8(3):421. 45. Tian J, Cheng P, Chen Z, et al. A machine learning-enabled Spectrum sensing method for OFDM systems. IEEE Trans Veh Technol. 2019;68(11):11374-11378. 46. Awe OP, Naqvi SM, Lambotharan S. Variational Bayesian learning technique for spectrum sensing in cognitive radio networks. IEEE Global Conference on Signal and Information Processing (GlobalSIP). Atlanta, GA; IEEE: 2014. 47. Xu Y, Cheng P, Chen Z, Li Y, Vucetic B. Mobile collaborative spectrum sensing for heterogeneous networks: a Bayesian machine learning approach. IEEE Trans Signal Process. 2018;66(21):5634-5647. 48. Basumatary N, Sarma N, Nath B. Applying classification methods for spectrum sensing in cognitive radio networks: An empirical study. Advances in Electronics, Communication and Computing. Singapore: Springer; 2017:85-92. 49. Li Z, Wu W, Liu X, Qi P. Improved cooperative spectrum sensing model based on machine learning for cognitive radio networks. IET Commun. 2018;12(19):2485-2492. 50. Vyas MR, Patel DK, Lopez-Benitez M. Artificial neural network based hybrid spectrum sensing scheme for cognitive radio. IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC). Montreal, QC: IEEE: 2017. 51. Kumar V, Kandpal DC, Jain M, Gangopadhyay R, Debnath S. K-mean clustering based cooperative spectrum sensing in generalized k-μ fading channels. 2016 Twenty Second National Conference on Communication (NCC). Guwahati, India; IEEE: 2016. 52. Chen X, Hou F, Huang H, Jing X. Principle component analysis based cooperative spectrum sensing in cognitive radio. 16th International Symposium on Communications and Information Technologies (ISCIT). Qingdao, China; IEEE: 2016. 53. Sobabe GC, Song Y, Bai X, Guo B. A cooperative spectrum sensing algorithm based on unsupervised learning. 10th International Congress on Image and Signal Processing. Shanghai, China: BioMedical Engineering and Informatics (CISP-BMEI); 2017. 54. Zhang Y, Wan P, Zhang S, Wang Y, Li N. A spectrum sensing method based on signal feature and clustering algorithm in cognitive wireless multimedia sensor networks. Adv Multimedia. 2017;2017:1–10. 55. Wang Y, Zhang Y, Zhang S, Li X, Wan P. A cooperative spectrum sensing method based on a feature and clustering algorithm. Chinese Automation Congress. Xi’an, China; IEEE: 2018. 56. Wang Y, Zhang Y, Wan P, Zhang S, Yang J. A spectrum sensing method based on empirical mode decomposition and K-means clustering algorithm. Wirel Commun Mobile Comput. 2018;2018(1):1-10. 57. Zhang Y, Wang Y, Wan P, Zhang S, Li N. A spectrum sensing method based on null space pursuit algorithm and FCM clustering algorithm. 4th International Conference. Haikou, China: ICCCS; 2018. 58. Wang Y, Zhang S, Zhang Y, Wan P, Li J. A cooperative spectrum sensing method based on empirical mode decomposition and information geometry in complex electromagnetic environment. Complexity. 2019;2019:1-13. 59. Zhang S, Wang Y, Li J, Wan P, Zhang Y, Li N. A cooperative spectrum sensing method based on information geometry and fuzzy c-means clustering algorithm. EURASIP J Wirel Commun Netw. 2019;2019(1):17-29. 60. Zhuang J, Wang Y, Zhang S, Wan P, Sun C. A multi-antenna spectrum sensing scheme based on main information extraction and genetic algorithm clustering. IEEE Access. 2019;7:119620-119630. 61. Zhang S, Wang Y, Yuan H, Wan P, Zhang Y, “Multiple-antenna cooperative spectrum sensing based on the wavelet transform and Gaussian mixture model,” Sensors, 2019;19(18):3863–3881. 62. Bhatti DMS, Zaidi SB, Rehman SU. Channel error detection based cluster formation for cooperative Spectrum sensing. International Conference on Information and Communication Technology Convergence (ICTC). Jeju, South Korea; IEEE: 2018. 63. Maity S, Chatterjee S, Tamaghna A. On optimal fuzzy c-means clustering for energy efficient cooperative spectrum sensing in cognitive radio networks. Digit Signal Process. 2016;49:104-115. 64. Paul A, Maity SP. Kernel fuzzy c-means clustering on energy detection based cooperative spectrum sensing. Digit Commun Netw. 2016;2(4):196-205. 65. Bhatti DMS, Shaikh B, Zaidi SIH. Fuzzy C-means and spatial correlation based clustering for cooperative spectrum sensing. International Conference on Information and Communication Technology Convergence (ICTC). Jeju, South Korea; IEEE: 2017. 66. Bhatti DMS, Ahmed S, Chan AS, Saleem K. Clustering formation in cognitive radio networks using machine learning. Int J Electron Commun. 2020;114:152–994. 67. Lo BF, Akyildiz IF. Reinforcement learning-based cooperative sensing in cognitive radio ad hoc networks. 21st Annual IEEE International Symposium on Personal, Indoor and Mobile Radio Communications. Instanbul, Turkey; IEEE: 2010. 28 of 28 JANU et al. 68. Lundén J, Koivunen V, Kulkarni SR, Poor HV. Reinforcement learning based distributed multiagent sensing policy for cognitive radio networks. IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN). Aachen, Germany: IEEE: 2011. 69. Lundén J, Kulkarni SR, Koivunen V, Poor HV. Multiagent reinforcement learning based Spectrum sensing policies for cognitive radio networks. IEEE J Selected Topics Signal Process. 2013;7(5):858-868. 70. Oksanen J, Lundén J, Koivunen V. Reinforcement learning based sensing policy optimization for energy efficient cognitive radio networks. Neurocomputing. 2012;80:102-110. 71. Zhang M, Wang L, Feng Y. Distributed cooperative spectrum sensing based on reinforcement learning in cognitive radio networks. AEU-Int J Electron C. 2018;94:359-366. 72. Menéndez ML, Morales D, Pardo L, Salicrú M. Statistical tests based on geodesic distances. Appl Math Lett. 1995;8(1):65-69. 73. Arnaudon M, Barbaresco F, Yang L. Riemannian medians and means with applications to radar signal processing. IEEE J Selected Topics Signal Process. 2013;7(4):595-604. 74. Moakher M. A differential geometric approach to the geometric mean of symmetric positive-definite matrices. SIAM J Matrix Anal Appl. 2008;26(3):735-747. 75. Liu X, Jia M, Zhang X, Lu W. A novel multichannel internet of things based on dynamic spectrum sharing in 5G communication. IEEE Internet Things J. 2019;6(4):5962-5970. 76. Yang C, Li J, Guizani M, Anpalagan A, El M. Advanced spectrum sharing in 5G cognitive heterogeneous networks. IEEE Wirel Commun. 2016;23(2):94-101. 77. Wang H, Nguyen DN, Dutkiewicz E, Fang G. Negotiable auction based on mixed graph: a novel Spectrum sharing framework. IEEE Trans Cognit Commun Netw. 2017;3(3):390-403. 78. Bany Salameh HA, Krunz M, Younis O. Cooperative adaptive spectrum sharing in cognitive radio networks. IEEE/ACM Trans Netw. 2010;18(4):1181-1194. 79. Sharma SK, Bogale TE, Le LB, Chatzinot S. Dynamic spectrum sharing in 5G wireless networks with full-duplex technology: recent advances and research challenges. IEEE Commun Surv Tutor. 2018;20(1):674-707. 80. Bhardwaj P, Panwar A, Ozdemir MEO. Enhanced dynamic spectrum access in multiband cognitive radio networks via optimized resource allocation. IEEE Trans Wirel Commun. 2016;15(12):8093-8106. 81. Kaur A, Kumar K. Energy-efficient resource allocation in cognitive radio networks under cooperative multi-agent model-free reinforcement learning schemes. IEEE Trans Netw Serv Manag. 2020;17(3):1337-1348. 82. Morozs N, Clarke T, Grace D. Heuristically accelerated reinforcement learning for dynamic secondary spectrum sharing. IEEE Access. 2015;3:2771-2783. 83. Li X, Fang J, Cheng W, Duan H, Chen Z, Li H. Intelligent power control for spectrum sharing in cognitive radios: a deep reinforcement learning approach. IEEE Access. 2018;6:25463-25473. 84. Zhang H, Yang N, Huangfu W, Long K, Leung VCM. Power control based on deep reinforcement learning for spectrum sharing. IEEE Trans Wirel Commun. 2020;19(6):4209-4219. 85. Zaky AB, Huang JZ, Wu K, ElHalawany BM. Generative neural network based spectrum sharing using linear sum assignment problems. China Commun. 2020;17(2):14–29. 86. Jacob S, Menon VG, Joseph S, et al. A novel spectrum sharing scheme using dynamic long short-term memory with CP-OFDMA in 5G networks. IEEE Trans Cognit Commun Netw. 2020;6(3):926-934. 87. Doshi A, Yerramalli S, Ferrari L, Yoo T, Andrews JG. A deep reinforcement learning framework for contention-based Spectrum sharing. IEEE J Selected Areas Commun. 2021;39(8):2526–2540. 88. Ahmed R, Chen Y, Hassan B, Du L. CR-IoTNet: machine learning based joint spectrum sensing and allocation for cognitive radio enabled IoT cellular networks. Ad Hoc Netw. 2021;112:1570-8705. 89. Kaur A, Kumar K. A reinforcement learning based evolutionary multi-objective optimization algorithm for spectrum allocation in cognitive radio networks. Phys Commun. 2020;43:1874-4907. How to cite this article: Janu D, Singh K, Kumar S. Machine learning for cooperative spectrum sensing and sharing: A survey. Trans Emerging Tel Tech. 2022;33(1):e4352. https://doi.org/10.1002/ett.4352