aDepartment of Electrical Engineering, University of Guilan, Rasht, Iran
bOptical Communications Research Group, Faculty of Engineering and Environment, Northumbria University, Newcastle, UK
A R T I C L E I N F O
Keywords:
Visible light communications (VLC) Carrier-less amplitude and phase (CAP) modulation
Deep learning (DL)
A B S T R A C T
We propose in this paper a carrier-less amplitude and phase visible light communications (VLC) system with deep learning (DL)-based post-equalizer (EQ) to significantly increase the transmission data rate. The proposed system is analyzed for various conditions including modulation order, transmitted signal bandwidth, and non- line of sight VLC channel. Results show that the highest data rate and spectral efficiency of 100 Mb/s and 4.67 b/s/Hz are achieved for the modulation order and signal bandwidth of 64 and 25 MHz, respectively.
In addition, we compare the performance and complexity of the proposed system with different types of EQs including least mean square and Volterra series. The study shows the DL-based EQ is qualified to mitigate mixed linear and nonlinear impairments by providing improved bit error rate performance compared to the other EQs for all modulation orders and the transmitted signal bandwidth.
1. Introduction
Optical wireless communications (OWC) use visible light (VL), in- frared (IR), and ultraviolet (UV) spectral bands to meet some of the demands for wireless connectivity in fifth-generation (5G) and 6G networks. OWC, as a complementary technology to the radio frequency (RF) wireless systems, has unique features such as almost unlimited bandwidth (BW), no spectrum authorization and regulations, much safer to the environment, higher security in the physical layer (PHY), higher energy efficiency and improved sustainability [1]. In the VL band OWC, known as VL communications (VLC), uses white light emitting diodes (LEDs) and photodetectors (PDs) as the transmitter (Tx) and the receiver (Rx); this offers simultaneous illumination and data communications [2]. However, white LEDs have some limitations that may cause VLC systems to underperform in practical applications.
Blue LEDs with phosphor and RGB (red, green, and blue) LEDs are the two most widely used methods of producing white light. Even though, RGB LEDs offer higher data rates,๐ ๐, blue LEDs with phosphor coat- ing have simpler implementation and lower cost. However, the long responsivity of phosphor limits the LED modulation bandwidth,๐ตLED, to only a few MHz, which in turn greatly constrains the achievable transmission capacity of the system. Moreover, white LEDs are another source of nonlinearity in VLC systems, leading to signal distortion and intersymbol interference (ISI) [3,4].
A common solution to mitigate these limitations is to employ linear and nonlinear equalizers (EQs) including adaptive least mean square
โ Corresponding author.
E-mail address: [email protected](G. Baghersalimi).
(LMS) โ the most widely used, Volterra series, and deep learning (DL)- based filters. As (i) in [5] it was shown how an adaptive EQ with the LMS algorithm could suppress the ISI in indoor VLC systems; (ii) in [6], it was shown that a Rx with a decision feedback EQ (DFE) with nonlinear Volterra feed-forward section could effectively mitigate the effects of LEDโs nonlinearity with improved performance by up to 5 dB in terms of optical power compared with a simple DFE; and (iii) in [7]
it was demonstrated that higher๐ ๐ of 170 Mb/s could be achieved by employing artificial neural network (ANN)-based equalization using white LEDs.
DL is a powerful subfield of machine learning (ML), which can learn, recognize, and predict complex patterns among high-dimensional input information data [8]. The DL methods have been used in many application domains including speech recognition, computer vision, self-driving cars, natural language processing, predictive forecasting, and communication systems [9โ11]. For example, authors in [12,13]
introduced several DL applications and improvements in the PHY of communication links. In [14], a deep neural network (DNN) was em- ployed as an accurate tool for channel estimation to reduce both dis- tortion and interference. Authors in [15] reviewed the challenges and applications of using DL in VLC, whereas in [16] DL was utilized to re- duce flickering and increase the dimming level in a VLC system. In [17], the authors determined an autoencoder to mitigate the nonlinearity of the LED and enhance the system bit error rate (BER) performance.
In [18], a DNN-based signal detector namely bi-directional long-short term memory for VLC links with 25 Mbps non-line of sight (NLOS)
https://doi.org/10.1016/j.optcom.2022.128741
Received 7 April 2022; Received in revised form 18 June 2022; Accepted 5 July 2022 Available online 8 July 2022
0030-4018/ยฉ2022 Elsevier B.V. All rights reserved.
Fig. 1. Schematic block diagram of the proposed CAP-VLC system with DL-based post-EQ.
and 50 Mbps NLOS using the second-order and first-order reflection, respectively, were reported with the BER of < 1 ร 10โ4. In [19], an orthogonal frequency division multiplexing (OFDM)-based optical quadrature spatial modulation for multiple-input multiple-output OWC with DNN-aided detection was proposed with a successful outcome.
There are different widely used spectrally efficient modulation schemes in intensity modulation/direct detection (IM/DD) optical sys- tems. In this work, we have adopted the carrier-less amplitude and phase (CAP) modulation scheme for VLC. To the best of the authorsโ
knowledge, the DL-based post-EQ has not been used in CAP-VLC systems. Therefore, inspired by many of the aforementioned research works, the effectiveness of applying DL for post-EQ in an indoor CAP- VLC system is investigated in this work to increase๐ ๐ and improve the BER performance. In our study, we successfully demonstrate ๐ ๐ of 100 Mb/s, which to our knowledge is the highest ๐ ๐ for a single band CAP-VLC system using a single white LED so far. In addition, we compare the BER performance of the proposed system with two common equalization techniques of Volterra series and adaptive LMS algorithm.
The rest of the paper is organized as follows. In Section 2, an overview of the DL-based indoor VLC-CAP system is explained. In Section 3, the most important results are presented and discussed.
Finally, Section4is allocated to conclude the paper.
2. System overview
Fig. 1illustrates the schematic block diagram of the proposed CAP- VLC system with DL-based post-EQ which is explained comprehensively as follows. Note, we assume that each LED follows the Lambertian radiation pattern.
2.1. The Tx
At the Tx, a random data bit stream {๐ผ๐} is mapped into the quadra- ture amplitude modulation (QAM) format prior to being applied to the CAP modulator. CAP is a spectrally efficient and low-cost technique, which can be implemented with ease, due to its simplicity and the lack of need for fast Fourier transform (FFT) and inverse FFT (IFFT) as required in OFDM. Like the QAM format, except that instead of using a local oscillator, CAP generates carrier frequencies using two orthogonal digital filters whose impulse responses are Hilbert transform pairs [20โ
23]. The mapped complex data are then up-sampled by a specific factor ๐๐ =โ2 (1 +๐ฝ)โ, whereโ.โis the ceiling function and๐ฝis the roll-of factor of the square root raised cosine (SRRC) filter. Next, the real and imaginary elements of the up-sampled signal are separated and sent to
the digital in-phase (I) and quadrature (Q) SRRC filters, respectively, which are given as [24]:
๐๐ผ(๐ก) =
โกโข
โขโข
โขโฃ
sin [๐พ(1 โ๐ฝ)] + 4๐ฝ(๐ก
๐๐
) cos [๐พ๐ฟ]
๐พ [
1 โ(4๐ฝ๐ก
๐๐
)2]
โคโฅ
โฅโฅ
โฅโฆ
.cos [๐พ๐ฟ] (1)
๐๐(๐ก) =
โกโข
โขโข
โขโฃ
sin [๐พ(1 โ๐ฝ)] + 4๐ฝ(
๐ก ๐๐
) cos [๐พ๐ฟ]
๐พ [
1 โ(4๐ฝ๐ก
๐๐
)2]
โคโฅ
โฅโฅ
โฅโฆ
.sin[๐พ๐ฟ] (2)
where๐๐ is the symbol duration,๐พ=๐๐กโ๐๐ , and๐ฟ= 1 +๐ฝ.
The outputs of the digital finite impulse response (FIR) filters are summed together to produce the desired CAP signal, which is given by [24]:
๐ (๐ก) =๐ ๐ผ(๐ก)โ ๐๐ผ(๐ก) โ๐ ๐(๐ก)โ ๐๐(๐ก) (3) where๐ ๐ผ(๐ก)and๐ ๐(๐ก)are up-sampledIandQcomponents, respectively andโindicates the time domain convolution. Finally,s (t)is scaled and a direct current (DC)-bias is added to it which makes it nonnegative within the dynamic range of the LED prior to IM of the light source as follow [25]:
๐ผ๐๐(๐ก) =๐ผ๐ท๐ถ+๐ผ๐ (๐ก), (4)
where๐ผis the scaling factor and defined as: ๐ผ= ๐ ๐ผร๐ผmax
(๐ ๐ผ+ 1) ร max{๐ (๐ก)} (5)
where๐ผmax=๐ผDC+ 0.5๐ผPPis the maximum value of๐ผin,๐ผDC is the DC- bias current,๐ผPPis the peak-to-peak current, and๐ ๐ผis the modulation index, which is defined as:
๐ ๐ผ=๐ผmaxโ๐ผDC
๐ผDC (6)
The Hammerstein method is adopted to model LED characteristics in this study. It includes a nonlinear function and a linear component to represent the LED with the limited bandwidth, which are characterized as a third-order polynomial and a first-order low pass Butterworth filter, respectively, as given by [25]:
๐NL(๐ฅ) =
โงโช
โจโช
โฉ
0.1947, ๐ฅ <0.1
0.2855๐ฅ3โ 1.0886๐ฅ2+ 2.0565๐ฅโ 0.0003, 0.1โค๐ฅ <1 1.2531, ๐ฅโฅ1
(7)
โLED(๐ก) =๐โ2๐๐LED๐ก (8)
where๐๐ฟ๐ธ๐ทis the 3-dB cut-off frequency of the LED. The LED transmit output power is defined as:
๐out(๐ก) =๐NL( ๐ผin(๐ก))
โ โLED(๐ก) (9)
the Tx and the Rx, respectively.๐๐ (๐)and๐(๐)are the optical filter and non-imaging concentrator gain, respectively. For the NLOS paths, every wall is divided into๐ฟ๐ small squares, which can be assumed as both the Tx and Rx.โ(
๐ก, ๐ , ๐๐๐)
represents the NLOS impulse response between the main Tx and the reflecting element๐๐๐, which acts as a Rx.
โ( ๐ก, ๐๐๐, ๐ท)
is the impulse response between๐๐๐ (i.e., acting as a Tx) and the main Rx,๐ท. It is worth to mention that, only a single reflection per wall is considered in this study and other higher order reflections are ignored since they contribute very little to the total received optical power [24]
2.3. The Rx
At the Rx side, the received signal is detected by an optical Rx composed of a single PD and a trans-impedance amplifier (TIA). The regenerated electrical received signal is given by:
๐ฆ(๐ก) =๐ (๐ก) โโ๐(๐ก) +๐(๐ก) (13)
where๐(๐ก)is the additive white Gaussian noise with the power๐๐ = ๐0๐ตRx, ๐0 is the noise power spectral density, and๐ตRx is the band- width of the Rx. Note,๐(๐ก)is mostly dominated by the ambient light induced shot noise. Following DC removal, a DL-based EQ is employed, which is explained in Subsection D in details. The equalized signal is applied to a set of inverse matched filters ๐๐ผ(๐ก) = ๐๐ผ(โ๐ก) and ๐๐(๐ก) = ๐๐(โ๐ก) to split theI andQ components of the CAP signal, respectively. Next, the detached signals are down-sampled prior to QAM demodulation.
2.4. DL-based EQ
As mentioned before, DL is a powerful subfield of ML, which em- ploys DNN to solve complex problems. DNN is a class of ANNs with two or more hidden layers of nodes [27], see Fig. 2. As shown, it consists of four layers including the input layer, two hidden layers, and output layer. The number of nodes, shown in circles, 120, 300, and 400 for both the input and output layer, and the 1st and 2nd hidden layers, respectively. All layers are fully connected, which means that every node in each layer is connected to all nodes in the next layer. The DL method splits the modeling dataset into training and test sets. The process for the training set is offline and includes two steps: (i) forward; and (ii) backward propagation [28]. First, in forward propagation, a subset of training dataset named minibatch crosses from the input layer to the hidden layers and then the output layer. The mathematical computations of forward propagation in the 1st hidden layer is given by [29]:
๐[1]=๐[1]๐+๐[1] (14)
๐ด[1]=๐[1](๐[1]) (15)
where[.]is the number of layer,๐and๐ด[1]are the input and output of the 1st layer, respectively,๐[1],๐[1]and๐[1](.)are the weights, biases,
๐=
โขโข
โขโข
โขโข
โขโข
โขโข
โฃ
1 โฆ 1
๐ฅ(1)[1]
2 โฆ๐ฅ(128)[1]
2
. . . ๐ฅ(1)[1]
120 โฆ๐ฅ(128)[1]
120
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฆ(๐[0]ร๐)=(120ร128)
(17)
๐[1]=
โกโข
โขโข
โขโข
โขโข
โขโข
โขโข
โขโฃ ๐(1)[1]
1 โฆ๐(128)[1]
1
๐(1)[1]
2 โฆ๐(128)[1]
2
๐(1)[1]
3 โฆ๐(128)[1]
3
. . . ๐(1)[1]
300 โฆ๐(128)[1]
300
โคโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฆ(๐[1]ร๐)=(300ร128)
(18)
๐[1]=
โกโข
โขโข
โขโข
โขโข
โขโข
โขโข
โขโฃ ๐ง(1)[1]
1 โฆ๐ง(128)[1]
1
๐ง(1)[1]
2 โฆ๐ง(128)[1]
2
๐ง(1)[1]
3 โฆ๐ง(128)[1]
3
. . . ๐ง(1)[1]
300 โฆ๐ง(128)[1]
300
โคโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฆ(๐[1]ร๐)=(300ร128)
(19)
๐ด[1]=
โกโข
โขโข
โขโข
โขโข
โขโข
โขโข
โขโฃ ๐(1)[1]
1 โฆ๐(128)[1]
1
๐(1)[1]2 โฆ๐(128)[1]2 ๐(1)[1]3 โฆ๐(128)[1]3
. . . ๐(1)[1]
300 โฆ๐(128)[1]
300
โคโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฅ
โฅโฆ(๐[1]ร๐)=(300ร128)
(20)
where๐[๐] is the number of nodes in๐thlayer. The matrix dimension for๐[๐],๐[๐],๐[๐], and๐ด[๐]are(
๐[๐]ร๐[๐โ1])
,(๐[๐]ร๐),(๐[๐]ร๐), and(๐[๐]ร๐), respectively, where๐ is the size of minibatch, which is assumed to be 128 in this work. Generally, the forward propagation process for all layers is given as:
๐[๐]=๐[๐]๐ด[๐โ1]+๐[๐] (21)
๐ด[๐]=๐[๐](๐[๐]) (22)
where๐ด[๐โ1]is the input of the๐th layer.
Fig. 2. The structure of the proposed DL-based post-EQ.
Fig. 3. The BER performance against (๐ธbโ๐0)testwith the DL-based EQ for a range of (๐ธbโ๐0)trainfor 128 QAM and signal BW of 5 MHz.
The rectified linear unit (ReLU) and Sigmoid values are used for the intermediate and output layers, respectively, which are given as [28]:
ReLU (๐ฅ) =
{๐ฅ, ๐ฅ >0
0, ๐ฅโค0, (23)
Sigmoid= 1
1 + exp(โ๐ฅ) (24)
There are two types of learning process for ML algorithms: su- pervised and unsupervised. In the former, which is used here, the network is trained based on the input/output pairs known as the labeled data [27]. The labeled data (๐ฅ, ๐ฆ) is shown as portsAandB inFig. 1. PortA, i.e., the input to DNN, is the received signal with the LED and channel induced impairment, and portBis the desired CAP signal, which is used as the prediction target.
Accordingly, a loss functionL is defined to determine the error, which indicates the difference between the outputs of DNN and the correct answers.Lis expressed by various metrics [30], and here we have used the mean square error (MSE), which is defined as:
MSE(๐, ๐ฆ) = 1 ๐
โ๐
๐=1
(๐๐โ๐ฆ๐)2, (25)
where๐is the estimated real output of DNN, and๐ฆis the desired labeled value.
In the backward propagation phase, the main aim of DL is to find the correct weights and biases parameters of (๐ค,๐), respectively, for minimizing L using different optimization algorithms. Stochastic gradient descent (SGD) [31] is the most popular optimization algorithm in DL, which is also adopted in this study. First,๐คand๐are initialized to a random values and the gradients of differentiable๐ฟare calculated based on๐ค and ๐using the chain rule. Here the gradients of cross entropy function are calculated as a useful example. Note, we have the followings:
๐๐ฟ
๐๐ฅ=๐๐ฅ, (26)
๐๐[2]= ๐๐ฟ ๐๐[2] = โ๐ฆ
๐[2]+ 1 โ๐ฆ
1 โ๐[2], (27)
๐๐ง[2]= ๐๐ฟ ๐๐ง[2] = ๐๐ฟ
๐๐[2]ร๐๐[2]
๐๐ง[2] =๐๐[2]ร๐โฒ( ๐ง[2])
= (โ๐ฆ
๐[2]+ 1 โ๐ฆ 1 โ๐[2]
)
ร( ๐[2](
1 โ๐[2]))
=๐[2]โ๐ฆ, (28)
๐๐ค[2]= ๐๐ฟ ๐๐ค[2] = ๐๐ฟ
๐๐[2] ร ๐๐[2]
๐๐ค[2]
= (โ๐ฆ
๐[2] + 1 โ๐ฆ 1 โ๐[2]
)
ร( ๐[2](
1 โ๐[2])
ร๐[1]๐)
=( ๐[2]โ๐ฆ)
ร๐[1]๐=๐๐ง[2]ร๐[1]๐, (29)
๐๐[2]= ๐๐ฟ ๐๐[2] = ๐๐ฟ
๐๐[2]ร๐๐[2]
๐๐[2] =๐๐ง[2]=๐[2]โ๐ฆ, (30)
Fig. 4. Flowchart of training and test process.
Fig. 5. BER performance of the proposed system for 64- and 128-QAM for a range of transmit signal BW.
Generally, the equation for the backward propagation for๐training example is given as:
๐๐[๐]=๐๐ด[๐]ร๐โฒ[๐](๐[๐]), (31) ๐๐[๐]= 1
๐ ร๐๐[๐]ร๐ด[๐โ1]๐, (32)
๐๐[๐]= 1
๐ ร๐๐[๐], (33)
๐๐ด[๐โ1]=๐[๐]๐ร๐๐[๐], (34)
where the dimension of matrices๐๐[๐],๐๐[๐],๐๐[๐], and๐๐ด[๐โ1]equal to(๐[๐]ร๐[๐โ1]),(๐[๐]ร๐),(๐[๐]ร๐), and(๐[๐]ร๐), respectively. The updated and optimized weights and biases in each layer are given as:
๐คโถ =๐คโ๐ผ๐๐ค, (35)
๐โถ =๐โ๐ผ๐๐, (36)
where,๐ผ is the learning rate. The backward propagation algorithm is repeated for๐โ๐times to complete one epoch, where๐is the number of complete training features for each node in the input layer. An epoch is completed when all dataset is applied to the DL algorithm. Although determining the number of epochs is different due to the convergence of algorithm, but it is considered to 100 in this study.
As shown inFig. 2, prior to the use of activation function, the batch normalization method is applied, which speeds up the optimization process to provide higher๐ผ. Note, (i) according to [32], the demand for the regularization technique is reduced noticeably; and (ii) the adaptive moment estimation (Adam) optimizer is employed due to faster conver- gence. More detailed information can be found in [33]. The pseudocode for the utilized learning algorithm is explained in Algorithm 1. To avoid overfitting and make the model more generalized, the dataset is divided into two sets of training and validation [34]. In the validation set, the hyperparameters can be changed and tuned to minimize the error in the produced model, which is fitted on the training dataset. In the test
Table 1 DNN parameters.
Symbol Parameter Value
๐[0] Number of nodes for input layer 120
โ Number of hidden layers 2
๐[1] Number of nodes for 1st hidden layer 300 ๐[2] Number of nodes for 2nd hidden layer 400
๐[3] Number of nodes for output layer 120
๐(.) Activation function for hidden layers ReLU ๐(.) Activation function for output layer Sigmoid
โ Number of Batch normalization layers 2
โ Optimizer SGD with Adam
โ (๐ธbโ๐0)train 5, 10, 15, 20 dB
โ Default (optimum) (๐ธbโ๐0)train 10 dB
L Loss function MSE
๐ผ Learning rate 0.01
๐ Size of minibatch 128
๐ Number of training samples for each node 106
โ Size of training dataset 120ร106
โ Size of test dataset 30000
โ Number of epoch 100
Table 2 System parameters.
Symbol Parameter Value
โ Room size 5ร5ร3 m3
๐1โ2 Semi-power half angle 70โฆ
๐๐ PD field of view 60โฆ
๐ด PD detector area 1 cm2
๐ PD Responsivity of PD 0.54 A/W
๐๐ (๐) Optical filter gain 1 ๐ต๐ LED modulation bandwidth 4.5 MHz ๐ฝ Roll of factor ofI/Qfilters 0.5
๐๐ Up-sampling factor 6
๐ฟ๐ I/Qfilter length 10 sym.
๐ต๐ก Total bandwidth 5โ25 MHz
โ Modulation type QAM
๐ Modulation order 64 & 128
๐บTIA TIA gain 50โ60 dB
โ Tx position Center of ceiling
โ Rx position LOS scenario: Center of floor
NLOS scenario: (1.15, 1.15, 0.85) m
phase, which is done online, the new and never-before-seen received CAP signals are used as the input dataset in the DNN EQs. The trained network attempts to recover them and minimize the error based on the gained knowledge. All the parameters for DNN are summarized in Table 1.
3. Result and discussion
In this section, we investigate the performance of the proposed CAP-VLC system with a DL-based post-EQ by means of computer simu- lations. MATLAB and Python are used to simulate VLC subsystem and the DL-based EQ, respectively. All key system parameters adopted are summarized inTable 2. To train the network, we have used a range of energy per bit to noise ratios (๐ธbโ๐0)train. Due to the computation time, we considered only (๐ธbโ๐0)train values of 5, 10, 15, and 20 dB.
Fig. 3depicts the BER performance against the (๐ธbโ๐0)testfor a range of (๐ธbโ๐0)trainvalues, 128-QAM and a signal bandwidth of 5 MHz. It is shown that, the best (optimum) BER performance is achieved for (๐ธbโ๐0)trainof 10 dB, with the (๐ธbโ๐0)testgains of 0.2, 0.8, and 5.7 dB compared with the (๐ธbโ๐0)trainvalues of 15, 20 and 5 dB, respectively, at the 7% forward error correlation (FEC) BER limit of 3.8 ร10โ3. Considering the trade-off between the simulation time and estimation of the BER, the coefficients of the DNN network corresponding to the optimum (๐ธbโ๐0)train of 10 dB is adopted for the rest of evaluation during the test phase.Fig. 4 illustrates the flowchart of training and test processes.
Fig. 5 compares the BER performance of the proposed system for 64- and 128-QAM for a range of transmit signal BW. Referring to the
7% FEC BER limit, 128-QAM with a signal BW of 5 MHz offers the best performance with the (๐ธbโ๐0)testgains of 1, 1.8, 6, and 6 dB compared with 10, 15, and 20 MHz 128-QAM, and 25 MHz 64-QAM, respectively.
๐ ๐of 93.4 Mb/s and the spectral efficiency of 4.67 b/s/Hz are attained for 20 MHz 128-QAM with respect to the 7% FEC BER limit. To achieve higher๐ ๐, the proposed system is also examined for 64-QAM and a signal BW of 25 MHz. Results demonstrate that๐ ๐is improved by up toโผ6 Mb/s reaching 100 Mb/s with respect to the 7% FEC BER limit.
At the Rx side, the small value of peak-to-peak injection current๐ผpp leads to lower๐ธbโ๐0 deterioration, which degrades the BER perfor- mance. Furthermore, large values of ๐ผpp may exceed the LED linear range, thus resulting in nonlinear distortions.Fig. 6depicts the BER performance against๐ผpp for (๐ธbโ๐0)train = 10dB,๐ = 64and BW of 25 MHz, respectively. As can be seen, the lowest BER ofโผ2.26ร10โ3 is achieved at 0.7โค๐ผppofโค0.8 A. Therefore,๐ผppof 0.8 A is considered as the default value in this study.
Fig. 7(a) depicts the BER performance of the proposed system with 64-QAM and a BW of 25 MHz for three different post-EQ schemes. It is clearly seen that, at the 7% FEC BER limit DL offers (๐ธbโ๐0)testgains 2 and 8 dB compared with Volterra and LMS EQs, respectively, which can be used to trade-off transmission range against๐ ๐. Note, reducing the bandwidth results in improved BER performance as shown inFig. 7(b) with the DL-based EQ outperforming LMS and Volterra. For instance, at the BER of 3.8ร10โ3, the (๐ธbโ๐0)testpenalties areโผ6 and 10 dB for Volterra and LMS, respectively compared with the DL method.
Next, we evaluated the BER performance of LOS and LOS + NLOS 64-QAM VLC links with a signal BW of 25 MHz. As can be seen in Fig. 8, the overall BER performance of pure LOS and LOS + NLOS links are almost the same.
Finally, the complexity of the DL-based EQ is compared with LMS and the 2nd order Volterra series EQ as summarized inTable 3[35].
As it is expected, the computational complexity of the DL-based EQ is approximately 104 times larger than other two EQs. However, in this work the proposed system with only two hidden layers is considered, which is a simple DNN using DL technique, while more complex net- works can be implemented using field programmable gate array (FPGA) boards. As an example, in [36] a DNN with two hidden layers was implemented using FPGA (Xilinx Zynq UltraScale) with the complexity of about 182007, thus demonstrating the potential use of FPGA in DL- based EQs in wireless communications. Note, ๐๐ฟ and๐NL are the linear and nonlinear memory depths, respectively.๐1, ๐ป1,๐ป2, and ๐ป3are the number of nodes in input, first hidden layer, second hidden layer, and output layer, respectively.
4. Conclusion
In this paper, we introduced a deep learning-based post-EQ for
โโsingleโโ CAP-VLC system considering both the LEDโs bandwidth lim- itation and its nonlinearity effects. The results showed that, the highest data rate of about 100 Mb/s is achieved for 64-QAM, the signal bandwidth of 25 MHz and the LED modulation bandwidth of 4.5 MHz (i.e., standard white LED). Finally, we compared the performance of the proposed scheme with two common EQs of LMS and Volterra series, demonstrating improved BER results.
Algorithm 1. Pseudo-code of DNN training algorithm Input:Number of layers๐ฟ, number of neurons for all layers, activation functions, loss function, number of epochs๐ธ, number of transmitted batches in one epoch๐, number of training example in each minibatch๐, learning rate๐ผ= 0.01,๐ฝ1= 0.9,๐ฝ2= 0.999, ๐= 10โ8
Output:A trained DNN
1:Initializeweights and biases for all layers randomly 2: forepoch = 1 to๐ธdo
3: for๐ก= 1to๐do
Fig. 6. The BER against๐ผppfor the 64-QAM CAP-VLC system with the DL-based EQ.
Fig. 7.The BER performance against (๐ธbโ๐0)testfor three different EQs schemes for the signal bandwidth of: (a) 25 MHz, and (b) 20 MH.
Table 3
Comparison of complexity for three different EQs.
Type of EQs ๐๐ฟ ๐๐ ๐ฟ ๐1 ๐ป1 ๐ป2 ๐ป3 Complexity Value of complexity
DL-based โ โ 120 300 400 120 ๐1ร๐ป1+๐ป1ร๐ป2+๐ป3 156400
LMS 6 โ โ โ โ โ ๐๐ฟ 6
Volterra series 6 6 โ โ โ โ ๐๐ฟ+๐๐ ๐ฟ(๐๐ ๐ฟ+ 1)โ2 27
Algorithm 1. Pseudo-code of DNN training algorithm 4: for๐= 2 to๐ฟdo
5: doBatch normalization as follow:
6:๐๐ฝโ 1
๐
โ๐ ๐=1๐ฅ๐ 7:๐๐ฝ2โ 1
๐
โ๐
๐=1(๐ฅ๐โ๐๐ฝ)2 8:๐ฅโง๐ โ๐ฅ๐-โ๐๐ฝ
๐2๐ฝ+๐
9:๐ ๐โ๐พ ๐ฅโง๐ +๐ฝ
10: doforward propagation 11:end for
12: calculateloss function 13: for๐= 2to L
14: dobackward propagation
15: initialize๐๐๐ค,๐๐๐,๐๐๐ค,๐๐๐to zero 16:๐๐๐ค=๐ฝ1๐๐๐ค+(
1 โ๐ฝ1) ๐๐ค 17:๐๐๐=๐ฝ1๐๐๐+(
1 โ๐ฝ1) ๐๐
Algorithm 1. Pseudo-code of DNN training algorithm 18:๐๐๐ค=๐ฝ2๐๐๐ค+(
1 โ๐ฝ2) ๐๐ค2 19:๐๐๐=๐ฝ2๐๐๐+(
1 โ๐ฝ2) ๐๐2 20:๐๐ถ๐๐๐๐๐๐ก๐๐
๐๐ค = Vdw
1โ๐ฝ1๐ก
21:๐๐ถ๐๐๐๐๐๐ก๐๐
๐๐ =1โ๐ฝVdb๐ก 1
22:๐๐ถ๐๐๐๐๐๐ก๐๐
๐๐ค = Sdw
1โ๐ฝ๐ก 2
23:๐๐๐๐ถ๐๐๐๐๐๐ก๐๐= Sdb
1โ๐ฝ๐ก 2
24: updateall weights and biases as follow:
25:๐ค=๐คโ๐ผ ๐
๐ถ๐๐๐๐๐๐ก๐๐
โ๐๐ค ๐๐ถ๐๐๐๐๐๐ก๐๐๐๐ค +๐
26:๐=๐โ๐ผ ๐
๐ถ๐๐๐๐๐๐ก๐๐
โ๐๐ ๐๐ถ๐๐๐๐๐๐ก๐๐๐๐ +๐
27:end for 28:end for 29:end for
Fig. 8. The BER performance against (๐ธbโ๐0)testfor LOS and LOS + NLOS 64-QAM VLC systems.
Declaration of competing interest
The authors declare that they have no known competing finan- cial interests or personal relationships that could have appeared to influence the work reported in this paper.
Data availability
No data was used for the research described in the article.
Acknowledgments
The authors would like to thank anonymous reviewers for their helpful comments.
Funding
We have not received any funding support.
References
[1] Harald Haas, Jaafar Elmirghani, Ian White, Optical wireless communication, Phil.
Trans. R. Soc. (2020)http://dx.doi.org/10.1098/rsta.2020.0051.
[2] Galefang Allycan Mapunda, Reuben Ramogomana, Leatile Marata, Bokamoso Basutli, Amjad Saeed Khan, Joseph Monamati Chuma, Indoor visible light communication: A tutorial and survey, hindawi, Wirel. Commun. Mob. Comput.
(2020)http://dx.doi.org/10.1155/2020/8881305.
[3] Carlos Medina, Mayteรฉ Zambrano, Kiara Navarro, LED based visible light communication: technology, applications and challenges โ a survey, Int. J. Adv.
Eng. Technol. 8 (4) (2015) 482โ495,http://dx.doi.org/10.7323/ijaet/v8iss4.
[4] A. Burton, Z. Ghassemlooy, S. Zvanovec, P.A. Haigh, H.L. Minh, X. Li, Investiga- tion into using compensation for the nonlinear effects of the output of LEDs in visible light communication systems, in: 2nd West Asian Colloquium on Optical Wireless Communications, WACOWC, Tehran, Iran, 2019, pp. 80โ84.
[5] Toshihiko Komine, Jun Hwan Lee, Shinichiro Haruyama, Masao Nakagawa, Adaptive equalization system for visible light wireless communication utilizing multiple white LED lighting equipment, IEEE Trans. Wirel. Commun. 8 (6) (2009) 2892โ2900,http://dx.doi.org/10.1109/TWC.2009.060258.
[6] Grzegorz Stepniak, Jerzy Siuzdak, Piotr Zwierko, Compensation of a VLC phosphorescent white LED nonlinearity by means of volterra DFE, IEEE Photonics Technol. Lett. 25 (16) (2013) 1597โ1600,http://dx.doi.org/10.1109/LPT.2013.
2272511.
[7] P.A. Haigh, Z. Ghassemlooy, S. Rajbhandari, I. Papakonstantinou, W. Popoola, Visible light communications: 170 Mb/s using an artificial neural network equalizer in a low bandwidth white light configuration, J. Lightwave Technol.
32 (9) (2014) 1807โ1813,http://dx.doi.org/10.1109/JLT.2014.2314635.
[8] Shaveta Dargan, Munish Kumar, Maruthi Rohit Ayyagari, Gulshan Kumar, A survey of deep learning and its applications: a new paradigm to machine learning, Arch. Comput. Methods Eng. 27 (2020) 1071โ1092,http://dx.doi.org/
10.1007/s11831-019-09344-w.
[9] Shi Dong, Ping Wang, Khushnood Abbas, A survey on deep learning and its applications, Comp. Sci. Rev. 40 (2021)http://dx.doi.org/10.1016/j.cosrev.2021.
100379.
[10] Feifan Liao, Shengyun Wei, Shun Zou, Deep learning methods in communication systems: a review, in: 2nd International Conference on Electronic Engineering and Informatics, IOP Publishing, 2020,http://dx.doi.org/10.1088/1742-6596/
1617/1/012024.
[11] Hoon Lee, Sang Hyun Lee, Tony Q.S. Quek, Inkyu Lee, Deep learning framework for wireless systems: applications to optical wireless communications, IEEE Commun. Mag. 57 (3) (2019) 35โ41, http://dx.doi.org/10.1109/MCOM.2019.
1800584.
[12] Tim OโShea, Jakob Hoydis, An introduction to deep learning for the physical layer, IEEE Trans. Cogn. Commun. Netw. 3 (4) (2017) 563โ575,http://dx.doi.
org/10.1109/TCCN.2017.2758370.
[13] Zhijin Qin, Hao Ye, Geoffrey Ye Li, Biing-Hwang Juang, Deep learning in physical layer communications, IEEE Wirel. Commun. 26 (2) (2019) 93โ99, http://dx.doi.org/10.1109/MWC.2019.1800601.
[14] Hao Ye, Geoffrey Ye Li, Biing-Hwang Juang, Power of deep learning for channel estimation and signal detection in OFDM systems, IEEE Wirel. Commun. Lett. 7 (1) (2018) 114โ117,http://dx.doi.org/10.1109/LWC.2017.2757490.
[15] Nan Chi, Junlian Jia, Fangchen Hu, Yiheng Zhao, Peng Zou, Challenges and prospects of machine learning in visible light communication, J. Commun. Inf.
Netw. 5 (3) (2020) 302โ309,http://dx.doi.org/10.23919/JCIN.2020.9200893.
[16] Mehmet Gorkem Ulkar, Tuncer Baykas, Ali E. Pusane, VLCnet: deep learning based end-to-end visible light communication system, J. Lightwave Technol. 38 (21) (2020) 5937โ5948,http://dx.doi.org/10.1109/JLT.2020.3006827.
[17] Pu Miao, Pu Miao, Chenhao Qi, Yi Jin, Chong Lin, A model-driven deep learning method for LED nonlinearity mitigation in OFDM-based optical communications, IEEE Access 7 (2019) 71436โ71446, http://dx.doi.org/10.1109/ACCESS.2019.
2919983.
[18] Yonghe Zhu, Chen Gong, Jianghua Luo, Meiyu Jin, Xianqing Jin, Zhengyuan Xu, Indoor non-line of sight visible light communication with a bi-LSTM neural network, in: IEEE International Conference on Communications Workshops (ICC Workshops), 2020, http://dx.doi.org/10.1109/ICCWorkshops49005.2020.
9145317.
[19] C. Chen, L. Zeng, X. Zhong, S. Fu, M. Liu, P. Du, Deep learning-aided OFDM- based generalized optical quadrature spatial modulation, IEEE Photonics J. 14 (1) (2022) 1โ6,http://dx.doi.org/10.1109/JPHOT.2021.3129541.
[20] Mahdi Nassiri, Saeed Ashouri, Gholamreza Baghersalimi, Zabih Ghassemlooy, A comparative performance assessment between pre-equalization and post equal- ization techniques in m-CAP indoor VLC systems with bit loading, in: 1st West Asian Colloquium on Optical Wireless Communications, WACOWC2018.
[21] Zhaocheng Wang, Tianqi Mao, Qi Wang, Optical OFDM for visible light com- munications, in: IEEE, 13th International Wireless Communications and Mobile Computing Conference, IWCMC, 2017,http://dx.doi.org/10.1109/IWCMC.2017.
7986454.
[22] Mahdi Nassiri, Atiyeh Pouralizadeh, Gholamreza Baghersalimi, Zabih Ghassem- looy, A hybrid variable m-CAP based indoor visible light communications and fingerprint positioning system, in: 3rd West Asian Colloquium on Optical Wireless Communications, WACOWC, 2020, http://dx.doi.org/10.1109/WASOWC49739.
2020.9410163.
[23] N. Bamiedakis, R.V. Penty, I.H. White, Carrier-less amplitude and phase modula- tion in wireless visible light communication systems, Phil. Trans. R. Soc. (2019) http://dx.doi.org/10.1098/rsta.2019.0181.
[24] Z. Ghassemlooy, W. Popoola, S. Rajbhandari, Optical Wireless Communications:
System and Channel Modelling with Matlabยฎ, CRC Press, 2019.
[25] M. Nassiri, G. Baghersalimi, Z. Ghassemlooy, Optical OFDM based on fractional fourier transform for an indoor VLC system, Appl. Opt. 60 (9) (2021) 2664โ2671, http://dx.doi.org/10.1364/AO.416565.
[26] M. Nassiri, S. Ashouri, G. Baghersalimi, Z. Ghassemlooy, Performance evaluation and comparison of spatial modulation and non-DC MIMO m-CAP techniques in indoor VLC system, IET Optoelectron. (2019) 281โ287,http://dx.doi.org/10.
1049/iet-opt.2018.5076.
[27] Md Zahangir Alom, Tarek M. Taha, Chris Yakopcic, Stefan Westberg, Pa- heding Sidike, Mst Shamima Nasrin, Mahmudul Hasan, Brian C. Van Essen, Abdul A.S. Awwal, Abdul A.S. Awwal, A state-of-the-art survey on deep learning theory and architectures, Electronics 8 (3) (2019) http://dx.doi.org/10.3390/
electronics8030292.
[28] Priti G. Pachpande, Monette H. Khadr, Ahmed F. Hussein, Hany Elgala, Visible light communication using deep learning techniques, in: IEEE 39th Sarnoff Symposium, 2018,http://dx.doi.org/10.1109/SARNOF.2018.8720493.
[29] Ian Goodfellow, Yoshua Bengio, Aaron Courville, Deep Learning, MIT Press, 2016.
[30] Qi Wang, Yue Ma, Kun Zhao, Yingjie Tian, A comprehensive survey of loss function in machine learning, J. Ann. Data Sci. (2020) http://dx.doi.org/10.
1007/s40745-020-00253-5.