• Non ci sono risultati.

ML in Visible light communication

N/A
N/A
Nora Pouralizadeh

Academic year: 2024

Condividi "ML in Visible light communication"

Copied!
9
0
0

Testo completo

(1)

aDepartment of Electrical Engineering, University of Guilan, Rasht, Iran

bOptical Communications Research Group, Faculty of Engineering and Environment, Northumbria University, Newcastle, UK

A R T I C L E I N F O

Keywords:

Visible light communications (VLC) Carrier-less amplitude and phase (CAP) modulation

Deep learning (DL)

A B S T R A C T

We propose in this paper a carrier-less amplitude and phase visible light communications (VLC) system with deep learning (DL)-based post-equalizer (EQ) to significantly increase the transmission data rate. The proposed system is analyzed for various conditions including modulation order, transmitted signal bandwidth, and non- line of sight VLC channel. Results show that the highest data rate and spectral efficiency of 100 Mb/s and 4.67 b/s/Hz are achieved for the modulation order and signal bandwidth of 64 and 25 MHz, respectively.

In addition, we compare the performance and complexity of the proposed system with different types of EQs including least mean square and Volterra series. The study shows the DL-based EQ is qualified to mitigate mixed linear and nonlinear impairments by providing improved bit error rate performance compared to the other EQs for all modulation orders and the transmitted signal bandwidth.

1. Introduction

Optical wireless communications (OWC) use visible light (VL), in- frared (IR), and ultraviolet (UV) spectral bands to meet some of the demands for wireless connectivity in fifth-generation (5G) and 6G networks. OWC, as a complementary technology to the radio frequency (RF) wireless systems, has unique features such as almost unlimited bandwidth (BW), no spectrum authorization and regulations, much safer to the environment, higher security in the physical layer (PHY), higher energy efficiency and improved sustainability [1]. In the VL band OWC, known as VL communications (VLC), uses white light emitting diodes (LEDs) and photodetectors (PDs) as the transmitter (Tx) and the receiver (Rx); this offers simultaneous illumination and data communications [2]. However, white LEDs have some limitations that may cause VLC systems to underperform in practical applications.

Blue LEDs with phosphor and RGB (red, green, and blue) LEDs are the two most widely used methods of producing white light. Even though, RGB LEDs offer higher data rates,๐‘…๐‘‘, blue LEDs with phosphor coat- ing have simpler implementation and lower cost. However, the long responsivity of phosphor limits the LED modulation bandwidth,๐ตLED, to only a few MHz, which in turn greatly constrains the achievable transmission capacity of the system. Moreover, white LEDs are another source of nonlinearity in VLC systems, leading to signal distortion and intersymbol interference (ISI) [3,4].

A common solution to mitigate these limitations is to employ linear and nonlinear equalizers (EQs) including adaptive least mean square

โˆ— Corresponding author.

E-mail address: [email protected](G. Baghersalimi).

(LMS) โ€“ the most widely used, Volterra series, and deep learning (DL)- based filters. As (i) in [5] it was shown how an adaptive EQ with the LMS algorithm could suppress the ISI in indoor VLC systems; (ii) in [6], it was shown that a Rx with a decision feedback EQ (DFE) with nonlinear Volterra feed-forward section could effectively mitigate the effects of LEDโ€™s nonlinearity with improved performance by up to 5 dB in terms of optical power compared with a simple DFE; and (iii) in [7]

it was demonstrated that higher๐‘…๐‘‘ of 170 Mb/s could be achieved by employing artificial neural network (ANN)-based equalization using white LEDs.

DL is a powerful subfield of machine learning (ML), which can learn, recognize, and predict complex patterns among high-dimensional input information data [8]. The DL methods have been used in many application domains including speech recognition, computer vision, self-driving cars, natural language processing, predictive forecasting, and communication systems [9โ€“11]. For example, authors in [12,13]

introduced several DL applications and improvements in the PHY of communication links. In [14], a deep neural network (DNN) was em- ployed as an accurate tool for channel estimation to reduce both dis- tortion and interference. Authors in [15] reviewed the challenges and applications of using DL in VLC, whereas in [16] DL was utilized to re- duce flickering and increase the dimming level in a VLC system. In [17], the authors determined an autoencoder to mitigate the nonlinearity of the LED and enhance the system bit error rate (BER) performance.

In [18], a DNN-based signal detector namely bi-directional long-short term memory for VLC links with 25 Mbps non-line of sight (NLOS)

https://doi.org/10.1016/j.optcom.2022.128741

Received 7 April 2022; Received in revised form 18 June 2022; Accepted 5 July 2022 Available online 8 July 2022

0030-4018/ยฉ2022 Elsevier B.V. All rights reserved.

(2)

Fig. 1. Schematic block diagram of the proposed CAP-VLC system with DL-based post-EQ.

and 50 Mbps NLOS using the second-order and first-order reflection, respectively, were reported with the BER of < 1 ร— 10โˆ’4. In [19], an orthogonal frequency division multiplexing (OFDM)-based optical quadrature spatial modulation for multiple-input multiple-output OWC with DNN-aided detection was proposed with a successful outcome.

There are different widely used spectrally efficient modulation schemes in intensity modulation/direct detection (IM/DD) optical sys- tems. In this work, we have adopted the carrier-less amplitude and phase (CAP) modulation scheme for VLC. To the best of the authorsโ€™

knowledge, the DL-based post-EQ has not been used in CAP-VLC systems. Therefore, inspired by many of the aforementioned research works, the effectiveness of applying DL for post-EQ in an indoor CAP- VLC system is investigated in this work to increase๐‘…๐‘‘ and improve the BER performance. In our study, we successfully demonstrate ๐‘…๐‘‘ of 100 Mb/s, which to our knowledge is the highest ๐‘…๐‘‘ for a single band CAP-VLC system using a single white LED so far. In addition, we compare the BER performance of the proposed system with two common equalization techniques of Volterra series and adaptive LMS algorithm.

The rest of the paper is organized as follows. In Section 2, an overview of the DL-based indoor VLC-CAP system is explained. In Section 3, the most important results are presented and discussed.

Finally, Section4is allocated to conclude the paper.

2. System overview

Fig. 1illustrates the schematic block diagram of the proposed CAP- VLC system with DL-based post-EQ which is explained comprehensively as follows. Note, we assume that each LED follows the Lambertian radiation pattern.

2.1. The Tx

At the Tx, a random data bit stream {๐ผ๐‘›} is mapped into the quadra- ture amplitude modulation (QAM) format prior to being applied to the CAP modulator. CAP is a spectrally efficient and low-cost technique, which can be implemented with ease, due to its simplicity and the lack of need for fast Fourier transform (FFT) and inverse FFT (IFFT) as required in OFDM. Like the QAM format, except that instead of using a local oscillator, CAP generates carrier frequencies using two orthogonal digital filters whose impulse responses are Hilbert transform pairs [20โ€“

23]. The mapped complex data are then up-sampled by a specific factor ๐‘›๐‘  =โŒˆ2 (1 +๐›ฝ)โŒ‰, whereโŒˆ.โŒ‰is the ceiling function and๐›ฝis the roll-of factor of the square root raised cosine (SRRC) filter. Next, the real and imaginary elements of the up-sampled signal are separated and sent to

the digital in-phase (I) and quadrature (Q) SRRC filters, respectively, which are given as [24]:

๐‘“๐ผ(๐‘ก) =

โŽกโŽข

โŽขโŽข

โŽขโŽฃ

sin [๐›พ(1 โˆ’๐›ฝ)] + 4๐›ฝ(๐‘ก

๐‘‡๐‘ 

) cos [๐›พ๐›ฟ]

๐›พ [

1 โˆ’(4๐›ฝ๐‘ก

๐‘‡๐‘ 

)2]

โŽคโŽฅ

โŽฅโŽฅ

โŽฅโŽฆ

.cos [๐›พ๐›ฟ] (1)

๐‘“๐‘„(๐‘ก) =

โŽกโŽข

โŽขโŽข

โŽขโŽฃ

sin [๐›พ(1 โˆ’๐›ฝ)] + 4๐›ฝ(

๐‘ก ๐‘‡๐‘ 

) cos [๐›พ๐›ฟ]

๐›พ [

1 โˆ’(4๐›ฝ๐‘ก

๐‘‡๐‘ 

)2]

โŽคโŽฅ

โŽฅโŽฅ

โŽฅโŽฆ

.sin[๐›พ๐›ฟ] (2)

where๐‘‡๐‘ is the symbol duration,๐›พ=๐œ‹๐‘กโˆ•๐‘‡๐‘ , and๐›ฟ= 1 +๐›ฝ.

The outputs of the digital finite impulse response (FIR) filters are summed together to produce the desired CAP signal, which is given by [24]:

๐‘ (๐‘ก) =๐‘ ๐ผ(๐‘ก)โŠ— ๐‘“๐ผ(๐‘ก) โˆ’๐‘ ๐‘„(๐‘ก)โŠ— ๐‘“๐‘„(๐‘ก) (3) where๐‘ ๐ผ(๐‘ก)and๐‘ ๐‘„(๐‘ก)are up-sampledIandQcomponents, respectively andโŠ—indicates the time domain convolution. Finally,s (t)is scaled and a direct current (DC)-bias is added to it which makes it nonnegative within the dynamic range of the LED prior to IM of the light source as follow [25]:

๐ผ๐‘–๐‘›(๐‘ก) =๐ผ๐ท๐ถ+๐›ผ๐‘ (๐‘ก), (4)

where๐›ผis the scaling factor and defined as: ๐›ผ= ๐‘€ ๐ผร—๐ผmax

(๐‘€ ๐ผ+ 1) ร— max{๐‘ (๐‘ก)} (5)

where๐ผmax=๐ผDC+ 0.5๐ผPPis the maximum value of๐ผin,๐ผDC is the DC- bias current,๐ผPPis the peak-to-peak current, and๐‘€ ๐ผis the modulation index, which is defined as:

๐‘€ ๐ผ=๐ผmaxโˆ’๐ผDC

๐ผDC (6)

The Hammerstein method is adopted to model LED characteristics in this study. It includes a nonlinear function and a linear component to represent the LED with the limited bandwidth, which are characterized as a third-order polynomial and a first-order low pass Butterworth filter, respectively, as given by [25]:

๐‘“NL(๐‘ฅ) =

โŽงโŽช

โŽจโŽช

โŽฉ

0.1947, ๐‘ฅ <0.1

0.2855๐‘ฅ3โˆ’ 1.0886๐‘ฅ2+ 2.0565๐‘ฅโˆ’ 0.0003, 0.1โ‰ค๐‘ฅ <1 1.2531, ๐‘ฅโ‰ฅ1

(7)

โ„ŽLED(๐‘ก) =๐‘’โˆ’2๐œ‹๐‘“LED๐‘ก (8)

where๐‘“๐ฟ๐ธ๐ทis the 3-dB cut-off frequency of the LED. The LED transmit output power is defined as:

๐‘ƒout(๐‘ก) =๐‘“NL( ๐ผin(๐‘ก))

โŠ— โ„ŽLED(๐‘ก) (9)

(3)

the Tx and the Rx, respectively.๐‘‡๐‘ (๐œ“)and๐‘”(๐œ“)are the optical filter and non-imaging concentrator gain, respectively. For the NLOS paths, every wall is divided into๐ฟ๐‘…small squares, which can be assumed as both the Tx and Rx.โ„Ž(

๐‘ก, ๐‘ , ๐œ–๐‘›๐‘Ÿ)

represents the NLOS impulse response between the main Tx and the reflecting element๐œ–๐‘›๐‘Ÿ, which acts as a Rx.

โ„Ž( ๐‘ก, ๐œ–๐‘›๐‘†, ๐ท)

is the impulse response between๐œ–๐‘›๐‘† (i.e., acting as a Tx) and the main Rx,๐ท. It is worth to mention that, only a single reflection per wall is considered in this study and other higher order reflections are ignored since they contribute very little to the total received optical power [24]

2.3. The Rx

At the Rx side, the received signal is detected by an optical Rx composed of a single PD and a trans-impedance amplifier (TIA). The regenerated electrical received signal is given by:

๐‘ฆ(๐‘ก) =๐‘ (๐‘ก) โˆ—โ„Ž๐‘(๐‘ก) +๐‘›(๐‘ก) (13)

where๐‘›(๐‘ก)is the additive white Gaussian noise with the power๐‘ƒ๐‘› = ๐‘0๐ตRx, ๐‘0 is the noise power spectral density, and๐ตRx is the band- width of the Rx. Note,๐‘›(๐‘ก)is mostly dominated by the ambient light induced shot noise. Following DC removal, a DL-based EQ is employed, which is explained in Subsection D in details. The equalized signal is applied to a set of inverse matched filters ๐‘”๐ผ(๐‘ก) = ๐‘“๐ผ(โˆ’๐‘ก) and ๐‘”๐‘„(๐‘ก) = ๐‘“๐‘„(โˆ’๐‘ก) to split theI andQ components of the CAP signal, respectively. Next, the detached signals are down-sampled prior to QAM demodulation.

2.4. DL-based EQ

As mentioned before, DL is a powerful subfield of ML, which em- ploys DNN to solve complex problems. DNN is a class of ANNs with two or more hidden layers of nodes [27], see Fig. 2. As shown, it consists of four layers including the input layer, two hidden layers, and output layer. The number of nodes, shown in circles, 120, 300, and 400 for both the input and output layer, and the 1st and 2nd hidden layers, respectively. All layers are fully connected, which means that every node in each layer is connected to all nodes in the next layer. The DL method splits the modeling dataset into training and test sets. The process for the training set is offline and includes two steps: (i) forward; and (ii) backward propagation [28]. First, in forward propagation, a subset of training dataset named minibatch crosses from the input layer to the hidden layers and then the output layer. The mathematical computations of forward propagation in the 1st hidden layer is given by [29]:

๐‘[1]=๐‘Š[1]๐‘‹+๐‘[1] (14)

๐ด[1]=๐‘”[1](๐‘[1]) (15)

where[.]is the number of layer,๐‘‹and๐ด[1]are the input and output of the 1st layer, respectively,๐‘Š[1],๐‘[1]and๐‘”[1](.)are the weights, biases,

๐‘‹=

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽฃ

1 โ€ฆ 1

๐‘ฅ(1)[1]

2 โ€ฆ๐‘ฅ(128)[1]

2

. . . ๐‘ฅ(1)[1]

120 โ€ฆ๐‘ฅ(128)[1]

120

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฆ(๐‘›[0]ร—๐‘)=(120ร—128)

(17)

๐‘[1]=

โŽกโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽฃ ๐‘(1)[1]

1 โ€ฆ๐‘(128)[1]

1

๐‘(1)[1]

2 โ€ฆ๐‘(128)[1]

2

๐‘(1)[1]

3 โ€ฆ๐‘(128)[1]

3

. . . ๐‘(1)[1]

300 โ€ฆ๐‘(128)[1]

300

โŽคโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฆ(๐‘›[1]ร—๐‘)=(300ร—128)

(18)

๐‘[1]=

โŽกโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽฃ ๐‘ง(1)[1]

1 โ€ฆ๐‘ง(128)[1]

1

๐‘ง(1)[1]

2 โ€ฆ๐‘ง(128)[1]

2

๐‘ง(1)[1]

3 โ€ฆ๐‘ง(128)[1]

3

. . . ๐‘ง(1)[1]

300 โ€ฆ๐‘ง(128)[1]

300

โŽคโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฆ(๐‘›[1]ร—๐‘)=(300ร—128)

(19)

๐ด[1]=

โŽกโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽข

โŽขโŽฃ ๐‘Ž(1)[1]

1 โ€ฆ๐‘Ž(128)[1]

1

๐‘Ž(1)[1]2 โ€ฆ๐‘Ž(128)[1]2 ๐‘Ž(1)[1]3 โ€ฆ๐‘Ž(128)[1]3

. . . ๐‘Ž(1)[1]

300 โ€ฆ๐‘Ž(128)[1]

300

โŽคโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฅ

โŽฅโŽฆ(๐‘›[1]ร—๐‘)=(300ร—128)

(20)

where๐‘›[๐“] is the number of nodes in๐“thlayer. The matrix dimension for๐‘Š[๐“],๐‘[๐“],๐‘[๐“], and๐ด[๐“]are(

๐‘›[๐“]ร—๐‘›[๐“โˆ’1])

,(๐‘›[๐“]ร—๐‘),(๐‘›[๐“]ร—๐‘), and(๐‘›[๐“]ร—๐‘), respectively, where๐‘ is the size of minibatch, which is assumed to be 128 in this work. Generally, the forward propagation process for all layers is given as:

๐‘[๐“]=๐‘Š[๐“]๐ด[๐“โˆ’1]+๐‘[๐“] (21)

๐ด[๐“]=๐‘”[๐“](๐‘[๐“]) (22)

where๐ด[๐“โˆ’1]is the input of the๐“th layer.

(4)

Fig. 2. The structure of the proposed DL-based post-EQ.

Fig. 3. The BER performance against (๐ธbโˆ•๐‘0)testwith the DL-based EQ for a range of (๐ธbโˆ•๐‘0)trainfor 128 QAM and signal BW of 5 MHz.

The rectified linear unit (ReLU) and Sigmoid values are used for the intermediate and output layers, respectively, which are given as [28]:

ReLU (๐‘ฅ) =

{๐‘ฅ, ๐‘ฅ >0

0, ๐‘ฅโ‰ค0, (23)

Sigmoid= 1

1 + exp(โˆ’๐‘ฅ) (24)

There are two types of learning process for ML algorithms: su- pervised and unsupervised. In the former, which is used here, the network is trained based on the input/output pairs known as the labeled data [27]. The labeled data (๐‘ฅ, ๐‘ฆ) is shown as portsAandB inFig. 1. PortA, i.e., the input to DNN, is the received signal with the LED and channel induced impairment, and portBis the desired CAP signal, which is used as the prediction target.

Accordingly, a loss functionL is defined to determine the error, which indicates the difference between the outputs of DNN and the correct answers.Lis expressed by various metrics [30], and here we have used the mean square error (MSE), which is defined as:

MSE(๐‘Ž, ๐‘ฆ) = 1 ๐‘

โˆ‘๐‘

๐‘–=1

(๐‘Ž๐‘–โˆ’๐‘ฆ๐‘–)2, (25)

where๐‘Žis the estimated real output of DNN, and๐‘ฆis the desired labeled value.

In the backward propagation phase, the main aim of DL is to find the correct weights and biases parameters of (๐‘ค,๐‘), respectively, for minimizing L using different optimization algorithms. Stochastic gradient descent (SGD) [31] is the most popular optimization algorithm in DL, which is also adopted in this study. First,๐‘คand๐‘are initialized to a random values and the gradients of differentiable๐ฟare calculated based on๐‘ค and ๐‘using the chain rule. Here the gradients of cross entropy function are calculated as a useful example. Note, we have the followings:

๐‘‘๐ฟ

๐‘‘๐‘ฅ=๐‘‘๐‘ฅ, (26)

๐‘‘๐‘Ž[2]= ๐‘‘๐ฟ ๐‘‘๐‘Ž[2] = โˆ’๐‘ฆ

๐‘Ž[2]+ 1 โˆ’๐‘ฆ

1 โˆ’๐‘Ž[2], (27)

๐‘‘๐‘ง[2]= ๐‘‘๐ฟ ๐‘‘๐‘ง[2] = ๐‘‘๐ฟ

๐‘‘๐‘Ž[2]ร—๐‘‘๐‘Ž[2]

๐‘‘๐‘ง[2] =๐‘‘๐‘Ž[2]ร—๐‘”โ€ฒ( ๐‘ง[2])

= (โˆ’๐‘ฆ

๐‘Ž[2]+ 1 โˆ’๐‘ฆ 1 โˆ’๐‘Ž[2]

)

ร—( ๐‘Ž[2](

1 โˆ’๐‘Ž[2]))

=๐‘Ž[2]โˆ’๐‘ฆ, (28)

๐‘‘๐‘ค[2]= ๐‘‘๐ฟ ๐‘‘๐‘ค[2] = ๐‘‘๐ฟ

๐‘‘๐‘Ž[2] ร— ๐‘‘๐‘Ž[2]

๐‘‘๐‘ค[2]

= (โˆ’๐‘ฆ

๐‘Ž[2] + 1 โˆ’๐‘ฆ 1 โˆ’๐‘Ž[2]

)

ร—( ๐‘Ž[2](

1 โˆ’๐‘Ž[2])

ร—๐‘Ž[1]๐‘‡)

=( ๐‘Ž[2]โˆ’๐‘ฆ)

ร—๐‘Ž[1]๐‘‡=๐‘‘๐‘ง[2]ร—๐‘Ž[1]๐‘‡, (29)

๐‘‘๐‘[2]= ๐‘‘๐ฟ ๐‘‘๐‘[2] = ๐‘‘๐ฟ

๐‘‘๐‘Ž[2]ร—๐‘‘๐‘Ž[2]

๐‘‘๐‘[2] =๐‘‘๐‘ง[2]=๐‘Ž[2]โˆ’๐‘ฆ, (30)

(5)

Fig. 4. Flowchart of training and test process.

Fig. 5. BER performance of the proposed system for 64- and 128-QAM for a range of transmit signal BW.

Generally, the equation for the backward propagation for๐‘training example is given as:

๐‘‘๐‘[๐“]=๐‘‘๐ด[๐“]ร—๐‘”โ€ฒ[๐“](๐‘[๐“]), (31) ๐‘‘๐‘Š[๐“]= 1

๐‘ ร—๐‘‘๐‘[๐“]ร—๐ด[๐“โˆ’1]๐‘‡, (32)

๐‘‘๐‘[๐“]= 1

๐‘ ร—๐‘‘๐‘[๐“], (33)

๐‘‘๐ด[๐“โˆ’1]=๐‘Š[๐“]๐‘‡ร—๐‘‘๐‘[๐“], (34)

where the dimension of matrices๐‘‘๐‘Š[๐“],๐‘‘๐‘[๐“],๐‘‘๐‘[๐“], and๐‘‘๐ด[๐“โˆ’1]equal to(๐‘›[๐“]ร—๐‘›[๐“โˆ’1]),(๐‘›[๐“]ร—๐‘),(๐‘›[๐“]ร—๐‘), and(๐‘›[๐“]ร—๐‘), respectively. The updated and optimized weights and biases in each layer are given as:

๐‘คโˆถ =๐‘คโˆ’๐›ผ๐‘‘๐‘ค, (35)

๐‘โˆถ =๐‘โˆ’๐›ผ๐‘‘๐‘, (36)

where,๐›ผ is the learning rate. The backward propagation algorithm is repeated for๐‘šโˆ•๐‘times to complete one epoch, where๐‘šis the number of complete training features for each node in the input layer. An epoch is completed when all dataset is applied to the DL algorithm. Although determining the number of epochs is different due to the convergence of algorithm, but it is considered to 100 in this study.

As shown inFig. 2, prior to the use of activation function, the batch normalization method is applied, which speeds up the optimization process to provide higher๐›ผ. Note, (i) according to [32], the demand for the regularization technique is reduced noticeably; and (ii) the adaptive moment estimation (Adam) optimizer is employed due to faster conver- gence. More detailed information can be found in [33]. The pseudocode for the utilized learning algorithm is explained in Algorithm 1. To avoid overfitting and make the model more generalized, the dataset is divided into two sets of training and validation [34]. In the validation set, the hyperparameters can be changed and tuned to minimize the error in the produced model, which is fitted on the training dataset. In the test

(6)

Table 1 DNN parameters.

Symbol Parameter Value

๐‘›[0] Number of nodes for input layer 120

โ€“ Number of hidden layers 2

๐‘›[1] Number of nodes for 1st hidden layer 300 ๐‘›[2] Number of nodes for 2nd hidden layer 400

๐‘›[3] Number of nodes for output layer 120

๐‘”(.) Activation function for hidden layers ReLU ๐‘”(.) Activation function for output layer Sigmoid

โ€“ Number of Batch normalization layers 2

โ€“ Optimizer SGD with Adam

โ€“ (๐ธbโˆ•๐‘0)train 5, 10, 15, 20 dB

โ€“ Default (optimum) (๐ธbโˆ•๐‘0)train 10 dB

L Loss function MSE

๐›ผ Learning rate 0.01

๐‘ Size of minibatch 128

๐‘š Number of training samples for each node 106

โ€“ Size of training dataset 120ร—106

โ€“ Size of test dataset 30000

โ€“ Number of epoch 100

Table 2 System parameters.

Symbol Parameter Value

โ€“ Room size 5ร—5ร—3 m3

๐œ™1โˆ•2 Semi-power half angle 70โ—ฆ

๐œ“๐‘ PD field of view 60โ—ฆ

๐ด PD detector area 1 cm2

๐‘…PD Responsivity of PD 0.54 A/W

๐‘‡๐‘ (๐œ“) Optical filter gain 1 ๐ต๐‘š LED modulation bandwidth 4.5 MHz ๐›ฝ Roll of factor ofI/Qfilters 0.5

๐‘›๐‘  Up-sampling factor 6

๐ฟ๐‘“ I/Qfilter length 10 sym.

๐ต๐‘ก Total bandwidth 5โ€“25 MHz

โ€“ Modulation type QAM

๐‘€ Modulation order 64 & 128

๐บTIA TIA gain 50โ€“60 dB

โ€“ Tx position Center of ceiling

โ€“ Rx position LOS scenario: Center of floor

NLOS scenario: (1.15, 1.15, 0.85) m

phase, which is done online, the new and never-before-seen received CAP signals are used as the input dataset in the DNN EQs. The trained network attempts to recover them and minimize the error based on the gained knowledge. All the parameters for DNN are summarized in Table 1.

3. Result and discussion

In this section, we investigate the performance of the proposed CAP-VLC system with a DL-based post-EQ by means of computer simu- lations. MATLAB and Python are used to simulate VLC subsystem and the DL-based EQ, respectively. All key system parameters adopted are summarized inTable 2. To train the network, we have used a range of energy per bit to noise ratios (๐ธbโˆ•๐‘0)train. Due to the computation time, we considered only (๐ธbโˆ•๐‘0)train values of 5, 10, 15, and 20 dB.

Fig. 3depicts the BER performance against the (๐ธbโˆ•๐‘0)testfor a range of (๐ธbโˆ•๐‘0)trainvalues, 128-QAM and a signal bandwidth of 5 MHz. It is shown that, the best (optimum) BER performance is achieved for (๐ธbโˆ•๐‘0)trainof 10 dB, with the (๐ธbโˆ•๐‘0)testgains of 0.2, 0.8, and 5.7 dB compared with the (๐ธbโˆ•๐‘0)trainvalues of 15, 20 and 5 dB, respectively, at the 7% forward error correlation (FEC) BER limit of 3.8 ร—10โˆ’3. Considering the trade-off between the simulation time and estimation of the BER, the coefficients of the DNN network corresponding to the optimum (๐ธbโˆ•๐‘0)train of 10 dB is adopted for the rest of evaluation during the test phase.Fig. 4 illustrates the flowchart of training and test processes.

Fig. 5 compares the BER performance of the proposed system for 64- and 128-QAM for a range of transmit signal BW. Referring to the

7% FEC BER limit, 128-QAM with a signal BW of 5 MHz offers the best performance with the (๐ธbโˆ•๐‘0)testgains of 1, 1.8, 6, and 6 dB compared with 10, 15, and 20 MHz 128-QAM, and 25 MHz 64-QAM, respectively.

๐‘…๐‘‘of 93.4 Mb/s and the spectral efficiency of 4.67 b/s/Hz are attained for 20 MHz 128-QAM with respect to the 7% FEC BER limit. To achieve higher๐‘…๐‘‘, the proposed system is also examined for 64-QAM and a signal BW of 25 MHz. Results demonstrate that๐‘…๐‘‘is improved by up toโˆผ6 Mb/s reaching 100 Mb/s with respect to the 7% FEC BER limit.

At the Rx side, the small value of peak-to-peak injection current๐ผpp leads to lower๐ธbโˆ•๐‘0 deterioration, which degrades the BER perfor- mance. Furthermore, large values of ๐ผpp may exceed the LED linear range, thus resulting in nonlinear distortions.Fig. 6depicts the BER performance against๐ผpp for (๐ธbโˆ•๐‘0)train = 10dB,๐‘€ = 64and BW of 25 MHz, respectively. As can be seen, the lowest BER ofโˆผ2.26ร—10โˆ’3 is achieved at 0.7โ‰ค๐ผppofโ‰ค0.8 A. Therefore,๐ผppof 0.8 A is considered as the default value in this study.

Fig. 7(a) depicts the BER performance of the proposed system with 64-QAM and a BW of 25 MHz for three different post-EQ schemes. It is clearly seen that, at the 7% FEC BER limit DL offers (๐ธbโˆ•๐‘0)testgains 2 and 8 dB compared with Volterra and LMS EQs, respectively, which can be used to trade-off transmission range against๐‘…๐‘‘. Note, reducing the bandwidth results in improved BER performance as shown inFig. 7(b) with the DL-based EQ outperforming LMS and Volterra. For instance, at the BER of 3.8ร—10โˆ’3, the (๐ธbโˆ•๐‘0)testpenalties areโˆผ6 and 10 dB for Volterra and LMS, respectively compared with the DL method.

Next, we evaluated the BER performance of LOS and LOS + NLOS 64-QAM VLC links with a signal BW of 25 MHz. As can be seen in Fig. 8, the overall BER performance of pure LOS and LOS + NLOS links are almost the same.

Finally, the complexity of the DL-based EQ is compared with LMS and the 2nd order Volterra series EQ as summarized inTable 3[35].

As it is expected, the computational complexity of the DL-based EQ is approximately 104 times larger than other two EQs. However, in this work the proposed system with only two hidden layers is considered, which is a simple DNN using DL technique, while more complex net- works can be implemented using field programmable gate array (FPGA) boards. As an example, in [36] a DNN with two hidden layers was implemented using FPGA (Xilinx Zynq UltraScale) with the complexity of about 182007, thus demonstrating the potential use of FPGA in DL- based EQs in wireless communications. Note, ๐‘€๐ฟ and๐‘€NL are the linear and nonlinear memory depths, respectively.๐‘€1, ๐ป1,๐ป2, and ๐ป3are the number of nodes in input, first hidden layer, second hidden layer, and output layer, respectively.

4. Conclusion

In this paper, we introduced a deep learning-based post-EQ for

โ€˜โ€˜singleโ€™โ€™ CAP-VLC system considering both the LEDโ€™s bandwidth lim- itation and its nonlinearity effects. The results showed that, the highest data rate of about 100 Mb/s is achieved for 64-QAM, the signal bandwidth of 25 MHz and the LED modulation bandwidth of 4.5 MHz (i.e., standard white LED). Finally, we compared the performance of the proposed scheme with two common EQs of LMS and Volterra series, demonstrating improved BER results.

Algorithm 1. Pseudo-code of DNN training algorithm Input:Number of layers๐ฟ, number of neurons for all layers, activation functions, loss function, number of epochs๐ธ, number of transmitted batches in one epoch๐‘, number of training example in each minibatch๐‘š, learning rate๐›ผ= 0.01,๐›ฝ1= 0.9,๐›ฝ2= 0.999, ๐œ‚= 10โˆ’8

Output:A trained DNN

1:Initializeweights and biases for all layers randomly 2: forepoch = 1 to๐ธdo

3: for๐‘ก= 1to๐‘do

(7)

Fig. 6. The BER against๐ผppfor the 64-QAM CAP-VLC system with the DL-based EQ.

Fig. 7.The BER performance against (๐ธbโˆ•๐‘0)testfor three different EQs schemes for the signal bandwidth of: (a) 25 MHz, and (b) 20 MH.

Table 3

Comparison of complexity for three different EQs.

Type of EQs ๐‘€๐ฟ ๐‘€๐‘ ๐ฟ ๐‘€1 ๐ป1 ๐ป2 ๐ป3 Complexity Value of complexity

DL-based โ€“ โ€“ 120 300 400 120 ๐‘€1ร—๐ป1+๐ป1ร—๐ป2+๐ป3 156400

LMS 6 โ€“ โ€“ โ€“ โ€“ โ€“ ๐‘€๐ฟ 6

Volterra series 6 6 โ€“ โ€“ โ€“ โ€“ ๐‘€๐ฟ+๐‘€๐‘ ๐ฟ(๐‘€๐‘ ๐ฟ+ 1)โˆ•2 27

Algorithm 1. Pseudo-code of DNN training algorithm 4: for๐“= 2 to๐ฟdo

5: doBatch normalization as follow:

6:๐œ‡๐›ฝโ† 1

๐‘š

โˆ‘๐‘š ๐‘–=1๐‘ฅ๐‘– 7:๐œŽ๐›ฝ2โ† 1

๐‘š

โˆ‘๐‘š

๐‘–=1(๐‘ฅ๐‘–โˆ’๐œ‡๐›ฝ)2 8:๐‘ฅโˆง๐‘– โ†๐‘ฅ๐‘–-โˆš๐œ‡๐›ฝ

๐œŽ2๐›ฝ+๐œ–

9:๐‘ ๐‘–โ†๐›พ ๐‘ฅโˆง๐‘– +๐›ฝ

10: doforward propagation 11:end for

12: calculateloss function 13: for๐“= 2to L

14: dobackward propagation

15: initialize๐‘‰๐‘‘๐‘ค,๐‘‰๐‘‘๐‘,๐‘†๐‘‘๐‘ค,๐‘†๐‘‘๐‘to zero 16:๐‘‰๐‘‘๐‘ค=๐›ฝ1๐‘‰๐‘‘๐‘ค+(

1 โˆ’๐›ฝ1) ๐‘‘๐‘ค 17:๐‘‰๐‘‘๐‘=๐›ฝ1๐‘‰๐‘‘๐‘+(

1 โˆ’๐›ฝ1) ๐‘‘๐‘

Algorithm 1. Pseudo-code of DNN training algorithm 18:๐‘†๐‘‘๐‘ค=๐›ฝ2๐‘†๐‘‘๐‘ค+(

1 โˆ’๐›ฝ2) ๐‘‘๐‘ค2 19:๐‘†๐‘‘๐‘=๐›ฝ2๐‘†๐‘‘๐‘+(

1 โˆ’๐›ฝ2) ๐‘‘๐‘2 20:๐‘‰๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘

๐‘‘๐‘ค = Vdw

1โˆ’๐›ฝ1๐‘ก

21:๐‘‰๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘

๐‘‘๐‘ =1โˆ’๐›ฝVdb๐‘ก 1

22:๐‘†๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘

๐‘‘๐‘ค = Sdw

1โˆ’๐›ฝ๐‘ก 2

23:๐‘†๐‘‘๐‘๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘= Sdb

1โˆ’๐›ฝ๐‘ก 2

24: updateall weights and biases as follow:

25:๐‘ค=๐‘คโˆ’๐›ผ ๐‘‰

๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘

โˆš๐‘‘๐‘ค ๐‘†๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘๐‘‘๐‘ค +๐œ€

26:๐‘=๐‘โˆ’๐›ผ ๐‘‰

๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘

โˆš๐‘‘๐‘ ๐‘†๐ถ๐‘œ๐‘Ÿ๐‘Ÿ๐‘’๐‘๐‘ก๐‘’๐‘‘๐‘‘๐‘ +๐œ€

27:end for 28:end for 29:end for

(8)

Fig. 8. The BER performance against (๐ธbโˆ•๐‘0)testfor LOS and LOS + NLOS 64-QAM VLC systems.

Declaration of competing interest

The authors declare that they have no known competing finan- cial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data availability

No data was used for the research described in the article.

Acknowledgments

The authors would like to thank anonymous reviewers for their helpful comments.

Funding

We have not received any funding support.

References

[1] Harald Haas, Jaafar Elmirghani, Ian White, Optical wireless communication, Phil.

Trans. R. Soc. (2020)http://dx.doi.org/10.1098/rsta.2020.0051.

[2] Galefang Allycan Mapunda, Reuben Ramogomana, Leatile Marata, Bokamoso Basutli, Amjad Saeed Khan, Joseph Monamati Chuma, Indoor visible light communication: A tutorial and survey, hindawi, Wirel. Commun. Mob. Comput.

(2020)http://dx.doi.org/10.1155/2020/8881305.

[3] Carlos Medina, Mayteรฉ Zambrano, Kiara Navarro, LED based visible light communication: technology, applications and challenges โ€“ a survey, Int. J. Adv.

Eng. Technol. 8 (4) (2015) 482โ€“495,http://dx.doi.org/10.7323/ijaet/v8iss4.

[4] A. Burton, Z. Ghassemlooy, S. Zvanovec, P.A. Haigh, H.L. Minh, X. Li, Investiga- tion into using compensation for the nonlinear effects of the output of LEDs in visible light communication systems, in: 2nd West Asian Colloquium on Optical Wireless Communications, WACOWC, Tehran, Iran, 2019, pp. 80โ€“84.

[5] Toshihiko Komine, Jun Hwan Lee, Shinichiro Haruyama, Masao Nakagawa, Adaptive equalization system for visible light wireless communication utilizing multiple white LED lighting equipment, IEEE Trans. Wirel. Commun. 8 (6) (2009) 2892โ€“2900,http://dx.doi.org/10.1109/TWC.2009.060258.

[6] Grzegorz Stepniak, Jerzy Siuzdak, Piotr Zwierko, Compensation of a VLC phosphorescent white LED nonlinearity by means of volterra DFE, IEEE Photonics Technol. Lett. 25 (16) (2013) 1597โ€“1600,http://dx.doi.org/10.1109/LPT.2013.

2272511.

[7] P.A. Haigh, Z. Ghassemlooy, S. Rajbhandari, I. Papakonstantinou, W. Popoola, Visible light communications: 170 Mb/s using an artificial neural network equalizer in a low bandwidth white light configuration, J. Lightwave Technol.

32 (9) (2014) 1807โ€“1813,http://dx.doi.org/10.1109/JLT.2014.2314635.

[8] Shaveta Dargan, Munish Kumar, Maruthi Rohit Ayyagari, Gulshan Kumar, A survey of deep learning and its applications: a new paradigm to machine learning, Arch. Comput. Methods Eng. 27 (2020) 1071โ€“1092,http://dx.doi.org/

10.1007/s11831-019-09344-w.

[9] Shi Dong, Ping Wang, Khushnood Abbas, A survey on deep learning and its applications, Comp. Sci. Rev. 40 (2021)http://dx.doi.org/10.1016/j.cosrev.2021.

100379.

[10] Feifan Liao, Shengyun Wei, Shun Zou, Deep learning methods in communication systems: a review, in: 2nd International Conference on Electronic Engineering and Informatics, IOP Publishing, 2020,http://dx.doi.org/10.1088/1742-6596/

1617/1/012024.

[11] Hoon Lee, Sang Hyun Lee, Tony Q.S. Quek, Inkyu Lee, Deep learning framework for wireless systems: applications to optical wireless communications, IEEE Commun. Mag. 57 (3) (2019) 35โ€“41, http://dx.doi.org/10.1109/MCOM.2019.

1800584.

[12] Tim Oโ€™Shea, Jakob Hoydis, An introduction to deep learning for the physical layer, IEEE Trans. Cogn. Commun. Netw. 3 (4) (2017) 563โ€“575,http://dx.doi.

org/10.1109/TCCN.2017.2758370.

[13] Zhijin Qin, Hao Ye, Geoffrey Ye Li, Biing-Hwang Juang, Deep learning in physical layer communications, IEEE Wirel. Commun. 26 (2) (2019) 93โ€“99, http://dx.doi.org/10.1109/MWC.2019.1800601.

[14] Hao Ye, Geoffrey Ye Li, Biing-Hwang Juang, Power of deep learning for channel estimation and signal detection in OFDM systems, IEEE Wirel. Commun. Lett. 7 (1) (2018) 114โ€“117,http://dx.doi.org/10.1109/LWC.2017.2757490.

[15] Nan Chi, Junlian Jia, Fangchen Hu, Yiheng Zhao, Peng Zou, Challenges and prospects of machine learning in visible light communication, J. Commun. Inf.

Netw. 5 (3) (2020) 302โ€“309,http://dx.doi.org/10.23919/JCIN.2020.9200893.

[16] Mehmet Gorkem Ulkar, Tuncer Baykas, Ali E. Pusane, VLCnet: deep learning based end-to-end visible light communication system, J. Lightwave Technol. 38 (21) (2020) 5937โ€“5948,http://dx.doi.org/10.1109/JLT.2020.3006827.

[17] Pu Miao, Pu Miao, Chenhao Qi, Yi Jin, Chong Lin, A model-driven deep learning method for LED nonlinearity mitigation in OFDM-based optical communications, IEEE Access 7 (2019) 71436โ€“71446, http://dx.doi.org/10.1109/ACCESS.2019.

2919983.

[18] Yonghe Zhu, Chen Gong, Jianghua Luo, Meiyu Jin, Xianqing Jin, Zhengyuan Xu, Indoor non-line of sight visible light communication with a bi-LSTM neural network, in: IEEE International Conference on Communications Workshops (ICC Workshops), 2020, http://dx.doi.org/10.1109/ICCWorkshops49005.2020.

9145317.

[19] C. Chen, L. Zeng, X. Zhong, S. Fu, M. Liu, P. Du, Deep learning-aided OFDM- based generalized optical quadrature spatial modulation, IEEE Photonics J. 14 (1) (2022) 1โ€“6,http://dx.doi.org/10.1109/JPHOT.2021.3129541.

[20] Mahdi Nassiri, Saeed Ashouri, Gholamreza Baghersalimi, Zabih Ghassemlooy, A comparative performance assessment between pre-equalization and post equal- ization techniques in m-CAP indoor VLC systems with bit loading, in: 1st West Asian Colloquium on Optical Wireless Communications, WACOWC2018.

[21] Zhaocheng Wang, Tianqi Mao, Qi Wang, Optical OFDM for visible light com- munications, in: IEEE, 13th International Wireless Communications and Mobile Computing Conference, IWCMC, 2017,http://dx.doi.org/10.1109/IWCMC.2017.

7986454.

[22] Mahdi Nassiri, Atiyeh Pouralizadeh, Gholamreza Baghersalimi, Zabih Ghassem- looy, A hybrid variable m-CAP based indoor visible light communications and fingerprint positioning system, in: 3rd West Asian Colloquium on Optical Wireless Communications, WACOWC, 2020, http://dx.doi.org/10.1109/WASOWC49739.

2020.9410163.

[23] N. Bamiedakis, R.V. Penty, I.H. White, Carrier-less amplitude and phase modula- tion in wireless visible light communication systems, Phil. Trans. R. Soc. (2019) http://dx.doi.org/10.1098/rsta.2019.0181.

[24] Z. Ghassemlooy, W. Popoola, S. Rajbhandari, Optical Wireless Communications:

System and Channel Modelling with Matlabยฎ, CRC Press, 2019.

[25] M. Nassiri, G. Baghersalimi, Z. Ghassemlooy, Optical OFDM based on fractional fourier transform for an indoor VLC system, Appl. Opt. 60 (9) (2021) 2664โ€“2671, http://dx.doi.org/10.1364/AO.416565.

[26] M. Nassiri, S. Ashouri, G. Baghersalimi, Z. Ghassemlooy, Performance evaluation and comparison of spatial modulation and non-DC MIMO m-CAP techniques in indoor VLC system, IET Optoelectron. (2019) 281โ€“287,http://dx.doi.org/10.

1049/iet-opt.2018.5076.

[27] Md Zahangir Alom, Tarek M. Taha, Chris Yakopcic, Stefan Westberg, Pa- heding Sidike, Mst Shamima Nasrin, Mahmudul Hasan, Brian C. Van Essen, Abdul A.S. Awwal, Abdul A.S. Awwal, A state-of-the-art survey on deep learning theory and architectures, Electronics 8 (3) (2019) http://dx.doi.org/10.3390/

electronics8030292.

[28] Priti G. Pachpande, Monette H. Khadr, Ahmed F. Hussein, Hany Elgala, Visible light communication using deep learning techniques, in: IEEE 39th Sarnoff Symposium, 2018,http://dx.doi.org/10.1109/SARNOF.2018.8720493.

[29] Ian Goodfellow, Yoshua Bengio, Aaron Courville, Deep Learning, MIT Press, 2016.

[30] Qi Wang, Yue Ma, Kun Zhao, Yingjie Tian, A comprehensive survey of loss function in machine learning, J. Ann. Data Sci. (2020) http://dx.doi.org/10.

1007/s40745-020-00253-5.

(9)

Riferimenti

Documenti correlati