The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications
Next Article in Journal
The Effect of Different Turn Speeds on Whole-Body Coordination in Younger and Older Healthy Adults
Next Article in Special Issue
Design and Experimental Characterization of a Discovery and Tracking System for Optical Camera Communications
Previous Article in Journal
Drone vs. Bird Detection: Deep Learning Algorithms and Results from a Grand Challenge
Previous Article in Special Issue
Performance of Vehicular Visible Light Communications under the Effects of Atmospheric Turbulence with Aperture Averaging
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications †

1
Optical Communications Research Group, Faculty of Engineering and Environment, Northumbria University, Newcastle upon Tyne NE1 8ST, UK
2
Institute of Photonics, University of Strathclyde, Glasgow G1 1XQ, UK
3
Department of Electromagnetic Field, Faculty of Electrical Engineering, Czech Technical University in Prague, 16627 Prague, Czech Republic
4
Instituto de Telecomunicações and Departamento de Electrónica, Telecomunicações e Informática, Universidade de Aveiro, 3810-193 Aveiro, Portugal
*
Author to whom correspondence should be addressed.
This paper is an extended version of our paper published in Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Zvanovec, S.; Alves, L.N.; Haigh, P.A.; Minh, H.L. An Artificial Neural Network Equalizer for Constant Power 4-PAM in Optical Camera Communications. In Proceedings of the 2020 12th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP), Porto, Portugal, 20–22 July 2020.
Sensors 2021, 21(8), 2826; https://doi.org/10.3390/s21082826
Submission received: 10 March 2021 / Revised: 29 March 2021 / Accepted: 14 April 2021 / Published: 16 April 2021
(This article belongs to the Special Issue Visible Light Communication, Networking, and Sensing)

Abstract

:
In this paper, we propose and validate an artificial neural network-based equalizer for the constant power 4-level pulse amplitude modulation in an optical camera communications system. We introduce new terminology to measure the quality of the communications link in terms of the number of row pixels per symbol N pps , which allows a fair comparison considering the progress made in the development of the current image sensors in terms of the frame rates and the resolutions of each frame. Using the proposed equalizer, we experimentally demonstrate a non-flickering system using a single light-emitting diode (LED) with N pps of 20 and 30 pixels/symbol for the unequalized and equalized systems, respectively. Potential transmission rates of up to 18.6 and 24.4 kbps are achieved with and without the equalization, respectively. The quality of the received signal is assessed using the eye-diagram opening and its linearity and the bit error rate performance. An acceptable bit error rate (below the forward error correction limit) and an improvement of ~66% in the eye linearity are achieved using a single LED and a typical commercial camera with equalization.

1. Introduction

Optical camera communication (OCC) systems, which are part of the optical wireless communications (OWC), leverage the use of off-the-shelf conventional, complementary metal-oxide-semiconductor (CMOS) image sensors (ISs) and standard light-emitting diodes (LEDs) as the receiver (Rx) and the transmitter (Tx), respectively.
The camera-based Rxs can capture intensity-modulated light signals from a range of LED light sources (i.e., traffic lights, advertising boards, signage, display screens, vehicle head, and taillights, streetlights, etc.). The OCC technology together with the visible and infrared light transmission could be used in different low data rate Rb applications, such as the Internet of Things (IoT) (e.g., as part of the fifth-generation wireless and beyond), motion capturing [1], intelligent transportation systems [2], indoor localization, security, virtual reality, and advertising [3]. OOC comprises a plurality of pixels (i.e., photodetectors (PDs)), where the signal strength of each pixel depends on the intensity of incident light [4]. Each pixel can detect signals at different wavelengths over the visible range, e.g., red, green, and blue (RGB), hence offering parallel detection capabilities and an adaptive field of view (FoV) feature. In addition, the transmitted information from many light sources, different directions, and locations via the line-of-sight (LOS) [5,6], non-LOS, and/or a combination of both paths [7] can be captured using a single-pixel or a pixel-array IS-based Rx. Thus resulting in a higher signal-to-noise ratio, improved mobility, and flexibility over a linkspan up to hundreds of meters [8].
On the contrary, the IS requires a higher sampling duration and lower number of quantization levels compared with the PDs due to the light integration time (known as the exposure time T exp ), and the built-in analog to digital converter circuit [9]. The sequential-readout nature of CMOS IS-based Rx allows each pixel-row to capture the incident light at a different time, thus resulting in the so-called rolling shutter (RS) effect [9]. Note that, the performance of VLC with IS-based Rx is limited mainly by the camera capabilities, i.e., the frame rate Rf, T exp , and FoV. As a result, in OCC, the transmission bandwidth is rather low and limited to a few tens of kHz compared to the PD-based VLC systems. Although, low data should not be seen as a problem considering that there are many applications where low Rb is not critical at all (i.e., IoT, etc.). However, in OCC, lower Rb may result in the flickering effect at the Tx [10,11]. In IEEE 802.15.7m standard [12], different schemes have been proposed for OCC to mitigate flickering and to increase Rb [13]. For example, in [14], an optical orthogonal frequency division multiplexing VLC with a special IS-based Rx with a built-in PD-array was used to achieve a very high Rb of 55 Mbps. However, the fabrication process of the IS was too complex and, therefore, not commercially available. In [15,16], under-sampled frequency and phase shift on-off keying (OOK) modulation schemes were proposed to mitigate flickering in OCC with low Rb. In [17], Manchester coding was proposed to alleviate flickering in the RS mode, where it was shown that link performance in terms of Rb deteriorated with the transmission range [8,17].
Moreover, an OCC link with the under-sampled pulse amplitude modulation (PAM) with subcarriers was experimentally demonstrated with the increase Rb to 250 bps [18,19]. In addition, a multilevel-intensity modulation scheme for RS-based OCC with the frame rate Rf of 30 fps was proposed in [20] with Rb of 10 kbps over a link range of up to 2 m. Furthermore, a parallel transmission VLC system with color-shift-keying (i.e., different colors RGB-LEDs) was reported in [21] with an overall Rb of 5.2 kbps. In [22], the concept of parallel transmission was demonstrated over a range of up to 60 m and with Rb of 150 bps. Whereas a 16 × 16 array μ LED and a high-speed camera with Rf of 960 fps and using Rb of 122.88 kb/s was reported in [23].
In OCC systems, equalization methods can also be deployed to compensate for spatial and temporal induced dispersion. In [24], an OOK VLC (a single LED) and camera-based Rx with a dual equalization scheme to compensate for both spatial and temporal dispersion were reported with increased Rb up to 14.37 kb/s. The artificial neural network (ANN) architecture has also been proposed for post-equalization to combat non-linear impairments in OWC [25,26]. The use of an ANN-based equalizer is one of the remarkable solutions adapted in PD-based OWCs, wherein the ANN act as the universal classifiers [27]. In [25], a 170 Mb/s OOK VLC link using an LED with a modulation bandwidth of 4.5 MHz and the ANN-based equalizer at the Tx was reported, where the superiority of ANN equalizers in mitigating intersymbol interference (ISI) was demonstrated compared with other equalization techniques. Note, in OCC with the ANN-based equalizer, the network needs to be trained once for a range of Texp with the data being stored in a look-up table within the camera.
In [28], the variable transparent amplitude shape code (VTASC) scheme was experimentally evaluated for device-to-device (D2D) (i.e., smartphones) communications in the form of high-density modulation (HDM) with the ANN assisted demodulator. Rb of 2.66 Mbps over a 20 cm long transmission link was achieved. Note, the concept of D2D is one form of the multiple-input multiple-output system, where every pixel is transmitted and detected. Similarly, to allow transmission and reception of information under bad weather conditions, a convolution neural network (CNN)-based OCC was proposed in [29]. The CNN was used for classification and recognition of LED patterns and to decode the transmitted data streams even under an unclear state, where LED patterns are not visible to the camera due to blocking of the transmission path and/or weather conditions. In [30], an OCC link with an ANN-based decoder was reported to mitigate the gap-time effect between two adjacent frames, where OOK was transmitted using an RGB-LED with Rb of 47 kb/s. In [4], an OCC link using a single LED source and Manchester line code with the non-return to zero formats was reported with Rb of 14 kb/s.
In this work, the aim is to establish a flickering-free OCC system with improved Rb using a single LED and an ANNs-based equalizer. The key contributions extended from our previous work [31] are:
  • Comprehensive and systematical investigation of the applicability of CP-PAM for the LED- and camera-based VLC.
  • Development of a practical CP-PAM OCC prototype with a single Luxeon Rebel white LED (SR-01-WC310) and an IS (Thorlabs DCC1645C) as the Tx and the Rx, respectively.
  • Development of an efficient signal extraction algorithm for the RS-based OCC system.
  • Implementation of an ANN-based equalizer at the Rx to enhance the system performance.
  • Development of an experimental test-bed for the proposed system and evaluating it in terms of the Tx’s frequency, eye diagrams, and the bit error rate (BER) with and without the ANNs-based equalizer.
  • Proposing a new measurement metric for assessing the quality of the communications link in terms of the number of row pixels/symbol.
The remainder of the paper is organized as follows. Section 2 introduces the proposed CP PAM scheme, whereas Section 3 outlines the ANN equalizer model for IS-based OCC. The experimental setup is described in Section 4. Results and discussion are presented in Section 5. Finally, conclusions are given in Section 6.

2. Constant Power-PAM in RS-Based OCC System

The OCC system is mainly composed of a light source-based Tx with normalized length (diameter) represented in L , and a camera-based Rx, which is modeled using a single convex lens with a focal length f. The transmission speed in the RS-based OCC system is defined by the amount of the information that can be captured by an image at the distance d , which depends on the acquired number of samples (i.e., pixel rows) and is given by [32]:
N row = 2   f × t a n ( FoV 2 ) = 2   f × L 2 d   ,
where FoV is the angular field of view.
Note that the acquired N row is incorporated with the sampling frequency of the IS, known as the rolling rate of IS, F s (i.e., the frequency at which the row pixels are sampled at the image plane).
Therefore, the maximum frequency of the transmitted signal is limited F s 2 according to Nyquist’s theorem. The F s value depends on the pixel clock and Texp (i.e., the time that every sample (pixel) of the IS is exposed to the light). Note, Texp acts as a moving-average filter [10,33] with the frequency resolution given by:
Δ f = 1   T exp =   F s N row ( d )   ,
F s is defined in terms of the bandwidth of the transmitted signal f Tx and the number of received pixels per symbol, N pps , which is given by:
F s = N pps   .     f Tx   ,
Note, (i) N pps varies with the payload Pbit; and (ii) the maximum transmission distance is proportional to both Δ f and the size (diameter) of the light source. Higher Texp results in increased signal intensity levels, and, therefore, higher signal-to-noise-ratio (SNR) at the cost of reduced Rx bandwidth. With reference to Equations (1)–(3), a communications link can be established at low Rf but with flickering, which is due to the variation in the mean value of light intensity during a time period larger than the optical bandwidth of the human eye. This may occur provided there are many consecutive symbols with the same logical state.
The flicker index is a relative measure of the cyclic variation in the output of various sources at given frequencies [34,35]. It considers the waveform of the light output and its amplitude, which can be determined by dividing the area above the line of average light output by the total area under the light output curve for a single curve, see Figure 1, and is given by:
Flicker   index = area   1   area   1 + area   2   ,  
The flicker index has a range of 0 to 1.0, with 0 representing the steady light output level. Area 2 may be close to zero provided the light output varies as periodic spikes, thus leading to a flickering index close to 1. Higher values indicate an increased possibility of noticeable flickering.
To mitigate flickering, CP-PAM can be adopted to equalize the mean intensity value of all symbols, i.e., I ave [18]. In CP-PAM, each PAM symbol is temporally divided into two equal chips, (i) the 1st chip for the intensity of the PAM symbol I S ; and (ii) the 2nd chip for the stabilization level, i.e., 2 I ave I S , see Figure 2. For example, a symbol with a level of “2” will be stabilized in the following chip with another symbol with a level of “1” to ensure performance equality, as clarified in Table 1.
It is also noted that considering the Rb efficiency of CP 4-PAM is reduced by half due to the stabilization level (also used for error detection), the CP N-PAM offers a higher coding efficiency compared with Manchester coding [17].

3. ANN Equalizer

In RS-based OCC systems, the IS sampling process limits the available bandwidth and results in ISI at higher data rates, thus impacting the performance of the communications link. The ability to detect the slow rise-time symbol may be impacted by the existence of the transition between different illumination levels. Equalization is one option that is being adopted to mitigate the ISI. Note, the ISI is predicted by the training filter coefficients based on a training sequence. Alternatively, the ISI can be viewed as a classification problem, where class decision boundaries are created to classify symbols based on training [4]. Hence, determining the optimal threshold boundaries in a practical channel can be seen as a nonlinear process, and consequently, the ANN-based equalizer with the adaptive algorithm can be employed to mitigate ISI and, therefore, increase the data rate. Unlike other communication systems, OCC training of the ANN network is carried out only once for a specific exposure time with the data being stored with a look-up table [25,26].
An ANN is an interconnected network of processing elements (neurons). It comprises of two distinct stages: (i) The training phase, where the ANN estimates an input-output map between the received and training data to determine the weighted input from each neuron. The weighted values are updated in each training iteration until either the required performance is achieved, or the entire training set is used; and (ii) the operation phase, where the ANN is deployed without the knowledge of the dataset under test. The multilayer perceptron (MLP) is a popular ANN architecture, which has been demonstrated with high effectiveness in signal equalization [36]. It offers the ability to map any non-linear input-output sequence, provided there are sufficient neurons in the hidden layer(s), and the SNR is sufficiently high.
The MLP structure consists of at least three layers; (i) a single input layer x; (ii) ( M 1 ) hidden layers; and (iii) a single output layer y. The input layer (also called the observation vector) has the same structure as a conventional linear equalizer for sequential equalization, i.e., it is a tapped delay line o(m−1) = [ o 1 ( m 1 ) , o 2 ( m 1 ) , …,   o N m 1 ( m 1 ) ], where N is the number of neurons, and m is the layer number. This is illustrated in Figure 3, where weights w k n ( m ) relate the nth input to the kth neuron. Each neuron can be biased with a value C(m), which is in turn scaled by a threshold factor v k ( m ) .
The output o k ( m ) of the kth neuron is mapped via a non-linear activation function f(.) as given by [25]:
o k ( m ) = f ( n = 1 N m 1 w k n ( m ) o n ( m 1 ) + C ( m ) v k ( m ) ) .
The output of each layer is usually connected to each of the neurons in the next layer, i.e., a fully connected mode, therefore, using the observation vector o ( m ) for the mth layer and the N m × N m 1 connection matrix between layers m and m 1 , the output is given in the vector form by:
o ( m ) = f ( W ( m ) o ( m 1 ) + C ( m ) v ( m ) ) ,
where W ( m ) and v ( m ) are given by:
W ( m ) = [ w 1 ( m ) w 2 ( m ) . . . w N m ( m ) ] ,
v ( m ) = [ v 1 ( m )         v 2 ( m )                 v N m ( m ) ] T .
Considering the N 0 × 1 input vector, N M × 1 output vector, o ( 0 ) = x and o ( M ) = y , the following observation vector o ( m ) is given by:
x = [ x 1         x 2                 x N o ( m ) ] T ,
y = [ y 1         y 2                 y N M ] T .
Therefore,
o ( 1 ) = f [ W ( 1 ) x + C ( 1 ) v ( 1 ) ) ,
o ( 2 ) = f [ W ( 2 ) o + C ( 2 ) v ( 2 ) ) ,
y = f [ W ( M ) o ( M 1 ) + C ( M ) v ( M ) ) .
MLP will record its trained information in w k n ( m ) and in the threshold factors v n ( m ) , since C ( m ) is given as a constant for all layers (i.e., set as C ( m ) = 1 , m = 1 ,   2 , , M ). Resilient back-propagation (RBP) is a supervised back-propagation (BP) training method, which updates the weights to converge more rapidly than the standard BP training technique [26]. Figure 3 depicts a single neuron for the case where the layers are interconnected with different weight coefficients. The RBP adjusts the MLP weights to reduce the error cost function E n as given by [25]:
E k = | | d k y k | | 2 ,
where d k and y k are the ideal and actual received symbols, respectively. It should be noted that, for the training sequence, d is known. Each iteration of the RBP algorithm has a dynamic step size, which varies based on the magnitude of the gradient descent of E k .

4. Experimental Setup

The schematic block diagram of the proposed OCC system is shown in Figure 4a. A pseudorandom binary sequence (PRBS) with a length of 2   16 –1 bits was generated using MATLAB, which was then up-sampled with n samp of 50 and modulated using CP-PAM. Based on the output labels provided for 4-PAM, see Table 1, mapping of the data to the corresponding symbols was carried out. The PRBS s(t) was divided into sub-sequences with effective symbols per packet with lengths of Pbit-symbols, which depends on the transmitter bandwidth f Tx . Each subsequence was encoded with a pre- and post-amble to form a Qth Tx packet, where Qth represents the packet number and each packet consists of 3-symbol pre-amble [1.5 1.5 1.5], Pbit-symbol payload, and 3-symbol post-amble [1.5 1.5 1.5].
The symbols in the overhead signal (i.e., pre-amble and post-amble) were chosen to ensure constant average optical power when compared with the payload. The signal was then sequentially uploaded onto an arbitrary wave generator (AWG, AFG3252C, 240 MHz bandwidth), see Figure 4b. The uploading process was done through the generation of Qth Tx packet at different f Tx , the output of which was used for intensity modulation of a Luxeon Rebel LED (SR-01-WC310) with a peak wavelength at 630 nm. Note, a linewidth of 118 nm was used for transmission of the modulated light over a short LoS free-space channel (i.e., 50 cm).
At the Rx, a diffuser was used to scatter the light over the capturing area of the IS (Thorlabs DCC1645C RS) with a standard Texp of 2 ms was adopted in this study. The observed frames P U × V × 3 Q at the output of the camera were processed off-line in MATLAB using both Algorithms 1 and 2. In Algorithm 1, the data set z i Q was retrieved by accumulating the intensities for all pixels in each row. The received signal was then normalized to remove the DC by capturing 20 frames with no signal, see Figure 5a,b.
Algorithm 1 Signal Extraction Algorithm
Sensors 21 02826 i001
Next, to ensure that a full packet was captured by the IS, Algorithm 2 was applied to select the optimum frame (i.e., including both pre- and post-ambles) for each Qth Tx packet per f Tx , in which 10 × P U × V × 3 Q are captured at Qth Tx packet to maintain the synchronization between both the Tx and the Rx.
A resampling process was then applied to resize the signal length based on the packet size observed in pixels. Next, a correlation algorithm was used to maintain the synchronization between the transmitted Qth Tx packet and received z cal   Q signals, where a filtered version of z cal   Q was simulated based on the encoded Qth packet using a moving average filter. Note, the window size of the filter was set to n samp since it provided an optimal match compared with the observed signal. Next, z cal in the vector form was applied to an MLP equalizer using an array of tapped-delay lines as previously described. The MLP used here included single input, hidden, and output layers. All the key system parameters are listed in Table 2.
Algorithm 2 Find the frame with full packet inclusion (i.e., includes both pre- and post-ambles)
Sensors 21 02826 i002

5. Results and Discussion

The experimental work was focused on deploying an MLP-based equalization to mitigate the ISI due to the limited modulation bandwidth of the CMOS IS-based Rx. The measured and simulated CIS for Texp of 2 ms are highlighted in Figure 6, showing that the obtained IS bandwidth (i.e., a 3 dB point) was 250 Hz. It is also noted that the mismatch between the measured and simulated response was caused by aliasing due to the limited sampling frequency of the IS and utilization of image compression techniques [38]. The CP 4-PAM encoded signal was then generated at a different bandwidth f Tx of up to 1520 Hz.
The captured frames at the Rx are processed with Pbit of up to 70 symbols per packet. Figure 7 illustrates examples of the captured frames and the processed signals for Pbit of 5, 10, 15, 20, 50, and 70, i.e., f Tx 220, 320, 420, 520, 1120, and 1520 Hz, respectively. Note, the width of the received Qth packet and the recorded Fs are 666 pixels and 13.31 kHz, respectively, based on the demodulated signal, see Figure 7. Increasing f Tx decreases the number of received pixels for each CP 4-PAM symbol, thus, reducing the quality of data transmission. The number of pixels utilized for each CP 4-PAM symbol is indicated in Table 3. For the link with the ANN-based equalizer deployed at the Rx side, the quality of the received signal was measured using the eye diagrams and the BER performance. As illustrated in the eye diagrams, see Figure 8, the eye-openings indicate the impact of the ISI on the received signal. Note, (i) the threshold levels can be differentiated for Pbit of 5 and 20 symbols, see Figure 8a,b, respectively, but not for Pbit of 50 and 70 symbols as in see Figure 8c,d, respectively; (ii) the five levels are shown in the eye diagrams, where one of the levels represents the packet overhead designed to maintain the same average power for CP 4-PAM; and (iii) the overhead level is removed at the Rx side using Algorithm 1.
An example of transmitted and received signals with and without equalizer for N pps of 8.7 pixels per symbol is illustrated in Figure 9. The equalized signal at the Rx side shows a significant improvement in reducing the impact of the ISI on the received signal with minimal signal distortions.
The eye linearity of the received signals is measured based on the average amplitude levels is given by [39]:
Eye   linearity = min ( V up ,   V mid ,   V low ) max ( V up ,   V mid ,   V low )   ,
where V up ,   V mid ,   and   V low are the average amplitude levels.
Figure 10 shows the eye linearity of the received signals with respect to N pps for the link with and without ANN equalizer and for Texp of 2 ms. Note, we have used N pps , i.e., new terminology for a fair comparison considering the progress made in the development of ISs. As shown, for the link with no equalizer, the eye linearity increases with N pps reaching a maximum level of 0.6 at N pps of ~26, beyond which it drops linearly with rapidly N pps . However, with the ANN equalizer, the eye linearity is improved significantly for both the test and trained cases reaching the optimal linearization of almost 1 at N pps of 18 and remaining constant beyond N pps > 18 (i.e., being independent of N pps ). Thus, the ANN equalizer show an improvement of ~66% in the eye linearity for N pps > 18 pixels/symbol (i.e., f Tx < 920 Hz) for both training and testing sets.
Next, we measured the BER as a function of N pps for the link with and without ANN equalizer, as illustrated in Figure 11. In addition, shown is the forward error correction (FEC) BER limit line of 3.8 × 10−3. Note, at the FEC limit the N pps value is reduced from 30 to 20 for pixels per symbol for the links without and with the ANN equalizer, respectively, compared with the test plot. Thus, the effective Rb (i.e., no post- and pre-ambles) is estimated by:
R b = 2 V N pps · R f   ,
where V represents the pixel row.
The effective R b at the FEC limit for the case with and without the ANN equalizer with a range of IS resolutions is indicated in Table 4. It demonstrates with the ANN equalizer Rb of 24.4 and 12.2 kbps for Rf of 60, and 30 fps, respectively, can be achieved compared with the case of no equalizer with Rb of 18.6, and 9.3 kbps for Rf of 60, and 30 fps, respectively.

6. Conclusions

We proposed an ANN-based equalization technique for a CP 4-PAM based OCC system. An experimental setup was developed to demonstrate non-flickering communications using a single light-emitting diode with a transmission rate of Rb of 24.4 kbps. The quality of received signals was measured based on the eye-diagram opening, eye linearity, and the BER. We demonstrated the ability to mitigate the intersymbol interference and hence to transmit a signal with an acceptable BER (below the FEC limit) for N pps of 20, and 30 for unequalized and equalized systems, respectively. An improvement of ~66% in the eye linearity was achieved using a single LED, and a typical commercial camera with equalization technique was achieved. The limitation of the proposed system was assessed by the system complexity, including the associative memories needed for the look-up table training data as well as the IS resolution, gap-time and exposure time, and reading time.

Author Contributions

Conceptualization: O.I.Y., N.B.H. and Z.G.; investigation: O.I.Y. and N.B.H.; project administration: Z.G., S.Z., L.N.A. and H.L.-M.; software: O.I.Y. and N.B.H.; writing—original draft preparation, O.I.Y.; writing—review and editing, N.B.H., Z.G., S.Z., L.N.A. and H.L.-M. All authors have read and agreed to the published version of the manuscript.

Funding

One of the authors (Othman Younus) is funded by the Northumbria University PhD scholarship. The work is supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement no 764461 (VISION), and the EU COST Action on Newfocus (CA19111).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Teli, S.R.; Zvanovec, S.; Ghassemlooy, Z. Performance evaluation of neural network assisted motion detection schemes implemented within indoor optical camera based communications. Opt. Express 2019, 27, 24082–24092. [Google Scholar] [CrossRef]
  2. Eso, E.; Ghassemlooy, Z.; Zvanovec, S.; Gholami, A.; Burton, A.; Hassan, N.B.; Younus, O.I. Experimental Demonstration of Vehicle to Road Side Infrastructure Visible Light Communications. In Proceedings of the 2019 2nd West Asian Colloquium on Optical Wireless Communications (WACOWC), Tehran, Iran, 27–28 April 2019; pp. 85–89. [Google Scholar]
  3. Saeed, N.; Guo, S.; Park, K.-H.; Al-Naffouri, T.Y.; Alouini, M.-S. Optical camera communications: Survey, use cases, challenges, and future trends. Phys. Commun. 2019, 37, 100900. [Google Scholar] [CrossRef] [Green Version]
  4. Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Haigh, P.A.; Zvanovec, S.; Alves, L.N.; Minh, H.L. Data Rate Enhancement in Optical Camera Communications Using an Artificial Neural Network Equaliser. IEEE Access 2020, 8, 42656–42665. [Google Scholar] [CrossRef]
  5. Lee, H.-Y.; Lin, H.-M.; Wei, Y.-L.; Wu, H.-I.; Tsai, H.-M.; Lin, K.C.-J. RollingLight: Enabling Line-of-Sight Light-to-Camera Communications. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services, Florence, Italy, 18–22 May 2015; pp. 167–180. [Google Scholar]
  6. Nguyen, T.; Islam, A.; Hossan, T.; Jang, Y.M. Current Status and Performance Analysis of Optical Camera Communication Technologies for 5G Networks. IEEE Access 2017, 5, 4574–4594. [Google Scholar] [CrossRef]
  7. Bani Hassan, N.; Ghassemlooy, Z.; Zvanovec, S.; Biagi, M.; Vegni, A.M.; Zhang, M.; Luo, P. Non-Line-of-Sight MIMO Space-Time Division Multiplexing Visible Light Optical Camera Communications. J. Lightwave Technol. 2019, 37, 2409–2417. [Google Scholar] [CrossRef]
  8. Eso, E.; Teli, S.; Bani Hassan, N.; Vitek, S.; Ghassemlooy, Z.; Zvanovec, S. 400  m rolling-shutter-based optical camera communications link. Opt. Lett. 2020, 45, 1059–1062. [Google Scholar] [CrossRef]
  9. Lauxtermann, S.; Lee, A.; Stevens, J.; Joshi, A. Comparison of Global Shutter Pixels for CMOS Image Sensors. In Proceedings of the 2007 International Image Sensor Workshop, Ogunquit, ME, USA, 7–10 June 2007. [Google Scholar]
  10. Li, X.; Hassan, N.B.; Burton, A.; Ghassemlooy, Z.; Zvanovec, S.; Perez-Jimenez, R. A Simplified Model for the Rolling Shutter Based Camera in Optical Camera Communications. In Proceedings of the 2019 15th International Conference on Telecommunications (ConTEL), Graz, Austria, 3–5 July 2019; pp. 1–5. [Google Scholar]
  11. Sturniolo, A.; Cossu, G.; Ciaramella, E.; Hassan, N.B.; Shou, Z.; Huang, Y.; Ghassemlooy, Z. ROI Assisted Digital Signal Processing for Rolling Shutter Optical Camera Communications. In Proceedings of the 2018 11th International Symposium on Communication Systems, Networks & Digital Signal Processing (CSNDSP), Budapest, Hungary, 18–20 July 2018; pp. 1–6. [Google Scholar]
  12. Nguyen, T.; Islam, A.; Yamazato, T.; Jang, Y.M. Technical Issues on IEEE 802.15.7m Image Sensor Communication Standardization. IEEE Commun. Mag. 2018, 56, 213–218. [Google Scholar] [CrossRef]
  13. Cahyadi, W.A.; Chung, Y.H.; Ghassemlooy, Z.; Hassan, N.B. Optical Camera Communications: Principles, Modulations, Potential and Challenges. Electronics 2020, 9, 1339. [Google Scholar] [CrossRef]
  14. Goto, Y.; Takai, I.; Yamazato, T.; Okada, H.; Fujii, T.; Kawahito, S.; Arai, S.; Yendo, T.; Kamakura, K. A New Automotive VLC System Using Optical Communication Image Sensor. IEEE Photonics J. 2016, 8, 1–17. [Google Scholar] [CrossRef]
  15. Roberts, R.D. Undersampled frequency shift ON-OFF keying (UFSOOK) for camera communications (CamCom). In Proceedings of the 2013 22nd Wireless and Optical Communication Conference, Chongqing, China, 16–18 May 2013; pp. 645–648. [Google Scholar]
  16. Luo, P.; Ghassemlooy, Z.; Le Minh, H.; Tang, X.; Tsai, H. Undersampled phase shift ON-OFF keying for camera communication. In Proceedings of the Sixth International Conference on Wireless Communications and Signal Processing (WCSP), Hefei, China, 23–25 October 2014. [Google Scholar]
  17. Nguyen, D.T.; Chae, Y.; Park, Y. Enhancement of Data Rate and Packet Size in Image Sensor Communications by Employing Constant Power 4-PAM. IEEE Access 2018, 6, 8000–8010. [Google Scholar] [CrossRef]
  18. Luo, P.; Ghassemlooy, Z.; Minh, H.L.; Tsai, H.; Tang, X. Undersampled-PAM with subcarrier modulation for camera communications. In Proceedings of the 2015 Opto-Electronics and Communications Conference (OECC), Shanghai, China, 28 June–2 July 2015; pp. 1–3. [Google Scholar]
  19. Luo, P.; Zhang, M.; Ghassemlooy, Z.; Zvanovec, S.; Feng, S.; Zhang, P. Undersampled-Based Modulation Schemes for Optical Camera Communications. IEEE Commun. Mag. 2018, 56, 204–212. [Google Scholar] [CrossRef] [Green Version]
  20. Rachim, V.P.; Chung, W. Multilevel Intensity-Modulation for Rolling Shutter-Based Optical Camera Communication. IEEE Photonics Technol. Lett. 2018, 30, 903–906. [Google Scholar] [CrossRef]
  21. Hu, P.; Pathak, P.H.; Zhang, H.; Yang, Z.; Mohapatra, P. High Speed LED-to-Camera Communication using Color Shift Keying with Flicker Mitigation. IEEE Trans. Mobile Comput. 2020, 19, 1603–1617. [Google Scholar] [CrossRef] [Green Version]
  22. Luo, P.; Zhang, M.; Ghassemlooy, Z.; Minh, H.L.; Tsai, H.; Tang, X.; Png, L.C.; Han, D. Experimental Demonstration of RGB LED-Based Optical Camera Communications. IEEE Photonics J. 2015, 7, 1–12. [Google Scholar] [CrossRef]
  23. Griffiths, A.D.; Herrnsdorf, J.; Strain, M.J.; Dawson, M.D. Scalable visible light communications with a micro-LED array projector and high-speed smartphone camera. Opt. Express 2019, 27, 15585–15594. [Google Scholar] [CrossRef]
  24. Liu, L.; Deng, R.; Chen, L. Spatial and Time Dispersions Compensation With Double-Equalization for Optical Camera Communications. IEEE Photonics Technol. Lett. 2019, 31, 1753–1756. [Google Scholar] [CrossRef]
  25. Haigh, P.A.; Ghassemlooy, Z.; Rajbhandari, S.; Papakonstantinou, I.; Popoola, W. Visible Light Communications: 170 Mb/s Using an Artificial Neural Network Equalizer in a Low Bandwidth White Light Configuration. J. Lightwave Technol. 2014, 32, 1807–1813. [Google Scholar] [CrossRef]
  26. Rajbhandari, S.; Ghassemlooy, Z.; Angelova, M. Effective Denoising and Adaptive Equalization of Indoor Optical Wireless Channel with Artificial Light Using the Discrete Wavelet Transform and Artificial Neural Network. J. Lightwave Technol. 2009, 27, 4493–4500. [Google Scholar] [CrossRef] [Green Version]
  27. Rajbhandari, S.; Faith, J.; Ghassemlooy, Z.; Angelova, M. Comparative study of classifiers to mitigate intersymbol interference in diffuse indoor optical wireless communication links. Optik 2013, 124, 4192–4196. [Google Scholar] [CrossRef]
  28. Willy Anugrah, C.; Yeon Ho, C. Smartphone camera-based device-to-device communication using neural network-assisted high-density modulation. Opt. Eng. 2018, 57, 1–9. [Google Scholar]
  29. Islam, A.; Hossan, M.T.; Jang, Y.M. Convolutional neural networkscheme–based optical camera communication system for intelligent Internet of vehicles. Int. J. Distrib. Sens. Netw. 2018, 14, 1550147718770153. [Google Scholar] [CrossRef]
  30. Liu, L.; Deng, R.; Chen, L.-K. 47-kbit/s RGB-LED-based optical camera communication based on 2D-CNN and XOR-based data loss compensation. Opt. Express 2019, 27, 33840–33846. [Google Scholar] [CrossRef]
  31. Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Zvanovec, S.; Alves, L.N.; Haigh, P.A.; Minh, H.L. An Artificial Neural Network Equalizer for Constant Power 4-PAM in Optical Camera Communications. In Proceedings of the 2020 12th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP), Porto, Portugal, 20–22 July 2020; pp. 1–6. [Google Scholar]
  32. Nguyen, H.; Pham, T.L.; Nguyen, H.; Jang, Y.M. Trade-off Communication distance and Data rate of Rolling shutter OCC. In Proceedings of the 2019 Eleventh International Conference on Ubiquitous and Future Networks (ICUFN), Zagreb, Croatia, 2–5 July 2019; pp. 148–151. [Google Scholar]
  33. Chau, J.C.; Little, T.D.C. Analysis of CMOS active pixel sensors as linear shift-invariant receivers. In Proceedings of the 2015 IEEE International Conference on Communication Workshop (ICCW), London, UK, 8–12 June 2015; pp. 1398–1403. [Google Scholar]
  34. Lehman, B.; Wilkins, A.; Berman, S.; Poplawski, M.; Miller, N.J. Proposing measures of flicker in the low frequencies for lighting applications. In Proceedings of the 2011 IEEE Energy Conversion Congress and Exposition, Phoenix, AZ, USA, 16–21 September 2011; pp. 2865–2872. [Google Scholar]
  35. DiLaura, D.L.; Houser, K.W.; Mistrick, R.; Stey, S.G. The Lighting Handbook Reference and Application, 10th ed.; Illuminating Engineering Society of North America (IES): New York, NY, USA, 2011; p. 247. [Google Scholar]
  36. Rajbhandari, S.; Chun, H.; Faulkner, G.; Haas, H.; Xie, E.; McKendry, J.J.D.; Herrnsdorf, J.; Gu, E.; Dawson, M.D.; O’Brien, D. Neural Network-Based Joint Spatial and Temporal Equalization for MIMO-VLC System. IEEE Photonics Technol. Lett. 2019, 31, 821–824. [Google Scholar] [CrossRef] [Green Version]
  37. 1.3Mp CMOS Digital Image Sensor. In ON Semiconductor; Application Note 98AON94065F; Semiconductor Components Industries, LLC: Aurora, CO, USA, 2017; Available online: https://www.onsemi.com/pdf/datasheet/mt9m131-d.pdf (accessed on 20 March 2021).
  38. Cossu, G.; Sturniolo, A.; Ciaramella, E. Modelization and Characterization of a CMOS Camera as an Optical Real-Time Oscilloscope. IEEE Photonics J. 2020, 12, 1–13. [Google Scholar] [CrossRef]
  39. AN 835: PAM4 Signaling Fundamentals. Intel; Application Note AN-835; Intel Corporation: Santa Clara, CA, USA, 2019; Available online: https://www.intel.com/content/dam/www/programmable/us/en/pdfs/literature/an/an835.pdf (accessed on 10 March 2021).
Figure 1. Defining Flicker Index [34,35].
Figure 1. Defining Flicker Index [34,35].
Sensors 21 02826 g001
Figure 2. An example of a generated packet signal with f Tx of 220 Hz.
Figure 2. An example of a generated packet signal with f Tx of 220 Hz.
Sensors 21 02826 g002
Figure 3. A structure of the kth neuron in the layer m.
Figure 3. A structure of the kth neuron in the layer m.
Sensors 21 02826 g003
Figure 4. The CP 4-PAM OCC scheme: (a) System block diagram, and (b) photograph of the experimental setup.
Figure 4. The CP 4-PAM OCC scheme: (a) System block diagram, and (b) photograph of the experimental setup.
Sensors 21 02826 g004aSensors 21 02826 g004b
Figure 5. An example of the received Qth Tx packet signal at a Texp of 2 ms: (a) without DC gain normalization, and (b) with normalization.
Figure 5. An example of the received Qth Tx packet signal at a Texp of 2 ms: (a) without DC gain normalization, and (b) with normalization.
Sensors 21 02826 g005
Figure 6. Measured and estimated bandwidth of the for IS with Texp of 2 ms.
Figure 6. Measured and estimated bandwidth of the for IS with Texp of 2 ms.
Sensors 21 02826 g006
Figure 7. Examples of the frame acquisition based on CIS for CP 4-PAM and f Tx of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.
Figure 7. Examples of the frame acquisition based on CIS for CP 4-PAM and f Tx of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.
Sensors 21 02826 g007
Figure 8. Examples of the captured eye diagrams of the CIS received signal for CP 4-PAM with f Tx of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.
Figure 8. Examples of the captured eye diagrams of the CIS received signal for CP 4-PAM with f Tx of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.
Sensors 21 02826 g008
Figure 9. An example of the transmitted and received signal with and without equalization for CP 4-PAM and N pps of 18.5 row pixels/symbol: (a) training sets, and (b) testing sets.
Figure 9. An example of the transmitted and received signal with and without equalization for CP 4-PAM and N pps of 18.5 row pixels/symbol: (a) training sets, and (b) testing sets.
Sensors 21 02826 g009aSensors 21 02826 g009b
Figure 10. The eye linearity against N pps for the proposed system with and without equalization and for Texp of 2 ms.
Figure 10. The eye linearity against N pps for the proposed system with and without equalization and for Texp of 2 ms.
Sensors 21 02826 g010
Figure 11. The BER measurements as a function of N pps for the proposed system with and without equalization and for Texp of 2 ms.
Figure 11. The BER measurements as a function of N pps for the proposed system with and without equalization and for Texp of 2 ms.
Sensors 21 02826 g011
Table 1. Proposed CP 4-PAM levels.
Table 1. Proposed CP 4-PAM levels.
Input DataConventional PAM LevelConstant Power 4-PAM
First Level
( I S )
Stabilization Level
( 2 I a v e I S )
11330
10221
01112
00003
Table 2. System parameters.
Table 2. System parameters.
DescriptionValue
TxLED typeLuxeon Rebel LED (SR-01-WC310)
Tx signal bandwidth     f Tx   (Hz)220–1520 Hz
Tx bias current180 mA
Camera RxCamera modelThorlabs DCC1645C-HQ
Exposure time Texp2 ms
Maximum SNR of IS44 dB [37]
Lens typeNavitar 12 mm F/1.8 2/3” 10 MP
Pixel clock10 MHz
Camera raw image resolution1280 × 1024 pixels
Captured symbols per frame11–76 symbols
Packet GeneratorData formatCP-PAM
Symbol per packet Pbit5–70 symbols
Packet generator sample rate11.125 kHz
Number of samples n samp 10
ChannelChannel length50 cm
ANN EqualizerActivation functionHyperbolic tangent sigmoid
Number of neurons in input layer200
Number of neurons in output layer1
Number of neurons in hidden layer200
Number of hidden layers2
Percentage of the train to test0.8
Maximum epochs1000
learning rate parameter η0.01
Network training functionResilient back-propagation
Table 3. Results with Rf of 30 fps and CIS width of 1024 px.
Table 3. Results with Rf of 30 fps and CIS width of 1024 px.
Payload
Symbol/Packet (Pbit)
Total Number of Symbols/Packet Number   of   Row   Pixels / Symbol   ( N p p s ) f T x     ( Hz )
51160.54220
101641.62320
152131.71420
202625.61520
303618.50720
354116.24820
404614.48920
505611.891120
70768.761520
Table 4. Effective Rb at different ISs resolutions at the FEC limits.
Table 4. Effective Rb at different ISs resolutions at the FEC limits.
ISs Resolutions R b   ( bps )   at   N p p s = 26
(i.e., w/o Equalization)
R b   ( bps )   at   N p p s = 20  
(i.e., with Equalization)
Rf = 30 fpsRf = 60 fpsRf = 30 fpsRf = 60 fps
1200 × 180037947588504010,080
1500 × 210044868972594011,880
1800 × 2400517810,357684013,680
2100 × 3000656313,126864017,280
2400 × 3000656313,126864017,280
3300 × 4200933218,66512,24024,480
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Zvanovec, S.; Alves, L.N.; Le-Minh, H. The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications. Sensors 2021, 21, 2826. https://doi.org/10.3390/s21082826

AMA Style

Younus OI, Hassan NB, Ghassemlooy Z, Zvanovec S, Alves LN, Le-Minh H. The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications. Sensors. 2021; 21(8):2826. https://doi.org/10.3390/s21082826

Chicago/Turabian Style

Younus, Othman Isam, Navid Bani Hassan, Zabih Ghassemlooy, Stanislav Zvanovec, Luis Nero Alves, and Hoa Le-Minh. 2021. "The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications" Sensors 21, no. 8: 2826. https://doi.org/10.3390/s21082826

APA Style

Younus, O. I., Hassan, N. B., Ghassemlooy, Z., Zvanovec, S., Alves, L. N., & Le-Minh, H. (2021). The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications. Sensors, 21(8), 2826. https://doi.org/10.3390/s21082826

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop