Data-Driven Symbol Detection for Intersymbol Interference Channels with Bursty Impulsive Noise (2405.10814v1)
Abstract: We developed machine learning approaches for data-driven trellis-based soft symbol detection in coded transmission over intersymbol interference (ISI) channels in presence of bursty impulsive noise (IN), for example encountered in wireless digital broadcasting systems and vehicular communications. This enabled us to obtain optimized detectors based on the Bahl-Cocke-Jelinek-Raviv (BCJR) algorithm while circumventing the use of full channel state information (CSI) for computing likelihoods and trellis state transition probabilities. First, we extended the application of the neural network (NN)-aided BCJR, recently proposed for ISI channels with additive white Gaussian noise (AWGN). Although suitable for estimating likelihoods via labeling of transmission sequences, the BCJR-NN method does not provide a framework for learning the trellis state transitions. In addition to detection over the joint ISI and IN states we also focused on another scenario where trellis transitions are not trivial: detection for the ISI channel with AWGN with inaccurate knowledge of the channel memory at the receiver. Without access to the accurate state transition matrix, the BCJR- NN performance significantly degrades in both settings. To this end, we devised an alternative approach for data-driven BCJR detection based on the unsupervised learning of a hidden Markov model (HMM). The BCJR-HMM allowed us to optimize both the likelihood function and the state transition matrix without labeling. Moreover, we demonstrated the viability of a hybrid NN and HMM BCJR detection where NN is used for learning the likelihoods, while the state transitions are optimized via HMM. While reducing the required prior channel knowledge, the examined data-driven detectors with learned trellis state transitions achieve bit error rates close to the optimal full CSI-based BCJR, significantly outperforming detection with inaccurate CSI.
- O. Simeone, “A very brief introduction to machine learning with applications to communication systems,” IEEE Trans. Cogn. Commun. Netw., vol. 4, no. 4, pp. 648-664.
- K. Hornik, M. Stinchcombe, and H. White, “Multilayer feedforward networks are universal approximators,” Neural Networks, 1989.
- T. O’Shea and J. Hoydis, “An introduction to deep learning for the physical layer,” IEEE Trans. Cogn. Commun. Netw., vol. 3, no. 4, pp. 563-575, 2017.
- S. Dörner, S. Cammerer, J. Hoydis and S. ten Brink, “Deep learning based communication over the air,” IEEE J. Sel. Topics Signal Process., vol. 12, no. 1, pp. 132-143, 2018.
- B. Karanov, M. Chagnon, F. Thouin, T. A. Eriksson, H. Bülow, D. Lavery, P. Bayvel and L. Schmalen, “End-to-end deep learning of optical fiber communications,” IEEE/Optica J. Lightw. Technol., vol. 36, no. 20, pp. 4843-4855, 2018.
- B. Karanov, D. Lavery, P. Bayvel and L. Schmalen, “End-to-end optimized transmission over dispersive intensity-modulated channels using bidirectional recurrent neural networks,” Opt. Express, vol. 27, no. 14, pp. 19650-19663, 2019.
- B. Karanov, L. Schmalen and A. Alvarado, “Distance-agnostic auto-encoders for short reach fiber communications,” in Optical Fiber Communications Conference (OFC), 2021, pp. 1-3.
- F. A. Aoudia and J. Hoydis, “Model-free training of end-to-end communication systems,” IEEE J. Sel. Areas Commun., vol. 37, no. 11, pp. 2503-2516, 2019.
- B. Karanov, M. Chagnon, V. Aref, D. Lavery, P. Bayvel and L. Schmalen, “Concept and Experimental demonstration of optical IM/DD end-to-end system optimization using a generative model,” in Proc. Optical Fiber Communications Conference and Exhibition (OFC), 2020, pp. 1-3.
- C. Häger and H. D. Pfister, “Physics-based deep learning for fiber-optic communication systems,” IEEE J. Sel. Areas Commun., vol. 39, no. 1, pp. 280-294, 2021.
- N. Farsad and A. Goldsmith, “Neural network detection of data sequences in communication systems,” IEEE Trans. Signal Process., vol. 66, no. 21, pp. 5663-5678, 2018.
- E. Nachmani, E. Marciano, L. Lugosch, W. J. Gross, D. Burshtein and Y. Be’ery, “Deep learning methods for improved decoding of linear codes,” IEEE J. Sel. Top. Signal Process., vol. 12, no. 1, pp. 119-131, 2018.
- G. D. Forney, “Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference,” IEEE Trans. Inform. Theory, vol. 18, no. 3, pp. 363-378, 1972.
- L. R. Bahl, J. Cocke, F. Jelinek, and J. Raviv, “Optimal decoding of linear codes for minimizing symbol error rate,” IEEE Trans. Inform. Theory, vol. 20, no. 2, pp. 284–287, 1974.
- N. Shlezinger, N. Farsad, Y. C. Eldar and A. J. Goldsmith, “ViterbiNet: A deep learning based Viterbi algorithm for symbol detection,” IEEE Trans. Wireless Commun., vol. 19, no. 5, pp. 3319-3331, 2020.
- N. Shlezinger, N. Farsad, Y. C. Eldar and A. J. Goldsmith, “Learned factor graphs for inference from stationary time sequences,” IEEE Trans. Signal Process., vol. 70, pp. 366-380, 2022.
- N. Shlezinger, J. Whang, Y. C. Eldar and A. G. Dimakis, “Model-based deep learning,” Proc. IEEE, vol. 111, no. 5, pp. 465-499, 2023.
- M. Mostafa, “Stability proof of iterative interference cancellation for OFDM signals with blanking nonlinearity in impulsive noise channels,” IEEE Signal Process. Lett., vol. 24, no. 2, pp. 201-205, 2017.
- P. Yang, Y. L. Guan, X. B. Liu and Z. Liu, “An improved hybrid turbo equalizer for single carrier transmission with impulsive noise and ISI,” IEEE Trans. Veh. Technol., vol. 66, no. 11, pp. 9852-9861, 2017.
- H. Oh and H. Nam, “Design and performance analysis of nonlinearity preprocessors in an impulsive noise environment,” in IEEE Trans. Veh. Technol., vol. 66, no. 1, pp. 364-376, 2017.
- S. Liu, F. Yang, W. Ding and J. Song, “Double kill: Compressive-sensing-based narrow-band interference and impulsive noise mitigation for vehicular communications,” IEEE Trans. Veh. Technol., vol. 65, no. 7, pp. 5099-5109, 2016.
- D. Middleton, “Statistical-physical model of electromagnetic interference,” IEEE Trans. Electromagn. Compat., vol. EMC-19, no. 3, pt. 1, pp. 106–127, 1977.
- D. Middleton, “Non-Gaussian noise models in signal processing for telecommunications: new methods an results for class A and class B noise models,” IEEE Trans. Inf. Theory, vol. 45, no. 4, pp. 1129-1149, 1999.
- G. Ndo, F. Labeau and M. Kassouf, “A Markov-Middleton model for bursty impulsive noise: Modeling and receiver design,” IEEE Trans. Power Del., vol. 28, no. 4, pp. 2317-2325, 2013.
- A. Mirbadin, A. Vannucci, G. Colavolpe, R. Pecori and L. Veltri, “Iterative receiver design for the estimation of Gaussian samples in impulsive noise,” Appl. Sci., vol. 11, no. 2: 557, 2021.
- L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proc. IEEE, vol. 77, no. 2, pp. 257-286, 1989.
- L. E. Baum, “An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes,” Inequalities III: Proceedings of the 3rd Symposium on Inequalities, pp. 1-8, 1972.
- L. R. Welch, “Hidden Markov models and the Baum-Welch algorithm,” IEEE Information Theory Society Newsletter, vol. 53, no. 4, pp. 1&10-13, 2003.
- G. K. Kaleh and R. Vallet, “Joint parameter estimation and symbol Detection for linear or nonlinear unknown channels,” IEEE Trans. Commun., vol. 42, no. 7, pp. 2406-2413, 1994.
- L. Schmid, T. Raviv, N. Shlezinger, and Laurent Schmalen, “Blind channel estimation and joint symbol detection with data-driven factor graphs,” ArXiv preprint arXiv:2401.12627, 2024.
- C.-H. Chen, B. Karanov, W. v. Houtum, W. Yan, A. Young and A. Alvarado, “On the robustness of deep learning-aided symbol detectors to varying conditions and imperfect channel knowledge,” in Proc. IEEE Wireless Communications and Networking Conference (WCNC), 2024, pp. 1-6.
- C.-H. Chen, W.-H. Huang, B. Karanov, Y. Wu, A. Young, W. v. Houtum, “Analysis of impulsive interference in digital audio broadcasting systems in electric vehicles,” in Proc. Symposium on Information Theory and Signal Procssing in the Benelux (SITB), 2024.
- K. S. Gilhousen, J. A. Heller, I. M. Jacobs, and A. J. Viterbi, “Coding systems study for high data rate telemetry links,” NASA Rep., Jan. 1971, prepared under Contract NAS2-6024, Linkabit Corp., San Diego, CA, USA, pp. 235.