LinFormer: A Linear-based Lightweight Transformer Architecture For Time-Aware MIMO Channel Prediction (2410.21351v1)
Abstract: The emergence of 6th generation (6G) mobile networks brings new challenges in supporting high-mobility communications, particularly in addressing the issue of channel aging. While existing channel prediction methods offer improved accuracy at the expense of increased computational complexity, limiting their practical application in mobile networks. To address these challenges, we present LinFormer, an innovative channel prediction framework based on a scalable, all-linear, encoder-only Transformer model. Our approach, inspired by NLP models such as BERT, adapts an encoder-only architecture specifically for channel prediction tasks. We propose replacing the computationally intensive attention mechanism commonly used in Transformers with a time-aware multi-layer perceptron (TMLP), significantly reducing computational demands. The inherent time awareness of TMLP module makes it particularly suitable for channel prediction tasks. We enhance LinFormer's training process by employing a weighted mean squared error loss (WMSELoss) function and data augmentation techniques, leveraging larger, readily available communication datasets. Our approach achieves a substantial reduction in computational complexity while maintaining high prediction accuracy, making it more suitable for deployment in cost-effective base stations (BS). Comprehensive experiments using both simulated and measured data demonstrate that LinFormer outperforms existing methods across various mobility scenarios, offering a promising solution for future wireless communication systems.
- R. M. Dreifuerst and R. W. Heath, “Massive MIMO in 5G: How beamforming, codebooks, and feedback enable larger arrays,” IEEE Communications Magazine, vol. 61, no. 12, pp. 18–23, 2023.
- C.-X. Wang, X. You, X. Gao, X. Zhu, Z. Li, C. Zhang, H. Wang, Y. Huang, Y. Chen, H. Haas et al., “On the road to 6G: Visions, requirements, key technologies, and testbeds,” IEEE Communications Surveys & Tutorials, vol. 25, no. 2, pp. 905–974, 2023.
- X. You, C.-X. Wang, J. Huang, X. Gao, Z. Zhang, M. Wang, Y. Huang, C. Zhang, Y. Jiang, J. Wang et al., “Towards 6g wireless communication networks: Vision, enabling technologies, and new paradigm shifts,” Science China Information Sciences, vol. 64, pp. 1–74, 2021.
- K. T. Truong and R. W. Heath, “Effects of channel aging in massive MIMO systems,” Journal of Communications and Networks, vol. 15, no. 4, pp. 338–351, 2013.
- H. Yin, H. Wang, Y. Liu, and D. Gesbert, “Addressing the curse of mobility in massive MIMO with prony-based angular-delay domain channel predictions,” IEEE Journal on Selected Areas in Communications, vol. 38, no. 12, pp. 2903–2917, 2020.
- D. Astely, P. Von Butovitsch, S. Faxér, and E. Larsson, “Meeting 5g network requirements with massive MIMO,” Ericsson Technology Review, vol. 2022, no. 1, pp. 2–11, 2022.
- H. P. Bui, Y. Ogawa, T. Nishimura, and T. Ohgane, “Performance evaluation of a multi-user MIMO system with prediction of time-varying indoor channels,” IEEE transactions on antennas and propagation, vol. 61, no. 1, pp. 371–379, 2012.
- I. C. Wong and B. L. Evans, “Joint channel estimation and prediction for OFDM systems,” in GLOBECOM’05. IEEE Global Telecommunications Conference, 2005., vol. 4. IEEE, 2005, pp. 5–pp.
- H. Kim, S. Kim, H. Lee, C. Jang, Y. Choi, and J. Choi, “Massive MIMO channel prediction: Kalman filtering vs. machine learning,” IEEE Transactions on Communications, vol. 69, no. 1, pp. 518–528, 2020.
- J. Yuan, H. Q. Ngo, and M. Matthaiou, “Machine learning-based channel prediction in massive MIMO with channel aging,” IEEE Transactions on Wireless Communications, vol. 19, no. 5, pp. 2960–2973, 2020.
- C. Huang, R. He, B. Ai, A. F. Molisch, B. K. Lau, K. Haneda, B. Liu, C.-X. Wang, M. Yang, C. Oestges et al., “Artificial intelligence enabled radio propagation for communications—Part II: Scenario identification and channel modeling,” IEEE Transactions on Antennas and Propagation, vol. 70, no. 6, pp. 3955–3969, 2022.
- Z. Xiao, Z. Zhang, C. Huang, X. Chen, C. Zhong, and M. Debbah, “C-GRBFnet: A physics-inspired generative deep neural network for channel representation and prediction,” IEEE Journal on Selected Areas in Communications, vol. 40, no. 8, pp. 2282–2299, 2022.
- C. Wu, X. Yi, Y. Zhu, W. Wang, L. You, and X. Gao, “Channel prediction in high-mobility massive MIMO: From spatio-temporal autoregression to deep learning,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 7, pp. 1915–1930, 2021.
- J. L. Elman, “Finding structure in time,” Cognitive science, vol. 14, no. 2, pp. 179–211, 1990.
- C. Liu, X. Liu, Z. Wei, S. Hu, D. W. K. Ng, and J. Yuan, “Deep learning-empowered predictive beamforming for IRS-assisted multi-user communications,” in 2021 IEEE Global Communications Conference (GLOBECOM). IEEE, 2021, pp. 01–07.
- J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, “Empirical evaluation of gated recurrent neural networks on sequence modeling,” arXiv preprint arXiv:1412.3555, 2014.
- O. Stenhammar, G. Fodor, and C. Fischione, “A comparison of neural networks for wireless channel prediction,” IEEE Wireless Communications, 2024.
- S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997.
- W. Xu, J. An, Y. Xu, C. Huang, L. Gan, and C. Yuen, “Time-varying channel prediction for RIS-assisted MU-MISO networks via deep learning,” IEEE Transactions on Cognitive Communications and Networking, vol. 8, no. 4, pp. 1802–1815, 2022.
- W. Jiang and H. D. Schotten, “Deep learning for fading channel prediction,” IEEE Open Journal of the Communications Society, vol. 1, pp. 320–332, 2020.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Advances in neural information processing systems, vol. 30, 2017.
- H. Jiang, M. Cui, D. W. K. Ng, and L. Dai, “Accurate channel prediction based on transformer: Making mobility negligible,” IEEE Journal on Selected Areas in Communications, vol. 40, no. 9, pp. 2717–2732, 2022.
- E. Eldele, M. Ragab, Z. Chen, M. Wu, and X. Li, “Tslanet: Rethinking transformers for time series representation learning,” arXiv preprint arXiv:2404.08472, 2024.
- A. Zeng, M. Chen, L. Zhang, and Q. Xu, “Are transformers effective for time series forecasting?” in Proceedings of the AAAI conference on artificial intelligence, vol. 37, no. 9, 2023, pp. 11 121–11 128.
- Z. Li, S. Qi, Y. Li, and Z. Xu, “Revisiting long-term time series forecasting: An investigation on linear mapping,” arXiv preprint arXiv:2305.10721, 2023.
- O. El Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE transactions on wireless communications, vol. 13, no. 3, pp. 1499–1513, 2014.
- 3GPP, “Radio Resource Control (RCC) Protocol Specification, document TS 38.331,” 3GPP, Tech. Rep. 38.331, Jun. 2019, version 15.6.0.
- S. H. Lim, S. Kim, B. Shim, and J. W. Choi, “Deep learning-based beam tracking for millimeter-wave communications under mobility,” IEEE Transactions on Communications, vol. 69, no. 11, pp. 7458–7469, 2021.
- C. Liu, M. Li, L. Zhao, P. Whiting, S. V. Hanly, I. B. Collings, and M. Zhao, “Robust adaptive beam tracking for mobile millimetre wave communications,” IEEE Transactions on Wireless Communications, vol. 20, no. 3, pp. 1918–1934, 2020.
- C.-X. Wang, J. Huang, H. Wang, X. Gao, X. You, and Y. Hao, “6g wireless channel measurements and models: Trends and challenges,” IEEE Vehicular Technology Magazine, vol. 15, no. 4, pp. 22–32, 2020.
- C.-X. Wang, Z. Lv, X. Gao, X. You, Y. Hao, and H. Haas, “Pervasive wireless channel modeling theory and applications to 6G GBSMs for all frequency bands and all scenarios,” IEEE Transactions on Vehicular Technology, vol. 71, no. 9, pp. 9159–9173, 2022.
- J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “Bert: Pre-training of deep bidirectional transformers for language understanding,” arXiv preprint arXiv:1810.04805, 2018.
- Q. Cao, T. Kojima, Y. Matsuo, and Y. Iwasawa, “Unnatural error correction: Gpt-4 can almost perfectly handle unnatural scrambled text,” in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023, pp. 8898–8913.
- R. Xiong, Y. Yang, D. He, K. Zheng, S. Zheng, C. Xing, H. Zhang, Y. Lan, L. Wang, and T. Liu, “On layer normalization in the transformer architecture,” in International Conference on Machine Learning. PMLR, 2020, pp. 10 524–10 533.
- 3GPP, “Study on channel model for frequencies from 0.5 to 100 ghz,” 3GPP, Tech. Rep. 38.901, Jan. 2020, version 16.1.0.
- MATLAB and MathWorks, “5g toolbox,” https://www.mathworks.com/products/5g.html, 2020, accessed: Sep. 15, 2020.
- I. Loshchilov and F. Hutter, “Decoupled weight decay regularization,” arXiv preprint arXiv:1711.05101, 2017.
- L. N. Smith and N. Topin, “Super-convergence: Very fast training of neural networks using large learning rates,” in Artificial intelligence and machine learning for multi-domain operations applications, vol. 11006. SPIE, 2019, pp. 369–386.
- F. Peng, S. Zhang, Z. Jiang, X. Wang, and W. Chen, “A novel mobility induced channel prediction mechanism for vehicular communications,” IEEE Transactions on Wireless Communications, vol. 22, no. 5, pp. 3488–3502, 2022.
- Y. Nie, N. H. Nguyen, P. Sinthong, and J. Kalagnanam, “A time series is worth 64 words: Long-term forecasting with transformers,” arXiv preprint arXiv:2211.14730, 2022.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.