Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improvement Of Audiovisual Quality Estimation Using A Nonlinear Autoregressive Exogenous Neural Network And Bitstream Parameters (2402.18056v1)

Published 28 Feb 2024 in eess.IV, cs.SD, and eess.AS

Abstract: With the increasing demand for audiovisual services, telecom service providers and application developers are compelled to ensure that their services provide the best possible user experience. Particularly, services such as videoconferencing are very sensitive to network conditions. Therefore, their performance should be monitored in real time in order to adjust parameters to any network perturbation. In this paper, we developed a parametric model for estimating the perceived audiovisual quality in videoconference services. Our model is developed with the nonlinear autoregressive exogenous (NARX) recurrent neural network and estimates the perceived quality in terms of mean opinion score (MOS). We validate our model using the publicly available INRS bitstream audiovisual quality dataset. This dataset contains bitstream parameters such as loss per frame, bit rate and video duration. We compare the proposed model against state-of-the-art methods based on machine learning and show our model to outperform these methods in terms of mean square error (MSE=0.150) and Pearson correlation coefficient (R=0.931)

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. Cisco, “Cisco Visual Networking Index: Forecast and Trends, 2017–2022 White Paper,” 2019.
  2. “Video artifact evaluation based on qos and objective qoe parameters,” in 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2017, pp. 1048–1054.
  3. Cisco, “Cisco visual networking index: Forecast and methodology,” 2015.
  4. “A survey on parametric qoe estimation for popular services,” Journal of Network and Computer Applications, vol. 77, pp. 1–17, 2017.
  5. “Itu-t recommendation g.1070: Opinion model for video-telephony applications,” 2018.
  6. “Itu-t recommendation p.1201: Parametric non-intrusive assessment of audiovisual media streaming quality,” 2012.
  7. “Itu-t recommendation g.107: The e model, a computational model for use in transmission planning,” 2015.
  8. “Audiovisual quality estimation for video calls in wireless applications,” in 2010 IEEE Global Telecommunications Conference GLOBECOM 2010, 2010, pp. 1–5.
  9. “Machine learning based reduced reference bitstream audiovisual quality prediction models for realtime communications,” in 2017 IEEE International Conference on Multimedia and Expo (ICME), 2017, pp. 571–576.
  10. “Inrs audiovisual quality dataset,” in Proceedings of the 24th ACM International Conference on Multimedia, New York, NY, USA, 2016, MM ’16, p. 167–171, Association for Computing Machinery.
  11. José Maria P. Menezes and Guilherme A. Barreto, “Long-term time series prediction with the narx network: An empirical evaluation,” Elsevier Science Publishers B. V., vol. 71, no. 16–18, pp. 3335–3343, oct 2008.
  12. “Research and application of data mining and narx neural networks in load forecasting,” International journal of database theory and application, vol. 7, pp. 13–24, 2014.
  13. “Learning long-term dependencies is not as difficult with narx networks,” in Proceedings of the 8th International Conference on Neural Information Processing Systems, Cambridge, MA, USA, 1995, NIPS’95, p. 577–583, MIT Press.
  14. “Computational capabilities of recurrent narx neural networks,” IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 27, no. 2, pp. 208–215, 1997.
  15. S.S. Haykin, Neural Networks and Learning Machines, Number vol. 10 in Neural networks and learning machines. Prentice Hall, 2009.
  16. “A delay damage model selection algorithm for narx neural networks,” IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2719–2730, 1997.
  17. Kayri, “Predictive abilities of bayesian regularization and levenberg–marquardt algorithms in artificial neural networks: A comparative empirical study on social data,” MDPI AG, vol. 21, May 2016.
  18. “High-precision indoor visible light positioning using deep neural network based on the bayesian regularization with sparse training point,” IEEE Photonics Journal, vol. 11, no. 3, pp. 1–10, 2019.
  19. “Rfc3550: Rtp: A transport protocol for real-time applications,” 2003.
  20. “Itu-t recommendation p.911: Subjective audiovisual quality assessment methods for multimedia applications,” 1998.
  21. “Cross-validation techniques for determining the number of correlated components between two data sets when the number of samples is very small,” in 2016 50th Asilomar Conference on Signals, Systems and Computers, 2016, pp. 601–605.
  22. “Itu-t recommendation p.1401: Methods, metrics and procedures for statistical evaluation, qualification and comparison of objective quality prediction models,” 2012.

Summary

We haven't generated a summary for this paper yet.