Bitrate Ladder Construction using Visual Information Fidelity (2312.07780v2)
Abstract: Recently proposed perceptually optimized per-title video encoding methods provide better BD-rate savings than fixed bitrate-ladder approaches that have been employed in the past. However, a disadvantage of per-title encoding is that it requires significant time and energy to compute bitrate ladders. Over the past few years, a variety of methods have been proposed to construct optimal bitrate ladders including using low-level features to predict cross-over bitrates, optimal resolutions for each bitrate, predicting visual quality, etc. Here, we deploy features drawn from Visual Information Fidelity (VIF) (VIF features) extracted from uncompressed videos to predict the visual quality (VMAF) of compressed videos. We present multiple VIF feature sets extracted from different scales and subbands of a video to tackle the problem of bitrate ladder construction. Comparisons are made against a fixed bitrate ladder and a bitrate ladder obtained from exhaustive encoding using Bjontegaard delta metrics.
- “Http live streaming (hls) authoring specification for apple devices.” [Online]. Available: https://developer.apple.com/documentation/http-live-streaming/hls-authoring-specification-for-apple-devices
- “Per-title encode optimization.” [Online]. Available: https://netflixtechblog.com/per-title-encode-optimization-7e99442b62a2
- “Optimized shot-based encodes.” [Online]. Available: https://netflixtechblog.com/optimized-shot-based-encodes-now-streaming-4b9464204830
- “Dynamic optimizer.” [Online]. Available: https://netflixtechblog.com/dynamic-optimizer-a-perceptual-video-encoding-optimization-framework-e19f1e3a277f
- H. Sheikh and A. Bovik, “Image information and visual quality,” IEEE Transactions on Image Processing, vol. 15, no. 2, pp. 430–444, 2006.
- A. V. Katsenou, M. Afonso, D. Agrafiotis, and D. R. Bull, “Predicting video rate-distortion curves using textural features,” in 2016 Picture Coding Symposium, PCS 2016, Nuremberg, Germany, December 4-7, 2016.
- A. V. Katsenou, M. Afonso, and D. R. Bull, “Study of compression statistics and prediction of rate-distortion curves for video texture.”
- A. V. Katsenou, J. Sole, and D. R. Bull, “Content-gnostic bitrate ladder prediction for adaptive video streaming,” in Picture Coding Symposium, PCS 2019, Ningbo, China, November 12-15, 2019.
- A. V. Katsenou, J. Sole, and D. Bull, “Efficient bitrate ladder construction for content-optimized adaptive video streaming,” IEEE Open Journal of Signal Processing, vol. 2, pp. 496–511, 2021.
- A. V. Katsenou, F. Zhang, K. Swanson, M. Afonso, J. Sole, and D. R. Bull, “Vmaf-based bitrate ladder estimation for adaptive streaming,” in Picture Coding Symposium, PCS 2021, Bristol, United Kingdom, June 29 - July 2, 2021.
- A. Telili, W. Hamidouche, S. A. Fezza, and L. Morin, “Benchmarking learning-based bitrate ladder prediction methods for adaptive video streaming,” in Picture Coding Symposium, PCS 2022, San Jose, CA, USA, December 7-9, 2022.
- V. V. Menon, H. Amirpour, M. Ghanbari, and C. Timmerer, “Perceptually-aware per-title encoding for adaptive video streaming,” in IEEE International Conference on Multimedia and Expo, ICME 2022, Taipei, Taiwan, July 18-22, 2022.
- F. Nasiri, W. Hamidouche, L. Morin, N. Dhollande, and J. Aubié, “Ensemble learning for efficient VVC bitrate ladder prediction,” in 10th European Workshop on Visual Information Processing, Lisbon, Portugal, September 11-14, 2022.
- V. V. Menon, J. Zhu, P. T. Rajendran, H. Amirpour, P. L. Callet, and C. Timmerer, “Just noticeable difference-aware per-scene bitrate-laddering for adaptive video streaming,” CoRR, vol. abs/2305.00225, 2023.
- S. Paul, A. Norkin, and A. C. Bovik, “Efficient per-shot convex hull prediction by recurrent learning,” CoRR, vol. abs/2206.04877, 2022.
- “Vmaf - video multi-method assessment fusion.” [Online]. Available: https://github.com/Netflix/vmaf
- S. Li, F. Zhang, L. Ma, and K. N. Ngan, “Image quality assessment by separately evaluating detail losses and additive impairments,” IEEE Transactions on Multimedia, vol. 13, no. 5, pp. 935–949, 2011.