TY - GEN
T1 - Continuous QoE Prediction Based on WaveNet
AU - Tan, Phan Xuan
AU - Duc, Tho Nguyen
AU - Tran, Chanh Minh
AU - Kamioka, Eiji
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2020/2/14
Y1 - 2020/2/14
N2 - Continuous QoE prediction is crucial in the purpose of maximizing viewer satisfaction, by which video service providers could improve the revenue. Continuously predicting QoE is challenging since it requires QoE models that are capable of capturing the complex dependencies among QoE influence factors. The existing approaches that utilize Long-Short-Term-Memory (LSTM) network successfully model such long-term dependencies, providing the superior QoE prediction performance. However, the inherent drawback of sequential computing of LSTM will result in high computational cost in training and prediction tasks. Recently, WaveNet, a deep neural network for generating raw audio waveform, has been introduced. Immediately, it gains a great attention since it successfully leverages the characteristic of parallel computing of causal convolution and dilated convolution to deal with time-series data (e.g., audio signal). Being inspired by the success of WaveNet, in this paper, we propose WaveNet-based QoE model for continuous QoE prediction in video streaming services. The model is trained and tested upon on two publicly available databases, namely, LFOVIA Video QoE and LIVE Mobile Stall Video II. The experimental results demonstrate that the proposed model outperforms the baselines models in terms of processing time, while maintaining sufficient accuracy.
AB - Continuous QoE prediction is crucial in the purpose of maximizing viewer satisfaction, by which video service providers could improve the revenue. Continuously predicting QoE is challenging since it requires QoE models that are capable of capturing the complex dependencies among QoE influence factors. The existing approaches that utilize Long-Short-Term-Memory (LSTM) network successfully model such long-term dependencies, providing the superior QoE prediction performance. However, the inherent drawback of sequential computing of LSTM will result in high computational cost in training and prediction tasks. Recently, WaveNet, a deep neural network for generating raw audio waveform, has been introduced. Immediately, it gains a great attention since it successfully leverages the characteristic of parallel computing of causal convolution and dilated convolution to deal with time-series data (e.g., audio signal). Being inspired by the success of WaveNet, in this paper, we propose WaveNet-based QoE model for continuous QoE prediction in video streaming services. The model is trained and tested upon on two publicly available databases, namely, LFOVIA Video QoE and LIVE Mobile Stall Video II. The experimental results demonstrate that the proposed model outperforms the baselines models in terms of processing time, while maintaining sufficient accuracy.
KW - Causal Convolution
KW - Deep Learning
KW - LSTM
KW - PixelCNN
KW - Quality of Experience (QoE)
KW - Video Streaming
KW - WaveNet
UR - http://www.scopus.com/inward/record.url?scp=85085691958&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85085691958&partnerID=8YFLogxK
U2 - 10.1145/3384613.3384633
DO - 10.1145/3384613.3384633
M3 - Conference contribution
AN - SCOPUS:85085691958
T3 - ACM International Conference Proceeding Series
SP - 80
EP - 84
BT - Proceedings of the 2020 12th International Conference on Computer and Automation Engineering, ICCAE 2020
PB - Association for Computing Machinery
T2 - 12th International Conference on Computer and Automation Engineering, ICCAE 2020
Y2 - 14 February 2020 through 16 February 2020
ER -