Resource management for power-constrained HEVC transcoding using reinforcement learning

Costero, Luis (Universidad Complutense de Madrid, Madrid, Spain) ; Iranfar, Arman (EPFL, Lausanne, Switzerland) ; Zapater, Marina (EPFL, Lausanne, Switzerland ; School of Management and Engineering Vaud, HES-SO // University of Applied Sciences Western Switzerland) ; Igual, Francisco D. (Universidad Complutense de Madrid, Madrid, Spain) ; Olcoz, Katzalin (Universidad Comutense de Madrid, Madrid, Spain) ; Atienza, David (EPFL, Lausanne, Switzerland)

The advent of online video streaming applications and services along with the users' demand for high-quality contents require High Efficiency Video Coding (HEVC), which provides higher video quality and more compression at the cost of increased complexity. On one hand, HEVC exposes a set of dynamically tunable parameters to provide trade-offs among Quality-of-Service (QoS), performance, and power consumption of multi-core servers on the video providers' data center. On the other hand, resource management of modern multi-core servers is in charge of adapting system-level parameters, such as operating frequency and multithreading, to deal with concurrent applications and their requirements. Therefore, efficient multi-user HEVC streaming necessitates joint adaptation of application-and system-level parameters. Nonetheless, dealing with such a large and dynamic design space is challenging and difficult to address through conventional resource management strategies. Thus, in this work, we develop a multi-agent Reinforcement Learning framework to jointly adjust application-and system-level parameters at runtime to satisfy the QoS of multi-user HEVC streaming in power-constrained servers. In particular, the design space, composed of all design parameters, is split into smaller independent sub-spaces. Each design sub-space is assigned to a particular agent so that it can explore it faster, yet accurately. The benefits of our approach are revealed in terms of adaptability and quality (with up to to 4× improvements in terms of QoS when compared to a static resource management scheme), and learning time (6× fasterthan an equivalent mono-agent implementation). Finally, we show that the power-capping techniques formulated outperform the hardware-based power capping with respect to quality.

Article Type:
Ingénierie et Architecture
ReDS - Reconfigurable & embedded Digital Systems
pp. 2834-2850
Published in:
IEEE Transactions on Parallel and Distributed Systems
Numeration (vol. no.):
2020, vol. 31, no. 12
Appears in Collection:

Note: The status of this file is: restricted

 Record created 2021-04-06, last modified 2021-04-07

Download fulltext

Rate this document:

Rate this document:
(Not yet reviewed)