EPtask: Deep Reinforcement Learning based Energy-efficient and Priority-aware Task Scheduling for Dynamic Vehicular Edge Computing

Li, Peisong, Xiao, Ziren, Wang, Xinheng ORCID: 0000-0001-8771-8901, Huang, Kaizhu, Huang, Yi ORCID: 0000-0001-7774-1024 and Gao, Honghao (2023) EPtask: Deep Reinforcement Learning based Energy-efficient and Priority-aware Task Scheduling for Dynamic Vehicular Edge Computing. IEEE Transactions on Intelligent Vehicles, 9 (1). pp. 1-17.

PDF
EPtask_Deep_Reinforcement_Learning_based_Energy-efficient_and_Priority-aware_Task_Scheduling_for_Dynamic_Vehicular_Edge_Computing.pdf - Author Accepted Manuscript
Download (1MB) | Preview

Official URL: http://dx.doi.org/10.1109/tiv.2023.3321679

Abstract

The increasing complexity of vehicles has led to a growing demand for in-vehicle services that rely on multiple sensors. In the Vehicular Edge Computing (VEC) paradigm, energy-efficient task scheduling is critical to achieving optimal completion time and energy consumption. Although extensive research has been conducted in this field, challenges remain in meeting the requirements of time-sensitive services and adapting to dynamic traffic environments. In this context, a novel algorithm called Multi-action and Environment-adaptive Proximal Policy Optimization algorithm (MEPPO) is designed based on the conventional PPO algorithm and then a joint task scheduling and resource allocation method is proposed based on the designed MEPPO algorithm. In specific, the method involves three core aspects. Firstly, task scheduling strategy is designed to generate task offloading decisions and priority assignment decisions for the tasks utilizing PPO algorithm, which can further reduce the completion time of service requests. Secondly, transmit power allocation scheme is designed considering the expected transmission distance among vehicles and edge servers, which can minimize transmission energy consumption by adjusting the allocated transmit power dynamically. Thirdly, the proposed MEPPO-based scheduling method can make scheduling decisions for vehicles with different numbers of tasks by manipulating the state space of the PPO algorithm, which makes the proposed method be adaptive to real-world dynamic VEC environment. At last, the effectiveness of the proposed method is demonstrated through extensive simulation and on-site experiments.

Item Type:	Article
Uncontrolled Keywords:	7 Affordable and Clean Energy
Divisions:	Faculty of Science and Engineering > School of Electrical Engineering, Electronics and Computer Science
Depositing User:	Symplectic Admin
Date Deposited:	16 Oct 2023 08:00
Last Modified:	15 Mar 2024 17:11
DOI:	10.1109/tiv.2023.3321679
Related URLs:	Publisher
URI:	https://livrepository.liverpool.ac.uk/id/eprint/3173740