Code of Paper "Joint Task Offloading and Resource Optimization in NOMA-based Vehicular Edge Computing: A Game-Theoretic DRL Approach", JSA 2022.

Home Page: https://www.sciencedirect.com/science/article/abs/pii/S138376212200265X

License: GNU General Public License v3.0

Python 100.00%

deep-reinforcement-learning noma potential-game resource-allocation task-offloading vehicular-networks

game-theoretic-deep-reinforcement-learning's Introduction

Hi! U are the -th visitor

I am currently a Postdoctoral Research Fellow in cooperation with Prof. Shaohua Wan at the Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China (UESTC), Shenzhen, China. I received the Ph.D. degree in Computer Science from the College of Computer Science at the Chongqing University (CQU), Chongqing, China, in 2023 and the B.S. degree in Network Engineering from the School of Computer and Control Engineering at the North University of China (NUC), Taiyuan, China, in 2017. I have authored and co-authored more than 10 papers with total google scholar .

My research interests include:

Vehicular Networks
Vehicular Cyber-Physical Systems
Edge Computing
Deep Reinforcement Learning
Game Theory

🔥 News

2023.04: My GitHub stars have reached 100!
2022.10: 🎉🎉 One paper is accepted by JSA!
2022.10: 🎉🎉 My google scholar citations have reached 100!
2022.09: One invention patent is granted!
2022.06: One paper is accepted by IEEE ITSC 2022!

🕒 Recent Research

Enabling Digital Twin in Vehicular Edge Computing: A Multi-Agent Multi-Objective Deep Reinforcement Learning Solution
Xincao Xu, Kai Liu, Penglin Dai, and Biwen Chen

We present a DT-VEC architecture, where the heterogeneous information can be sensed by vehicles and uploaded to the edge node via V2I communications. The DT-VEC are modeled at the edge node, forming a logical view to reflect the physical vehicular environment.
We model the DT-VEC by deriving an ISAC-assisted sensing model and a reliability-guaranteed uploading model.
We formulate the bi-objective problem to maximize the system quality and minimize the system cost, simultaneously. In particular, we define the quality of DT-VEC by considering the timeliness and consistency, and define the cost of DT-VEC by considering the redundancy, sensing cost, and transmission cost.
We propose a multi-agent multi-objective (MAMO) deep reinforcement learning solution implemented distributedly in the vehicles and the edge nodes. Specifically, a dueling critic network is proposed to evaluate the advantage of action over the average of random actions.
Submitted to IEEE Transactions on Consumer Electronics (under review)

Cooperative Sensing and Heterogeneous Information Fusion in VCPS: A Multi-agent Deep Reinforcement Learning Approach
Xincao Xu, Kai Liu, Penglin Dai, Ruitao Xie, and Jiangtao Luo

We present a VEC architecture, in which heterogeneous information can be cooperatively sensed and uploaded via V2I communications. Logical views can be constructed by fusing the heterogeneous information at edge nodes.
We derive a cooperative sensing model based on the multi-class M/G/1 priority queue. On this basis, we define a noval metric AoV by modeling the timeliness, completeness, and consistency of the logical views.
We formulate the problem, which aims at maximizing the quality of VCPS.
We propose a multiagent DRL solution, where a difference-reward-based credit assignment is designed to divide the system reward into the difference reward for vehicles, reflecting their individual contributions.
Submitted to IEEE Transactions on Intelligent Transportation Systems (under review)

Joint Task Offloading and Resource Optimization in NOMA-based Vehicular Edge Computing: A Game-Theoretic DRL Approach
Xincao Xu, Kai Liu, Penglin Dai, Feiyu Jin, Hualing Ren, Choujun Zhan, and Songtao Guo

We present a NOMA-based VEC architecture, where heterogeneous resources of edge nodes are cooperated for real-time task processing.
We derive a V2I transmission model by considering both intra-edge and inter-edge interference and formulate a cooperative resource optimization (CRO) problem by jointly optimizing real-time task offloading and heterogeneous resource allocation, aiming at maximizing the service ratio.
We decompose the CRO into two subproblems, namely, task offloading and resource alloction. The first subproblem is modeled as an EPG with Nash equilibrium (NE) existence and converagence, and a multi-agent D4PG algorithm is proposed to achieve the NE by adopting the potential function as reward function. The second subproblem is divided into two independent convex optimization problems, and an optimal solution is proposed based on a gradient-based iterative method and KKT condition.
Accepted by Journal of Systems Architecture (JCR Q1)

📖 Publications

JCR: Journal Citation Reports by Clarivate Com.
SCI: Journal Partition List by National Science Library, Chinese Academy of Sciences
CCF: Recommended Publications by China Computer Federation
*: Corresponding Author

Journal

Xincao Xu, Kai Liu^*, Penglin Dai, Feiyu Jin, Hualing Ren, Choujun Zhan, and Songtao Guo, Joint Task Offloading and Resource Optimization in NOMA-Based Vehicular Edge Computing: A Game-Theoretic DRL Approach, Journal of Systems Architecture, volume 134, pp. 102780, January 2023. IF: 5.836 (2021), 4.497 (5-year) [JCR Q1|SCI Q2|CCF B]
Chunhui Liu, Kai Liu^*, Hualing Ren, Xincao Xu, Ruitao Xie and Jingjing Cao, RtDS: Real-time Distributed Strategy for Multi-period Task Offloading in Vehicular Edge Computing Environment, Neural Computing and Applications, to appear. IF: 5.606 (2020), 5.573 (5-year) [JCR Q1|SCI Q2]
Ke Xiao, Kai Liu^*, Xincao Xu, Liang Feng, Zhou Wu and Qiangwei Zhao, Cooperative Coding and Caching Scheduling via Binary Particle Swarm Optimization in Software Defined Vehicular Networks, Neural Computing and Applications, volume 33, issue 5, pp. 1467-1478, May 2021. IF: 5.606 (2020), 5.573 (5-year) [JCR Q1|SCI Q2]
Ke Xiao, Kai Liu^*, Xincao Xu, Yi Zhou and Liang Feng, Efficient Fog-assisted Heterogeneous Data Services in Software Defined VANETs, Journal of Ambient Intelligence and Humanized Computing, volume 12, issue 1, pp.261-273, January 2021. IF: 7.104 (2020), 6.163 (5-year) [JCR Q2|SCI Q3]
Xincao Xu, Kai Liu^*, Ke Xiao, Liang Feng, Zhou Wu and Songtao Guo, Vehicular Fog Computing Enabled Real-time Collision Warning via Trajectory Calibration, Mobile Networks and Applications, volume 25, issue 6, pp. 2482-2494, December 2020. IF: 2.602 (2019), 2.76 (5-year) [JCR Q2|CCF C]
Kai Liu^*, Xincao Xu, Mengliang Chen, Bingyi Liu^*, Libing Wu and Victor Lee, A Hierarchical Architecture for the Future Internet of Vehicles, IEEE Communications Magazine, volume 57, issue 7, pp. 41-47, July 2019. IF: 10.356 (2018), 12.091 (5-year) [JCR Q1|SCI Q1]

Conference

Xincao Xu, Kai Liu, Qisen Zhang, Hao Jiang, Ke Xiao and Jiangtao Luo, Age of View: A New Metric for Evaluating Heterogeneous Information Fusion in Vehicular Cyber-Physical Systems, IEEE 25th International Conference on Intelligent Transportation Systems (IEEE ITSC’22), Macau, China, October 8-12, 2022.
Chunhui Liu, Kai Liu, Xincao Xu, Hualing Ren, Feiyu Jin and Songtao Guo, Real-time Task Offloading for Data and Computation Intensive Services in Vehicular Fog Computing Environments, IEEE International Conference on Mobility, Sensing and Networking (IEEE MSN’20), Tokyo, Japan, December 17-19, 2020.
Yi Zhou, Kai Liu, Xincao Xu, Chunhui Liu, Liang Feng and Chao Chen, Multi-period Distributed Delay-sensitive Tasks Offloading in a Two-layer Vehicular Fog Computing Architecture, International Conference on Neural Computing and Applications (NCAA’20), Shenzhen, China, July 3-6, 2020.
Yi Zhou, Kai Liu, Xincao Xu, Songtao Guo, Zhou Wu, Victor Lee and Sang Son, Distributed Scheduling for Time-Critical Tasks in a Two-layer Vehicular Fog Computing Architecture, IEEE Consumer Communications and Networking Conference (IEEE CCNC’20), Las Vegas, USA, January 11-14, 2020.
Xincao Xu, Kai Liu, Ke Xiao, Hualing Ren, Liang Feng and Chao Chen, Design and Implementation of a Fog Computing Based Collision Warning System in VANETs, IEEE International Symposium on Product Compliance Engineering-Asia (IEEE ISPCE-CN’18), Hong Kong/Shengzhen, December 5-7, 2018. (Best Paper Award)

Chinese Papers

Xincao Xu, Kai Liu^*, Chunhui Liu, Hao Jiang, Songtao Guo and Weiwei Wu, Potential Game Based Channel Allocation for Vehicular Edge Computing, Tien Tzu Hsueh Pao/Acta Electronica Sinica, volume 49, issue 5, pp.851-860, July 2021. [CCF T1]
Xincao Xu, Yi Zhou, Kai Liu, Chaocen Xiang, Yantao Li and Songtao Guo, Potenial Game based Distributed Channel Allocation in Vehicular Fog Computing Environments, 14th China Conference on Internet of Things (CWSN’20), Dunhuang, China, September 18-21, 2020. (Best Paper Candidate)

💻 Coding

📊 Weekly development breakdown

Still Gathering Statistics...

game-theoretic-deep-reinforcement-learning's People

Contributors

Stargazers

Watchers

game-theoretic-deep-reinforcement-learning's Issues

Windows运行环境

你好，我还想问一下这个可以在windows上面跑吗？我配置yml文件，也出现很多库下载不了的问题“Could not find a version that satisfies the requirement absl (from versions:），这个原因是因为我没在linux下面运行程序吗

ModuleNotFoundError: No module named 'Log'

E: Unable to locate package libpython3.9

你好，请问使用sudo apt-get install libpython3.9仍旧报错是什么问题呢，谢谢谢谢

显式解

作者您好，您论文内的Computation Resource Allocation 的kkt最优解是怎么推出来的？好像显式解推不出来吧？希望您的回复

IndexError: list index out of range

你好，我在运行experiment.py时弹出这个错误，请问是为什么？
Traceback (most recent call last):
File "D:\hgxdm\dm\Game-Theoretic-Deep-Reinforcement-Learning-main\Game\Experiment\experiment.py", line 7, in
tf.config.experimental.set_virtual_device_configuration(gpus[0],
IndexError: list index out of range

博主能不能讲一下代码结构啊，能力不够看不太懂

运行experiment.py时报错ModuleNotFoundError: No module named 'Experiment'

运行experiment.py时报错ModuleNotFoundError: No module named 'Experiment'
请问各位大佬 Experiment模块怎么弄哇QAQ

关于acme以及sonnet等库

作者您好！最近重新拜读了您的论文和源码，由于我是一个深度学习和强化学习方面的菜鸟，所以对acme库的相关框架使用起来比较困难。
请问您有一些学习资料推荐吗？或者是否可以请教一下您的学习路线呢？

请问作者我改动了环境，是不是得通过make_environment.py文件创建新的pkl文件？但是我看文件里没有下面源码对应的trajectories_file。。

result files

Hello, could you provide the result files in the acme directory, such as logs.csv, because I want to compare my running results with yours. Thanks a lot.

No module named "acme.types" issue

hello,I want to konw why I have the wrong"No module named 'acme.types'",but I can't solve it,it will be wrong
"ERROR: Could not find a version that satisfies the requirement acme_types (from versions: none)
ERROR: No matching distribution found for acme_types".thank you

执行DDPG算法时报错：valueerror: typeerror: object of type 'discretevalueddistribution' has no len()

我在运行代码时遇到错误

亲爱的作者们，您好：

我正在读你的论文，给我留下了深刻的印象。但是当我尝试运行代码时出现此错误。
请帮助我解决这个错误
@neardws

(environment) teddy@teddy-HP-Notebook:/Desktop/Game-Theoretic-Deep-Reinforcement-Learning-main/Experiment$ python run_d4pg.py
2023-10-12 02:51:53.432688: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory
2023-10-12 02:51:53.432719: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
Traceback (most recent call last):
File "/home/teddy/Desktop/Game-Theoretic-Deep-Reinforcement-Learning-main/Experiment/run_d4pg.py", line 5, in
from Agents.D4PG_new.agent import D4PG
File "/home/teddy/Desktop/Game-Theoretic-Deep-Reinforcement-Learning-main/Agents/D4PG_new/agent.py", line 24, in
from acme import datasets
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/acme/datasets/init.py", line 18, in
from acme.datasets.reverb import make_reverb_dataset
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/acme/datasets/reverb.py", line 22, in
from acme.adders import reverb as adders
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/acme/adders/reverb/init.py", line 20, in
from acme.adders.reverb.base import DEFAULT_PRIORITY_TABLE
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/acme/adders/reverb/base.py", line 28, in
import reverb
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/reverb/init.py", line 27, in
from reverb import item_selectors as selectors
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/reverb/item_selectors.py", line 19, in
from reverb import pybind
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/reverb/pybind.py", line 6, in
_load_op_library.reraise_wrapped_error(e)
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/reverb/platform/default/load_op_library.py", line 47, in reraise_wrapped_error
raise error
File "/home/teddy/anaconda3/envs/environment/lib/python3.9/site-packages/reverb/pybind.py", line 4, in
from .libpybind import *
ImportError: libpython3.9.so.1.0: cannot open shared object file: No such file or directory
(environment) teddy@teddy-HP-Notebook:/Desktop/Game-Theoretic-Deep-Reinforcement-Learning-main/Experiment$

亲爱的作者您好，请问论文中边缘节点的位置是如何设置的呢？

作者您好，论文中Table 1中 𝑙𝑒 Location of edge node 𝑒的值是如何设置的呢？用的是数据集中的数据吗？谢谢！

关于动态库的问题

我是在执行 python expriment.py时遇到这个问题
我想请教下这个动态库未定义应该怎么办已经配置好久环境了，ku
网上查的方式都试过了都不太行

code help

您好，我正在学习代码，想请教一个问题。就是先有了任务卸载的决策，然后进行通信资源分配，然后还有完成有线传输后，才能进行计算任务。但是假设任务k在时隙t要卸载到边缘节点e上，t时隙分配给k的通信资源为0，所以上传时延为punish_time，还要加上任务的有线传输时延，但是如果节点e又存在可用的计算资源分配，计算时延可以计算出来，但是对于边缘节点e来说，被这个车辆占用的资源持续时隙occupied_time在代码中写的是从卸载的时隙t开始到计算完成的时隙，但是不是还存在任务上传和卸载的时延吗？

请求论文和实验的一些疑惑解答

博主您好，我是一名在读研究生，十分有幸读到了您的这篇论文，想深入学习下去，在论文和实验中我遇到了一些困惑，尽力查阅了不少资料后还是不太明白，希望能够向您请教，感激不尽。
1.文中每个边缘节点的效用函数，都是所有边缘节点的服务比之和吗

2.您的势函数是如何构造出来的，不太明白势函数的第二项Ue（-Se，S-e）的意思，和怎么计算的

3.您在证明这个势函数的时候，最后一步是把Ue（-Se，S-e）和Ue（-Se‘，S-e）抵消了吗，有点不太明白怎么消去的

4.在跑您实验中的D4PG算法时，我得到的average_execution_times和average_service_times的值非常小，

使用是默认的global数据，和文中的结果不太符合，文中的D4pg，APT和AST的值看起来是7和10左右。其余的算法都符合您图中的数值，没有找到问题是出现在哪里了。
5.关于势博弈您有一些资料推荐吗，看了原文感觉读起来有些吃力。。
希望博主能在闲暇之余给予一些指导，非常感激您

ImportError: libpython3.9.so.1.0: cannot open shared object file: No such file or directory

请教一下作者关于mad4pg的实验问题

Describe the bug
A clear and concise description of what the bug is.

To Reproduce
Steps to reproduce the behavior:

Go to '...'
...
See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Runtime Environment (please complete the following information):

OS: [e.g. Ubuntu 22.04]
Python Version [e.g. 3.11.2]
...

Additional context
Add any other context about the problem here.

libserialization_cc_proto.so: undefined symbol: _ZNK6google8protobuf7Message11GetTypeNameEv

您好，请问这个问题是因为啥？我查到可能是版本问题，我安装了protobuf3.17.2 ,也不可以，请问您使用的是啥版本？或许有其他解决办法吗？非常感谢🙇‍

博主我想问您一下您论文中的VEC架构的图是怎么画的呀，能教一下方法嘛

博主你论文里面VEC架构图是用什么画的哇能教一下方法嘛

Could not load dynamic library 'libcudart.so.11.0

Could you please upload the dataset

Could you please upload the dataset:"DiDi Chuxing GAIA Open Dataset Initiative", (like gps_20161116.csv), which is needed in "process_trajectories.py". The website "https://outreach.didichuxing.com/research/opendata/en/" is not availabe now. Thank you.

vehicle的trajectories_file_name，具体的数据结构是如何，是怎么封装的
在代码reaed_vehicle_trajectories中new_df = df[df['vehicle_id'] == vehicle_id]获取到对应vehicle_id的所有数据，型获取longitude和latitude列中的最大值与最小值，那么在计算max(min) distance 是否是错误的，因为获取到的(x,y)不是一个时间下的坐标。
在代码reaed_vehicle_trajectories中最大距离和最小距离计算时减去的是edge的通信范围吗，最大最小距离是到原点的距离吗？
self._vehicle_number_in_edge = int(self._vehicle_number / self._edge_number)指的是什么，vehicle与edge的比值吗，还是指的在t时刻下edge e内有x个vehicle
问题4中self._vehicle_number_in_edge与environment.py中vehicle_index_within_edges是不是同一个意思
environment.py与make_environment.py有什么区别吗，它们的作用是什么

neardws / game-theoretic-deep-reinforcement-learning Goto Github PK