Delay-optimal dynamic mode selection and resource allocation in device-to-device communications - part I: optimal policy

Lei, Lei, Kuang, Yiru, Cheng, Nan, Shen, Xuemin, Zhong, Zhangdui, and Lin, Chuang (2016) Delay-optimal dynamic mode selection and resource allocation in device-to-device communications - part I: optimal policy. IEEE Transactions on Vehicular Technology, 65 (5). pp. 3474-3490.

PDF (Published Version) - Published Version
Restricted to Repository staff only

DOI: 10.1109/TVT.2015.2444795

View at Publisher Website: https://doi.org/10.1109/TVT.2015.2444795

Abstract

In this paper (Part I and Part II), we investigate the optimal dynamic mode selection and resource allocation to minimize the average end-to-end delay under a dropping probability constraint for an orthogonal frequency-division multiple-access (OFDMA) cellular network with device-to-device (D2D) communications. Different from the previous studies, which mostly focus on an infinite-backlog traffic model, we consider dynamic data arrival with nonsaturated buffers and formulate the resource control problem in D2D communications into an infinite-horizon average-reward constrained Markov decision process (CMDP) in Part I. The CMDP characterizes the dynamic interference between D2D links and cellular links based on their varying backlogged states, the dynamic route selection, and the coupled interactions between uplink and downlink resource allocations. We propose the general form of the optimal policy. In particular, it is proved that the optimal delay, respective of all feasible randomized policies, is attained by either a deterministic policy or a simple mixed policy, which randomizes between two deterministic policies. Therefore, the determination of an optimal randomized policy essentially becomes the determination of one or two deterministic policies, which can be obtained by an equivalent Bellman's equation with reduced state space. Simulation results show that the optimal policy based on the CMDP model outperforms the conventional channel-state-information-only scheme and the throughput-optimal scheme in stability sense.


Item ID:	53199
Item Type:	Article (Research - C1)
ISSN:	1939-9359
Keywords:	device-to-device (D2D) communication; Markov decision process (MDP); mode selection; resource allocation
Date Deposited:	19 Jun 2018 02:06
FoR Codes:	40 ENGINEERING > 4006 Communications engineering > 400608 Wireless communication systems and technologies (incl. microwave and millimetrewave) @ 100%
SEO Codes:	89 INFORMATION AND COMMUNICATION SERVICES > 8901 Communication Networks and Services > 890103 Mobile Data Networks and Services @ 100%
Downloads:	Total: 1
	More Statistics

Actions (Repository Staff Only)

Item Control Page