Chong Huang

Research fellow in Wireless Communications

PhD

chong.huang@surrey.ac.uk

Academic and research departments

Publications

Chong Huang, Long Zhang, Gaojie Chen, Zhen Gao, Pei Xiao (2024)Reconfigurable Intelligent Surface Empowered Full Duplex Systems: Opportunities and Challenges, In: IEEE Communications Standards Magazine8(4)pp. 16-22 IEEE

DOI: 10.1109/MCOMSTD.0001.2400025

Reconfigurable intelligent surfaces (RISs) have emerged as a promising technology in wireless communications. Simultaneously transmitting and reflecting RIS (STAR-RISs) in particular have garnered significant attention due to their dual capabilities of simultaneous transmission and reflection, underscoring their potential applications in critical scenarios within the forthcoming sixth-generation (6G) technology landscape. Moreover, full-duplex (FD) systems have emerged as a breakthrough research direction in wireless transmission technology due to their high spectral efficiency. This paper explores the application potential of STAR-RIS in FD systems for future wireless communications, presenting an innovative technology that provides robust self-interference cancellation (SIC) capabilities for FD systems. We utilize the refraction functionality of STAR-RIS enhances the transmission capacity of FD systems, while its reflection functionality is used to eliminate self interference within the FD system. We delve into the applications of two different types of STAR-RIS in FD systems and compare their performance through simulations. Furthermore, we discuss the performance differences of STAR-RIS empowered FD systems under various configurations in a case study, and demonstrate the superiority of the proposed deep learning-based optimization algorithm. Finally, we discuss possible future research directions for STAR-RIS empowered FD systems.

Yang Luo, Arunprakash Jayaprakash, Gaojie Chen, Chong Huang, Qu Luo, Pei Xiao (2025)Widely Linear Augmented Extreme Learning Machine Based Impairments Compensation for Satellite Communications, In: Pre-print IEEE

DOI: 10.1109/TVT.2025.3581439

—Satellite communications are crucial for the evolution beyond fifth-generation networks. However, the dynamic nature of satellite channels and their inherent impairments present significant challenges. In this paper, a novel post-compensation scheme that combines the complex-valued extreme learning machine with augmented hidden layer (CELMAH) architecture and widely linear processing (WLP) is developed to address these issues by exploiting signal impropriety in satellite communications. Although CELMAH shares structural similarities with WLP, it employs a different core algorithm and does not fully exploit the signal impropriety. By incorporating WLP principles, we derive a tailored formulation suited to the network structure and propose the CELM augmented by widely linear least squares (CELM-WLLS) for post-distortion. The proposed approach offers enhanced communication robustness and is highly effective for satellite communication scenarios characterized by dynamic channel conditions and non-linear impairments. CELM-WLLS is designed to improve signal recovery performance and outperform traditional methods such as least square (LS) and minimum mean square error (MMSE). Compared to CELMAH, CELM-WLLS demonstrates approximately 0.8 dB gain in BER performance, and also achieves a two-thirds reduction in computational complexity , making it a more efficient solution.

Donggen Li, Jingfu Li, Chong Huang, Gaojie Chen, Pei Xiao, Wenjiang Feng (2025)Unsupervised Learning for Energy Efficiency Optimization over CF-mMIMO under IRLLC, In: IEEE Communications Letters 29(6)pp. 1436-1440 IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/LCOMM.2025.3564759

This letter investigates the energy efficiency (EE) of cell-free massive multiple-input multiple-output (CF-mMIMO) systems under ultra-reliable low-latency communication (URLLC) constraints. To improve the EE and satisfy the reliability of each user equipment (UE), UEs are classified into power-constrained UEs and power-tolerant UEs. Accordingly, an unsupervised deep neural network (UNSNet) is proposed, which consists of three sub-modules for extracting the channel characteristics of the power-constrained UEs, the power-tolerant UEs, and all the UEs, respectively. The UNSNet achieves reliability improvement for power-tolerant UEs with minimal impact on EE and enhances EE for power-constrained UEs while maintaining reliability. To accommodate dynamic communication environments, UNSNet integrates online learning techniques, further enhancing the robustness of the network while ensuring that the training process is label-independent to achieve low computational complexity. Numerical results show that the proposed method achieves the trade-off between EE and reliability and has a faster processing speed than traditional iterative methods. Index Terms—Energy efficiency (EE), cell-free massive multiple-input multiple-output (CF-mMIMO), ultra-reliable low-latency communication (URLLC), unsupervised learning

Liyuan Pang, Kang Song, Pu Miao, Chong Huang, Baofeng Ji, Zhongfu An, Xujie Zhou (2025)Dynamic Interference Management by Using the Enhanced Clustering and Deep Reinforcement Learning in VLC-Enabled UAV Communication, In: IEEE transactions on consumer electronicspp. 1-1 IEEE

DOI: 10.1109/TCE.2025.3594794

Unmanned aerial vehicle (UAV) equipped with visible light communication (VLC) emerges as a promising technology for nighttime outdoor consumer applications. Nevertheless, the interference from the overlapped coverage of multiple UAVs will degrade the signal quality thus constrain the overall system performance. In this paper, an efficient two-stage dynamic interference management approach is investigated to maximize the total sum-rate by jointly optimizing the user association, UAV trajectory, frequency band assignment and power allocation while satisfying the specific requirements of data rate and target illumination in VLC-enabled multi-UAV system. Firstly, the total flying time is split into multiple overlapping short time periods, and the enhanced K-means clustering algorithm is employed to establish the user association indirectly for each time window. Then, the multi-agent deep reinforcement learning framework is invoked to determine the optimal UAV trajectory and frequency-power resource blocks accordingly. After that, the sub-problems are sequentially solved over the time windows and the sub-optimal solution is obtained through several iterations and updates. Simulation results reveal that the proposed scheme exhibits excellent convergence performance, and it can also achieve at least 21.6% average throughput improvements as compared with the conventional schemes in complex interference patterns.

Chong Huang, Gaojie Chen, Jing Zhu, Qu Luo, Pei Xiao, Wei Huang, Rahim Tafazolli (2025)Hybrid Generative Semantic and Bit Communications in Satellite Networks: Trade-offs in Latency, Generation Quality, and Computation, In: 2025 IEEE Global Communications Conference (GLOBECOM 2025 ) Proceedings Institute of Electrical and Electronics Engineers (IEEE)

As satellite communications play an increasingly important role in future wireless networks, the issue of limited link budget in satellite systems has attracted significant attention in current research. Although semantic communications emerge as a promising solution to address these constraints, it introduces the challenge of increased computational resource consumption in wireless communications. To address these challenges, we propose a multi-layer hybrid bit and generative semantic communication framework which can adapt to the dynamic satellite communication networks. Furthermore, to balance the semantic communication efficiency and performance in satellite-to-ground transmissions, we introduce a novel semantic communication efficiency metric (SEM) that evaluates the trade-offs among latency, computational consumption, and semantic reconstruction quality in the proposed framework. Moreover, we utilize a novel deep reinforcement learning (DRL) algorithm group relative policy optimization (GRPO) to optimize the resource allocation in the proposed network. Simulation results demonstrate the flexibility of our proposed transmission framework and the effectiveness of the proposed metric SEM, illustrate the relationships among various semantic communication metrics.

Yingquan Zou, Jiayu Peng, Jingfu Li, Chong Huang, Pei Xiao (2025)4D FMCW MIMO radar based MCMC-EPF Track-Before-Detect method for UAV tracking in low SNR Institute of Electrical and Electronics Engineers (IEEE)

Tracking micro-unmanned aerial vehicles (micro-UAVs) in low signal-to-noise ratio (SNR) environments poses significant challenges due to their weak radar cross-section (RCS) and the inherent limitations of traditional Detect-Before-Track (DBT) radar algorithms. This paper proposes a novel Track-Before-Detect (TBD) approach based on a Markov Chain Monte Carlo-Enhanced Particle Filter (MCMC-EPF), leveraging 4D Frequency Modulated Continuous Wave (FMCW) MIMO radar. By directly processing unthresholded multi-frame 4D-FFT radar data, the method achieves joint detection and tracking, effectively preserving weak target information that is typically lost in DBT methods. Experimental results on real radar data demonstrate that the proposed algorithm achieves robust and accurate UAV tracking, maintaining a root-mean-square error (RMSE) within 1 meter and an average relative tracking error below 2.5% under low SNR conditions. These results highlight the method's potential for reliable UAV surveillance in challenging operational environments.

Sisai Fang, Gaojie Chen, Chong Huang, Yue Gao, Yonghui Li, Kai-Kit Wong, Jonathon A Chambers (2025)Weighted Sum Rate Enhancement by Using Dual-Side IOS-Assisted Full-Duplex for Multi-User MIMO Systems, In: IEEE internet of things journal12(12) IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JIOT.2025.3544804

This paper established a novel multi-input multi-output (MIMO) communication network, in the presence of full-duplex (FD) transmitters and receivers with the assistance of dual-side intelligent omni surface (IOS). Compared with the traditional IOS, the dual-side IOS allows signals from both sides to reflect and refract simultaneously, which further exploits the potential of metasurfaces to avoid frequency dependence, and size, weight, and power (SWaP) limitations. By considering both the downlink and uplink transmissions, we aim to maximize the weighted sum rate, subject to the transmit power constraints of the transmitter, the users and the dual-side reflecting and refracting phase shifts constraints. However, the formulated sum rate maximization problem is not convex, hence we exploit the weighted minimum mean square error (WMMSE) approach, and tackle the original problem iteratively by solving two sub-problems. For the beamforming matrices optimization of the downlink and uplink, we resort to the Lagrangian dual method combined with a bisection search to obtain the results. Furthermore , we resort to the quadratically constrained quadratic programming (QCQP) method to optimize the reflecting and refracting phase shifts of both sides of the IOS. Simulation results validate the efficacy of the proposed algorithm and demonstrate the superiority of the dual-side IOS.

Yingquan Zou, JiaLin Lv, Jingfu Li, Chong Huang, Pei Xiao (2025)Robust Object Detection in Low-Light via Sparse mmWave Radar and Lightweight Feature Fusion, In: IEEE/CIC International Conference on Communications in China (ICCC 2025) - Proceedings Institute of Electrical and Electronics Engineers (IEEE)

For the application of autonomous driving in low-light environments, traditional visual sensors often suffer significant performance degradation. Millimeter-wave radar offers a more robust alternative due to its light insensitivity and all-weather capabilities. However, its point cloud data is inherently sparse and noisy, which poses challenges for accurate detection. This paper proposes a lightweight detection framework for millimeter-wave radar point clouds, specifically designed for low-light scenarios in V2X-enabled intelligent transportation systems. The method extracts spatial, scale, and motion statistical features, and integrates them through a compact neural architecture named LightNetwork. Experiments on real-world datasets demonstrate that the proposed method delivers competitive or even superior accuracy compared to YOLOv8, particularly in terms of center localization and height estimation, while using only 0.65K parameters and achieving 0.24 ms inference latency. Ablation studies confirm the effectiveness of each feature component. Thanks to its compact design and high efficiency, the model is well suited for deployment on edge nodes in distributed V2X infrastructures, supporting real-time cooperative perception in next-generation networked applications.

Chong Huang, Xuyang Chen, Gaojie Chen, Pei Xiao, Geoffrey Ye Li, Wei Huang (2025)Deep Reinforcement Learning-Based Resource Allocation for Hybrid Bit and Generative Semantic Communications in Space-Air-Ground Integrated Networks, In: IEEE Journal on Selected Areas in CommunicationsEarly Access(Early Access) Institute of Electrical and Electronics Engineers (IEEE)

In this paper, we introduce a novel framework consisting of hybrid bit-level and generative semantic communications for efficient downlink image transmission within space-air-ground integrated networks (SAGINs). The proposed model comprises multiple low Earth orbit (LEO) satellites, unmanned aerial vehicles (UAVs), and ground users. Considering the limitations in signal coverage and receiver antennas that make the direct communication between satellites and ground users unfeasible in many scenarios, thus UAVs serve as relays and forward images from satellites to the ground users. Our hybrid communication framework effectively combines bit-level transmission with several semantic-level image generation modes, optimizing bandwidth usage to meet stringent satellite link budget constraints and ensure communication reliability and low latency under low signal-to-noise ratio (SNR) conditions. To reduce the transmission delay while ensuring reconstruction quality for the ground user, we propose a novel metric to measure delay and reconstruction quality in the proposed system, and employ a deep reinforcement learning (DRL)-based strategy to optimize resource allocation in the proposed network. Simulation results demonstrate the superiority of the proposed framework in terms of communication resource conservation, reduced latency, and maintaining high image quality, significantly outperforming traditional solutions. Therefore, the proposed framework can ensure that real-time image transmission requirements in SAGINs, even under dynamic network conditions and user demand. Index Terms—Space-air-ground integrated network, hybrid bit and semantic communications, deep reinforcement learning, image transmission, latency, Generative AI (GAI).

Shuo Sun, Chong Huang, Pei Xiao, Cicek Cavdar, Rahim Tafazolli, Dong An, Daoliang Li (2025)Traffic Prediction-Based Dynamic Cell Zooming-Assisted Base Station Sleep Mode in Green Cellular Networks, In: 2025 IEEE 26th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC 2025) Institute of Electrical and Electronics Engineers (IEEE)

The rapid advancement of wireless communication technologies has significantly increased energy consumption in cellular networks. To address this issue, we propose a multi-level sleep mode strategy for multi base station (BS) collaboration assisted by cell zooming and user association. First, the sleep mode allows BSs with low-traffic load to enter different sleep states to achieve energy savings, while cell zooming dynamically adjusts BS coverage, enabling collaboration through joint user association. However, since sleep mode leads to increased delay, we propose an energy-saving method with a delay constraint. In addition, optimizing the energy consumption of multiple BSs under delay constraints is a complex non-convex problem. To solve this problem, we first model the optimization process as a Markov decision process (MDP) and then optimize it using the proximal policy optimization (PPO) algorithm. Furthermore, although traffic load fluctuates dynamically, it still follows certain predictable patterns. Therefore, we propose a bidirectional long short-term memory (Bi-LSTM) traffic prediction algorithm to assist the sleep mode with cell zooming in making optimal or suboptimal decisions during the energy consumption optimization process. The simulation results demonstrate the superiority of the proposed scheme over benchmarks in terms of energy savings. Index Terms—Base station multi-level sleep strategy, cell zooming , deep reinforcement learning, bidirectional long short-term memory.

Xuyang Chen, Chong Huang, Gaojie Chen, Daquan Feng, Pei Xiao (2025)The Communication and Computation Trade-off in Wireless Semantic Communications, In: IEEE wireless communications lettersEarly Access

DOI: 10.1109/LWC.2025.3569205

Semantic communications have emerged as a crucial research direction for future wireless communication networks. However, as wireless systems become increasingly complex, the demands for computation and communication resources in semantic communications continue to grow rapidly. This paper investigates the trade-off between computation and communication in wireless semantic communications, taking into consideration transmission task delay and performance constraints within the semantic communication framework. We propose a novel trade-off metric to analyze the balance between computation and communication in semantic transmissions and employ the deep reinforcement learning (DRL) algorithm to minimize this metric, thereby reducing the cost associated with balancing computation and communication. Through simulations, we analyze the trade-off between computation and communication and demonstrate the effectiveness of optimizing this trade-off metric.

Liyuan Pang, Pu Miao, Chong Huang, Xiufeng Xu, Peng Chen, Gaojie Chen (2025)Energy-Efficient D eployment a nd P ower allocation for VLC-enabled Unmanned Aerial Vehicles Communications, In: 2024 IEEE 24th International Conference on Communication Technology (ICCT)

DOI: 10.1109/ICCT62411.2024.10946567

Deploying unmanned aerial vehicles (UAV) enabled visible light communication (VLC) networks to accommodate both the illumination and communication requirements of all users presents a serious challenge. In this paper, the UAV-VLC network deployment is investigated by the joint optimization of user association, UAV placement and power allocation, which is mathematically formulated as a minimization of energy consumption problem. The original problem is decoupled into two subproblems and sequentially solved by the proposed two-stage optimization scheme. To elaborate, the K-means algorithm is employed to cluster the users firstly, thereby establishing the user association indirectly. Then, the deep reinforcement learning based technique is employed to determine the optimum UAV placement and power allocation as considering the specific requirements of illumination and communications for all users. Simulation results demonstrate that the proposed scheme can achieve the superior performance and can reduce the total transmit power consumption at least by 74.39% and 67.62% as compared with the conventional schemes.

Chong Huang, Gaojie Chen, Pei Xiao, De Mi, Yunsheng Zhang, Hui Tang, Chen Lu, Rahim Tafazolli (2023)Federated Learning for RIS-Assisted UAV-Enabled Wireless Networks: Learning-Based Optimization for UAV Trajectory, RIS Phase Shifts and Weighted Aggregation, In: IECON 2023- 49th Annual Conference of the IEEE Industrial Electronics Societypp. 1-6 IEEE

DOI: 10.1109/IECON51785.2023.10312474

This paper investigates a learning-based approach autonomously and jointly optimizing the trajectory of unmanned aerial vehicle (UAV), phase shifts of reconfigurable intelligent surfaces (RIS), and aggregation weights for federated learning (FL) in wireless communications, forming an autonomous RIS-assisted UAV-enabled network. The proposed network considers practical RIS reflection models and FL transmission errors in wireless communications. To optimize the RIS phase shifts, a double cascade correlation network (DCCN) is introduced. Additionally, the deep deterministic policy gradient (DDPG) algorithm is employed to address the optimization problem of UAV trajectory and FL aggregation weights based on the results obtained from DCCN. Simulation results demonstrate the substantial improvement in FL performance within the autonomous RIS-assisted UAV-enabled network setting achieved by the proposed algorithms compared to the benchmarks.

Chong Huang, Gaojie Chen, Pei Xiao, Jonathon A Chambers, Wei Huang (2024)Fair Resource Allocation For Hierarchical Federated Edge Learning in Space-Air-Ground Integrated Networks via Deep Reinforcement Learning with Hybrid Control, In: IEEE Journal on Selected Areas in Communications42(12)pp. 3618-3631 IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JSAC.2024.3459086

The space-air-ground integrated network (SAGIN) has become a crucial research direction in future wireless communications due to its ubiquitous coverage, rapid and flexible deployment, and multi-layer cooperation capabilities. However, integrating hierarchical federated learning (HFL) with edge computing and SAGINs remains a complex open issue to be resolved. This paper proposes a novel framework for applying HFL in SAGINs, utilizing aerial platforms and low Earth orbit (LEO) satellites as edge servers and cloud servers, respectively, to provide multi-layer aggregation capabilities for HFL. The proposed system also considers the presence of inter-satellite links (ISLs), enabling satellites to exchange federated learning models with each other. Furthermore, we consider multiple different computational tasks that need to be completed within a limited satellite service time. To maximize the convergence performance of all tasks while ensuring fairness, we propose the use of the distributional soft-actor-critic (DSAC) algorithm to optimize resource allocation in the SAGIN and aggregation weights in HFL. Moreover, we address the efficiency issue of hybrid action spaces in deep reinforcement learning (DRL) through a decoupling and recoupling approach, and design a new dynamic adjusting reward function to ensure fairness among multiple tasks in federated learning. Simulation results demonstrate the superiority of our proposed algorithm, consistently outperforming baseline approaches and offering a promising solution for addressing highly complex optimization problems in SAGINs.

Chong Huang, Gaojie Chen, Pei Xiao, Yue Xiao, Zhu Han, Jonathon A Chambers (2024)Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach, In: IEEE journal on selected areas in communications : a publication of the IEEE Communications Society42(5) IEEE

DOI: 10.1109/JSAC.2024.3365899

—In recent years, the amalgamation of satellite communications and aerial platforms into space-air-ground integrated network (SAGINs) has emerged as an indispensable area of research for future communications due to the global coverage capacity of low Earth orbit (LEO) satellites and the flexible Deployment of aerial platforms. This paper presents a deep reinforcement learning (DRL)-based approach for the joint optimization of offloading and resource allocation in hybrid cloud and multi-access edge computing (MEC) scenarios within SAGINs. The proposed system considers the presence of multiple satellites, clouds and unmanned aerial vehicles (UAVs). The multiple tasks from ground users are modeled as directed acyclic graphs (DAGs). With the goal of reducing energy consumption and latency in MEC, we propose a novel multi-agent algorithm based on DRL that optimizes both the offloading strategy and the allocation of resources in the MEC infrastructure within SAGIN. A hybrid action algorithm is utilized to address the challenge of hybrid continuous and discrete action space in the proposed problems, and a decision-assisted DRL method is adopted to reduce the impact of unavailable actions in the training process of DRL. Through extensive simulations, the results demonstrate the efficacy of the proposed learning-based scheme, the proposed approach consistently outperforms benchmark schemes, highlighting its superior performance and potential for practical applications. Index Terms—Space-air-ground integrated networks, edge computing , resource allocation, unmanned aerial vehicle, deep reinforcement learning.

Peng Xu, Chenghong Luo, Chong Huang, Gaojie Chen, Yong Li, Kai-Kit Wong (2025)A General Framework for Probabilistic Relay Selection in Asymmetric Buffer-Aided Cooperative Relaying Systems, In: IEEE Transactions on CommunicationsEarly Access(Early Access) Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/TCOMM.2025.3547795

This paper presents a general framework for probabilistic relay selection in asymmetric buffer-aided cooperative relaying systems, which caters to scenarios with both perfect and imperfect channel state information (CSI) during the relay selection (RS) process. The framework extends and generalizes many existing buffer-aided RS schemes. In particular, we introduce an auxiliary stochastic process which assigns varying selection probabilities to different links, considering the dynamic wireless channel and buffer states. Subsequently, we leverage the obtained outage probability and average packet delay (APD) to formulate outage optimization problems while adhering to APD. To address the intricate high-dimensional optimization problems, we employ a deep learning (DL) approach, which involves designing the probability mass function of the auxiliary stochastic process and developing an effective loss function to update the neural network. Simulation results unequivocally demonstrate the superior performance of the proposed DL-based probabilistic relay selection scheme compared to benchmark schemes, particularly in scenarios involving imperfect CSI.

Chong Huang, Gaojie Chen, Yitong Zhou, Haocheng Jia, Pei Xiao, Rahim Tafazolli (2022)Deep Learning Empowered Secure RIS-Assisted Non-Terrestrial Relay Networks, In: 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL) IEEE

DOI: 10.1109/VTC2022-Fall57202.2022.10012808

This paper proposes a secure transmission in reconfigurable intelligent surfaces (RIS) aided non-terrestrial cooperative networks (NTCN), where the practical phase-dependent model is considered in which the RIS reflection amplitudes change with the corresponding discrete phase shifts. Moreover, we employ a full-duplex transmission scheme at the relay nodes to reduce the long-range signal loss and improve the security between the satellite and the relay node. To solve the complex non-convex optimization problem of the joint RIS reflection coefficient and relay selection optimization, we propose the deep cascade correlation learning (DCCL) algorithm to enhance optimization efficiency. Simulation results show that the proposed DCCL-based method significantly improves the secrecy capacity compared to the random relay selection and RIS coefficient methods.

Kaiyue Li, Chong Huang, Yu Gong, Gaojie Chen (2023)Double Deep Learning for Joint Phase-Shift and Beamforming Based on Cascaded Channels in RIS-Assisted MIMO Networks, In: IEEE wireless communications letters12(4)pp. 659-663 IEEE

DOI: 10.1109/LWC.2023.3238073

This letter investigates machine learning approach for the joint optimal phase shift and beamforming in the reconfigurable intelligent surface (RIS) assisted multiple-input and multiple-output (MIMO) network, consisting of one source node, one RIS panel and one destination node. If individual source-to-RIS and RIS-to-destination channels are known, the joint optimization is similar to that in the traditional MIMO network, which has been well studied. However, the channel estimation for the individual channels is complicated and often inaccurate. On the other hand, while estimating the cascaded channels for the source-RIS-destination links are more accessible, the corresponding joint optimization is complicated. In this letter, we propose a novel double deep learning network model which is superior to the conventional reinforcement learning in the RIS joint optimization. Numerical simulations are given to verify the proposed algorithm.

Haocheng Jia, Gaojie Chen, Chong Huang, Shuping Dang, Jonathon A. Chambers (2023)Trajectory and Phase Shift Optimization for RIS-Equipped UAV in FSO Communications with Atmospheric and Pointing Error Loss, In: Electronics (Basel)12(20)

DOI: 10.3390/electronics12204275

This paper proposes a new framework for reconfigurable intelligent surface (RIS)-equipped unmanned aerial vehicles (UAVs) in free-space optical (FSO) communication. To ensure practicality, we consider atmospheric loss caused by fog, which leads to an inhomogeneous medium for laser propagation. In addition, we incorporate the pointing error loss caused by the power fraction on the photodetector (PD) into the system and derive a closed-form expression for the elliptical beam footprint in the pointing error loss. We then propose a leading angle assisted particle swarm optimization (PSO) method to efficiently optimize the numerical results of pointing error loss. Furthermore, after obtaining these numerical results as a precondition, the UAV trajectory is optimized using the proximal policy optimization (PPO) method to achieve the maximum average capacity. Numerical simulations demonstrate that the proposed optimization method achieves greater efficiency and accuracy compared to the decode-and-forward (DF) relay and deep Q-learning (DQN) methods.

Chong Huang, Gaojie Chen, Yun Wen, Zihuai Lin, Yue Xiao, Pei Xiao (2023)Deep Learning-Based Resource Allocation in UAV-RIS-Aided Cell-Free Hybrid NOMA/OMA Networks Institute of Electrical and Electronics Engineers (IEEE)

This paper investigates a deep learning-based algorithm to optimize the unmanned aerial vehicle (UAV) trajectory and reconfigurable intelligent surface (RIS) reflection coefficients in UAV-RIS-aided cell-free (CF) hybrid non-orthogonal multiple-access (NOMA)/orthogonal multiple-access (OMA) networks. The practical RIS reflection model and user grouping optimization are considered in the proposed network. A double cascade correlation network (DCCN) is proposed to optimize the RIS reflection coefficients , and based on the results from DCCN, an inverse-variance deep reinforcement learning (IV-DRL) algorithm is introduced to address the UAV trajectory optimization problem. Simulation results show that the proposed algorithms significantly improve the performance in UAV-RIS-assisted CF networks.

Chong Huang, Gaojie Chen, Jinchuan Tang, Pei Xiao, Zhu Han (2022)Machine-Learning-Empowered Passive Beamforming and Routing Design for Multi-RIS-Assisted Multihop Networks, In: IEEE internet of things journal9(24)25673pp. 25673-25684 IEEE

DOI: 10.1109/JIOT.2022.3195543

This article proposes a novel machine-learning-based routing optimization for the multiple reconfigurable intelligent surfaces (M-RIS)-assisted multihop cooperative networks, in which a practical phase model for reconfigurable intelligent surface (RIS) with the amplitude variation based on the corresponding discrete phase shift is considered. We aim to maximize the end-to-end data rate in the proposed network by jointly optimizing the data transmission path, the passive beamforming design of RIS, and transmit power allocation. To tackle this complicated nonconvex problem, we divide it into two subtasks: 1) the passive beamforming design of the RIS and 2) joint routing and power allocation optimization. First, for the passive beamforming design of RIS, we develop a distributed learning algorithm that employs a cascade forward backpropagation network in each relay node to solve the RIS coefficients optimization problem by directly using the optimization target to train the cascade networks. This solution can avoid the curse of dimensionality of traditional reinforcement learning algorithms in the RIS optimization problem. Then, based on the result of RIS optimization, we introduce the proximal policy optimization (PPO) algorithm with the clipping method to find solutions for joint optimization of routing and power allocation via achieving the long-term benefit in the Markov decision process (MDP). Simulation results show that the proposed learning-based scheme can learn from the environment to improve its policy stability and efficiency in the iterative training process for optimizing routing and RIS and significantly outperform the benchmark schemes.

Jianping Quan, Peng Xu, Chenghong Luo, Chong Huang, Gaojie Chen (2022)Deep Reinforcement Learning based Relay Selection for SWIPT Systems with Data Buffer and Energy Storage, In: 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL) IEEE

DOI: 10.1109/VTC2022-Fall57202.2022.10013082

In this paper, we study the simultaneous wireless information and power transfer (SWIPT) cooperative system, where one source forwards information to one destination with the assistance of multiple relays. Each relay is equipped with a finite data buffer and a finite energy buffer storing the harvested energy by radio-frequency (RF). An optimization problem is formulated for throughput maximization of the SWIPT cooperative system, taking into consideration the strict delay constraint, dynamic channel conditions, time-varying discrete data buffer states and time-varying continuous energy buffer states. A discrete-time Markov decision process (MDP) is adopted to model the relay selection process referring to data buffer states and energy buffer states. Two deep Q-network (DQN)-based methods named invalid action penalty (IAP) and invalid action mask (IAM) are proposed. The simulation results show that the proposed IAM method can achieve better convergence and throughput performance than the IAP method.