A comprehensive review of intelligent end-to-end networking solutions through the integration of graph neural networks and deep reinforcement learning

Muhammad Kamran^1,2,3†, Salwa Muhammad Akhtar^4†, Muhammad Zain ul Abideen⁵, Junaid Asghar⁶, Muhammad Farman^1,7∗, Aseel Smerat^8,9, Mohamad Hafez^2,10

Show Less

¹ Mathematics Research Center, Department of Mathematics Near East University, Mersin, Turkey

² Department of Mathematics, Faculty of Engineering and Quantity Surviving, INTI International University Colleges, Nilai, Negeri Sembilan, Malaysia

³ International Center for Interdisciplinary Research in Sciences, The University of Lahore, Lahore, Pakistan

⁴ Department of Information Systems, University of Management and Technology, Lahore, Punjab, Pakistan

⁵ Department of Mechanical Engineering, Faculty of Engineering, University of Central Punjab, Lahore, Punjab, Pakistan

⁶ Department of Computer Science Information Technology, Faculty of Information Technology, University of Lahore, Lahore, Punjab, Pakistan

⁷ Research Center of Applied Mathematics, Khazar University, Baku, Azerbaijan

⁸ Department of Mathematics, Faculty of Educational Sciences, Al-Ahliyya Amman University, Amman, Jordan

⁹ Department of Biosciences, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, Tamil Nadu, India

¹⁰ Department of Management, Faculty of Management, Shinawatra University, Sam Khok, Pathum Thani, Thailand

†These authors contributed equally to this work.

IJOCTA 2026, 16(3), 794–815; https://doi.org/10.36922/IJOCTA025500230

Received: 14 December 2025 | Revised: 19 January 2026 | Accepted: 26 January 2026 | Published online: 30 April 2026

© 2026 by the Author(s). This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution -Noncommercial 4.0 International License (CC-by the license) ( https://creativecommons.org/licenses/by-nc/4.0/ )

Download PDF

XML

Cite

Abstract

Topology awareness and scalable, adaptive network control have become critical with the development of 5G/6G, the Internet-of-Things, vehicular networks, and edge computing. Traditional rule-based and centralized networking models are unable to support dynamic topologies, heterogeneous traffic models, and demands with strict quality-of-service requirements. Structural and topological dependencies are encoded using graph neural networks (GNNs) and combined with deep reinforcement learning (DRL) to make decisions sequentially. Exploiting rewards is an avenue toward intelligent end-to-end network optimization. This review is a systematic examination of modern GNN–DRL models implemented in routing, congestion control, chaining of service functions, vehicular communication, and the optimization of optical networks. It also highlights their performance strengths, including topology awareness, cross-topology generalization, high sample efficiency, and high scalability, as well as their weaknesses, such as inference overhead, inconsistent benchmarking practices, low real-time deployability, and sensitivity to noisy or partial state observations. The main findings of this review are: (i) a coherent taxonomy of GNN-based, DRL-based, and hybrid GNN–DRL effective designs; (ii) comparative analysis of algorithms, architecture components, and learning pipelines; (iii) generalized performance trends in major areas of intelligent networking; and (iv) a collection of grounded research directions to be followed in the future, lightweight architecture, transfer learning pipeline, fault tolerant learning, and unified evaluation frameworks. Finally, this review focuses on enabling resilient infrastructure through intelligent, scalable, and autonomous end-to-end networking solutions.

Keywords

Graph neural networks

Deep reinforcement learning

Intelligent networking

End-to-end network optimization

Autonomous network management

Resilient infrastructure

Funding

This work was funded by the Faculty of Engineering and Quantity Surveying, INTI International University Colleges, Nilai, Negeri Sembilan, Malaysia.

Conflict of interest

The authors declare they have no competing interests.

References

Ksentini A, Nikaein N. Toward enforcing network slicing on RAN: flexibility and resource abstraction. IEEE Commun Mag. 2017;55(6):102-108. https://www.doi.org/10.1109/MCOM.2017. 1601119

Giordani M, Polese M, Mezzavilla M, Rangan S, Zorzi M. Toward 6G networks: use cases and technologies. IEEE Commun Mag. 2020;58(3):55-61. https://www.doi.org/10.1109/MCOM.001.1900411

Chen Z, Ma X, Zhang C, Wen Z, Li L. Tera-hertz wireless communications for 2030 and beyond: a cutting-edge frontier. IEEE Commun Mag. 2021;59(11):66-72. https://www.doi.org/10.1109/MCOM.011.2100195

Gong S, Lu X, Hoang DT, et al. Toward smart wireless communications via intelligent reflecting surfaces: a contemporary survey. IEEE Commun Surv Tutor. 2020;22(4):2283-2314. https://www.doi.org/10.1109/COMST.2020.3004197

Gkarmpounis G, Vranis C, Vretos N, Daras P. Survey on graph neural networks. IEEE Access. 2024;12:128816-128832. https://www.doi.org/10.1109/ACCESS.2024.3456913

Akyildiz IF, Kak A, Nie S. 6G and beyond: the future of wireless communications systems. IEEE Access. 2020;8:133995-134030. https://www.doi.org/10.1109/ACCESS.2020.3010896

Tam P, Ros S, Song I, Kang S, Kim S. A sur- vey of intelligent end-to-end networking solutions: integrating graph neural networks and deep reinforcement learning approaches. Electronics. 2024;13(5):994. https://www.doi.org/10.3390/electronics13050994

Li X, Chen M, Liu Y, Wang L. Federated multi- agent deep reinforcement learning for resource allocation of vehicle-to-vehicle communications. IEEE Trans Veh Technol. 2022;71(8):8810-8824. https://www.doi.org/10.1109/TVT.2022.3173057

Chen H, Wang J, Li D, Zhang Z. A tutorial on terahertz-band localization for 6G communication systems. IEEE Commun Surv Tutor. 2022;24(3):1780-1815. https://www.doi.org/10.1109/COMST.2022.3178209

Yang X, Chen J, Wang H, Liu X. A survey on smart agriculture: development modes, technologies, and security and privacy challenges. IEEE/CAA J Autom Sinica. 2021;8(2):273-302. https://www.doi.org/10.1109/JAS.2020.1003536

Z˙ arski M, Wysocki T, Kulesza J. Computer vision- based inspection on post-earthquake with UAV synthetic dataset. IEEE Access. 2022;10:108134- 108144. https://www.doi.org/10.1109/ACCESS.2022.3212918

Alencar D, Barreto R, Santos A. Dynamic microservice allocation for virtual reality distribution with QoE support. IEEE Trans Netw Serv Manag. 2022;19(1):729-740. https://www.doi.org/10.1109/TNSM.2021.3076922

Xiao L, Zhang Y, Li H. A segmented variable- parameter ZNN for dynamic quadratic minimization with improved convergence and robustness. IEEE Trans Neural Netw Learn Syst. 2023;34(5):2413-2424. https://www.doi.org/10.1109/TNNLS.2021.3106640

Salam MA, Azar AT, Hussien R. Swarm-based extreme learning machine models for global optimization. Comput Mater Contin. 2022;70(3). https://www.doi.org/10.32604/cmc.2022.020583

Ding Z, Feng B, Jiang C. COIN: a container workload prediction model focusing on common and individual changes in workloads. IEEE Trans Parallel Distrib Syst. 2022;33(12):4738-4751. https://www.doi.org/10.1109/TPDS.2022.3202833

Naderializadeh N, Avestimehr AS, Jafar SA. Re- source management in wireless networks via multi- agent deep reinforcement learning. IEEE Trans Wireless Commun. 2021;20(6):3507-3523. https://www.doi.org/10.1109/TWC.2021.3051163

Tam P, Song I, Kang S, Ros S, Kim S. Graph neural networks for intelligent modelling in network management and orchestration: a survey on communications. Electronics. 2022;11(20):3371. https://www.doi.org/10.3390/electronics11203371

Hu W, Chen J, Zhang S. Graph signal processing for geometric data and beyond: theory and applications. IEEE Trans Multimed. 2022;24:3961- 3977. https://www.doi.org/10.1109/TMM.2021.3111440

He Q, Liu L, Zhang J. Routing optimization with deep reinforcement learning in knowledge- defined networking. IEEE Trans Mob Comput. 2024;23(2):1444-1455. https://www.doi.org/10.1109/TMC.2023.3235446

Li M, Li H. Application of deep neural network and deep reinforcement learning in wireless communication. PLoS One. 2020;15(7):e0235447. https://www.doi.org/10.1371/journal.pone. 0235447

Casas-Velasco DM, Rendon OMC, da Fonseca NLS. Intelligent routing based on reinforcement learning for software-defined networking. IEEE Trans Netw Serv Manag. 2021;18(1):870-881. https://www.doi.org/10.1109/TNSM.2020. 3036911

Zhao Z, Wang X, Liu Y. A transmission-reliable topology control framework based on deep reinforcement learning for UWSNs. IEEE Internet Things J. 2023;10(15):13317-13332. https://www.doi.org/10.1109/JIOT.2023.3262690

Sua´rez-Varela J, Ferriol-Galm´es M, Lo´pez A, et al. The graph neural networking challenge: a world- wide competition for education in AI/ML for net- works. ACM SIGCOMM Comput Commun Rev. 2021;51(3):9-16. https://www.doi.org/10.1145/3477482.3477485

Wu Z, Pan S, Chen F, et al. A comprehensive survey on graph neural networks. IEEE Trans Neural Netw Learn Syst. 2021;32(1):4-24. https://www.doi.org/10.1109/TNNLS.2020.2978386

Ji M, Zhang Y, Li X. Graph neural networks and deep reinforcement learning based resource allocation for V2X communications. IEEE Internet Things J. 2024;12(4):3613-3628. https://www.doi.org/10.1109/JIOT.2024.3469547

Khemani B, Patil S, Kotecha K, Tanwar S. A review of graph neural networks: concepts, architectures, techniques, challenges, datasets, ap- plications, and future directions. J Big Data. 2024;11(1):18. https://www.doi.org/10.1186/s40537-023-00876- 4

Munikoti S, Nunes I, Rao A. Challenges and opportunities in deep reinforcement learning with graph neural networks: a comprehensive review of algorithms and applications. IEEE Trans Neural Netw Learn Syst. 2024;35(11):15051-15071. https://www.doi.org/10.1109/TNNLS.2023. 3283523

Jian C, Wang Y, Zhang L. Online-learning task scheduling with GNN-RL scheduler in collaborative edge computing. Cluster Comput. 2024;27(1):589-605. https://www.doi.org/10.1007/s10586-022-03957- w

Lai Y, Liu H, Zhang Q. Toward adversarially robust recommendation from adaptive fraudster detection. IEEE Trans Inf Forensics Secur. 2024;19:907-919. https://www.doi.org/10.1109/TIFS.2023.3327876

Han J, Cen J, Wu L, et al. A survey of geo- metric graph neural networks: data structures, models and applications. Front Comput Sci. 2025;19(11):1911375. https://www.doi.org/10.1007/s11704-025-41426- w

Idris NF, Ismail MA, Kasim S, et al. A review of feature selection methods on diabetes mellitus classification. Int J Adv Sci Eng Inf Technol. 2025;15(3):686-692. https://www.doi.org/10.18517/ijaseit.15.3.12652

Yogeesh N, Mohammad SI, Raja N, et al. From crisp to fuzzy: a comparative review of statistical and fuzzy approaches to problem solving. Appl Math Inf Sci. 2025;19(3):647-658. https://www.doi.org/10.18576/amis/190313

Al-Daoud KI, Yogeesh N, Mohammad SI, et al. Explainability in AI using fuzzy inference systems for the regression problem. Appl Math Inf Sci. 2025;19(5):973-987. https://www.doi.org/10.18576/amis/190501

Previous article in this issue

Next article in this issue

An International Journal of Optimization and Control: Theories & Applications, Electronic ISSN: 2146-5703 Print ISSN: 2146-0957, Published by AccScience Publishing