Funding information: National Research Foundation of Korea, NRF-2017R1A2B1009709; Institute for Information &and Communications Technology Promotion, 2016-0-00160

Read the full text

About

PDF

Tools

Share a link

Email
Wechat
Bluesky

Summary

Reinforcement learning (RL) is utilized in a wide range of real-world applications. Typical applications include single agent-based RL. However, most practical tasks require multiple agents for cooperative control processes. Multiple-agent RL demands complicated design, and numerous design possibilities should be considered for its practical usefulness. We propose two RL implementations for a message-queuing telemetry transport (MQTT) protocol system. Two types of implementations improve the communication efficiency of MQTT: (i) single-broker-agent implementation and (ii) multiple-publisher-agents implementation. We focused on different message priorities in a dynamic environment for each implementation. The proposed implementations improve communication efficiency by adjusting the loop cycle time of the broker or by learning the message importance. The proposed MQTT control scheme improves the battery efficiency of Internet-of-Things (IoT)-based devices with relatively insufficient battery power.

References

1Busoniu L, Babuska R, De Schutter B. Chapter 7: Multi-agent reinforcement learning: an overview. In: D Srinivasan, LC Jain, eds. Innovations in Multi-Agent Systems and Applications – 1. Studies in Computational Intelligence. Vol 310. Berlin, Germany: Springer; 2010: 183-221.
10.1007/978-3-642-14435-6_7
Google Scholar
2MQTT Version 3.1.1 Protocol Specification [Internet]. http://docs.oasis-open.org/mqtt/mqtt/v3.1.1/os/mqtt-v3.1.1-os.html.
Google Scholar
3Myung S, Kim S. The design of open IoT platform based on oneM2M standard protocol. J Korea Inst Inform Commun Eng. 2017; 21(10): 1943-1949.
Google Scholar
4 Seo JO, Kim CW. Design and implementation of real time things control system using MQTT and WebSocket in IoT environment. J Korea Inst Electron Commun Sci. 2018; 13(3): 517-524.
Google Scholar
5 Youn JS, Hong JY. MQTT protocol based GPS information push notification service APP. J Korea Soc Comput Inform. 2018; 26(1): 129-130.
Google Scholar
6Kim BJ, Cho SH. Study on IoT middleware lightweight messaging protocols. J Inst Electron Inform Eng. 2014; 11: 2014.
Google Scholar
7Kapoor S. Multi-agent reinforcement learning: a report on challenges and approaches; 2018.
Google Scholar
8Sutton RS, Barto AG. Reinforcement Learning: An Introduction. Cambridge, UK: MIT Press; 1998.
Google Scholar
9Baird L. Residual algorithms: reinforcement learning with function approximation. Paper presented at: Proceedings of the Twelfth International Conference on Machine Learning; 1995.
Google Scholar
10 Martinez B, Monton M, Vilajosana I. The power of models: modeling power consumption for IoT devices. IEEE Sens J. 2015; 15(10): 5777-5789.
10.1109/JSEN.2015.2445094
Web of Science® Google Scholar
11 Chen X, Rhee W. Low power sensor design for IoT and mobile healthcare applications. China Commun. 2015; 12(5): 42-54.
10.1109/CC.2015.7112043
Web of Science® Google Scholar
12 Kang SW. Characterizing power consumption of MQTT protocol usage on raspberry pi. J Korea Inst Inform Commun Eng. 2017; 21(12): 2347-2356.
Google Scholar
13 Hong YS, Lee JH. An improvement of energy consumption efficiency for MQTT-SN using transmission scheduling and variable period of keep-alive. Adv Sci Lett. 2017; 23(10): 9634-9638.
10.1166/asl.2017.9763
Google Scholar
14 Hoffert J, Mack D, Schmidt D. Integrating machine learning techniques to adapt protocols for QoS-enabled distributed real-time and embedded publish/subscribe middleware. Netw Protoc Algorithms: Special Issue on Data Dissemination for Large-scale Complex Critical Infrastructures. 2010; 2. https://doi.org/10.5296/npa.v2i3.429.
Google Scholar
15 Zantalis F, Koulouras G, Karabetsos S, Kandris D. A review of machine learning and IoT in smart transportation. Future Internet. Basel, Switzerland: MDPI; 2019.
10.3390/fi11040094
Google Scholar
16Mohammadi M, Al-Fuqaha A. Enabling cognitive smart cities using big data and machine learning: approaches and challenges; October 2018. arXiv:1810.04107v1.
Google Scholar
17Watkins CJCH. Learning from delayed rewards [PhD thesis]. England: University of Cambridge; 1989.
Google Scholar
18Kaelbling LP, Littman ML, Moor AW. Reinforcement learning: a survey. J Artif Intell Res. 1996; 4: 237-285.
10.1613/jair.301
Web of Science® Google Scholar
19Glorennce PY. Reinforcement learning: an overview. Pape presented at: Proceedings of the European Symposium on Intelligent Techniques; 2000.
Google Scholar
20Munos R, Stepleton T, Harutyunyan A, Bellemare MG. Safe and efficient off-policy reinforcement learning; 2016.
Google Scholar
21 Gu S, Lillicrap T, Sutskever I, Levine S. Continuous Deep Q-Learning with Model-Based Acceleration. Vienna, Austria: ICML; 2016.
Google Scholar
22 Silver D, Lever G, Heess N, Degris T, Wierstra D, Riedmiller M. Deterministic Policy Gradient Algorithms. Vienna, Austria: ICML; 2015.
Google Scholar
23Haarnoja T, Zhou A, Abbeel P, Levine S. Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor; August 2018. arXiv:1801.01290v2.
Google Scholar
24Shu T, Tian Y. M^3RL: mind-aware multi-agent management reinforcement learning. Paper presented at: ICLR 2019 Conference; September 2018.
Google Scholar
25Fujimoto S, van Hoof H, Meger D. Addressing function approximation error in actor-critic methods; February 2018. arXiv:1802.09477.
Google Scholar
26Grau-Moya J, Leibfried F, Vrancx P. Soft Q-learning with mutual-information regularization. Paper presented at: ICLR 2019 Conference; September 2018.
Google Scholar
27Mosquitto [Internet]. https://mosquitto.org/.
Google Scholar
28MQTT Paho Project [Internet]. https://github.com/eclipse/paho.mqtt.python.
Google Scholar
29Yang Z, Xie Y, Wang Z. A theoretical analysis of deep Q-learning; May 2019. arXiv:1901.00137v2.
Google Scholar
30 Kang SW. Characterizing power consumption of MQTT protocol usage on raspberry pi characterizing power consumption of MQTT protocol usage on raspberry pi. J Korea Inst Inform Commun Eng. 2017; 21(12): 2347-2356.
Google Scholar

Citing Literature

Volume33, Issue15

Special Issue:13th IEEE International Conference on Networking, Architecture, and Storage (NAS2018). Cognitive Computing for Intelligence Web Systems (ISAIR2018)

10 August 2021

e5727

Context-aware pub/sub control method using reinforcement learning

Summary

References

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Context-aware pub/sub control method using reinforcement learning

Summary

References

Citing Literature

References

Related

Information