International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020


IJSTR >> Volume 9 - Issue 10, October 2020 Edition

International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616

The Era Of Big Data: A Thorough Inspection In The Building Blocks Of Future Generation Data Management

[Full Text]



Zeinab Lashkaripour



Big Data (BD), Big Data Analytic (BDA), Cloud Computing (CC), future generation, Internet of Things (IoT), Machine Learning (ML), storage infrastructure, technology.



Data as one of the main assets in any organization, is generated at a constantly increasing pace from various sources of network devices such as smart appliances and embedded sensors. This high pace in device expansion and data generation indicates the dawn of Big Data (BD) era. Thus, this paper is aimed at providing an extensive knowledge on this ever increasing pool of data. Accordingly, a variety of events leading to BD and definitions given for it through the years are demonstrated and analyzed based on different factors. Furthermore, the infrastructures and architectures for storing, processing, manipulating, and analyzing such large-scale scheme-free datasets are compared with respect to criteria such as usage, performance, flexibility, scalability, and complexity. Moreover, for better understanding of BD, the related technologies named Cloud Computing (CC) and Internet of Things (IoT) and the broad sources of data generation are also presented. Finally, the challenges that rise beside all the gains are discussed and to conclude, a novel summarize of the issues in CC, IoT, and BD is also given. This paper would be of great value to those who seek to study, research, and work in this scientific field and demand a full dimensional perspective.



[1] EMC Education Services., Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data, Indiana: John Wiley & Sons, pp. 3-9, 2015.
[2] Z.D. Stephens, S.Y. Lee, F. Faghri, R.H. Campbell, C. Zhai, M.J. Efron, R. Iyer, M.C. Schatz, S. Sinha, and G.E. Robinson, “Big Data: Astronomical or Genomical?,” PLoS biology, vol. 13, no. 7, p.e1002195, 2015.
[3] P. Zikopoulos, D. Deroos, K. Parasuraman, T. Deutsch, J. Giles, and D. Corrigan, Harness the Power of Big Data The IBM Big Data Platform, McGraw Hill Professional, pp. 5-14, 2012.
[4] W.N. Price and I.G. Cohen, “Privacy in the Age of Medical Big Data,” Nature medicine, vol. 25, no. 1, p. 37, 2019.
[5] V. Marx, “Biology: The Big Challenges of Big Data,” Nature, vol. 498, no. 7453, pp. 255-260, 2013.
[6] M. Chen, S. Mao, and Y. Liu, “Big Data: A Survey,” Mobile Networks and Applications, vol. 19, no. 2, pp. 171-209, 2014.
[7] D. Zhang, October. “Big Data Security and Privacy Protection,” In 8th International Conference on Management and Computer Science (ICMCS 2018). Atlantis Press, 2018
[8] J. Dean, Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners, New Jersey: John Wiley& Sons, pp. 5-8, 2014.
[9] R.L. Villars, C.W. Olofson, and M. Eastwood, “Big Data: What it is and Why You Should Care,” White Paper, MA, USA, IDC, 2011.
[10] A. L’heureux, K. Grolinger, H.F. Elyamany, and M.A. Capretz, “Machine Learning With Big Data: Challenges and Approaches,” IEEE Access, vol. 5, pp. 7776-7797, 2017.
[11] M. Mohammadi, A. Al-Fuqaha, S. Sorour, and M. Guizani, “Deep Learning For IoT Big Data and Streaming Analytics: A survey,” IEEE Communications Surveys & Tutorials, vol. 20, no. 4, pp.2923-2960, 2018.
[12] Intel IT center, “Peer Research on Big Data Analysis,” https://www.intel.com/content/dam/www/public/us/en/documents/reports/data-insights-peer-research-report.pdf. 2012.
[13] N. Mehta and A. Pandit, “Concurrence of Big Data Analytics and Healthcare: A Systematic Review,” International journal of medical informatics, vol. 114, pp.57-65, 2018.
[14] D. Cirillo and A. Valencia, “Big Data Analytics For Personalized Medicine,” Current opinion in biotechnology, vol. 58, pp.161-167, 2019.
[15] C. Dremel, M.M. Herterich, J. Wulf, and J. Vom Brocke, “Actualizing big data analytics affordances: A revelatory case study,” Information & Management, vol. 57, no. 1, pp. 103121, 2020.
[16] Q. Song, H. Ge, J. Caverlee, and X. Hu, “Tensor Completion Algorithms in Big Data Analytics,” ACM Transactions on Knowledge Discovery from Data (TKDD), vol. 13, no. 1, p. 6, 2019.
[17] L. Atzori, A. Iera, and G. Morabito, “The Internet of Things: A Survey,” Computer networks, vol. 54, no. 15: pp. 2787-2805, 2010.
[18] Statista Research Department, “Numbers of LinkedIn Members From 1st Quarter 2009 to 3rd Quarter 2016 (in Millions),” https://www.statista.com/statistics/274050/quarterly-numbers-of-linkedin-members/. 2017, (accessed 24 Jan 2020).
[19] Statista Research Department, “Most Popular Social Networks Worldwide as of October 2019, Ranked by Number of Active Users (in Millions),” https://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/. 2019, (accessed 24 Jan 2020).
[20] Gartner, “Gartner Says 4.9 Billion Connected "Things" Will Be in Use in 2015,” https://www.gartner.com/en/newsroom/press-releases/2014-11-11-gartner-says-nearly-5-billion-connected-things-will-be-in-use-in-2015. 2014, (accessed 24 Jan 2020).
[21] Statista Research Department, “Size of the Bitcoin Blockchain From 2010 to 2019, by Quarter (in Megabytes),” https://www.statista.com/statistics/647523/worldwide-bitcoin-blockchain-size/. 2019, (accessed 28 Jan 2020).
[22] World Economic Forum, “The Future of Jobs Report 2018,” World Economic Forum, Geneva, Switzerland, 2018.
[23] M. James, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, and A.H. Byers, “Big Data: The Next Frontier For Innovation, Competition, and Productivity,” 2011.
[24] W.L. Chang and N. Grady, “NIST Big Data Interoperability Framework,” vol. 1, Big Data Definitions(No. Special Publication (NIST SP)-1500-1), 2015.
[25] J.S. Ward, and A. Barker, “Undefined by Data: A Survey of Big Data Definitions,” arXiv preprint arXiv:1309.5821, 2013.
[26] P. Goswami and S. Madan, “A Survey on Big Data & Privacy Preserving Publishing Techniques,” Advances in Computational Sciences and Technology, vol. 10, no. 3, pp. 395-408, 2017.
[27] J.J. Seddon, and W.L. Currie, “A Model For Unpacking Big Data Analytics in High-Frequency Trading,” Journal of Business Research, vol. 70, pp. 300-307, 2017.
[28] P. Mikalef, I.O. Pappas, J. Krogstie and M. Giannakos, “Big Data Analytics Capabilities: A Systematic Literature Review and Research Agenda,” Information Systems and e-Business Management, vol. 16, no. 3, pp. 5 47-578, 2018.
[29] E. Brewer, “CAP Twelve Years Later: How the ‘Rules’ Have Changed,” Computer, vol. 2, pp. 23-29, 2012.
[30] P. Murthy, A. Bharadwaj, P.A. Subrahmanyam, A. Roy, and S. Rajan, “Big Data Taxonomy,” Big Data Working Group, Cloud Security Alliance, 2014.
[31] F. Chang, J. Dean, S. Ghemawat, W.C. Hsieh, D.A. Wallach, M Burrows, T. Chandra, A. Fikes, and R.E. Gruber, “Bigtable: A Distributed Storage System For Structured Data,” ACM Transactions on Computer Systems (TOCS), vol. 26, no. 2, pp. 1-26, 2008.
[32] M. Stonebraker, S., Madden, D.J. Abadi, S. Harizopoulos, N. Hachem, and P. Helland, September. “The End of an Architectural Era: (It's Time For a Complete Rewrite),” In Proceedings of the 33rd international conference on Very large data bases, pp. 1150-1160, VLDB Endowment, 2007.
[33] A. Pavlo and M. Aslett, “What's Really New With NewSQL?,” ACM Sigmod Record, vol. 45, no. 2, pp. 45-55, 2016.
[34] B. Scofield, “NoSQL–Death to Relational Databases,”CodeMash Presentation, January, 2010.
[35] P. Mell and T. Grance, “The NIST Definition of Cloud Computing,” 2011.
[36] B. Grobauer, T. Walloschek, and E. Stocker, “Understanding cloud computing vulnerabilities,” IEEE Security & privacy, vol. 9, no. 2, pp. 50-57, 2010.
[37] Q. Zhang, L. Cheng, and R. Boutaba, “Cloud Computing: State-of-the-Art and Research Challenges,” Journal of internet services and applications, vol. 1, no. 1, pp. 7-18, 2010.
[38] R.L. Grossman, “The Case For Cloud Computing,” IT professional, vol. 11, no. 2, pp. 23-27, 2009.
[39] Z. Allam and Z.A. Dhunny, “On Big Data, Artificial Intelligence and Smart Cities,” Cities, vol. 89, pp. 80-91, 2019.
[40] S.K. Lakshmanaprabu, K. Shankar, M. Ilayaraja, A.W. Nasir, V. Vijayakumar, and N. Chilamkurti, “Random Forest For Big Data Classification in the Internet of Things Using Optimal Features,” International Journal of Machine Learning and Cybernetics, pp. 1-10, 2019.
[41] E.D. Feigelson and G.J. Babu, “Big Data in Astronomy,” Significance, vol. 9, no. 4, pp. 22-25, 2012.
[42] J. Winter, “Algorithmic Discrimination: Big Data Analytics and the Future of the Internet,” In The future internet, (pp. 125-140). Springer, Cham, 2015.
[43] H.D. Ma, “Internet of Things: Objectives and Scientific Challenges,” Journal of Computer science and Technology, vol. 26, no. 6, pp. 919-924, 2011.
[44] R.A.A. Habeeb, F. Nasaruddin, A. Gani, I.A.T. Hashem, E. Ahmed, and M. Imran, “Real-time Big Data Processing For Anomaly Detection: A Survey,” International Journal of Information Management, vol. 45, pp. 289-307, 2019.
[45] I.A.T. Hashem, I. Yaqoob, N.B. Anuar, S. Mokhtar, A. Gani, and S.U. Khan, “The Rise of “Big Data” on Cloud Computing: Review and Open Research Issues,” Information Systems, vol. 47, pp. 98-115, 2015.
[46] C. Dobre, and F. Xhafa, “Intelligent sServices For Big Data Science,” Future Generation Computer Systems, vol. 37, pp. 267-281, 2014.
[47] S. Wolfert, L. Ge, C. Verdouw, and M.J. Bogaardt, “Big Data in Smart Farming–A Review,” Agricultural Systems, vol. 153, pp. 69-80, 2017.
[48] J. Fan, F. Han, and H. Liu, “Challenges of Big Data Analysis,” National science review, vol. 1, no. 2, pp. 293-314, 2014.
[49] R. Iqbal, F. Doctor, B. More, S. Mahmud, and U. Yousuf, “Big Data Analytics: Computational Intelligence Techniques and Application Areas,” Technological Forecasting and Social Change, p. 119253, 2018.
[50] X. Jin, B.W. Wah, X. Cheng, and Y. Wang, “Significance and Challenges of Big Data Research,” Big Data Research, vol. 2, no. 2, pp. 59-64, 2015.
[51] I. Lee, “Big Data: Dimensions, Evolution, Impacts, and Challenges,” Business Horizons, vol. 60, no. 3, pp. 293-303, 2017.
[52] H.V. Jagadish, J. Gehrke, A. Labrinidis, Y. Papakonstantinou, J.M. Patel, R. Ramakrishnan, and C. Shahabi, “Big Data and its Technical Challenges,” Communications of the ACM, vol. 57, no. 7, pp. 86-94, 2014.
[53] C.P. Chen and C.Y. Zhang, “Data-Intensive Applications, Challenges, Techniques and Technologies: A Survey on Big Data,” Information sciences, vol. 275, pp. 314-347, 2014.
[54] E. Bertino and E. Ferrari, “Big Data Security and Privacy,” In A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years, (pp. 425-439). Springer, Cham, 2018.
[55] U. Sivarajah, M.M. Kamal, Z. Irani, and V. Weerakkody, “Critical Analysis of Big Data Challenges and Analytical Methods,” Journal of Business Research, vol. 70, pp. 263-286, 2017.
[56] Zhou, L., Pan, S., Wang, J. and Vasilakos, A.V., “Machine Learning on Big Data: Opportunities and Challenges,” Neurocomputing, vol. 237, pp. 350-361, 2017.
[57] S.M. Idrees, M.A. Alam, and P. Agarwal, “A study of Big Data and its Challenges,” International Journal of Information Technology, vol. 11, no. 4, pp. 841-846, 2019.