Data management in cloud environments: NoSQL and NewSQL data stores

被引:156
作者
Grolinger K. [1 ]
Higashino W.A. [1 ,2 ]
Tiwari A. [1 ]
Capretz M.A.M. [1 ]
机构
[1] Department of Electrical and Computer Engineering, Western University, London
[2] Instituto de Computação, Universidade Estadual de Campinas, Campinas, SP
来源
Journal of Cloud Computing: Advances, Systems and Applications | / 2卷 / 1期
关键词
Big Data; Cloud computing; Data management; Distributed storage; NewSQL; NoSQL;
D O I
10.1186/2192-113X-2-22
中图分类号
学科分类号
摘要
Advances in Web technology and the proliferation of mobile devices and sensors connected to the Internet have resulted in immense processing and storage requirements. Cloud computing has emerged as a paradigm that promises to meet these requirements. This work focuses on the storage aspect of cloud computing, specifically on data management in cloud environments. Traditional relational databases were designed in a different hardware and software era and are facing challenges in meeting the performance and scale requirements of Big Data. NoSQL and NewSQL data stores present themselves as alternatives that can handle huge volume of data. Because of the large number and diversity of existing NoSQL and NewSQL solutions, it is difficult to comprehend the domain and even more challenging to choose an appropriate solution for a specific task. Therefore, this paper reviews NoSQL and NewSQL solutions with the objective of: (1) providing a perspective in the field, (2) providing guidance to practitioners and researchers to choose the appropriate data store, and (3) identifying challenges and opportunities in the field. Specifically, the most prominent solutions are compared focusing on data models, querying, scaling, and security related capabilities. Features driving the ability to scale read requests and write requests, or scaling data storage are investigated, in particular partitioning, replication, consistency, and concurrency control. Furthermore, use cases and scenarios in which NoSQL and NewSQL data stores have been used are discussed and the suitability of various solutions for different sets of applications is examined. Consequently, this study has identified challenges in the field, including the immense diversity and inconsistency of terminologies, limited documentation, sparse comparison and benchmarking criteria, and nonexistence of standardized query languages. © 2013 Grolinger et al.
引用
收藏
相关论文
共 74 条
[1]  
Facebook Newsroom A New data center for Iowa., , http://newsroom.fb.com/News/606/ANew-Data-Center-for-Iowa, Accessed 29 Sep 2013
[2]  
Ohlhorst, F.J., (2013) Big Data Analytics: Turning Big Data into Big Money, , John Wiley & Sons, Inc, Hoboken, New Jersey, USA
[3]  
Stonebraker, M., Madden, S., Abadi, D.J., Harizopoulos, S., Hachem, N., Helland, P., The end of an architectural era: (it's time for a complete rewrite) (2007) Proc 33rd Int Conf Large Data Bases, pp. 1150-1160
[4]  
Beyer, M.A., Laney, D., (2012) The Importance of "Big Data": A Definition, , http://www.gartner.com/id=2057415, Accessed 29 Sep 2013
[5]  
Agrawal, D., Das, S., El Abbadi, A., Big data and cloud computing: Current State and Future Opportunities (2011) Proceedings of the 14th International Conference on Extending Database Technology-EDBT/ICDT'11., pp. 530-533. , ACM Press, New York, NY, USA
[6]  
Bughin, J., Chui, M., Manyika, J., Clouds, big data, and smart assets: Ten tech-enabled business trends to watch (2010) McKinsey Quarterly, 2010, pp. 1-14
[7]  
Mell, P., Grance, T., (2011) The NIST definition of cloud computing, , http://csrc.nist.gov/publications/nistpubs/800-145/SP800-145.pdf, NIST special publication 800-145. Accessed on 29 Sep 2013
[8]  
Zhang, Q., Cheng, L., Boutaba, R., Cloud computing: State-of-the-art and research challenges (2010) J Intern Serv Appl, 1, pp. 7-18. , 10.1007/s13174-010-0007-6
[9]  
Venters, W., Whitley, E.A., A critical review of cloud computing: Researching desires and realities (2012) J Info Technol, 27, pp. 179-197. , 10.1057/jit.2012.17
[10]  
Tudorica, B.G., Bucur, C., (2011) A comparison between several NoSQL databases with comments and notes, pp. 1-5. , 2011 10th International Conference RoEduNet. IEEE