Bigdata Analytics: Comparative Study of Tools
Sri Vasavi College, Erode Self-Finance Wing, 3rd February 2017. National Conference on Computer and Communication, NCCC’17. International Journal of Computer Science (IJCS) Published by SK Research Group of Companies (SKRGC)
Download this PDF format
Abstract
The term “Big Data” was first introduced to the computing world by Roger Magoulas from O’Reilly media in 2005 in order to define a great amount of data that traditional data management techniques cannot manage and process due to the complexity and size of this data. A study on the Evolution of Big Data as a Research and Scientific Topic shows that the term “Big Data” was present in research starting with 1970s but has been comprised in publications in 2008. Nowadays the Big Data concept is treated from different points of view covering its implications in many fields. In the information era, enormous amounts of data have become available on hand to decision makers. Big data refers to datasets that are not only big, but also high in variety and velocity, which makes them difficult to handle using traditional tools and techniques. Due to the rapid growth of such data, solutions need to be studied and provided in order to handle and extract value and knowledge from these datasets. Furthermore, decision makers need to be able to gain valuable insights from such varied and rapidly changing data, ranging from daily transactions to customer interactions and social network data. Such value can be provided using big data analytics, which is the application of advanced analytics techniques on big data. This paper aims to analyze some of the different analytics methods and tools which can be applied to big data, as well as the opportunities provided by the application of big data analytics in various decision domains.
References
[1] https://www.slideshare.net/HarshMishra3/ harsh-big-data-seminar-report
[2] https://www.infoworld.com/d/business- intelligence/7-top-tools-tamingbig- data-191131.
[3] J. Venner, ?Pro Hadoop?, a press, (2009).
[4]T. White,? Hadoop: The Definitive Guide?, third ed., O'Reilly Media, Yahoo Press, (2012).
[5] W. Tantisiriroj, S. Patil and G Gibson, ?Data-intensive File Systems for Internet Services?, A Rose by Any Other Name (CMU-PDL-08-114). Research Centers and Institutes at Research
[6] M. K. McKusick and S. Quinlan, ?GFS: Evolution on Fast-forward?, ACM Queue, New York, vol. 7, no. 7, (2009).
[7] K. Shvachko, H. Kuang, S. Radia and R Chansler, ?The Hadoop Distributed File System?, Proceedings of IEEE Conference, 978-1-4244-7153-9/10, (2010).
[8] J. Dean and S. Ghemawat, ?Mapreduce: Simplified data processing on large clusters?, commun. ACM, vol. 51, no. 1, (2008), pp. 107–113.
[9] J. Dean and S. Ghemawat, ?Mapreduce: A flexible data processing tool?, commun. ACM, vol. 53, no. 1, (2010), pp. 72–77.
[10] Hewitt, Eben (December 15, 2010). Cassandra: The Definitive Guide (1st ed.).
[11] Brown, MC (October 31, 2011), Getting Started with CouchDB (1st ed.), O'Reilly Media.
[12] Pirtle, Mitch (March 3, 2011), MongoDB for Web Development (1st ed.), Addison-Wesley Professional.
Keywords
Big data, Big data analytics, Business analytics, Bid data tools.