Big Data Analytics: A Theoretical Study

!!!! Bi-Annual Double Blind Peer Reviewed Refereed Journal !!!!

!!!! Open Access Journal !!!!

Category: 
Vol9_Issue1
Authors: 
Bikash Jha, M. Tech (IT), Student, USICT, GGSIPU, Dwarka 16-C, New Delhi
Dr. Amit Prakash Singh, Associate Professor, USICT, GGSIPU, Dwarka 16-C, New Delhi
Abstract: 

The “Data” which becomes large enough and that cannot be processed using traditional methods is termed as “Big Data”. Big data concerns with very high volume, growing and complex data with different independent sources. Hadoop Architecture was introduced in order to organize and store large amount of data in different shapes, sizes and formats. Data types may be structured, semi-structured and un-structured. Hadoop is a framework for managing very large amount of heterogeneous data. Big data analytics is generally the identification of hidden patterns and unknown correlations. Hive is an open source project developed by Facebook for data analysis. Hive uses HQL commonly known as hive query language. This paper gives us an idea about hive tools which is based on the Hadoop ecosystem. We use the different SQL/HQL queries to process the data and find the result using the Hive tools of Hadoop which uses the map reduce algorithm.
 

Rating: 
Average: 4.5 (2 votes)