Please I need written answers for these question dont post a
Please I need written answers for these question (don\'t post as image)
Question 1: List and describe the three main \"V\"s that characterize Big Data.
Question 2: Define MapReduce.
Question 3: What is a data scientist and what does the job involve?
Question 4: What are recommender systems, how are they developed, and how is the data used to build a recommendation system obtained?
Question 5: In what ways can communications companies use geospatial analysis to harness their data effectively?
Solution
the three \'v\'s which are characterstcs of big data are
1)velocity:it describes about the rate of data increasing day by day
2)verasity: describes about the different types of data
3)volume:describes the amount of data increasing day by day
4)variety:describes about the accuracy of data
so A data is said to be big data if satfies all the above qualities.
***********************************************************************************************************************************
mapreduce: it is a programming model for data processing .this mapreduce can return in java,python,c,ruby
there are two generations in mapreduce
i)mapreduce1: it contains task trackers and job trackers as main components
ii)mapreduce2: it uses yarn as scheduler,it mainly contains two components they are node manager and resource manager
***********************************************************************************************************************************
datascientist:they are the wranglers of big data .they will get large amount of messy data and apply there statistics and programming to get the clean data to analayze the data and get the required output
responsibilities:
i)they conduct undirect reasearch
ii)they will extract the messy data from internal and external sources
iii)they will the filter the messy data
iv)they will invent new algorithms to solve problems in analyzing the data
