Big Data and Data Mining

 

Welcome to the big data and data mining lesson summary video. 
In this lesson, you gained insights into the impact of big data on various aspects 
of society, from business operations to sports. 
And developed an understanding of key attributes and 
challenges associated with big data. 
In this video, we will recap fundamentals of big data and 
how big data drives digital transformation. 
How data scientists leverage the essential characteristics of the Cloud to gain 
insights from big data, the data mining process, and 
common tools used to process big data. 
The availability of vast amounts of data, resulting in what we now call big data, 
is driving transformation in business and industry and consequently, 
how we live our daily lives. 
Organizations realize that we require fundamental changes to their approach to 
business, impacting every aspect of the organization. 
The availability of so many disparate amounts of data created by people, 
tools and machines requires new, innovative, and scalable technology. 
Big data drives us to derive real time business insights 
relating to consumers risk, profit, performance productivity management, 
and ultimately enhancing business values. 
Not everyone agrees on the definition of big data, but 
people generally agree on the five characteristics of this data value, 
volume, velocity, variety and veracity. 
People expect investing time in studying big data will create value. 
Volume refers to the scale of the data, drivers of volume include increasing 
collectible data sources and scalable infrastructure. 
Velocity indicates ever increasing sources of nonstop processes that generate data 
quickly. 
Variety reflects that related data comes from different sources, 
both structured and unstructured. 
Veracity refers to the quality and origin of data and 
that it accurately conforms to facts. 
The development of cloud and 
cloud technologies enables us to work with big data. 
Cloud refers to the delivery of on-demand computing resources on 
a pay-for-use basis. 
Cloud computing has five essential characteristics, on-demand, 
network access, resource pooling, elasticity, and measured service. 
On-demand means access to processing, power, storage and network that you need. 
These computing resources can be accessed via a network with Internet access. 
Resource pooling allows providers to service multiple consumers with 
the resources dynamically assigned according to demand, 
making cloud computing cost efficient. 
Elasticity means that you can access resources as you need them and 
automatically scale back when you don't. 
With measured service, you only pay for what you use or reserve as you go.

Comments

Popular posts from this blog

Lila's Journey to Becoming a Data Scientist: Her Working Approach on the First Task

Notes on Hiring for Data Science Teams

switch functions