Big Data and the Internet of Things

advertisement
BIG DATA AND THE INTERNET OF THINGS:
TWO SIDES OF THE SAME COIN?
TAMARA DULL, DIRECTOR OF EMERGING TECHNOLOGIES
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
QUIZ #1: BIG DATA OR THE INTERNET OF THINGS?
Every minute, we:
• send 204M emails
• generate 1.8M Facebook
likes
• send 278K tweets
• upload 200K photos to
Facebook
BIG DATA
Source: http://blog.qmee.com/wp-content/uploads/2013/07/Qmee-Online-In-60-Seconds2.png
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
QUIZ #2: BIG DATA OR THE INTERNET OF THINGS?
12M RFID tags – used to capture
data and track movement of
objects in the physical world – had
been sold in 2011.
By 2021, it is estimated that
number will have risen to 209B as
____________ takes off.
INTERNET OF THINGS
Source: http://www.chassis-plans.com/blog/big-data-interesting-facts-and-figures/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
QUIZ #3: BIG DATA OR THE INTERNET OF THINGS?
The boom of _____________
will mean that the amount of
devices that connect to the
Internet will rise from about
13 billion today to 50 billion
by 2020.
INTERNET OF THINGS
Source: http://newsroom.cisco.com/ioe
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
QUIZ #4: BIG DATA OR THE INTERNET OF THINGS?
The ____________
industry is expected to
grow from US$10.2 billion
in 2013 to about US$54.3
billion by 2017.
BIG DATA
Source: http://www.dazeinfo.com/2014/05/02/rise-big-data-industry-market-worth-53-4-billion-2017/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
TODAY’S
ROADMAP
BIG DATA IS A SUBSET OF THE INTERNET OF THINGS
DISCUSSION
•
The Definitions
• The Comparison
• What Really Matters
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS GARTNER’S BIG DATA DEFINITION 2012
Big data is high volume, high velocity, and/or high variety
information assets that require new forms of processing to
enable enhanced decision making, insight discovery and
process optimization.
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS GARTNER’S BIG DATA DEFINITION 2001
Date: 6 February 2001
Author: Doug Laney
3-D Data Management: Controlling Data Volume, Velocity and
Variety. Current business conditions and mediums are pushing traditional data
management principles to their limits, giving rise to novel and more formalized
approaches.
META Trend: During 2001/02, leading enterprises will increasingly use a
centralized data warehouse to define a common business vocabulary that
improves internal and external collaboration. Through 2003/04, data quality and
integration woes will be tempered by data profiling technologies (for generating
metadata, consolidated schemas, and integration logic) and information
logistics agents. By 2005/06, data, document, and knowledge management will
coalesce, driven by schema-agnostic indexing strategies and portal maturity.
Source: http://blogs.gartner.com/doug-laney/deja-vvvue-others-claiming-gartners-volume-velocity-variety-construct-for-big-data/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS “WHAT IS BIG DATA?” ASKS DATASCIENCE@BERKELEY
Source: http://datascience.berkeley.edu/what-is-big-data/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS “WHAT IS BIG DATA?”: THREE RESPONSES
Big Data is the result of
collecting information at its
most granular level — it’s
what you get when you
instrument a system and
keep all of the data that your
instrumentation is able to
gather.
What’s “big” in big data isn’t necessarily the size
of the databases, it’s the big number of data
sources we have, as digital sensors and
behavior trackers migrate across the world. As
we triangulate information in more ways, we will
discover hitherto unknown patterns in nature and
society — and pattern-making is the wellspring
of new art, science, and commerce.
Jon Bruner, Editor-atLarge, O’Reilly Media
Quentin Hardy, Deputy Tech Editor, The New York
Times
Big data is data that even when efficiently compressed still contains 5-10 times
more information (measured in entropy or predictive power, per unit of time) than
what you are used to right now. It may require a different approach to extract
value.
Vincent Granville, Co-Founder, Data Science Central
Source: http://datascience.berkeley.edu/what-is-big-data/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS INTERNET OF THINGS
Networks of low-cost sensors and actuators for data collection, monitoring,
decision making, and process optimization.
McKinsey & Company
The Internet of Everything (IoE) brings
together people, process, data, and
things to make networked connections
more relevant and valuable than ever
before-turning information into actions
that create new capabilities, richer
experiences, and unprecedented
economic opportunity for businesses,
individuals, and countries.
The Internet of Things – which
excludes PC, tablets, and
smartphones - is the network
of physical objects that contain
embedded technology to
communicate and sense or
interact with their internal
states or the external
environment.
Cisco
Gartner
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS GARTNER HYPE CYCLE 2011
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE DEFINITIONS GARTNER HYPE CYCLE 2014
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE COMPARISON
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE COMPARISON A SIMPLIFIED PERSPECTIVE
Big Data
data
Internet
of Things
data
devices
connectivity
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE COMPARISON MAURITIUS RESOLUTION ON BIG DATA
•
•
•
•
•
•
•
•
•
•
•
•
Implement privacy by design.
Be transparent about what data is collected, how data is processed, for what purposes
data will be used, and whether data will be distributed to third parties.
Define the purpose of collection at the time of collection and, at all times, limit use of
the data to the defined purpose.
Obtain consent.
Collect and store only the amount of data necessary for the intended lawful purpose.
Allow individuals access to data maintained about them, information on the source of
the data, key inputs into their profile, and any algorithms used to develop their profile.
Allow individuals to correct and control their information.
Conduct a privacy impact assessment.
Consider data anonymization.
Limit and carefully control access to personal data.
Conduct regular reviews to verify if results from profiling are “responsible, fair and
ethical and compatible with and proportionate to the purpose for which the profiles are
being used.”
Allow for manual assessments of any algorithmic profiling outcomes with “significant
effects to individuals.”
Source: http://www.insideprivacy.com/international/data-protection-officials-adopt-internet-of-things-declaration-and-big-data-resolution/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE COMPARISON MAURITIUS DECLARATION ON THE INTERNET OF THINGS
•
•
•
•
•
•
•
Self-determination is an inalienable right for all human beings.
Data obtained from connected devices is “high in quantity, quality and
sensitivity” and, as such, “should be regarded and treated as personal data.”
Those offering connected devices “should be clear about what data they
collect, for what purposes and how long this data is retained.”
Privacy by design should become a key selling point of innovative
technologies.
Data should be processed locally, on the connected device itself. Where it is
not possible to process data locally, companies should ensure end-to-end
encryption.
Data protection and privacy authorities should seek appropriate enforcement
action when the law has been breached.
All actors in the internet of things ecosystem “should engage in a strong,
active and constructive debate” on the implications of the internet of things
and the choices to be made.
Source: http://www.insideprivacy.com/international/data-protection-officials-adopt-internet-of-things-declaration-and-big-data-resolution/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
THE COMPARISON TAKEAWAYS FROM MAURITIUS CONFERENCE
•
•
•
•
Objective: Set out principles and recommendations designed to reduce the
risks associated with the collection and use of data for players in the
connected devices and big data ecosystems.
These two documents are not binding; however, they serve as relevant
indicators of direction of data privacy policies and trends.
The concerns addressed at the conference echo those of the White House
reports on big data privacy, including discrimination, personalization, and data
anonymization.
The adoption of the Declaration and Resolution serves as a reminder of the
importance of ethics in tackling issues of data privacy and “the potential risks
of big data-driven digital predestination.”
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
WHAT REALLY MATTERS
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
QUIZ WHAT DO THESE COMPANIES HAVE IN COMMON?
Google
Facebook
Twitter
LinkedIn
Orbitz
Airbnb
Angie’s List
Match.com
OpenTable
Uber
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
QUIZ RESPONSE
WHAT THESE COMPANIES HAVE IN COMMON:
•
Google
Facebook
Twitter
LinkedIn
Orbitz
Airbnb
Angie’s List
Match.com
OpenTable
Uber
•
•
•
•
Presence. They’re all online; they have no brick-&-mortar
presence. If their website or mobile app is unavailable, it’s as
if they don’t exist.
Primary asset. Their primary corporate asset is data,
specifically data (mostly big) created by users. They do not
sell physical inventory.
Business model. Their primary function is to connect users
with the right web page, person, and/or service. If
connections aren’t relevant, quick, or easy, users move on.
Revenue model. They make their money by connecting the
data dots, primarily through advertising or service fees. If
connections aren’t made, money is lost.
Top companies:
The Bottom Line
If any of these companies fail
to keep collecting and
connecting the data dots for
its users, for whatever reason,
they will go out of business.
•
•
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
Google, Facebook, Twitter, LinkedIn, and Orbitz are included
in Forbes’ 2014 ten best companies to work for list.
Google, Facebook, and Twitter are included in Glassdoor’s Top 25
Companies for Culture & Values 2014 list.
WHAT REALLY
DATA AS A CORPORATE ASSET
MATTERS

"To understand what it means to manage data as an
asset, you first have to understand the business
definition of the term ‘asset.’ First, the asset needs to
have a value; second, that value needs to be
measurable; and third, it helps a company achieve its
strategic objectives. When managed the right way—
that is, as an asset—a company’s data meets these
three criteria.“
Jill Dyché, co-founder of Baseline Consulting and current
VP of SAS Best Practices
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
Data is driving
your business
 Good news: It
can provide
insight into your
business
performance and
strategic
direction
 Bad news: You
could drown in it
 Manage your data
before it manages
you
WHAT REALLY
FIVE QUESTIONS JILL ASKS EXECUTIVES
MATTERS
1.
2.
3.
4.
5.
Are you giving your corporate data – big and small – resources comparable
to your other corporate assets?
Are you dedicating technology comparable to your other corporate assets?
Are you allocating funding relative to your other corporate assets?
Do you measure the cost of poor, missing or inaccurate data?
Do you understand the “opportunity cost” of not delivering timely and
relevant data to the business?
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
WHAT REALLY
MATTERS
THREE KEY TAKEAWAYS
Big data is about making money. If you’re not
using big data to improve your business – e.g.,
revenues, profits, operational efficiencies, decision
making, etc. – then don’t do big data. It’s not worth
the time, money or hassle.
2. Manage your data as a corporate asset. Even if big
data or the Internet of Things is not your core
business, managing your data – big and small – as a
corporate asset is absolutely vital as your company
makes its way in today’s connected marketplace. To
not manage your corporate data is to rob your
company of its strategic edge.
3. Two sides of the same coin? Big data is a subtopic
of the Internet of Things. Big data is about data, while
IoT is about data, devices, and connectivity.
1.
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
FINAL QUIZ BIG DATA OR THE INTERNET OF THINGS?
The value of the ________
market is expected to soar
from $2 billion in 2013 to $50
billion by 2020, according to
market research firm Allied
Market Research.
HADOOP
Source: http://www.datanami.com/2014/05/29/hadoop-market-grow-58-2020-report-says/
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
IT’S A BIG DATA WORLD OUT THERE.
NOW LET’S BE SAFE.
Tamara.Dull@sas.com
@tamaradull
C op yr i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r es er v e d .
sas.com
Download