big data

Post on 17-Jun-2015

51 Views

Category:

Data & Analytics

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Big Data

What is Big data?

Big Data refers to the massive amounts of data that collect over time that are difficult to analyze and handle using common database management tools.

The data are analyzed for marketing trends in business as well as in the fields of manufacturing, medicine and science.

The types of data include business transactions, e-mail messages, photos, surveillance videos, activity logs and unstructured text from blogs and social media, as well as the huge amounts of data that can be collected from sensors of all varieties

Who's Generating Big Data?

Social media and networks(all of us are generating data)

Scientific instruments(collecting all sorts of data)

Mobile devices (tracking all objects all the time)

Sensor technology and networks(measuring all kinds of data)

Most analysts and practitioners currently refer to data sets from 30-50 terabytes(1000 gigabytes per terabyte) to multiple petabytes (1000 terabytes per petabyte) as big data.

Big data: 3V's

Volume:The massive scale and growth of unstructured data outstrips traditional storage and analytical solutions

Velocity:Data is generated in real time, with demands for usable information to be served up immediately

Variety: Data is getting generated in the form of relational data, text data, semi structured data ,Graph data etc.

Examples of Big Data Projects

Consumer product companies and retail organizations are monitoring social media like Facebook and Twitter to get an unprecedented view into customer behavior, preferences, and product perception.

Manufacturers are monitoring minute vibration data from their equipment, which changes slightly as it wears down, to predict the optimal time to replace or maintain. Replacing it too soon wastes money; replacing it too late triggers an expensive work stoppage

Advertising and marketing agencies are tracking social media to understand responsiveness to campaigns, promotions, and other advertising mediums.

- - one of largest Destinations on the web

80% of the U.S.Internet population uses Yahoo!

Global network of content,commerce ,media ,search and access products.

100+ properties including mail ,TV, news ,shopping ,finance,autos ,travels,games ,movies, healths ,etc.

25+ terabytes of data collected each day Representing 1000's of cataloged consumer

behaviours

Yahoo!Big Data-A league of its own

Grand challenge problems of data processing

Travel,Credit card processing ,Stock exchange ,Retail,Internet

Y!Data challenge exceeds others by 2 orders of magnitude

Behavioral Targeting(BT)

Yahoo!User DNA

On a per consumer basis: maintain a behavioral/interests profile andprofitability (user value and LTV) metrics

Row 1 Row 2 Row 3 Row 40

2

4

6

8

10

12

Column 1

Column 2

Column 3

Thank you

top related